Record Linkage and Deduplicating Data with ML
Regular Session
Machine learning and fuzzy matching can enable us to identify duplicate or linked records across datasets, even when the records don’t have a common unique identifier.
Ahmad Firjani will explain how he used machine learning algorithms to link matching records from clinic dataset to other patient datasets. These data include significant overlap, but neither Patient IDs nor most names or addresses match exactly.