The Impact of Name Transformation on Match Rates Within a Large Consumer Database.


Journal

AMIA ... Annual Symposium proceedings. AMIA Symposium
ISSN: 1942-597X
Titre abrégé: AMIA Annu Symp Proc
Pays: United States
ID NLM: 101209213

Informations de publication

Date de publication:
2022
Historique:
medline: 3 5 2023
pubmed: 2 5 2023
entrez: 2 5 2023
Statut: epublish

Résumé

Accurate record linkage depends on the availability and quality of features such as first name and last name. Privacy preserving record linkage methods using tokenization is sensitive to perturbations in the patient features used as inputs. In this study we evaluated the impact of name transformations on the accuracy of patient matching using a large commercial dataset. We used a set of 68 million records representing 59 million unique individuals, and implemented and evaluated eight name transformation strategies, and generated precision, recall and F1 scores. Transforming names to include the most common nicknames resulted in a significant gain in recall while maintaining precision, and generated the highest F1 score compared with no name transformation (0.905 vs 0.807). Strategies tailored to transforming patient features can improve the precision and recall of patient matching, and make it possible to create high quality, linked datasets for research purposes.

Identifiants

pubmed: 37128403
pii: 447
pmc: PMC10148307

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

692-699

Informations de copyright

©2022 AMIA - All rights reserved.

Références

JAMIA Open. 2019 Sep 27;2(4):562-569
pubmed: 32025654
Mil Med. 2020 Mar 2;185(3-4):e335-e339
pubmed: 31714995
J Am Med Inform Assoc. 2019 Jul 1;26(7):594-602
pubmed: 30938759
Appl Clin Inform. 2017 Apr 05;8(2):322-336
pubmed: 28378025
Arthritis Care Res (Hoboken). 2017 Sep;69(9):1369-1376
pubmed: 27899012
Am J Public Health. 2021 Aug;111(8):1400-1403
pubmed: 34464174
J Am Med Inform Assoc. 2015 Sep;22(5):1072-80
pubmed: 26104741
J Am Med Inform Assoc. 2019 May 1;26(5):447-456
pubmed: 30848796
BMC Med Inform Decis Mak. 2002 Dec 13;2:9
pubmed: 12482326
Diabetes Care. 2016 Oct;39(10):1671-6
pubmed: 27422579
Circulation. 2017 Sep 26;136(13):1207-1216
pubmed: 28687707

Auteurs

Jonah Leshin (J)

Datavant, San Francisco, CA.

Arjun Sanghvi (A)

Datavant, San Francisco, CA.

Kavi Ravuri (K)

Datavant, San Francisco, CA.

Matthew Owen (M)

Datavant, San Francisco, CA.

Abel Kho (A)

Northwestern University, Chicago, IL.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH