Assessing the suitability of general practice electronic health records for clinical prediction model development: a data quality assessment.
Clinical prediction model development
Data linkage
Data quality assessment
Electronic health records
Electronic medical records
General practice
Primary care
Journal
BMC medical informatics and decision making
ISSN: 1472-6947
Titre abrégé: BMC Med Inform Decis Mak
Pays: England
ID NLM: 101088682
Informations de publication
Date de publication:
30 10 2021
30 10 2021
Historique:
received:
05
07
2021
accepted:
20
10
2021
entrez:
31
10
2021
pubmed:
1
11
2021
medline:
24
11
2021
Statut:
epublish
Résumé
The use of general practice electronic health records (EHRs) for research purposes is in its infancy in Australia. Given these data were collected for clinical purposes, questions remain around data quality and whether these data are suitable for use in prediction model development. In this study we assess the quality of data recorded in 201,462 patient EHRs from 483 Australian general practices to determine its usefulness in the development of a clinical prediction model for total knee replacement (TKR) surgery in patients with osteoarthritis (OA). Variables to be used in model development were assessed for completeness and plausibility. Accuracy for the outcome and competing risk were assessed through record level linkage with two gold standard national registries, Australian Orthopaedic Association National Joint Replacement Registry (AOANJRR) and National Death Index (NDI). The validity of the EHR data was tested using participant characteristics from the 2014-15 Australian National Health Survey (NHS). There were substantial missing data for body mass index and weight gain between early adulthood and middle age. TKR and death were recorded with good accuracy, however, year of TKR, year of death and side of TKR were poorly recorded. Patient characteristics recorded in the EHR were comparable to participant characteristics from the NHS, except for OA medication and metastatic solid tumour. In this study, data relating to the outcome, competing risk and two predictors were unfit for prediction model development. This study highlights the need for more accurate and complete recording of patient data within EHRs if these data are to be used to develop clinical prediction models. Data linkage with other gold standard data sets/registries may in the meantime help overcome some of the current data quality challenges in general practice EHRs when developing prediction models.
Sections du résumé
BACKGROUND
The use of general practice electronic health records (EHRs) for research purposes is in its infancy in Australia. Given these data were collected for clinical purposes, questions remain around data quality and whether these data are suitable for use in prediction model development. In this study we assess the quality of data recorded in 201,462 patient EHRs from 483 Australian general practices to determine its usefulness in the development of a clinical prediction model for total knee replacement (TKR) surgery in patients with osteoarthritis (OA).
METHODS
Variables to be used in model development were assessed for completeness and plausibility. Accuracy for the outcome and competing risk were assessed through record level linkage with two gold standard national registries, Australian Orthopaedic Association National Joint Replacement Registry (AOANJRR) and National Death Index (NDI). The validity of the EHR data was tested using participant characteristics from the 2014-15 Australian National Health Survey (NHS).
RESULTS
There were substantial missing data for body mass index and weight gain between early adulthood and middle age. TKR and death were recorded with good accuracy, however, year of TKR, year of death and side of TKR were poorly recorded. Patient characteristics recorded in the EHR were comparable to participant characteristics from the NHS, except for OA medication and metastatic solid tumour.
CONCLUSIONS
In this study, data relating to the outcome, competing risk and two predictors were unfit for prediction model development. This study highlights the need for more accurate and complete recording of patient data within EHRs if these data are to be used to develop clinical prediction models. Data linkage with other gold standard data sets/registries may in the meantime help overcome some of the current data quality challenges in general practice EHRs when developing prediction models.
Identifiants
pubmed: 34717599
doi: 10.1186/s12911-021-01669-6
pii: 10.1186/s12911-021-01669-6
pmc: PMC8557028
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
297Informations de copyright
© 2021. The Author(s).
Références
J Med Internet Res. 2018 May 29;20(5):e185
pubmed: 29844010
Int J Epidemiol. 2019 Dec 1;48(6):1741-1741h
pubmed: 31292616
Pharmacoepidemiol Drug Saf. 2020 Jan;29(1):9-17
pubmed: 31736248
EGEMS (Wash DC). 2016 Sep 11;4(1):1244
pubmed: 27713905
J Chronic Dis. 1987;40(5):373-83
pubmed: 3558716
Aust N Z J Public Health. 2005 Dec;29(6):565-71
pubmed: 16366069
J Comp Eff Res. 2016 Jul;5(4):345-54
pubmed: 27346480
Fam Pract. 2006 Apr;23(2):253-63
pubmed: 16368704
Inform Prim Care. 2006;14(3):203-9
pubmed: 17288707
Med J Aust. 2019 Apr;210 Suppl 6:S12-S16
pubmed: 30927466