Challenges associated with missing data in electronic health records: A case study of a risk prediction model for diabetes using data from Slovenian primary care.
Case-Control Studies
Cross-Sectional Studies
Data Accuracy
Diabetes Mellitus, Type 2
/ diagnosis
Electronic Health Records
/ standards
Female
Humans
Male
Middle Aged
Primary Health Care
/ statistics & numerical data
Quality Improvement
Risk Assessment
/ methods
Slovenia
/ epidemiology
Surveys and Questionnaires
databases and data mining
electronic health records
missing data
primary care
quality control
type 2 diabetes
Journal
Health informatics journal
ISSN: 1741-2811
Titre abrégé: Health Informatics J
Pays: England
ID NLM: 100883604
Informations de publication
Date de publication:
09 2019
09 2019
Historique:
pubmed:
14
10
2017
medline:
17
7
2020
entrez:
14
10
2017
Statut:
ppublish
Résumé
The increasing availability of data stored in electronic health records brings substantial opportunities for advancing patient care and population health. This is, however, fundamentally dependant on the completeness and quality of data in these electronic health records. We sought to use electronic health record data to populate a risk prediction model for identifying patients with undiagnosed type 2 diabetes mellitus. We, however, found substantial (up to 90%) amounts of missing data in some healthcare centres. Attempts at imputing for these missing data or using reduced dataset by removing incomplete records resulted in a major deterioration in the performance of the prediction model. This case study illustrates the substantial wasted opportunities resulting from incomplete records by simulation of missing and incomplete records in predictive modelling process. Government and professional bodies need to prioritise efforts to address these data shortcomings in order to ensure that electronic health record data are maximally exploited for patient and population benefit.
Identifiants
pubmed: 29027512
doi: 10.1177/1460458217733288
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM