Demonstrating the consequences of learning missingness patterns in early warning systems for preventative health care: A novel simulation and solution.

Early warning systems Machine learning Patient deterioration Simulation Tree-based methods Variational autoencoder

Journal

Journal of biomedical informatics
ISSN: 1532-0480
Titre abrégé: J Biomed Inform
Pays: United States
ID NLM: 100970413

Informations de publication

Date de publication:
10 2020
Historique:
received: 17 01 2020
revised: 20 05 2020
accepted: 03 08 2020
pubmed: 17 8 2020
medline: 29 7 2021
entrez: 16 8 2020
Statut: ppublish

Résumé

When using tree-based methods to develop predictive analytics and early warning systems for preventive healthcare, it is important to use an appropriate imputation method to prevent learning the missingness pattern. To demonstrate this, we developed a novel simulation that generated synthetic electronic health record data using a variational autoencoder with a custom loss function, which took into account the high missing rate of electronic health data. We showed that when tree-based methods learn missingness patterns (correlated with adverse events) in electronic health record data, this leads to decreased performance if the system is used in a new setting that has different missingness patterns. Performance is worst in this scenario when the missing rate between those with and without an adverse event is the greatest. We found that randomized and Bayesian regression imputation methods mitigate the issue of learning the missingness pattern for tree-based methods. We used this information to build a novel early warning system for predicting patient deterioration in general wards and telemetry units: PICTURE (Predicting Intensive Care Transfers and other UnfoReseen Events). To develop, tune, and test PICTURE, we used labs and vital signs from electronic health records of adult patients over four years (n = 133,089 encounters). We analyzed primary outcomes of unplanned intensive care unit transfer, emergency vasoactive medication administration, cardiac arrest, and death. We compared PICTURE with existing early warning systems and logistic regression at multiple levels of granularity. When analyzing PICTURE on the testing set using all observations within a hospital encounter (event rate = 3.4%), PICTURE had an area under the receiver operating characteristic curve (AUROC) of 0.83 and an adjusted (event rate = 4%) area under the precision-recall curve (AUPR) of 0.27, while the next best tested method-regularized logistic regression-had an AUROC of 0.80 and an adjusted AUPR of 0.22. To ensure system interpretability, we applied a state-of-the-art prediction explainer that provided a ranked list of features contributing most to the prediction. Though it is currently difficult to compare machine learning-based early warning systems, a rudimentary comparison with published scores demonstrated that PICTURE is on par with state-of-the-art machine learning systems. To facilitate more robust comparisons and development of early warning systems in the future, we have released our variational autoencoder's code and weights so researchers can (a) test their models on data similar to our institution and (b) make their own synthetic datasets.

Identifiants

pubmed: 32795506
pii: S1532-0464(20)30156-8
doi: 10.1016/j.jbi.2020.103528
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

103528

Informations de copyright

Copyright © 2020 Elsevier Inc. All rights reserved.

Auteurs

Christopher E Gillies (CE)

Department of Emergency Medicine, United States; Michigan Center for Integrative Research in Critical Care (MCIRCC), United States; Michigan Institute for Data Science (MIDAS), University of Michigan, Ann Arbor, United States. Electronic address: cgillies@med.umich.edu.

Daniel F Taylor (DF)

Department of Emergency Medicine, United States; Michigan Center for Integrative Research in Critical Care (MCIRCC), United States.

Brandon C Cummings (BC)

Department of Emergency Medicine, United States; Michigan Center for Integrative Research in Critical Care (MCIRCC), United States.

Sardar Ansari (S)

Department of Emergency Medicine, United States; Michigan Center for Integrative Research in Critical Care (MCIRCC), United States.

Fadi Islim (F)

School of Nursing, United States; Michigan Dialysis Services, Canton, MI, United States; Michigan Center for Integrative Research in Critical Care (MCIRCC), United States.

Steven L Kronick (SL)

Department of Emergency Medicine, United States; Michigan Center for Integrative Research in Critical Care (MCIRCC), United States.

Richard P Medlin (RP)

Department of Emergency Medicine, United States; Michigan Center for Integrative Research in Critical Care (MCIRCC), United States.

Kevin R Ward (KR)

Department of Emergency Medicine, United States; Department of Biomedical Engineering, United States; Michigan Center for Integrative Research in Critical Care (MCIRCC), United States; Michigan Institute for Data Science (MIDAS), University of Michigan, Ann Arbor, United States.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH