Implications of the Use of Artificial Intelligence Predictive Models in Health Care Settings : A Simulation Study.


Journal

Annals of internal medicine
ISSN: 1539-3704
Titre abrégé: Ann Intern Med
Pays: United States
ID NLM: 0372351

Informations de publication

Date de publication:
10 2023
Historique:
medline: 30 10 2023
pubmed: 9 10 2023
entrez: 9 10 2023
Statut: ppublish

Résumé

Substantial effort has been directed toward demonstrating uses of predictive models in health care. However, implementation of these models into clinical practice may influence patient outcomes, which in turn are captured in electronic health record data. As a result, deployed models may affect the predictive ability of current and future models. To estimate changes in predictive model performance with use through 3 common scenarios: model retraining, sequentially implementing 1 model after another, and intervening in response to a model when 2 are simultaneously implemented. Simulation of model implementation and use in critical care settings at various levels of intervention effectiveness and clinician adherence. Models were either trained or retrained after simulated implementation. Admissions to the intensive care unit (ICU) at Mount Sinai Health System (New York, New York) and Beth Israel Deaconess Medical Center (Boston, Massachusetts). 130 000 critical care admissions across both health systems. Across 3 scenarios, interventions were simulated at varying levels of clinician adherence and effectiveness. Statistical measures of performance, including threshold-independent (area under the curve) and threshold-dependent measures. At fixed 90% sensitivity, in scenario 1 a mortality prediction model lost 9% to 39% specificity after retraining once and in scenario 2 a mortality prediction model lost 8% to 15% specificity when created after the implementation of an acute kidney injury (AKI) prediction model; in scenario 3, models for AKI and mortality prediction implemented simultaneously, each led to reduced effective accuracy of the other by 1% to 28%. In real-world practice, the effectiveness of and adherence to model-based recommendations are rarely known in advance. Only binary classifiers for tabular ICU admissions data were simulated. In simulated ICU settings, a universally effective model-updating approach for maintaining model performance does not seem to exist. Model use may have to be recorded to maintain viability of predictive modeling. National Center for Advancing Translational Sciences.

Sections du résumé

BACKGROUND
Substantial effort has been directed toward demonstrating uses of predictive models in health care. However, implementation of these models into clinical practice may influence patient outcomes, which in turn are captured in electronic health record data. As a result, deployed models may affect the predictive ability of current and future models.
OBJECTIVE
To estimate changes in predictive model performance with use through 3 common scenarios: model retraining, sequentially implementing 1 model after another, and intervening in response to a model when 2 are simultaneously implemented.
DESIGN
Simulation of model implementation and use in critical care settings at various levels of intervention effectiveness and clinician adherence. Models were either trained or retrained after simulated implementation.
SETTING
Admissions to the intensive care unit (ICU) at Mount Sinai Health System (New York, New York) and Beth Israel Deaconess Medical Center (Boston, Massachusetts).
PATIENTS
130 000 critical care admissions across both health systems.
INTERVENTION
Across 3 scenarios, interventions were simulated at varying levels of clinician adherence and effectiveness.
MEASUREMENTS
Statistical measures of performance, including threshold-independent (area under the curve) and threshold-dependent measures.
RESULTS
At fixed 90% sensitivity, in scenario 1 a mortality prediction model lost 9% to 39% specificity after retraining once and in scenario 2 a mortality prediction model lost 8% to 15% specificity when created after the implementation of an acute kidney injury (AKI) prediction model; in scenario 3, models for AKI and mortality prediction implemented simultaneously, each led to reduced effective accuracy of the other by 1% to 28%.
LIMITATIONS
In real-world practice, the effectiveness of and adherence to model-based recommendations are rarely known in advance. Only binary classifiers for tabular ICU admissions data were simulated.
CONCLUSION
In simulated ICU settings, a universally effective model-updating approach for maintaining model performance does not seem to exist. Model use may have to be recorded to maintain viability of predictive modeling.
PRIMARY FUNDING SOURCE
National Center for Advancing Translational Sciences.

Identifiants

pubmed: 37812781
doi: 10.7326/M23-0949
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

1358-1369

Subventions

Organisme : NCATS NIH HHS
ID : UL1TR004419
Pays : United States

Auteurs

Akhil Vaid (A)

Division of Data-Driven and Digital Medicine, Department of Medicine, and The Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (A.V., P.J.).

Ashwin Sawant (A)

Division of Data-Driven and Digital Medicine, Department of Medicine; The Charles Bronfman Institute of Personalized Medicine; and Division of Hospital Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (A.S.).

Mayte Suarez-Farinas (M)

Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, New York (M.S., J.L.).

Juhee Lee (J)

Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, New York (M.S., J.L.).

Sanjeev Kaul (S)

Department of Surgery, Hackensack Meridian School of Medicine, Nutley, New Jersey (S.K.).

Patricia Kovatch (P)

Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York (P.K., B.G.).

Robert Freeman (R)

Division of Data-Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (R.F.).

Joy Jiang (J)

The Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (J.J.).

Pushkala Jayaraman (P)

Division of Data-Driven and Digital Medicine, Department of Medicine, and The Charles Bronfman Institute of Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (A.V., P.J.).

Zahi Fayad (Z)

BioMedical Engineering and Imaging Institute and Mount Sinai Heart, Icahn School of Medicine at Mount Sinai, New York, New York (Z.F.).

Edgar Argulian (E)

Mount Sinai Heart, Icahn School of Medicine at Mount Sinai, New York, New York (E.A., S.L., J.N.).

Stamatios Lerakis (S)

Mount Sinai Heart, Icahn School of Medicine at Mount Sinai, New York, New York (E.A., S.L., J.N.).

Alexander W Charney (AW)

The Charles Bronfman Institute of Personalized Medicine and Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, New York, and Department of Surgery, Hackensack Meridian School of Medicine, Nutley, New Jersey (A.W.C.).

Fei Wang (F)

Department of Population Health Sciences, Weill Cornell Medicine, New York, New York (F.W.).

Matthew Levin (M)

Department of Anesthesiology, Perioperative and Pain Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (M.L.).

Benjamin Glicksberg (B)

Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York (P.K., B.G.).

Jagat Narula (J)

Mount Sinai Heart, Icahn School of Medicine at Mount Sinai, New York, New York (E.A., S.L., J.N.).

Ira Hofer (I)

Division of Data-Driven and Digital Medicine, Department of Medicine; The Charles Bronfman Institute of Personalized Medicine; and Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, New York (I.H.).

Karandeep Singh (K)

Department of Learning Health Sciences, University of Michigan Medical School, Ann Arbor, Michigan (K.S.).

Girish N Nadkarni (GN)

Division of Data-Driven and Digital Medicine, Department of Medicine; The Charles Bronfman Institute of Personalized Medicine; and Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York (G.N.N.).

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH