Evaluation of crowdsourced mortality prediction models as a framework for assessing artificial intelligence in medicine.

evaluation health informatics machine learning

Journal

Journal of the American Medical Informatics Association : JAMIA

ISSN: 1527-974X

Titre abrégé: J Am Med Inform Assoc

Pays: England

ID NLM: 9430800

Informations de publication

Date de publication:
18 Aug 2023

Historique:

received: 18 04 2023

revised: 05 07 2023

accepted: 08 08 2023

medline: 22 8 2023

pubmed: 22 8 2023

entrez: 21 8 2023

Statut: aheadofprint

Résumé

Applications of machine learning in healthcare are of high interest and have the potential to improve patient care. Yet, the real-world accuracy of these models in clinical practice and on different patient subpopulations remains unclear. To address these important questions, we hosted a community challenge to evaluate methods that predict healthcare outcomes. We focused on the prediction of all-cause mortality as the community challenge question. Using a Model-to-Data framework, 345 registered participants, coalescing into 25 independent teams, spread over 3 continents and 10 countries, generated 25 accurate models all trained on a dataset of over 1.1 million patients and evaluated on patients prospectively collected over a 1-year observation of a large health system. The top performing team achieved a final area under the receiver operator curve of 0.947 (95% CI, 0.942-0.951) and an area under the precision-recall curve of 0.487 (95% CI, 0.458-0.499) on a prospectively collected patient cohort. Post hoc analysis after the challenge revealed that models differ in accuracy on subpopulations, delineated by race or gender, even when they are trained on the same data. This is the largest community challenge focused on the evaluation of state-of-the-art machine learning methods in a healthcare system performed to date, revealing both opportunities and pitfalls of clinical AI.

Identifiants

DOI: 10.1093/jamia/ocad159 PMID: 37604111

pubmed: 37604111

pii: 7246871

doi: 10.1093/jamia/ocad159

pii:

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Subventions

Organisme : NCATS NIH HHS

ID : UL1 TR002319

Pays : United States

Organisme : NIH HHS

ID : U24TR002306

Pays : United States

Investigateurs

Aaron Lee (A)

Ali Salehzadeh-Yazdi (A)

Alidivinas Prusokas (A)

Anand Basu (A)

Anas Belouali (A)

Ann-Kristin Becker (AK)

Ariel Israel (A)

Augustinas Prusokas (A)

B Winter (B)

Carlos Vega Moreno (CV)

Christoph Kurz (C)

Dagmar Waltemath (D)

Darius Schweinoch (D)

Enrico Glaab (E)

Gang Luo (G)

Guanhua Chen (G)

Helena U Zacharias (HU)

Hezhe Qiao (H)

Inggeol Lee (I)

Ivan Brugere (I)

Jaewoo Kang (J)

Jifan Gao (J)

Julia Truthmann (J)

JunSeok Choe (J)

Kari A Stephens (KA)

Lars Kaderali (L)

Lav R Varshney (LR)

Marcus Vollmer (M)

Maria-Theodora Pandi (MT)

Martin L Gunn (ML)

Meliha Yetisgen (M)

Neetika Nath (N)

Noah Hammarlund (N)

Oliver Müller-Stricker (O)

Panagiotis Togias (P)

Patrick J Heagerty (PJ)

Peter Muir (P)

Peter Banda (P)

Renata Retkute (R)

Ron Henkel (R)

Sagar Madgi (S)

Samir Gupta (S)

Sanghoon Lee (S)

Sean Mooney (S)

Shabeeb Kannattikuni (S)

Shamim Sarhadi (S)

Shikhar Omar (S)

Shuo Wang (S)

Soumyabrata Ghosh (S)

Stefan Neumann (S)

Stefan Simm (S)

Subha Madhavan (S)

Sunkyu Kim (S)

Thomas Von Yu (T)

Venkata Satagopam (V)

Vikas Pejaver (V)

Yachee Gupta (Y)

Yonghwa Choi (Y)

Zofia Nawalany (Z)

Łukasz Charzewski (Ł)