Evaluation of crowdsourced mortality prediction models as a framework for assessing artificial intelligence in medicine.
evaluation
health informatics
machine learning
Journal
Journal of the American Medical Informatics Association : JAMIA
ISSN: 1527-974X
Titre abrégé: J Am Med Inform Assoc
Pays: England
ID NLM: 9430800
Informations de publication
Date de publication:
18 Aug 2023
18 Aug 2023
Historique:
received:
18
04
2023
revised:
05
07
2023
accepted:
08
08
2023
medline:
22
8
2023
pubmed:
22
8
2023
entrez:
21
8
2023
Statut:
aheadofprint
Résumé
Applications of machine learning in healthcare are of high interest and have the potential to improve patient care. Yet, the real-world accuracy of these models in clinical practice and on different patient subpopulations remains unclear. To address these important questions, we hosted a community challenge to evaluate methods that predict healthcare outcomes. We focused on the prediction of all-cause mortality as the community challenge question. Using a Model-to-Data framework, 345 registered participants, coalescing into 25 independent teams, spread over 3 continents and 10 countries, generated 25 accurate models all trained on a dataset of over 1.1 million patients and evaluated on patients prospectively collected over a 1-year observation of a large health system. The top performing team achieved a final area under the receiver operator curve of 0.947 (95% CI, 0.942-0.951) and an area under the precision-recall curve of 0.487 (95% CI, 0.458-0.499) on a prospectively collected patient cohort. Post hoc analysis after the challenge revealed that models differ in accuracy on subpopulations, delineated by race or gender, even when they are trained on the same data. This is the largest community challenge focused on the evaluation of state-of-the-art machine learning methods in a healthcare system performed to date, revealing both opportunities and pitfalls of clinical AI.
Identifiants
pubmed: 37604111
pii: 7246871
doi: 10.1093/jamia/ocad159
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : NCATS NIH HHS
ID : UL1 TR002319
Pays : United States
Organisme : NIH HHS
ID : U24TR002306
Pays : United States
Investigateurs
Aaron Lee
(A)
Ali Salehzadeh-Yazdi
(A)
Alidivinas Prusokas
(A)
Anand Basu
(A)
Anas Belouali
(A)
Ann-Kristin Becker
(AK)
Ariel Israel
(A)
Augustinas Prusokas
(A)
B Winter
(B)
Carlos Vega Moreno
(CV)
Christoph Kurz
(C)
Dagmar Waltemath
(D)
Darius Schweinoch
(D)
Enrico Glaab
(E)
Gang Luo
(G)
Guanhua Chen
(G)
Helena U Zacharias
(HU)
Hezhe Qiao
(H)
Inggeol Lee
(I)
Ivan Brugere
(I)
Jaewoo Kang
(J)
Jifan Gao
(J)
Julia Truthmann
(J)
JunSeok Choe
(J)
Kari A Stephens
(KA)
Lars Kaderali
(L)
Lav R Varshney
(LR)
Marcus Vollmer
(M)
Maria-Theodora Pandi
(MT)
Martin L Gunn
(ML)
Meliha Yetisgen
(M)
Neetika Nath
(N)
Noah Hammarlund
(N)
Oliver Müller-Stricker
(O)
Panagiotis Togias
(P)
Patrick J Heagerty
(PJ)
Peter Muir
(P)
Peter Banda
(P)
Renata Retkute
(R)
Ron Henkel
(R)
Sagar Madgi
(S)
Samir Gupta
(S)
Sanghoon Lee
(S)
Sean Mooney
(S)
Shabeeb Kannattikuni
(S)
Shamim Sarhadi
(S)
Shikhar Omar
(S)
Shuo Wang
(S)
Soumyabrata Ghosh
(S)
Stefan Neumann
(S)
Stefan Simm
(S)
Subha Madhavan
(S)
Sunkyu Kim
(S)
Thomas Von Yu
(T)
Venkata Satagopam
(V)
Vikas Pejaver
(V)
Yachee Gupta
(Y)
Yonghwa Choi
(Y)
Zofia Nawalany
(Z)
Łukasz Charzewski
(Ł)
Informations de copyright
© The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.