Digital medicine and the curse of dimensionality.
Journal
NPJ digital medicine
ISSN: 2398-6352
Titre abrégé: NPJ Digit Med
Pays: England
ID NLM: 101731738
Informations de publication
Date de publication:
28 Oct 2021
28 Oct 2021
Historique:
received:
13
04
2021
accepted:
27
09
2021
entrez:
29
10
2021
pubmed:
30
10
2021
medline:
30
10
2021
Statut:
epublish
Résumé
Digital health data are multimodal and high-dimensional. A patient's health state can be characterized by a multitude of signals including medical imaging, clinical variables, genome sequencing, conversations between clinicians and patients, and continuous signals from wearables, among others. This high volume, personalized data stream aggregated over patients' lives has spurred interest in developing new artificial intelligence (AI) models for higher-precision diagnosis, prognosis, and tracking. While the promise of these algorithms is undeniable, their dissemination and adoption have been slow, owing partially to unpredictable AI model performance once deployed in the real world. We posit that one of the rate-limiting factors in developing algorithms that generalize to real-world scenarios is the very attribute that makes the data exciting-their high-dimensional nature. This paper considers how the large number of features in vast digital health data can challenge the development of robust AI models-a phenomenon known as "the curse of dimensionality" in statistical learning theory. We provide an overview of the curse of dimensionality in the context of digital health, demonstrate how it can negatively impact out-of-sample performance, and highlight important considerations for researchers and algorithm designers.
Identifiants
pubmed: 34711924
doi: 10.1038/s41746-021-00521-5
pii: 10.1038/s41746-021-00521-5
pmc: PMC8553745
doi:
Types de publication
Journal Article
Review
Langues
eng
Pagination
153Subventions
Organisme : NIGMS NIH HHS
ID : R01 GM140468
Pays : United States
Organisme : U.S. Department of Health & Human Services | National Institutes of Health (NIH)
ID : 5R01DC006859
Organisme : United States Department of Defense | United States Navy | Office of Naval Research (ONR)
ID : N00014-17-1-2826
Informations de copyright
© 2021. The Author(s).
Références
BMC Neurol. 2017 Mar 21;17(1):55
pubmed: 28327094
PLoS Med. 2005 Aug;2(8):e124
pubmed: 16060722
Philos Trans A Math Phys Eng Sci. 2016 Nov 13;374(2080):
pubmed: 27698035
Lancet Digit Health. 2021 May;3(5):e275-e276
pubmed: 33858816
PLoS One. 2019 Nov 7;14(11):e0224365
pubmed: 31697686
Cogn Neuropsychol. 2012;29(1-2):34-55
pubmed: 23017085
Front Nutr. 2018 Nov 29;5:117
pubmed: 30555829
Neuroimage. 2017 Jan 15;145(Pt B):137-165
pubmed: 27012503
J Voice. 2018 Sep;32(5):644.e1-644.e9
pubmed: 28864082
J Am Med Inform Assoc. 2020 Nov 1;27(11):1784-1797
pubmed: 32929494
Digit Biomark. 2020 Dec 2;4(3):109-122
pubmed: 33442573
Front Aging Neurosci. 2018 Jan 09;9:437
pubmed: 29375365
Amyotroph Lateral Scler Frontotemporal Degener. 2013 Dec;14(7-8):494-500
pubmed: 23898888
J Speech Hear Disord. 1987 Nov;52(4):367-87
pubmed: 3312817
JAMA. 2020 Sep 22;324(12):1212-1213
pubmed: 32960230
BMJ Health Care Inform. 2020 Oct;27(3):
pubmed: 33106330
IEEE Trans Signal Process. 2016 Feb 1;64(3):580-591
pubmed: 26807014
J Alzheimers Dis. 2020;78(4):1547-1574
pubmed: 33185605
Nat Med. 2019 Jan;25(1):44-56
pubmed: 30617339
Stat Med. 2015 Sep 10;34(20):2781-93
pubmed: 25988604
Science. 2015 Aug 7;349(6248):636-8
pubmed: 26250683
Science. 2014 Oct 31;346(6209):583-7
pubmed: 25359966
BMC Med Inform Decis Mak. 2012 Feb 15;12:8
pubmed: 22336388
Med Biol Eng Comput. 2006 Dec;44(12):1031-51
pubmed: 17111118
Neuropsychopharmacology. 2021 Jul;46(8):1510-1517
pubmed: 33958703