Multi-modality machine learning predicting Parkinson's disease.


Journal

NPJ Parkinson's disease
ISSN: 2373-8057
Titre abrégé: NPJ Parkinsons Dis
Pays: United States
ID NLM: 101675390

Informations de publication

Date de publication:
01 Apr 2022
Historique:
received: 01 09 2021
accepted: 01 02 2022
entrez: 2 4 2022
pubmed: 3 4 2022
medline: 3 4 2022
Statut: epublish

Résumé

Personalized medicine promises individualized disease prediction and treatment. The convergence of machine learning (ML) and available multimodal data is key moving forward. We build upon previous work to deliver multimodal predictions of Parkinson's disease (PD) risk and systematically develop a model using GenoML, an automated ML package, to make improved multi-omic predictions of PD, validated in an external cohort. We investigated top features, constructed hypothesis-free disease-relevant networks, and investigated drug-gene interactions. We performed automated ML on multimodal data from the Parkinson's progression marker initiative (PPMI). After selecting the best performing algorithm, all PPMI data was used to tune the selected model. The model was validated in the Parkinson's Disease Biomarker Program (PDBP) dataset. Our initial model showed an area under the curve (AUC) of 89.72% for the diagnosis of PD. The tuned model was then tested for validation on external data (PDBP, AUC 85.03%). Optimizing thresholds for classification increased the diagnosis prediction accuracy and other metrics. Finally, networks were built to identify gene communities specific to PD. Combining data modalities outperforms the single biomarker paradigm. UPSIT and PRS contributed most to the predictive power of the model, but the accuracy of these are supplemented by many smaller effect transcripts and risk SNPs. Our model is best suited to identifying large groups of individuals to monitor within a health registry or biobank to prioritize for further testing. This approach allows complex predictive models to be reproducible and accessible to the community, with the package, code, and results publicly available.

Identifiants

pubmed: 35365675
doi: 10.1038/s41531-022-00288-w
pii: 10.1038/s41531-022-00288-w
pmc: PMC8975993
doi:

Types de publication

Journal Article

Langues

eng

Pagination

35

Subventions

Organisme : NINDS NIH HHS
ID : U01 NS082151
Pays : United States
Organisme : U.S. Department of Health & Human Services | NIH | National Institute of Neurological Disorders and Stroke (NINDS)
ID : Z01-AG000949-02
Organisme : NINDS NIH HHS
ID : U01 NS082157
Pays : United States
Organisme : Intramural NIH HHS
ID : Z01 AG000949
Pays : United States
Organisme : NINDS NIH HHS
ID : U01 NS082133
Pays : United States
Organisme : NINDS NIH HHS
ID : U01 NS082137
Pays : United States
Organisme : Medical Research Council
ID : G0701075
Pays : United Kingdom
Organisme : NINDS NIH HHS
ID : U01 NS082148
Pays : United States
Organisme : NINDS NIH HHS
ID : U01 NS082134
Pays : United States
Organisme : Medical Research Council
ID : G0901254
Pays : United Kingdom

Informations de copyright

© 2022. This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply.

Références

PLoS Genet. 2019 Dec 12;15(12):e1008489
pubmed: 31830040
J Proteomics. 2014 Apr 04;100:44-54
pubmed: 24480284
Proc Natl Acad Sci U S A. 2010 Sep 14;107(37):16222-7
pubmed: 20798349
Mol Aspects Med. 2013 Apr-Jun;34(2-3):337-49
pubmed: 23506875
Neurol Genet. 2020 Jan 09;6(1):385
pubmed: 32042909
PLoS Med. 2015 Mar 31;12(3):e1001779
pubmed: 25826379
Nat Genet. 2015 Aug;47(8):856-60
pubmed: 26121088
Parkinsons Dis. 2020 Jun 12;2020:5374307
pubmed: 32617144
J Mol Neurosci. 2015 Mar;55(3):609-17
pubmed: 25129099
PLoS One. 2012;7(9):e44700
pubmed: 22970289
Am J Hum Genet. 2016 Mar 3;98(3):500-513
pubmed: 26942284
Mov Disord. 2021 Apr;36(4):842-851
pubmed: 33513272
Nature. 2020 Oct;586(7831):683-692
pubmed: 33116284
J Med Genet. 2020 May;57(5):331-338
pubmed: 31784483
PLoS One. 2013 Sep 09;8(9):e73777
pubmed: 24040066
PLoS Genet. 2014 Feb 20;10(2):e1004173
pubmed: 24586201
Nat Mach Intell. 2020 Jan;2(1):56-67
pubmed: 32607472
Front Aging Neurosci. 2021 May 06;13:633752
pubmed: 34025389
Cells. 2021 Apr 27;10(5):
pubmed: 33925602
Front Big Data. 2020 Jun 02;3:19
pubmed: 33693393
Elife. 2021 Feb 25;10:
pubmed: 33629954
Sci Rep. 2021 Sep 20;11(1):18550
pubmed: 34545158
Molecules. 2019 Jul 24;24(15):
pubmed: 31344785
Hum Mol Genet. 2016 Sep 1;25(17):3849-3862
pubmed: 27402877
Bioinformatics. 2017 Sep 01;33(17):2776-2778
pubmed: 28475694
Biom J. 2008 Jun;50(3):419-30
pubmed: 18435502
Lancet Neurol. 2019 Dec;18(12):1091-1102
pubmed: 31701892
Sci Rep. 2018 Jan 22;8(1):1362
pubmed: 29358745
Nucleic Acids Res. 2015 Apr 20;43(7):e47
pubmed: 25605792
Neuroimage Clin. 2020;26:102209
pubmed: 32062564
Cell Rep. 2020 Oct 13;33(2):108263
pubmed: 33053338
Nat Rev Neurol. 2018 Jan;14(1):5-6
pubmed: 29192261
Int J Med Inform. 2016 Jun;90:13-21
pubmed: 27103193
Nat Genet. 2019 Apr;51(4):584-591
pubmed: 30926966
Nat Commun. 2018 Oct 2;9(1):4038
pubmed: 30279509
NPJ Parkinsons Dis. 2018 Jan 15;4:2
pubmed: 29354684
Mov Disord. 2021 Aug;36(8):1795-1804
pubmed: 33960523
Sleep Breath. 2021 Jul 8;:
pubmed: 34236578
Neurology. 2018 May 15;90(20):e1759-e1770
pubmed: 29669906
Prog Neurobiol. 2013 Nov;110:2-28
pubmed: 24036231
Laryngoscope. 1984 Feb;94(2 Pt 1):176-8
pubmed: 6694486
Nat Biomed Eng. 2018 Oct;2(10):749-760
pubmed: 31001455
J Neurol. 2008 Sep;255 Suppl 5:18-32
pubmed: 18787879
Lancet Neurol. 2015 Oct;14(10):1002-9
pubmed: 26271532
J Neurol Neurosurg Psychiatry. 2014 Jan;85(1):31-7
pubmed: 23828833
Sleep Med Rev. 2021 Oct;59:101495
pubmed: 33979733
PLoS One. 2015 Jan 24;10(1):e0116641
pubmed: 25617759
Neurology. 2016 Feb 9;86(6):566-76
pubmed: 26764028
Front Neurol. 2017 Aug 14;8:394
pubmed: 28855887
Mov Disord Clin Pract. 2018 Mar 01;5(2):171-176
pubmed: 30009211
Neurol Sci. 2014 Mar;35(3):379-83
pubmed: 23975523
Ann Neurol. 2021 Jul;90(1):35-42
pubmed: 33901317
J Neurol. 2017 Aug;264(8):1642-1654
pubmed: 28251357
J Neurol. 2019 Aug;266(8):1897-1906
pubmed: 31053960
Alzheimers Dement. 2019 Mar;15(3):441-452
pubmed: 30503768

Auteurs

Mary B Makarious (MB)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.
Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK.
UCL Movement Disorders Centre, University College London, London, UK.

Hampton L Leonard (HL)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.
Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA.
Data Tecnica International LLC, Glen Echo, MD, USA.
German Center for Neurodegenerative Diseases (DZNE), Tübingen, Germany.

Dan Vitale (D)

Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA.
Data Tecnica International LLC, Glen Echo, MD, USA.

Hirotaka Iwaki (H)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.
Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA.
Data Tecnica International LLC, Glen Echo, MD, USA.

Lana Sargent (L)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.
Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA.
School of Nursing, Virginia Commonwealth University, Richmond, VA, USA.
Geriatric Pharmacotherapy Program, School of Pharmacy, Virginia Commonwealth University, Richmond, VA, USA.

Anant Dadu (A)

Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.

Ivo Violich (I)

Institute of Translational Genomics, University of Southern California, Los Angeles, CA, USA.

Elizabeth Hutchins (E)

Neurogenomics Division, Translational Genomics Research Institute (TGen), Phoenix, AZ, USA.

David Saffo (D)

Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA.

Sara Bandres-Ciga (S)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.

Jonggeol Jeff Kim (JJ)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.
Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Queen Mary University of London, London, UK.

Yeajin Song (Y)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.
Data Tecnica International LLC, Glen Echo, MD, USA.

Melina Maleknia (M)

Georgia Institute of Technology, Atlanta, GA, USA.

Matt Bookman (M)

Verily Life Sciences, South San Francisco, CA, USA.

Willy Nojopranoto (W)

Verily Life Sciences, South San Francisco, CA, USA.

Roy H Campbell (RH)

Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.

Sayed Hadi Hashemi (SH)

Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.

Juan A Botia (JA)

Department of Molecular Neuroscience, UCL Queen Square Institute of Neurology, London, UK.
Departamento de Ingeniería de la Información y las Comunicaciones, Universidad de Murcia, Murcia, Spain.

John F Carter (JF)

ModelOp, Chicago, IL, USA.

David W Craig (DW)

Institute of Translational Genomics, University of Southern California, Los Angeles, CA, USA.

Kendall Van Keuren-Jensen (K)

Neurogenomics Division, Translational Genomics Research Institute (TGen), Phoenix, AZ, USA.

Huw R Morris (HR)

Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK.
UCL Movement Disorders Centre, University College London, London, UK.

John A Hardy (JA)

Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK.
UCL Movement Disorders Centre, University College London, London, UK.
UK Dementia Research Institute and Department of Neurodegenerative Disease and Reta Lila Weston Institute, London, UK.
Institute for Advanced Study, The Hong Kong University of Science and Technology, Hong Kong, Hong Kong SAR, China.

Cornelis Blauwendraat (C)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.

Andrew B Singleton (AB)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA.
Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA.

Faraz Faghri (F)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA. faraz@datatecnica.com.
Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA. faraz@datatecnica.com.
Data Tecnica International LLC, Glen Echo, MD, USA. faraz@datatecnica.com.

Mike A Nalls (MA)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, USA. mike@datatecnica.com.
Center for Alzheimer's and Related Dementias, National Institutes of Health, Bethesda, MD, USA. mike@datatecnica.com.
Data Tecnica International LLC, Glen Echo, MD, USA. mike@datatecnica.com.

Classifications MeSH