Federated learning for multi-omics: A performance evaluation in Parkinson's disease.

Parkinson’s disease diagnosis federated learning machine learning omics data analysis

Journal

Patterns (New York, N.Y.)
ISSN: 2666-3899
Titre abrégé: Patterns (N Y)
Pays: United States
ID NLM: 101767765

Informations de publication

Date de publication:
08 Mar 2024
Historique:
received: 09 10 2023
revised: 29 01 2024
accepted: 02 02 2024
medline: 15 3 2024
pubmed: 15 3 2024
entrez: 15 3 2024
Statut: epublish

Résumé

While machine learning (ML) research has recently grown more in popularity, its application in the omics domain is constrained by access to sufficiently large, high-quality datasets needed to train ML models. Federated learning (FL) represents an opportunity to enable collaborative curation of such datasets among participating institutions. We compare the simulated performance of several models trained using FL against classically trained ML models on the task of multi-omics Parkinson's disease prediction. We find that FL model performance tracks centrally trained ML models, where the most performant FL model achieves an AUC-PR of 0.876 ± 0.009, 0.014 ± 0.003 less than its centrally trained variation. We also determine that the dispersion of samples within a federation plays a meaningful role in model performance. Our study implements several open-source FL frameworks and aims to highlight some of the challenges and opportunities when applying these collaborative methods in multi-omics studies.

Identifiants

pubmed: 38487808
doi: 10.1016/j.patter.2024.100945
pii: S2666-3899(24)00044-8
pmc: PMC10935499
doi:

Types de publication

Journal Article

Langues

eng

Pagination

100945

Subventions

Organisme : NINDS NIH HHS
ID : U01 NS082151
Pays : United States
Organisme : NINDS NIH HHS
ID : U01 NS082157
Pays : United States
Organisme : NINDS NIH HHS
ID : U01 NS082133
Pays : United States
Organisme : NINDS NIH HHS
ID : U01 NS082137
Pays : United States
Organisme : Intramural NIH HHS
ID : ZIA AG000534
Pays : United States
Organisme : NINDS NIH HHS
ID : U01 NS082134
Pays : United States
Organisme : NINDS NIH HHS
ID : U01 NS082148
Pays : United States

Informations de copyright

© 2024 The Authors.

Déclaration de conflit d'intérêts

B.P.D., A.D., D.V., M.A.N., and F.F. declare the following competing financial interests, as their participation in this project was part of a competitive contract awarded to Data Tecnica LLC by the National Institutes of Health to support open science research. M.A.N. also currently serves on the scientific advisory board for Character Bio and is an advisor to Neuron23 Inc. The study’s funders had no role in the study design, data collection, data analysis, data interpretation, or writing of the report. F.F. takes final responsibility for the decision to submit the paper for publication.

Références

Nature. 2020 Oct;586(7831):683-692
pubmed: 33116284
Proc Natl Acad Sci U S A. 2010 Sep 14;107(37):16222-7
pubmed: 20798349
Genes (Basel). 2022 Apr 21;13(5):
pubmed: 35627112
Front Big Data. 2020 Jun 02;3:19
pubmed: 33693393
Biomark Med. 2017 May;11(6):451-473
pubmed: 28644039
Nat Commun. 2018 Oct 2;9(1):4038
pubmed: 30279509
Quant Imaging Med Surg. 2021 Feb;11(2):852-857
pubmed: 33532283
NPJ Parkinsons Dis. 2022 Dec 16;8(1):172
pubmed: 36526647
J Healthc Inform Res. 2021;5(1):1-19
pubmed: 33204939
Nat Commun. 2022 Dec 5;13(1):7346
pubmed: 36470898
Nat Commun. 2021 Oct 11;12(1):5910
pubmed: 34635645
Sci Rep. 2020 Jul 28;10(1):12598
pubmed: 32724046
J Neurol. 2019 Aug;266(8):1897-1906
pubmed: 31053960
Nat Med. 2021 Oct;27(10):1735-1743
pubmed: 34526699
NPJ Parkinsons Dis. 2022 Apr 1;8(1):35
pubmed: 35365675
Bioinformatics. 2017 Sep 01;33(17):2776-2778
pubmed: 28475694
Lancet Neurol. 2019 Dec;18(12):1091-1102
pubmed: 31701892
J Med Internet Res. 2020 Oct 26;22(10):e20891
pubmed: 33104011
Nucleic Acids Res. 2015 Apr 20;43(7):e47
pubmed: 25605792
Phys Med Biol. 2022 Oct 19;67(21):
pubmed: 36198326
Int J Med Inform. 2016 Jun;90:13-21
pubmed: 27103193
Mov Disord. 2021 Aug;36(8):1795-1804
pubmed: 33960523
IEEE Trans Med Imaging. 2015 Oct;34(10):1993-2024
pubmed: 25494501
Sleep Breath. 2022 Jun;26(2):633-640
pubmed: 34236578
Heliyon. 2023 Jun 02;9(6):e16925
pubmed: 37332922

Auteurs

Benjamin P Danek (BP)

Department of Computer Science, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA.
Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
DataTecnica, Washington, DC 20037, USA.

Mary B Makarious (MB)

Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.
Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK.
UCL Movement Disorders Centre, University College London, London, UK.

Anant Dadu (A)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
DataTecnica, Washington, DC 20037, USA.

Dan Vitale (D)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
DataTecnica, Washington, DC 20037, USA.

Paul Suhwan Lee (PS)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.

Andrew B Singleton (AB)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.

Mike A Nalls (MA)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
DataTecnica, Washington, DC 20037, USA.
Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.

Jimeng Sun (J)

Department of Computer Science, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA.
Carle Illinois College of Medicine, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA.

Faraz Faghri (F)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
DataTecnica, Washington, DC 20037, USA.
Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.

Classifications MeSH