Federated Learning for multi-omics: a performance evaluation in Parkinson's disease.
Federated learning
Machine Learning
Omics Data Analysis
Parkinson’s Disease Diagnosis
Journal
bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187
Informations de publication
Date de publication:
06 Oct 2023
06 Oct 2023
Historique:
pubmed:
21
11
2023
medline:
21
11
2023
entrez:
21
11
2023
Statut:
epublish
Résumé
While machine learning (ML) research has recently grown more in popularity, its application in the omics domain is constrained by access to sufficiently large, high-quality datasets needed to train ML models. Federated Learning (FL) represents an opportunity to enable collaborative curation of such datasets among participating institutions. We compare the simulated performance of several models trained using FL against classically trained ML models on the task of multi-omics Parkinson's Disease prediction. We find that FL model performance tracks centrally trained ML models, where the most performant FL model achieves an AUC-PR of 0.876 ± 0.009, 0.014 ± 0.003 less than its centrally trained variation. We also determine that the dispersion of samples within a federation plays a meaningful role in model performance. Our study implements several open source FL frameworks and aims to highlight some of the challenges and opportunities when applying these collaborative methods in multi-omics studies.
Identifiants
pubmed: 37986893
doi: 10.1101/2023.10.04.560604
pmc: PMC10659429
pii:
doi:
Types de publication
Preprint
Langues
eng
Subventions
Organisme : Intramural NIH HHS
ID : Z01 AG000949
Pays : United States
Organisme : Intramural NIH HHS
ID : ZIA AG000534
Pays : United States
Organisme : Intramural NIH HHS
ID : ZIA NS003154
Pays : United States
Déclaration de conflit d'intérêts
B.D., A.D., D.V., M.A.N., and F.F.’s declare no competing non-financial interests but the following competing financial interests as their participation in this project was part of a competitive contract awarded to Data Tecnica LLC by the National Institutes of Health to support open science research. M.A.N. also currently serves on the scientific advisory board for Character Bio and is an advisor to Neuron23 Inc. The study’s funders had no role in the study design, data collection, data analysis, data interpretation, or writing of the report. Authors M.B.M, and J.S. declare no competing financial or non-financial interests. All authors and the public can access all data and statistical programming code used in this project for the analyses and results generation. F.F. takes final responsibility for the decision to submit the paper for publication.
Références
Nature. 2020 Oct;586(7831):683-692
pubmed: 33116284
Proc Natl Acad Sci U S A. 2010 Sep 14;107(37):16222-7
pubmed: 20798349
Genes (Basel). 2022 Apr 21;13(5):
pubmed: 35627112
Front Big Data. 2020 Jun 02;3:19
pubmed: 33693393
J Med Internet Res. 2020 Oct 26;22(10):e20891
pubmed: 33104011
Quant Imaging Med Surg. 2021 Feb;11(2):852-857
pubmed: 33532283
NPJ Parkinsons Dis. 2022 Dec 16;8(1):172
pubmed: 36526647
J Healthc Inform Res. 2021;5(1):1-19
pubmed: 33204939
Nat Commun. 2022 Dec 5;13(1):7346
pubmed: 36470898
Sci Rep. 2020 Jul 28;10(1):12598
pubmed: 32724046
J Neurol. 2019 Aug;266(8):1897-1906
pubmed: 31053960
Nat Med. 2021 Oct;27(10):1735-1743
pubmed: 34526699
NPJ Parkinsons Dis. 2022 Apr 1;8(1):35
pubmed: 35365675
Bioinformatics. 2017 Sep 01;33(17):2776-2778
pubmed: 28475694
Phys Med Biol. 2022 Oct 19;67(21):
pubmed: 36198326
Int J Med Inform. 2016 Jun;90:13-21
pubmed: 27103193
Mov Disord. 2021 Aug;36(8):1795-1804
pubmed: 33960523
IEEE Trans Med Imaging. 2015 Oct;34(10):1993-2024
pubmed: 25494501
Sleep Breath. 2022 Jun;26(2):633-640
pubmed: 34236578
Heliyon. 2023 Jun 02;9(6):e16925
pubmed: 37332922