Multisite learning of high-dimensional heterogeneous data with applications to opioid use disorder study of 15,000 patients across 5 clinical sites.


Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
30 06 2022
Historique:
received: 12 01 2022
accepted: 31 05 2022
entrez: 30 6 2022
pubmed: 1 7 2022
medline: 6 7 2022
Statut: epublish

Résumé

Integrating data across institutions can improve learning efficiency. To integrate data efficiently while protecting privacy, we propose A one-shot, summary-statistics-based, Distributed Algorithm for fitting Penalized (ADAP) regression models across multiple datasets. ADAP utilizes patient-level data from a lead site and incorporates the first-order (ADAP1) and second-order gradients (ADAP2) of the objective function from collaborating sites to construct a surrogate objective function at the lead site, where model fitting is then completed with proper regularizations applied. We evaluate the performance of the proposed method using both simulation and a real-world application to study risk factors for opioid use disorder (OUD) using 15,000 patient data from the OneFlorida Clinical Research Consortium. Our results show that ADAP performs nearly the same as the pooled estimator but achieves higher estimation accuracy and better variable selection than the local and average estimators. Moreover, ADAP2 successfully handles heterogeneity in covariate distributions.

Identifiants

pubmed: 35773438
doi: 10.1038/s41598-022-14029-9
pii: 10.1038/s41598-022-14029-9
pmc: PMC9245877
doi:

Types de publication

Journal Article Multicenter Study Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, P.H.S. Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

11073

Subventions

Organisme : NIH HHS
ID : LM010098
Pays : United States
Organisme : NIH HHS
ID : R01AI130460
Pays : United States
Organisme : NCCDPHP CDC HHS
ID : U18 DP006512
Pays : United States
Organisme : NLM NIH HHS
ID : R01 LM013519
Pays : United States
Organisme : Patient-Centered Outcomes Research Institute
ID : ME-2019C3-18315
Pays : United States
Organisme : NIH HHS
ID : R01CA246518
Pays : United States
Organisme : NIA NIH HHS
ID : R56 AG074604
Pays : United States
Organisme : ACL HHS
ID : U18DP006512
Pays : United States
Organisme : NIA NIH HHS
ID : R01 AG073435
Pays : United States
Organisme : CDC HHS
ID : U18DP006512
Pays : United States
Organisme : NIA NIH HHS
ID : R56 AG069880
Pays : United States

Informations de copyright

© 2022. The Author(s).

Références

J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
J Am Med Inform Assoc. 2020 Mar 1;27(3):376-385
pubmed: 31816040
Sci Rep. 2021 Oct 4;11(1):19647
pubmed: 34608222
Pain. 2015 Apr;156(4):569-576
pubmed: 25785523
MMWR Morb Mortal Wkly Rep. 2011 Nov 4;60(43):1487-92
pubmed: 22048730
J Am Med Inform Assoc. 2014 Jul-Aug;21(4):578-82
pubmed: 24821743
Am J Med Genet B Neuropsychiatr Genet. 2018 Oct;177(7):601-612
pubmed: 28557243
AMIA Annu Symp Proc. 2020 Mar 04;2019:1101-1110
pubmed: 32308907
Stud Health Technol Inform. 2015;216:574-8
pubmed: 26262116
Ann Stat. 2018 Jun;46(3):1352-1382
pubmed: 30034040
Nat Commun. 2022 Mar 30;13(1):1678
pubmed: 35354802
Acad Med. 2018 Mar;93(3):451-455
pubmed: 29045273
J Clin Psychiatry. 2016 Jun;77(6):772-80
pubmed: 27337416
J Am Med Inform Assoc. 2022 Jul 12;29(8):1366-1371
pubmed: 35579348
J Biomed Inform. 2022 Jul;131:104097
pubmed: 35643272
Pac Symp Biocomput. 2019;24:30-41
pubmed: 30864308
J Am Stat Assoc. 2023;118(542):1000-1010
pubmed: 37347088
Am J Gastroenterol. 2008 Sep;103(9):2171-8
pubmed: 18844611
J Am Med Inform Assoc. 2012 Jan-Feb;19(1):54-60
pubmed: 22037893
Pac Symp Biocomput. 2020;25:695-706
pubmed: 31797639
Nat Rev Genet. 2012 May 02;13(6):395-405
pubmed: 22549152
J Am Med Inform Assoc. 2015 Nov;22(6):1212-9
pubmed: 26159465
Ann Emerg Med. 2022 Feb;79(2):158-167
pubmed: 34119326
NPJ Digit Med. 2022 Jun 14;5(1):76
pubmed: 35701668
J Am Med Inform Assoc. 2020 Jul 1;27(7):1028-1036
pubmed: 32626900
Nat Commun. 2021 Oct 11;12(1):5910
pubmed: 34635645
Pharmacoepidemiol Drug Saf. 2020 Nov;29(11):1393-1401
pubmed: 32844549
N Engl J Med. 2010 Nov 18;363(21):1981-5
pubmed: 21083382
Sci Rep. 2022 Apr 22;12(1):6627
pubmed: 35459767
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):758-64
pubmed: 22511014
AMIA Annu Symp Proc. 2021 Jan 25;2020:1220-1229
pubmed: 33936498

Auteurs

Xiaokang Liu (X)

Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, 423 Guardian Drive, Philadelphia, PA, 19104, USA.

Rui Duan (R)

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA.

Chongliang Luo (C)

Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, 423 Guardian Drive, Philadelphia, PA, 19104, USA.
Division of Public Health Sciences, Washington University School of Medicine in St. Louis, St. Louis, MO, USA.

Alexis Ogdie (A)

Department of Medicine, Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.

Jason H Moore (JH)

Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, 90096, USA.

Henry R Kranzler (HR)

Department of Psychiatry, University of Pennsylvania Perelman School of Medicine and the VISN 4 MIRECC, Crescenz VAMC, Philadelphia, PA, USA.

Jiang Bian (J)

Department of Health Outcomes and Biomedical Informatics, University of Florida Health Cancer Center, Gainesville, FL, USA.

Yong Chen (Y)

Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, 423 Guardian Drive, Philadelphia, PA, 19104, USA. ychen123@upenn.edu.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH