Distributed learning for heterogeneous clinical data with application to integrating COVID-19 data across 230 sites.
Journal
NPJ digital medicine
ISSN: 2398-6352
Titre abrégé: NPJ Digit Med
Pays: England
ID NLM: 101731738
Informations de publication
Date de publication:
14 Jun 2022
14 Jun 2022
Historique:
received:
09
08
2021
accepted:
19
05
2022
entrez:
14
6
2022
pubmed:
15
6
2022
medline:
15
6
2022
Statut:
epublish
Résumé
Integrating real-world data (RWD) from several clinical sites offers great opportunities to improve estimation with a more general population compared to analyses based on a single clinical site. However, sharing patient-level data across sites is practically challenging due to concerns about maintaining patient privacy. We develop a distributed algorithm to integrate heterogeneous RWD from multiple clinical sites without sharing patient-level data. The proposed distributed conditional logistic regression (dCLR) algorithm can effectively account for between-site heterogeneity and requires only one round of communication. Our simulation study and data application with the data of 14,215 COVID-19 patients from 230 clinical sites in the UnitedHealth Group Clinical Research Database demonstrate that the proposed distributed algorithm provides an estimator that is robust to heterogeneity in event rates when efficiently integrating data from multiple clinical sites. Our algorithm is therefore a practical alternative to both meta-analysis and existing distributed algorithms for modeling heterogeneous multi-site binary outcomes.
Identifiants
pubmed: 35701668
doi: 10.1038/s41746-022-00615-8
pii: 10.1038/s41746-022-00615-8
pmc: PMC9198031
doi:
Types de publication
Journal Article
Langues
eng
Pagination
76Subventions
Organisme : U.S. Department of Health & Human Services | NIH | National Institute of Allergy and Infectious Diseases (NIAID)
ID : 1R01AI130460
Organisme : U.S. Department of Health & Human Services | National Institutes of Health (NIH)
ID : 1R56AG069880
Organisme : Patient-Centered Outcomes Research Institute (PCORI)
ID : ME-2019C3-18315
Organisme : Patient-Centered Outcomes Research Institute (PCORI)
ID : ME-2018C3-14899
Informations de copyright
© 2022. The Author(s).
Références
Pac Symp Biocomput. 2019;24:30-41
pubmed: 30864308
Sci Rep. 2021 Oct 4;11(1):19647
pubmed: 34608222
Nat Commun. 2020 Oct 29;11(1):5467
pubmed: 33122624
J Am Med Inform Assoc. 2014 Jul-Aug;21(4):621-6
pubmed: 24780722
J Am Med Inform Assoc. 2018 Mar 1;25(3):275-288
pubmed: 29036387
Med Care. 2010 Jun;48(6 Suppl):S45-51
pubmed: 20473204
Annu Rev Biomed Data Sci. 2018 Jul;1:115-129
pubmed: 31058261
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):684-7
pubmed: 22542813
Biostatistics. 2015 Oct;16(4):727-39
pubmed: 25813646
J Am Med Inform Assoc. 2020 Mar 1;27(3):376-385
pubmed: 31816040
JAMA. 2016 Dec 20;316(23):2481-2482
pubmed: 27997662
Pharmacoepidemiol Drug Saf. 2012 Jan;21 Suppl 1:1-8
pubmed: 22262586
J Am Med Inform Assoc. 2010 Mar-Apr;17(2):169-77
pubmed: 20190059
J Am Med Inform Assoc. 2014 Jul-Aug;21(4):578-82
pubmed: 24821743
J Chronic Dis. 1987;40(5):373-83
pubmed: 3558716
Biometrika. 2013;100(1):
pubmed: 24179236
Perspect Health Inf Manag. 2010 Oct 01;7:1d
pubmed: 21063545
Biometrics. 1987 Jun;43(2):289-99
pubmed: 3607201
JAMA Netw Open. 2018 Aug 3;1(4):e181755
pubmed: 30646124
J Am Med Inform Assoc. 2014 Jul-Aug;21(4):602-6
pubmed: 24821737
J Am Med Inform Assoc. 2012 Jan-Feb;19(1):54-60
pubmed: 22037893
NPJ Digit Med. 2020 Aug 19;3:109
pubmed: 32864472
Ann Intern Med. 2009 Sep 1;151(5):341-4
pubmed: 19638403
JAMA Intern Med. 2021 Apr 1;181(4):471-478
pubmed: 33351068
Stud Health Technol Inform. 2015;216:574-8
pubmed: 26262116
Pac Symp Biocomput. 2020;25:695-706
pubmed: 31797639
J Am Med Inform Assoc. 2015 Nov;22(6):1212-9
pubmed: 26159465
Sci Transl Med. 2010 Nov 10;2(57):57cm29
pubmed: 21068440
J Am Med Inform Assoc. 2013 Jan 1;20(1):29-34
pubmed: 22735615
Nat Commun. 2022 Mar 30;13(1):1678
pubmed: 35354802
Biometrika. 2012 Mar;99(1):223-229
pubmed: 24421412
J Am Med Inform Assoc. 2020 Jul 1;27(7):1028-1036
pubmed: 32626900
J Am Med Inform Assoc. 2010 May-Jun;17(3):322-7
pubmed: 20442151
J Am Med Inform Assoc. 2022 Jul 12;29(8):1366-1371
pubmed: 35579348
BioData Min. 2020 May 12;13:3
pubmed: 32419848
Proc Natl Acad Sci U S A. 2016 Jul 5;113(27):7329-36
pubmed: 27274072
Sci Rep. 2022 Apr 22;12(1):6627
pubmed: 35459767
J Am Med Inform Assoc. 2012 Sep-Oct;19(5):758-64
pubmed: 22511014
IARC Sci Publ. 1980;(32):5-338
pubmed: 7216345
Ann Intern Med. 2012 Aug 7;157(3):207-10
pubmed: 22868839
J Biomed Inform. 2022 Jul;131:104097
pubmed: 35643272
N Engl J Med. 2016 Dec 8;375(23):2293-2297
pubmed: 27959688