Privacy-first health research with federated learning.


Journal

NPJ digital medicine
ISSN: 2398-6352
Titre abrégé: NPJ Digit Med
Pays: England
ID NLM: 101731738

Informations de publication

Date de publication:
07 Sep 2021
Historique:
received: 08 03 2021
accepted: 21 07 2021
entrez: 8 9 2021
pubmed: 9 9 2021
medline: 9 9 2021
Statut: epublish

Résumé

Privacy protection is paramount in conducting health research. However, studies often rely on data stored in a centralized repository, where analysis is done with full access to the sensitive underlying content. Recent advances in federated learning enable building complex machine-learned models that are trained in a distributed fashion. These techniques facilitate the calculation of research study endpoints such that private data never leaves a given device or healthcare system. We show-on a diverse set of single and multi-site health studies-that federated models can achieve similar accuracy, precision, and generalizability, and lead to the same interpretation as standard centralized statistical models while achieving considerably stronger privacy protections and without significantly raising computational costs. This work is the first to apply modern and general federated learning methods that explicitly incorporate differential privacy to clinical and epidemiological research-across a spectrum of units of federation, model architectures, complexity of learning tasks and diseases. As a result, it enables health research participants to remain in control of their data and still contribute to advancing science-aspects that used to be at odds with each other.

Identifiants

pubmed: 34493770
doi: 10.1038/s41746-021-00489-2
pii: 10.1038/s41746-021-00489-2
pmc: PMC8423792
doi:

Types de publication

Journal Article

Langues

eng

Pagination

132

Subventions

Organisme : NIGMS NIH HHS
ID : R01 GM109718
Pays : United States
Organisme : ACL HHS
ID : U01CK000589
Pays : United States

Informations de copyright

© 2021. The Author(s).

Références

PLoS One. 2019 Jan 9;14(1):e0209650
pubmed: 30625188
BMC Med Inform Decis Mak. 2020 Feb 3;20(1):16
pubmed: 32013925
J Biomed Inform. 2019 Nov;99:103291
pubmed: 31560949
IEEE/ACM Trans Comput Biol Bioinform. 2016 May-Jun;13(3):431-44
pubmed: 26761861
Euro Surveill. 2011 Aug 11;16(32):
pubmed: 21871222
Int J Med Inform. 2018 Apr;112:59-67
pubmed: 29500022
Stud Health Technol Inform. 2015;216:574-8
pubmed: 26262116
Antimicrob Resist Infect Control. 2017 Jan 21;6:14
pubmed: 28127422
AMIA Annu Symp Proc. 2020 Mar 04;2019:313-322
pubmed: 32308824
Sci Data. 2016 May 24;3:160035
pubmed: 27219127
Sci Rep. 2020 Jul 28;10(1):12598
pubmed: 32724046
JMIR Med Inform. 2018 Apr 13;6(2):e20
pubmed: 29653917
PLoS Negl Trop Dis. 2018 Nov 12;12(11):e0006950
pubmed: 30419040
NPJ Digit Med. 2020 Sep 14;3:119
pubmed: 33015372

Auteurs

Adam Sadilek (A)

Google, Mountain View, CA, USA. adsa@google.com.

Luyang Liu (L)

Google, Mountain View, CA, USA.

Dung Nguyen (D)

Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA.
Department of Computer Science, University of Virginia, Charlottesville, VA, USA.

Methun Kamruzzaman (M)

Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA.

Benjamin Rader (B)

Computational Epidemiology Lab, Boston Children's Hospital, Boston, MA, USA.
Department of Epidemiology, Boston University, Boston, MA, USA.

Alex Ingerman (A)

Google, Mountain View, CA, USA.

Stefan Mellem (S)

Google, Mountain View, CA, USA.

Peter Kairouz (P)

Google, Mountain View, CA, USA.

Elaine O Nsoesie (EO)

Department of Global Health, Boston University, Boston, MA, USA.

Jamie MacFarlane (J)

Google, Mountain View, CA, USA.

Anil Vullikanti (A)

Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA.
Department of Computer Science, University of Virginia, Charlottesville, VA, USA.

Madhav Marathe (M)

Biocomplexity Institute, University of Virginia, Charlottesville, VA, USA.
Department of Computer Science, University of Virginia, Charlottesville, VA, USA.

Paul Eastham (P)

Google, Mountain View, CA, USA.

John S Brownstein (JS)

Computational Epidemiology Lab, Boston Children's Hospital, Boston, MA, USA.
Harvard Medical School, Boston, MA, USA.

Blaise Aguera Y Arcas (BAY)

Google, Mountain View, CA, USA.

Michael D Howell (MD)

Google, Mountain View, CA, USA.

John Hernandez (J)

Google, Mountain View, CA, USA. johnbhernandez@google.com.

Classifications MeSH