Privacy-Preserving Collaborative Prediction using Random Forests.
Journal
AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science
ISSN: 2153-4063
Titre abrégé: AMIA Jt Summits Transl Sci Proc
Pays: United States
ID NLM: 101539486
Informations de publication
Date de publication:
2019
2019
Historique:
entrez:
2
7
2019
pubmed:
2
7
2019
medline:
2
7
2019
Statut:
epublish
Résumé
We study the problem of privacy-preserving machine learning (PPML) for ensemble methods, focusing our effort on random forests. In collaborative analysis, PPML attempts to solve the conflict between the need for data sharing and privacy. This is especially important in privacy sensitive applications such as learning predictive models for clinical decision support from EHR data from different clinics, where each clinic has a responsibility for its patients' privacy. We propose a new approach for ensemble methods: each entity learns a model, from its own data, and then when a client asks the prediction for a new private instance, the answers from all the locally trained models are used to compute the prediction in such a way that no extra information is revealed. We implement this approach for random forests and we demonstrate its high efficiency and potential accuracy benefit via experiments on real-world datasets, including actual EHR data.
Types de publication
Journal Article
Langues
eng
Pagination
248-257Subventions
Organisme : NLM NIH HHS
ID : T15 LM007359
Pays : United States
Organisme : NIAID NIH HHS
ID : U54 AI117924
Pays : United States
Organisme : NCATS NIH HHS
ID : UL1 TR002373
Pays : United States
Références
PLoS Genet. 2008 Aug 29;4(8):e1000167
pubmed: 18769715
N Engl J Med. 2009 Feb 19;360(8):753-64
pubmed: 19228618
Int J Med Inform. 2018 Apr;112:59-67
pubmed: 29500022