Privacy preserving validation for multiomic prediction models.

machine learning model validation privacy reproducibility transcriptomics translational research

Journal

Briefings in bioinformatics
ISSN: 1477-4054
Titre abrégé: Brief Bioinform
Pays: England
ID NLM: 100912837

Informations de publication

Date de publication:
13 05 2022
Historique:
received: 26 10 2021
revised: 17 02 2022
accepted: 05 03 2022
pubmed: 8 4 2022
medline: 24 5 2022
entrez: 7 4 2022
Statut: ppublish

Résumé

Reproducibility of results obtained using ribonucleic acid (RNA) data across labs remains a major hurdle in cancer research. Often, molecular predictors trained on one dataset cannot be applied to another due to differences in RNA library preparation and quantification, which inhibits the validation of predictors across labs. While current RNA correction algorithms reduce these differences, they require simultaneous access to patient-level data from all datasets, which necessitates the sharing of training data for predictors when sharing predictors. Here, we describe SpinAdapt, an unsupervised RNA correction algorithm that enables the transfer of molecular models without requiring access to patient-level data. It computes data corrections only via aggregate statistics of each dataset, thereby maintaining patient data privacy. Despite an inherent trade-off between privacy and performance, SpinAdapt outperforms current correction methods, like Seurat and ComBat, on publicly available cancer studies, including TCGA and ICGC. Furthermore, SpinAdapt can correct new samples, thereby enabling unbiased evaluation on validation cohorts. We expect this novel correction paradigm to enhance research reproducibility and to preserve patient privacy.

Identifiants

pubmed: 35388408
pii: 6564350
doi: 10.1093/bib/bbac110
pmc: PMC9116386
pii:
doi:

Substances chimiques

RNA 63231-63-0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2022. Published by Oxford University Press.

Références

Cell. 2014 Aug 14;158(4):929-944
pubmed: 25109877
Breast Cancer Res Treat. 2012 Aug;135(1):301-6
pubmed: 22752290
Sci Rep. 2020 Jul 21;10(1):12123
pubmed: 32694712
Nucleic Acids Res. 2014 Dec 1;42(21):
pubmed: 25294822
Nat Biotechnol. 2019 Jun;37(6):685-691
pubmed: 31061482
Proc Natl Acad Sci U S A. 2019 May 14;116(20):9775-9784
pubmed: 31028141
Neural Comput. 1998 Sep 15;10(7):1895-1923
pubmed: 9744903
Genome Biol. 2020 Jan 16;21(1):12
pubmed: 31948481
Nat Methods. 2019 Aug;16(8):715-721
pubmed: 31363220
Cell. 2019 Jun 13;177(7):1888-1902.e21
pubmed: 31178118
Cell. 2018 Apr 5;173(2):291-304.e6
pubmed: 29625048
Nat Rev Genet. 2010 Oct;11(10):733-9
pubmed: 20838408
NAR Genom Bioinform. 2020 Sep;2(3):lqaa078
pubmed: 33015620
Nat Med. 2015 Nov;21(11):1350-6
pubmed: 26457759
Biostatistics. 2007 Jan;8(1):118-27
pubmed: 16632515
BMC Cancer. 2018 May 29;18(1):603
pubmed: 29843660
Genome Med. 2015 Feb 02;7(1):20
pubmed: 25722745
Eur Urol. 2020 Apr;77(4):420-433
pubmed: 31563503
Nucleic Acids Res. 2015 Apr 20;43(7):e47
pubmed: 25605792
Nat Methods. 2019 Dec;16(12):1289-1296
pubmed: 31740819
Nature. 2016 Mar 3;531(7592):47-52
pubmed: 26909576
Clin Cancer Res. 2009 Dec 15;15(24):7642-7651
pubmed: 19996206
Nat Biotechnol. 2018 Jun;36(5):421-427
pubmed: 29608177

Auteurs

Talal Ahmed (T)

Tempus Labs Inc., Chicago, IL 60654, USA.

Mark A Carty (MA)

Tempus Labs Inc., Chicago, IL 60654, USA.

Stephane Wenric (S)

Tempus Labs Inc., Chicago, IL 60654, USA.

Jonathan R Dry (JR)

Tempus Labs Inc., Chicago, IL 60654, USA.

Ameen A Salahudeen (AA)

Tempus Labs Inc., Chicago, IL 60654, USA.

Aly A Khan (AA)

Tempus Labs Inc., Chicago, IL 60654, USA.

Eric Lefkofsky (E)

Tempus Labs Inc., Chicago, IL 60654, USA.

Martin C Stumpe (MC)

Tempus Labs Inc., Chicago, IL 60654, USA.

Raphael Pelossof (R)

Tempus Labs Inc., Chicago, IL 60654, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH