Leveraging expression from multiple tissues using sparse canonical correlation analysis and aggregate tests improves the power of transcriptome-wide association studies.


Journal

PLoS genetics
ISSN: 1553-7404
Titre abrégé: PLoS Genet
Pays: United States
ID NLM: 101239074

Informations de publication

Date de publication:
04 2021
Historique:
received: 29 06 2020
accepted: 16 03 2021
revised: 20 04 2021
pubmed: 9 4 2021
medline: 28 8 2021
entrez: 8 4 2021
Statut: epublish

Résumé

Transcriptome-wide association studies (TWAS) test the association between traits and genetically predicted gene expression levels. The power of a TWAS depends in part on the strength of the correlation between a genetic predictor of gene expression and the causally relevant gene expression values. Consequently, TWAS power can be low when expression quantitative trait locus (eQTL) data used to train the genetic predictors have small sample sizes, or when data from causally relevant tissues are not available. Here, we propose to address these issues by integrating multiple tissues in the TWAS using sparse canonical correlation analysis (sCCA). We show that sCCA-TWAS combined with single-tissue TWAS using an aggregate Cauchy association test (ACAT) outperforms traditional single-tissue TWAS. In empirically motivated simulations, the sCCA+ACAT approach yielded the highest power to detect a gene associated with phenotype, even when expression in the causal tissue was not directly measured, while controlling the Type I error when there is no association between gene expression and phenotype. For example, when gene expression explains 2% of the variability in outcome, and the GWAS sample size is 20,000, the average power difference between the ACAT combined test of sCCA features and single-tissue, versus single-tissue combined with Generalized Berk-Jones (GBJ) method, single-tissue combined with S-MultiXcan, UTMOST, or summarizing cross-tissue expression patterns using Principal Component Analysis (PCA) approaches was 5%, 8%, 5% and 38%, respectively. The gain in power is likely due to sCCA cross-tissue features being more likely to be detectably heritable. When applied to publicly available summary statistics from 10 complex traits, the sCCA+ACAT test was able to increase the number of testable genes and identify on average an additional 400 additional gene-trait associations that single-trait TWAS missed. Our results suggest that aggregating eQTL data across multiple tissues using sCCA can improve the sensitivity of TWAS while controlling for the false positive rate.

Identifiants

pubmed: 33831007
doi: 10.1371/journal.pgen.1008973
pii: PGENETICS-D-20-01011
pmc: PMC8057593
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1008973

Subventions

Organisme : NCI NIH HHS
ID : R01 CA194393
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG009120
Pays : United States
Organisme : NCI NIH HHS
ID : R01 CA227237
Pays : United States
Organisme : NCI NIH HHS
ID : R35 CA197449
Pays : United States
Organisme : NHGRI NIH HHS
ID : U01 HG009088
Pays : United States

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
Nat Genet. 2012 Sep;44(9):981-90
pubmed: 22885922
Transl Psychiatry. 2018 May 18;8(1):99
pubmed: 29777097
Nat Genet. 2018 Jul;50(7):906-908
pubmed: 29892013
Am J Hum Genet. 2017 Apr 6;100(4):605-616
pubmed: 28343628
PLoS Genet. 2019 Mar 15;15(3):e1007530
pubmed: 30875371
Am J Hum Genet. 2011 Jan 7;88(1):76-82
pubmed: 21167468
Nat Genet. 2017 Sep;49(9):1385-1391
pubmed: 28714975
Nature. 2018 Jan 25;553(7689):530
pubmed: 29258290
Am J Hum Genet. 2019 Mar 7;104(3):410-421
pubmed: 30849328
Nat Genet. 2020 Nov;52(11):1239-1246
pubmed: 33020666
Nat Genet. 2019 Mar;51(3):568-576
pubmed: 30804563
PLoS Genet. 2019 Jan 22;15(1):e1007889
pubmed: 30668570
Nat Genet. 2015 Sep;47(9):1091-8
pubmed: 26258848
Nature. 2017 Nov 2;551(7678):92-94
pubmed: 29059683
Biostatistics. 2009 Jul;10(3):515-34
pubmed: 19377034
Nat Genet. 2019 Apr;51(4):675-682
pubmed: 30926970
Nat Genet. 2016 Mar;48(3):245-52
pubmed: 26854917
Am J Hum Genet. 2017 Jul 6;101(1):5-22
pubmed: 28686856
Nat Genet. 2018 Sep;50(9):1318-1326
pubmed: 30104760
Am J Hum Genet. 2014 May 1;94(5):662-76
pubmed: 24746957
Science. 2015 May 8;348(6235):648-60
pubmed: 25954001
Nat Genet. 2018 Apr;50(4):621-629
pubmed: 29632380
Cell. 2018 Jun 14;173(7):1705-1715.e16
pubmed: 29906448

Auteurs

Helian Feng (H)

Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.

Nicholas Mancuso (N)

Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America.
Division of Biostatistics, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America.

Alexander Gusev (A)

Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, Massachusetts, United States of America.
Division of Genetics, Brigham & Women's Hospital, Boston, MA, United States of America.
Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America.

Arunabha Majumdar (A)

Department of Human Genetics, University of California Los Angeles, Los Angeles, California, United States of America.
Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, California, United States of America.

Megan Major (M)

Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, California, United States of America.

Bogdan Pasaniuc (B)

Department of Human Genetics, University of California Los Angeles, Los Angeles, California, United States of America.
Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, California, United States of America.

Peter Kraft (P)

Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH