A computationally efficient Bayesian seemingly unrelated regressions model for high-dimensional quantitative trait loci discovery.

Bayesian computation Markov chain Monte Carlo covariance reparametrisation graphical models metabolomics quantitative trait loci

Journal

Journal of the Royal Statistical Society. Series C, Applied statistics
ISSN: 0035-9254
Titre abrégé: J R Stat Soc Ser C Appl Stat
Pays: England
ID NLM: 101086541

Informations de publication

Date de publication:
Aug 2021
Historique:
entrez: 10 1 2022
pubmed: 11 1 2022
medline: 11 1 2022
Statut: ppublish

Résumé

Our work is motivated by the search for metabolite quantitative trait loci (QTL) in a cohort of more than 5000 people. There are 158 metabolites measured by NMR spectroscopy in the 31-year follow-up of the Northern Finland Birth Cohort 1966 (NFBC66). These metabolites, as with many multivariate phenotypes produced by high-throughput biomarker technology, exhibit strong correlation structures. Existing approaches for combining such data with genetic variants for multivariate QTL analysis generally ignore phenotypic correlations or make restrictive assumptions about the associations between phenotypes and genetic loci. We present a computationally efficient Bayesian seemingly unrelated regressions model for high-dimensional data, with cell-sparse variable selection and sparse graphical structure for covariance selection. Cell sparsity allows different phenotype responses to be associated with different genetic predictors and the graphical structure is used to represent the conditional dependencies between phenotype variables. To achieve feasible computation of the large model space, we exploit a factorisation of the covariance matrix. Applying the model to the NFBC66 data with 9000 directly genotyped single nucleotide polymorphisms, we are able to simultaneously estimate genotype-phenotype associations and the residual dependence structure among the metabolites. The R package BayesSUR with full documentation is available at https://cran.r-project.org/web/packages/BayesSUR/.

Identifiants

pubmed: 35001978
doi: 10.1111/rssc.12490
pmc: PMC7612194
mid: EMS140622
doi:

Types de publication

Journal Article

Langues

eng

Pagination

886-908

Subventions

Organisme : Medical Research Council
ID : MC_UP_0801/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/M013138/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/M013138/2
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/S019669/1
Pays : United Kingdom

Références

Am J Hum Genet. 2021 Jun 3;108(6):983-1000
pubmed: 33909991
J Comput Graph Stat. 2009 Dec 1;18(4):930-940
pubmed: 20981245
Bioinformatics. 2012 May 15;28(10):1353-8
pubmed: 22492648
Am J Epidemiol. 2017 Nov 1;186(9):1084-1096
pubmed: 29106475
Biostatistics. 2017 Oct 1;18(4):618-636
pubmed: 28334312
Bioinformatics. 2014 Jul 15;30(14):2026-34
pubmed: 24665129
Stat Appl Genet Mol Biol. 2012 Jul 12;11(4):
pubmed: 22850063
Nat Commun. 2016 Mar 23;7:11122
pubmed: 27005778
Genetics. 2011 Dec;189(4):1449-59
pubmed: 21926303
Bioinformatics. 2018 Mar 1;34(5):890-892
pubmed: 28961702
Genetics. 2007 May;176(1):611-23
pubmed: 17339210
Bayesian Anal. 2019 Dec;14(4):1221-1244
pubmed: 33859772
PLoS Comput Biol. 2010 Apr 08;6(4):e1000737
pubmed: 20386736
Analyst. 2009 Sep;134(9):1781-5
pubmed: 19684899
PLoS Comput Biol. 2012 Jan;8(1):e1002330
pubmed: 22241974
J Comput Graph Stat. 2010 Fall;19(4):947-962
pubmed: 24963268
Nat Genet. 2012 Jan 29;44(3):269-76
pubmed: 22286219
PLoS Genet. 2012;8(8):e1002907
pubmed: 22916037
Nat Genet. 2009 Jan;41(1):35-46
pubmed: 19060910
Bioinformatics. 2016 Feb 15;32(4):523-32
pubmed: 26504141
Bioinformatics. 2016 Jul 1;32(13):1981-9
pubmed: 27153689
Biometrics. 2013 Jun;69(2):447-57
pubmed: 23607608

Auteurs

Leonardo Bottolo (L)

Department of Medical Genetics, University of Cambridge, Cambridge, UK.
The Alan Turing Institute, London, UK.
MRC Biostatistics Unit, Cambridge, UK.

Marco Banterle (M)

Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK.

Sylvia Richardson (S)

The Alan Turing Institute, London, UK.
MRC Biostatistics Unit, Cambridge, UK.

Mika Ala-Korpela (M)

Computational Medicine, Faculty of Medicine, University of Oulu and Biocenter Oulu, Oulu, Finland.
NMR Metabolomics Laboratory, School of Pharmacy, University of Eastern Finland, Kuopio, Finland.

Marjo-Riitta Järvelin (MR)

Center for Life Course Health Research, University of Oulu, Oulu, Finland.
Biocenter Oulu, University of Oulu, Oulu, Finland.
Department of Epidemiology and Biostatistics, Imperial College London, London, UK.
MRC-PHE Centre for Environment and Health, Imperial College London, London, UK.
Department of Life Sciences, Brunel University London, Uxbridge, UK.

Alex Lewin (A)

Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK.

Classifications MeSH