A computationally efficient Bayesian seemingly unrelated regressions model for high-dimensional quantitative trait loci discovery.
Bayesian computation
Markov chain Monte Carlo
covariance reparametrisation
graphical models
metabolomics
quantitative trait loci
Journal
Journal of the Royal Statistical Society. Series C, Applied statistics
ISSN: 0035-9254
Titre abrégé: J R Stat Soc Ser C Appl Stat
Pays: England
ID NLM: 101086541
Informations de publication
Date de publication:
Aug 2021
Aug 2021
Historique:
entrez:
10
1
2022
pubmed:
11
1
2022
medline:
11
1
2022
Statut:
ppublish
Résumé
Our work is motivated by the search for metabolite quantitative trait loci (QTL) in a cohort of more than 5000 people. There are 158 metabolites measured by NMR spectroscopy in the 31-year follow-up of the Northern Finland Birth Cohort 1966 (NFBC66). These metabolites, as with many multivariate phenotypes produced by high-throughput biomarker technology, exhibit strong correlation structures. Existing approaches for combining such data with genetic variants for multivariate QTL analysis generally ignore phenotypic correlations or make restrictive assumptions about the associations between phenotypes and genetic loci. We present a computationally efficient Bayesian seemingly unrelated regressions model for high-dimensional data, with cell-sparse variable selection and sparse graphical structure for covariance selection. Cell sparsity allows different phenotype responses to be associated with different genetic predictors and the graphical structure is used to represent the conditional dependencies between phenotype variables. To achieve feasible computation of the large model space, we exploit a factorisation of the covariance matrix. Applying the model to the NFBC66 data with 9000 directly genotyped single nucleotide polymorphisms, we are able to simultaneously estimate genotype-phenotype associations and the residual dependence structure among the metabolites. The R package BayesSUR with full documentation is available at https://cran.r-project.org/web/packages/BayesSUR/.
Identifiants
pubmed: 35001978
doi: 10.1111/rssc.12490
pmc: PMC7612194
mid: EMS140622
doi:
Types de publication
Journal Article
Langues
eng
Pagination
886-908Subventions
Organisme : Medical Research Council
ID : MC_UP_0801/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/M013138/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/M013138/2
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/S019669/1
Pays : United Kingdom
Références
Am J Hum Genet. 2021 Jun 3;108(6):983-1000
pubmed: 33909991
J Comput Graph Stat. 2009 Dec 1;18(4):930-940
pubmed: 20981245
Bioinformatics. 2012 May 15;28(10):1353-8
pubmed: 22492648
Am J Epidemiol. 2017 Nov 1;186(9):1084-1096
pubmed: 29106475
Biostatistics. 2017 Oct 1;18(4):618-636
pubmed: 28334312
Bioinformatics. 2014 Jul 15;30(14):2026-34
pubmed: 24665129
Stat Appl Genet Mol Biol. 2012 Jul 12;11(4):
pubmed: 22850063
Nat Commun. 2016 Mar 23;7:11122
pubmed: 27005778
Genetics. 2011 Dec;189(4):1449-59
pubmed: 21926303
Bioinformatics. 2018 Mar 1;34(5):890-892
pubmed: 28961702
Genetics. 2007 May;176(1):611-23
pubmed: 17339210
Bayesian Anal. 2019 Dec;14(4):1221-1244
pubmed: 33859772
PLoS Comput Biol. 2010 Apr 08;6(4):e1000737
pubmed: 20386736
Analyst. 2009 Sep;134(9):1781-5
pubmed: 19684899
PLoS Comput Biol. 2012 Jan;8(1):e1002330
pubmed: 22241974
J Comput Graph Stat. 2010 Fall;19(4):947-962
pubmed: 24963268
Nat Genet. 2012 Jan 29;44(3):269-76
pubmed: 22286219
PLoS Genet. 2012;8(8):e1002907
pubmed: 22916037
Nat Genet. 2009 Jan;41(1):35-46
pubmed: 19060910
Bioinformatics. 2016 Feb 15;32(4):523-32
pubmed: 26504141
Bioinformatics. 2016 Jul 1;32(13):1981-9
pubmed: 27153689
Biometrics. 2013 Jun;69(2):447-57
pubmed: 23607608