Fully Bayesian analysis of RNA-seq counts for the detection of gene expression heterosis.

CUDA empirical Bayes graphics processing unit hierarchical model hybrid vigor negative-binomial

Journal

Journal of the American Statistical Association
ISSN: 0162-1459
Titre abrégé: J Am Stat Assoc
Pays: United States
ID NLM: 01510020R

Informations de publication

Date de publication:
2019
Historique:
entrez: 30 7 2019
pubmed: 30 7 2019
medline: 30 7 2019
Statut: ppublish

Résumé

Heterosis, or hybrid vigor, is the enhancement of the phenotype of hybrid progeny relative to their inbred parents. Heterosis is extensively used in agriculture, and the underlying mechanisms are unclear. To investigate the molecular basis of phenotypic heterosis, researchers search tens of thousands of genes for heterosis with respect to expression in the transcriptome. Difficulty arises in the assessment of heterosis due to composite null hypotheses and non-uniform distributions for p-values under these null hypotheses. Thus, we develop a general hierarchical model for count data and a fully Bayesian analysis in which an efficient parallelized Markov chain Monte Carlo algorithm ameliorates the computational burden. We use our method to detect gene expression heterosis in a two-hybrid plant-breeding scenario, both in a real RNA-seq maize dataset and in simulation studies. In the simulation studies, we show our method has well-calibrated posterior probabilities and credible intervals when the model assumed in analysis matches the model used to simulate the data. Although model misspecification can adversely affect calibration, the methodology is still able to accurately rank genes. Finally, we show that hyperparameter posteriors are extremely narrow and an empirical Bayes (eBayes) approach based on posterior means from the fully Bayesian analysis provides virtually equivalent posterior probabilities, credible intervals, and gene rankings relative to the fully Bayesian solution. This evidence of equivalence provides support for the use of eBayes procedures in RNA-seq data analysis if accurate hyperparameter estimates can be obtained.

Identifiants

pubmed: 31354180
doi: 10.1080/01621459.2018.1497496
pmc: PMC6660196
mid: NIHMS990037
doi:

Types de publication

Journal Article

Langues

eng

Pagination

610-621

Subventions

Organisme : NIGMS NIH HHS
ID : R01 GM109458
Pays : United States

Références

Stat Med. 2009 Nov 10;28(25):3049-67
pubmed: 19630097
J Agric Biol Environ Stat. 2015 Dec;20(4):614-628
pubmed: 27147815
Plant Physiol. 2001 Apr;125(4):2189-202
pubmed: 11299398
Genetics. 2006 Jan;172(1):507-17
pubmed: 16172500
Proc Natl Acad Sci U S A. 2006 May 2;103(18):6805-10
pubmed: 16641103
Genome Biol. 2010;11(12):220
pubmed: 21176179
Proc Natl Acad Sci U S A. 1997 Aug 19;94(17):9226-31
pubmed: 11038567
Genome Biol. 2010;11(10):R106
pubmed: 20979621
Genome Res. 2012 Dec;22(12):2445-54
pubmed: 23086286
Bayesian Anal. 2013;8(2):
pubmed: 24368932
BMC Bioinformatics. 2018 Apr 11;19(1):127
pubmed: 29642837
Trends Genet. 2007 Feb;23(2):60-6
pubmed: 17188398
Brief Funct Genomics. 2010 Mar;9(2):118-28
pubmed: 20130067
Nucleic Acids Res. 2012 May;40(10):4288-97
pubmed: 22287627
Science. 2009 Nov 20;326(5956):1112-5
pubmed: 19965430
PLoS One. 2013 Dec 09;8(12):e81415
pubmed: 24349066
Theor Appl Genet. 2006 Feb;112(3):421-9
pubmed: 16362278
Bioinformatics. 2013 Apr 15;29(8):1035-43
pubmed: 23428641
Genome Biol. 2010;11(3):R25
pubmed: 20196867
Biometrics. 2013 Sep;69(3):594-605
pubmed: 23889143
Biostatistics. 2013 Apr;14(2):232-43
pubmed: 23001152
J Agric Biol Environ Stat. 2015 Dec;20(4):598-613
pubmed: 27110090
J Agric Biol Environ Stat. 2014 Sep;19(3):319-337
pubmed: 25435758
J Agric Biol Environ Stat. 2015 Dec;20(4):555-576
pubmed: 27570441
Nat Genet. 2010 May;42(5):459-63
pubmed: 20348958
BMC Bioinformatics. 2010 Aug 10;11:422
pubmed: 20698981
Genome Res. 2007 Mar;17(3):264-75
pubmed: 17255553
BMC Bioinformatics. 2014 Apr 26;15:116
pubmed: 24766777

Auteurs

Will Landau (W)

Department of Statistics, Iowa State University.

Jarad Niemi (J)

Department of Statistics, Iowa State University.

Dan Nettleton (D)

Department of Statistics, Iowa State University.

Classifications MeSH