A hierarchical Bayesian mixture model for inferring the expression state of genes in transcriptomes.


Journal

Proceedings of the National Academy of Sciences of the United States of America
ISSN: 1091-6490
Titre abrégé: Proc Natl Acad Sci U S A
Pays: United States
ID NLM: 7505876

Informations de publication

Date de publication:
11 08 2020
Historique:
pubmed: 28 7 2020
medline: 21 10 2020
entrez: 26 7 2020
Statut: ppublish

Résumé

Transcriptomes are key to understanding the relationship between genotype and phenotype. The ability to infer the expression state (active or inactive) of genes in the transcriptome offers unique benefits for addressing this issue. For example, qualitative changes in gene expression may underly the origin of novel phenotypes, and expression states are readily comparable between tissues and species. However, inferring the expression state of genes is a surprisingly difficult problem, owing to the complex biological and technical processes that give rise to observed transcriptomic datasets. Here, we develop a hierarchical Bayesian mixture model that describes this complex process and allows us to infer expression state of genes from replicate transcriptomic libraries. We explore the statistical behavior of this method with analyses of simulated datasets-where we demonstrate its ability to correctly infer true (known) expression states-and empirical-benchmark datasets, where we demonstrate that the expression states inferred from RNA-sequencing (RNA-seq) datasets using our method are consistent with those based on independent evidence. The power of our method to correctly infer expression states is generally high and remarkably, approaches the maximum possible power for this inference problem. We present an empirical analysis of primate-brain transcriptomes, which identifies genes that have a unique expression state in humans. Our method is implemented in the freely available R package zigzag.

Identifiants

pubmed: 32709743
pii: 1919748117
doi: 10.1073/pnas.1919748117
pmc: PMC7431084
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

19339-19346

Subventions

Organisme : NIGMS NIH HHS
ID : F32 GM125107
Pays : United States
Organisme : NIGMS NIH HHS
ID : R35 GM122592
Pays : United States

Informations de copyright

Copyright © 2020 the Author(s). Published by PNAS.

Déclaration de conflit d'intérêts

The authors declare no competing interest.

Références

Science. 2015 Feb 6;347(6222):664-7
pubmed: 25657249
Ups J Med Sci. 2010 Feb;115(1):11-20
pubmed: 20187846
Pflugers Arch. 2004 Feb;447(5):629-35
pubmed: 12811560
Mol Syst Biol. 2012;8:608
pubmed: 22968442
Nat Rev Genet. 2012 Mar 13;13(4):227-32
pubmed: 22411467
Proc Natl Acad Sci U S A. 2020 Aug 11;117(32):19339-19346
pubmed: 32709743
Nature. 2013 Jul 4;499(7456):79-82
pubmed: 23676674
Genome Biol. 2018 Jun 27;19(1):83
pubmed: 29950183
Mol Syst Biol. 2011 Jun 07;7:497
pubmed: 21654674
Mol Cell. 2013 Nov 21;52(4):473-84
pubmed: 24267449
Genome Res. 2014 Mar;24(3):411-21
pubmed: 24318729
BMC Syst Biol. 2015 Mar 21;9:16
pubmed: 25888718
Neuron. 2017 Apr 5;94(1):138-152.e5
pubmed: 28384468
PLoS Biol. 2010 May 18;8(5):e1000371
pubmed: 20502517
Bioinformatics. 2018 Sep 1;34(17):i647-i655
pubmed: 30423089
PLoS Genet. 2017 Jan 25;13(1):e1006554
pubmed: 28121980
Nature. 2012 Sep 6;489(7414):101-8
pubmed: 22955620
Bioinformatics. 2018 Oct 1;34(19):3340-3348
pubmed: 29688282
Theory Biosci. 2013 Sep;132(3):159-64
pubmed: 23615947
Mol Biol Evol. 2018 Jun 1;35(6):1422-1436
pubmed: 29554332
BMC Genomics. 2013 Nov 11;14:778
pubmed: 24215113
Cell. 2016 Apr 21;165(3):535-50
pubmed: 27104977
Elife. 2014 Jul 29;3:e03023
pubmed: 25073701
Nature. 2015 Feb 19;518(7539):317-30
pubmed: 25693563
Genome Res. 2014 Mar;24(3):422-30
pubmed: 24318730
Nat Methods. 2008 Jul;5(7):621-8
pubmed: 18516045
Proc Natl Acad Sci U S A. 2013 Oct 29;110(44):17778-83
pubmed: 24128763
Nat Struct Mol Biol. 2007 Feb;14(2):103-5
pubmed: 17277804
PLoS Comput Biol. 2009 Dec;5(12):e1000598
pubmed: 20011106
PLoS Genet. 2014 Jul 24;10(7):e1004525
pubmed: 25057982
Science. 2013 Nov 29;342(6162):1100-4
pubmed: 24136357
PLoS Genet. 2010 Sep 02;6(9):e1001090
pubmed: 20824076
Science. 2015 May 8;348(6235):660-5
pubmed: 25954002
Nature. 2011 May 5;473(7345):43-9
pubmed: 21441907
Science. 2017 Nov 24;358(6366):1027-1032
pubmed: 29170230
Proteomics. 2010 Dec;10(23):4209-12
pubmed: 21089048
Cell Rep. 2015 Mar 3;10(8):1398-409
pubmed: 25732829
Cell. 2012 Oct 26;151(3):476-82
pubmed: 23101621
Nature. 2011 May 19;473(7347):337-42
pubmed: 21593866
Science. 2015 May 8;348(6235):648-60
pubmed: 25954001
Cell. 2012 Oct 26;151(3):671-83
pubmed: 23101633

Auteurs

Ammon Thompson (A)

Department of Evolution and Ecology, University of California, Davis, CA 95616 ammonthompson@gmail.com.

Michael R May (MR)

Department of Evolution and Ecology, University of California, Davis, CA 95616.

Brian R Moore (BR)

Department of Evolution and Ecology, University of California, Davis, CA 95616.

Artyom Kopp (A)

Department of Evolution and Ecology, University of California, Davis, CA 95616.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH