Enumeration of compact coalescent histories for matching gene trees and species trees.


Journal

Journal of mathematical biology
ISSN: 1432-1416
Titre abrégé: J Math Biol
Pays: Germany
ID NLM: 7502105

Informations de publication

Date de publication:
01 2019
Historique:
received: 03 01 2018
revised: 12 07 2018
pubmed: 18 8 2018
medline: 23 6 2020
entrez: 18 8 2018
Statut: ppublish

Résumé

Compact coalescent histories are combinatorial structures that describe for a given gene tree G and species tree S possibilities for the numbers of coalescences of G that take place on the various branches of S. They have been introduced as a data structure for evaluating probabilities of gene tree topologies conditioning on species trees, reducing computation time compared to standard coalescent histories. When gene trees and species trees have a matching labeled topology [Formula: see text], the compact coalescent histories of t are encoded by particular integer labelings of the branches of t, each integer specifying the number of coalescent events of G present in a branch of S. For matching gene trees and species trees, we investigate enumerative properties of compact coalescent histories. We report a recursion for the number of compact coalescent histories for matching gene trees and species trees, using it to study the numbers of compact coalescent histories for small trees. We show that the number of compact coalescent histories equals the number of coalescent histories if and only if the labeled topology is a caterpillar or a bicaterpillar. The number of compact coalescent histories is seen to increase with tree imbalance: we prove that as the number of taxa n increases, the exponential growth of the number of compact coalescent histories follows [Formula: see text] in the case of caterpillar or bicaterpillar labeled topologies and approximately [Formula: see text] and [Formula: see text] for lodgepole and balanced topologies, respectively. We prove that the mean number of compact coalescent histories of a labeled topology of size n selected uniformly at random grows with [Formula: see text]. Our results contribute to the analysis of the computational complexity of algorithms for computing gene tree probabilities, and to the combinatorial study of gene trees and species trees more generally.

Identifiants

pubmed: 30116881
doi: 10.1007/s00285-018-1271-5
pii: 10.1007/s00285-018-1271-5
pmc: PMC7661175
mid: NIHMS1643740
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

155-188

Subventions

Organisme : NIGMS NIH HHS
ID : R01 GM117590
Pays : United States
Organisme : NIH HHS
ID : R01 GM117590
Pays : United States

Références

IEEE/ACM Trans Comput Biol Bioinform. 2016 Sep-Oct;13(5):913-925
pubmed: 26452289
Bioinformatics. 2016 Jun 15;32(12):i225-i233
pubmed: 27307621
PLoS Comput Biol. 2009 Sep;5(9):e1000501
pubmed: 19749978
Evolution. 2005 Jan;59(1):24-37
pubmed: 15792224
Theor Popul Biol. 2010 May;77(3):145-51
pubmed: 20064540
Math Biosci. 2012 Jan;235(1):45-55
pubmed: 22075548
Syst Biol. 2008 Feb;57(1):131-40
pubmed: 18300026
Bull Math Biol. 2019 Feb;81(2):384-407
pubmed: 28913585
Theor Popul Biol. 2015 Nov;105:17-23
pubmed: 26365816
PLoS Genet. 2006 May;2(5):e68
pubmed: 16733550
Evolution. 2012 Mar;66(3):763-775
pubmed: 22380439
J Comput Biol. 2015 Oct;22(10):918-29
pubmed: 25973633
J Comput Biol. 2007 Apr;14(3):360-77
pubmed: 17563317
IEEE/ACM Trans Comput Biol Bioinform. 2013 Sep-Oct;10(5):1253-62
pubmed: 24524157
J Comput Biol. 2007 May;14(4):517-35
pubmed: 17572027
J Comput Biol. 2017 Sep;24(9):831-850
pubmed: 28437136

Auteurs

Filippo Disanto (F)

Department of Mathematics, University of Pisa, Pisa, 56126, Italy. filippo.disanto@unipi.it.

Noah A Rosenberg (NA)

Department of Biology, Stanford University, Stanford, CA, 94305, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Animals Hemiptera Insect Proteins Phylogeny Insecticides
Amaryllidaceae Alkaloids Lycoris NADPH-Ferrihemoprotein Reductase Gene Expression Regulation, Plant Plant Proteins

Classifications MeSH