On the Number of Non-equivalent Ancestral Configurations for Matching Gene Trees and Species Trees.


Journal

Bulletin of mathematical biology
ISSN: 1522-9602
Titre abrégé: Bull Math Biol
Pays: United States
ID NLM: 0401404

Informations de publication

Date de publication:
02 2019
Historique:
received: 15 03 2017
accepted: 31 08 2017
pubmed: 16 9 2017
medline: 7 2 2020
entrez: 16 9 2017
Statut: ppublish

Résumé

An ancestral configuration is one of the combinatorially distinct sets of gene lineages that, for a given gene tree, can reach a given node of a specified species tree. Ancestral configurations have appeared in recursive algebraic computations of the conditional probability that a gene tree topology is produced under the multispecies coalescent model for a given species tree. For matching gene trees and species trees, we study the number of ancestral configurations, considered up to an equivalence relation introduced by Wu (Evolution 66:763-775, 2012) to reduce the complexity of the recursive probability computation. We examine the largest number of non-equivalent ancestral configurations possible for a given tree size n. Whereas the smallest number of non-equivalent ancestral configurations increases polynomially with n, we show that the largest number increases with [Formula: see text], where k is a constant that satisfies [Formula: see text]. Under a uniform distribution on the set of binary labeled trees with a given size n, the mean number of non-equivalent ancestral configurations grows exponentially with n. The results refine an earlier analysis of the number of ancestral configurations considered without applying the equivalence relation, showing that use of the equivalence relation does not alter the exponential nature of the increase with tree size.

Identifiants

pubmed: 28913585
doi: 10.1007/s11538-017-0342-x
pii: 10.1007/s11538-017-0342-x
pmc: PMC5851864
mid: NIHMS906480
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

384-407

Subventions

Organisme : NIGMS NIH HHS
ID : R01 GM117590
Pays : United States

Références

IEEE/ACM Trans Comput Biol Bioinform. 2016 Sep-Oct;13(5):913-925
pubmed: 26452289
Evolution. 2005 Jan;59(1):24-37
pubmed: 15792224
Theor Popul Biol. 2010 May;77(3):145-51
pubmed: 20064540
J Math Biol. 2011 Jun;62(6):833-62
pubmed: 20652704
Evolution. 2012 Mar;66(3):763-775
pubmed: 22380439
J Comput Biol. 2015 Oct;22(10):918-29
pubmed: 25973633
J Comput Biol. 2007 Apr;14(3):360-77
pubmed: 17563317
IEEE/ACM Trans Comput Biol Bioinform. 2013 Sep-Oct;10(5):1253-62
pubmed: 24524157
J Comput Biol. 2007 May;14(4):517-35
pubmed: 17572027
J Comput Biol. 2017 Sep;24(9):831-850
pubmed: 28437136

Auteurs

Filippo Disanto (F)

Department of Biology, Stanford University, Stanford, CA, USA. filippo.disanto@unipi.it.
Department of Mathematics, University of Pisa, Pisa, Italy. filippo.disanto@unipi.it.

Noah A Rosenberg (NA)

Department of Biology, Stanford University, Stanford, CA, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Animals Hemiptera Insect Proteins Phylogeny Insecticides
Amaryllidaceae Alkaloids Lycoris NADPH-Ferrihemoprotein Reductase Gene Expression Regulation, Plant Plant Proteins

Classifications MeSH