Fidelity of hyperbolic space for Bayesian phylogenetic inference.


Journal

PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922

Informations de publication

Date de publication:
04 2023
Historique:
received: 23 08 2022
accepted: 08 04 2023
revised: 08 05 2023
medline: 10 5 2023
pubmed: 26 4 2023
entrez: 26 4 2023
Statut: epublish

Résumé

Bayesian inference for phylogenetics is a gold standard for computing distributions of phylogenies. However, Bayesian phylogenetics faces the challenging computational problem of moving throughout the high-dimensional space of trees. Fortunately, hyperbolic space offers a low dimensional representation of tree-like data. In this paper, we embed genomic sequences as points in hyperbolic space and perform hyperbolic Markov Chain Monte Carlo for Bayesian inference in this space. The posterior probability of an embedding is computed by decoding a neighbour-joining tree from the embedding locations of the sequences. We empirically demonstrate the fidelity of this method on eight data sets. We systematically investigated the effect of embedding dimension and hyperbolic curvature on the performance in these data sets. The sampled posterior distribution recovers the splits and branch lengths to a high degree over a range of curvatures and dimensions. We systematically investigated the effects of the embedding space's curvature and dimension on the Markov Chain's performance, demonstrating the suitability of hyperbolic space for phylogenetic inference.

Identifiants

pubmed: 37099595
doi: 10.1371/journal.pcbi.1011084
pii: PCOMPBIOL-D-22-01264
pmc: PMC10166537
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1011084

Informations de copyright

Copyright: © 2023 Macaulay et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

Mol Biol Evol. 2022 Aug 3;39(8):
pubmed: 35816422
Mol Biol Evol. 2012 Jan;29(1):325-35
pubmed: 21890479
Syst Biol. 2012 Jan;61(1):1-11
pubmed: 21828081
PLoS One. 2008 Jul 09;3(7):e2651
pubmed: 18612422
Mol Biol Evol. 1987 Jul;4(4):406-25
pubmed: 3447015
J Math Biol. 2017 Jan;74(1-2):99-111
pubmed: 27155875
Mol Biol Evol. 2021 Apr 13;38(4):1627-1640
pubmed: 33185685
Syst Biol. 2011 Dec;60(6):826-32
pubmed: 21804094
Syst Biol. 2020 Mar 1;69(2):280-293
pubmed: 31504997
Syst Biol. 2015 May;64(3):472-91
pubmed: 25631175
Syst Biol. 2012 Jul;61(4):675-89
pubmed: 22357728
PLoS One. 2010 Mar 10;5(3):e9490
pubmed: 20224823
Comput Struct Biotechnol J. 2021 May 23;19:3198-3208
pubmed: 34141139
Nat Genet. 2023 Apr 10;:
pubmed: 37038003
Syst Biol. 2008 Feb;57(1):86-103
pubmed: 18278678
Syst Biol. 2020 Mar 1;69(2):209-220
pubmed: 31504998
Bioinformatics. 2003 Aug 12;19(12):1572-4
pubmed: 12912839
Mol Biol Evol. 2015 Jan;32(1):268-74
pubmed: 25371430
Syst Biol. 2018 May 01;67(3):490-502
pubmed: 29186587
Mol Biol Evol. 1997 Jul;14(7):717-24
pubmed: 9214744
Biol Methods Protoc. 2021 Mar 27;6(1):bpab006
pubmed: 33928190
BMC Bioinformatics. 2010 Oct 30;11:538
pubmed: 21034504

Auteurs

Matthew Macaulay (M)

University of Technology Sydney, Australian Institute for Microbiology & Infection, Sydney, Australia.

Aaron Darling (A)

Illumina Australia Pty Ltd, Sydney, Australia.

Mathieu Fourment (M)

University of Technology Sydney, Australian Institute for Microbiology & Infection, Sydney, Australia.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Animals Hemiptera Insect Proteins Phylogeny Insecticides
Amaryllidaceae Alkaloids Lycoris NADPH-Ferrihemoprotein Reductase Gene Expression Regulation, Plant Plant Proteins

Classifications MeSH