Scalable Bayesian Divergence Time Estimation With Ratio Transformations.
Bayesian inference
Hamiltonian Monte Carlo
divergence time estimation
effective sample size
pathogens
phylogenetics
ratio transformation
Journal
Systematic biology
ISSN: 1076-836X
Titre abrégé: Syst Biol
Pays: England
ID NLM: 9302532
Informations de publication
Date de publication:
01 Nov 2023
01 Nov 2023
Historique:
received:
02
01
2022
revised:
13
06
2023
accepted:
30
06
2023
pmc-release:
17
07
2024
medline:
8
11
2023
pubmed:
17
7
2023
entrez:
17
7
2023
Statut:
ppublish
Résumé
Divergence time estimation is crucial to provide temporal signals for dating biologically important events from species divergence to viral transmissions in space and time. With the advent of high-throughput sequencing, recent Bayesian phylogenetic studies have analyzed hundreds to thousands of sequences. Such large-scale analyses challenge divergence time reconstruction by requiring inference on highly correlated internal node heights that often become computationally infeasible. To overcome this limitation, we explore a ratio transformation that maps the original $N-1$ internal node heights into a space of one height parameter and $N-2$ ratio parameters. To make the analyses scalable, we develop a collection of linear-time algorithms to compute the gradient and Jacobian-associated terms of the log-likelihood with respect to these ratios. We then apply Hamiltonian Monte Carlo sampling with the ratio transform in a Bayesian framework to learn the divergence times in 4 pathogenic viruses (West Nile virus, rabies virus, Lassa virus, and Ebola virus) and the coralline red algae. Our method both resolves a mixing issue in the West Nile virus example and improves inference efficiency by at least 5-fold for the Lassa and rabies virus examples as well as for the algae example. Our method now also makes it computationally feasible to incorporate mixed-effects molecular clock models for the Ebola virus example, confirms the findings from the original study, and reveals clearer multimodal distributions of the divergence times of some clades of interest.
Identifiants
pubmed: 37458991
pii: 7225327
doi: 10.1093/sysbio/syad039
pmc: PMC10636426
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
1136-1153Subventions
Organisme : NIAID NIH HHS
ID : U19 AI135995
Pays : United States
Organisme : NIH HHS
ID : U19 AI135995
Pays : United States
Informations de copyright
© The Author(s) 2023. Published by Oxford University Press on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Références
Proc Natl Acad Sci U S A. 2007 May 8;104(19):7993-8
pubmed: 17470818
Proc Natl Acad Sci U S A. 2012 Sep 11;109(37):15066-71
pubmed: 22927414
Mol Ecol. 2015 Dec;24(24):6007-12
pubmed: 26769402
Mol Biol Evol. 2020 Oct 1;37(10):3047-3060
pubmed: 32458974
Bioinformatics. 2017 Jun 15;33(12):1798-1805
pubmed: 28200071
PLoS Comput Biol. 2014 Dec 04;10(12):e1003919
pubmed: 25474353
Mol Biol Evol. 1998 Dec;15(12):1647-57
pubmed: 9866200
PeerJ. 2019 Dec 18;7:e8272
pubmed: 31976168
Philos Trans R Soc Lond B Biol Sci. 2016 Jul 19;371(1699):
pubmed: 27325829
J Theor Biol. 2010 Dec 7;267(3):396-404
pubmed: 20851708
Wellcome Open Res. 2020 Mar 30;5:53
pubmed: 32923688
Virus Evol. 2018 Jun 08;4(1):vey016
pubmed: 29942656
PeerJ Comput Sci. 2023 Sep 1;9:e1516
pubmed: 37705656
Science. 2011 Nov 25;334(6059):1091-7
pubmed: 22116879
PLoS Biol. 2006 May;4(5):e88
pubmed: 16683862
Syst Biol. 2007 Jun;56(3):453-66
pubmed: 17558967
Mol Biol Evol. 2006 Jan;23(1):212-26
pubmed: 16177230
Mol Biol Evol. 2019 Aug 1;36(8):1793-1803
pubmed: 31004175
N Engl J Med. 2021 Apr 1;384(13):1240-1247
pubmed: 33789012
Syst Biol. 2016 Nov;65(6):1041-1056
pubmed: 27368344
Mol Biol Evol. 2010 Aug;27(8):1877-85
pubmed: 20203288
PLoS Pathog. 2019 Oct 31;15(10):e1008042
pubmed: 31671157
Science. 2011 Oct 28;334(6055):521-4
pubmed: 21940861
Science. 2020 Jun 19;368(6497):1367-1370
pubmed: 32554594
Mol Biol Evol. 2001 Mar;18(3):352-61
pubmed: 11230536
Virus Evol. 2019 Sep 05;5(2):vez036
pubmed: 31720009
Genetics. 2003 Aug;164(4):1645-56
pubmed: 12930768
BMC Evol Biol. 2015 Jun 26;15:119
pubmed: 26111824
Mol Phylogenet Evol. 2020 Sep;150:106845
pubmed: 32360706
Cell. 2015 Aug 13;162(4):738-50
pubmed: 26276630
Nat Commun. 2020 Oct 9;11(1):5110
pubmed: 33037213
Syst Biol. 2019 Nov 1;68(6):1052-1061
pubmed: 31034053
Syst Biol. 2018 Jul 1;67(4):594-615
pubmed: 29342307
Syst Biol. 2021 Feb 10;70(2):258-267
pubmed: 32687171