NetRAX: accurate and fast maximum likelihood phylogenetic network inference.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
02 08 2022
02 08 2022
Historique:
received:
16
12
2021
revised:
11
05
2022
accepted:
14
06
2022
pubmed:
18
6
2022
medline:
15
11
2022
entrez:
17
6
2022
Statut:
ppublish
Résumé
Phylogenetic networks can represent non-treelike evolutionary scenarios. Current, actively developed approaches for phylogenetic network inference jointly account for non-treelike evolution and incomplete lineage sorting (ILS). Unfortunately, this induces a very high computational complexity and current tools can only analyze small datasets. We present NetRAX, a tool for maximum likelihood (ML) inference of phylogenetic networks in the absence of ILS. Our tool leverages state-of-the-art methods for efficiently computing the phylogenetic likelihood function on trees, and extends them to phylogenetic networks via the notion of 'displayed trees'. NetRAX can infer ML phylogenetic networks from partitioned multiple sequence alignments and returns the inferred networks in Extended Newick format. On simulated data, our results show a very low relative difference in Bayesian Information Criterion (BIC) score and a near-zero unrooted softwired cluster distance to the true, simulated networks. With NetRAX, a network inference on a partitioned alignment with 8000 sites, 30 taxa and 3 reticulations completes within a few minutes on a standard laptop. Our implementation is available under the GNU General Public License v3.0 at https://github.com/lutteropp/NetRAX. Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 35713506
pii: 6609768
doi: 10.1093/bioinformatics/btac396
pmc: PMC9344847
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
3725-3733Subventions
Organisme : Klaus Tschira Foundation
ID : STA 860/6-2
Organisme : French Agence Nationale de la Recherche program (CoCoAlSeq project
ID : ANR-19-CE45-0012
Informations de copyright
© The Author(s) 2022. Published by Oxford University Press.
Références
PLoS Comput Biol. 2015 Apr 07;11(4):e1004135
pubmed: 25849429
Proc IEEE Comput Syst Bioinform Conf. 2005;:93-102
pubmed: 16447967
PLoS Comput Biol. 2017 Aug 1;13(8):e1005611
pubmed: 28763439
Mol Biol Evol. 2018 Feb 1;35(2):504-517
pubmed: 29220490
Sci Adv. 2019 May 01;5(5):eaav9188
pubmed: 31049399
Mol Phylogenet Evol. 2017 Jun;111:206-218
pubmed: 28347887
Bioinformatics. 2006 Nov 1;22(21):2604-11
pubmed: 16928736
J Mol Evol. 1981;17(6):368-76
pubmed: 7288891
Syst Biol. 2018 Jul 1;67(4):735-740
pubmed: 29514307
BMC Bioinformatics. 2012;13 Suppl 19:S12
pubmed: 23281614
Syst Biol. 2020 May 1;69(3):593-601
pubmed: 31432090
Syst Biol. 2018 Sep 1;67(5):743-744
pubmed: 29534232
Bioinformatics. 2019 Nov 1;35(21):4453-4455
pubmed: 31070718
Mol Biol Evol. 2017 Dec 1;34(12):3292-3298
pubmed: 28961984
PLoS Genet. 2016 Mar 07;12(3):e1005896
pubmed: 26950302
BMC Bioinformatics. 2016 Oct 13;17(1):422
pubmed: 27737628
Comput Appl Biosci. 1997 Jun;13(3):235-8
pubmed: 9183526