HPC: Hierarchical phylogeny construction.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2019
Historique:
received: 06 03 2019
accepted: 05 08 2019
entrez: 23 8 2019
pubmed: 23 8 2019
medline: 5 3 2020
Statut: epublish

Résumé

Rapid improvements in DNA sequencing technology have resulted in long genome sequences for a large number of similar isolates with a wide range of single nucleotide polymorphism (SNP) rates, where some isolates can have thousands of times lower SNP rates than others. Genome sequences of this kind are a challenge to existing methods for construction of phylogenetic trees. We address the issues by developing a hierarchical approach to phylogeny construction. In this method, the construction is performed at multiple levels, where at each level, groups of isolates with similar levels of similarity are identified and their phylogenetic trees are constructed. Time savings are achieved by using a sufficiently large number of columns from the input alignment, instead of all its columns. Our results show that the new approach is 20-60 times more efficient than existing programs and more accurate in situations where highly similar isolates have a wide range of SNP rates.

Identifiants

pubmed: 31437210
doi: 10.1371/journal.pone.0221357
pii: PONE-D-19-06578
pmc: PMC6705828
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0221357

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

J Comput Biol. 2008 Mar;15(2):129-38
pubmed: 18312146
PLoS One. 2010 Mar 10;5(3):e9490
pubmed: 20224823
Syst Biol. 2010 May;59(3):307-21
pubmed: 20525638
BMC Genomics. 2012;13 Suppl 7:S6
pubmed: 23281601
Bioinformatics. 2014 May 1;30(9):1312-3
pubmed: 24451623
BMC Genomics. 2014 Feb 26;15:162
pubmed: 24571581
Mol Biol Evol. 2015 Jan;32(1):268-74
pubmed: 25371430
PLoS One. 2016 Jun 24;11(6):e0158183
pubmed: 27341103
Genome Biol. 2017 Oct 3;18(1):186
pubmed: 28974235
Genes (Basel). 2019 Jan 22;10(2):
pubmed: 30678245
Mol Biol Evol. 1987 Jul;4(4):406-25
pubmed: 3447015
J Mol Evol. 1981;17(6):368-76
pubmed: 7288891
Comput Appl Biosci. 1997 Jun;13(3):235-8
pubmed: 9183526

Auteurs

Anindya Das (A)

Department of Computer Science, Iowa State University, Ames, Iowa, United States of America.

Xiaoqiu Huang (X)

Department of Computer Science, Iowa State University, Ames, Iowa, United States of America.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Classifications MeSH