BEAGLE 3: Improved Performance, Scaling, and Usability for a High-Performance Computing Library for Statistical Phylogenetics.

Bayesian phylogenetics GPU maximum likelihood multicore processing parallel computing

Journal

Systematic biology
ISSN: 1076-836X
Titre abrégé: Syst Biol
Pays: England
ID NLM: 9302532

Informations de publication

Date de publication:
01 11 2019
Historique:
received: 27 04 2018
revised: 10 04 2019
accepted: 10 04 2019
pubmed: 30 4 2019
medline: 10 1 2020
entrez: 30 4 2019
Statut: ppublish

Résumé

BEAGLE is a high-performance likelihood-calculation library for phylogenetic inference. The BEAGLE library defines a simple, but flexible, application programming interface (API), and includes a collection of efficient implementations for calculation under a variety of evolutionary models on different hardware devices. The library has been integrated into recent versions of popular phylogenetics software packages including BEAST and MrBayes and has been widely used across a diverse range of evolutionary studies. Here, we present BEAGLE 3 with new parallel implementations, increased performance for challenging data sets, improved scalability, and better usability. We have added new OpenCL and central processing unit-threaded implementations to the library, allowing the effective utilization of a wider range of modern hardware. Further, we have extended the API and library to support concurrent computation of independent partial likelihood arrays, for increased performance of nucleotide-model analyses with greater flexibility of data partitioning. For better scalability and usability, we have improved how phylogenetic software packages use BEAGLE in multi-GPU (graphics processing unit) and cluster environments, and introduced an automated method to select the fastest device given the data set, evolutionary model, and hardware. For application developers who wish to integrate the library, we also have developed an online tutorial. To evaluate the effect of the improvements, we ran a variety of benchmarks on state-of-the-art hardware. For a partitioned exemplar analysis, we observe run-time performance improvements as high as 5.9-fold over our previous GPU implementation. BEAGLE 3 is free, open-source software licensed under the Lesser GPL and available at https://beagle-dev.github.io.

Identifiants

pubmed: 31034053
pii: 5477405
doi: 10.1093/sysbio/syz020
pmc: PMC6802572
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

1052-1061

Subventions

Organisme : NIAID NIH HHS
ID : U19 AI135995
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG006139
Pays : United States
Organisme : NIAID NIH HHS
ID : R01 AI107034
Pays : United States
Organisme : Wellcome Trust
ID : 206298/Z/17/Z
Pays : United Kingdom

Informations de copyright

© The Author(s) 2019. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

Références

Virus Evol. 2018 Jun 08;4(1):vey016
pubmed: 29942656
J Mol Evol. 1985;22(2):160-74
pubmed: 3934395
J Mol Evol. 1981;17(6):368-76
pubmed: 7288891
Syst Biol. 2012 Jan;61(1):170-3
pubmed: 21963610
Syst Biol. 2015 Mar;64(2):356-62
pubmed: 25358969
Bioinformatics. 2017 Jun 15;33(12):1798-1805
pubmed: 28200071
Bioinformatics. 2009 Jun 1;25(11):1370-6
pubmed: 19369496
PLoS Comput Biol. 2014 Apr 10;10(4):e1003537
pubmed: 24722319
Bioinformatics. 2004 Feb 12;20(3):407-15
pubmed: 14960467
Syst Biol. 2012 May;61(3):539-42
pubmed: 22357727
Methods Mol Biol. 2019;1910:691-722
pubmed: 31278682
Trends Ecol Evol. 1996 Sep;11(9):367-72
pubmed: 21237881
Syst Biol. 2017 Mar 01;66(2):205-217
pubmed: 27576546
Syst Biol. 2010 May;59(3):307-21
pubmed: 20525638

Auteurs

Daniel L Ayres (DL)

Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA.

Michael P Cummings (MP)

Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA.

Guy Baele (G)

Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven - University of Leuven, 3000 Leuven, Belgium.

Aaron E Darling (AE)

The ithree Institute, University of Technology Sydney, Ultimo, New South Wales 2007, Australia.

Paul O Lewis (PO)

Department of Ecology and Evolutionary Biology, University of Connecticut, Unit 3043, Storrs, CT 06269, USA.

David L Swofford (DL)

Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA.

John P Huelsenbeck (JP)

Department of Integrative Biology, University of California, Berkeley, CA 94720 USA.

Philippe Lemey (P)

Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven - University of Leuven, 3000 Leuven, Belgium.

Andrew Rambaut (A)

Institute of Evolutionary Biology, University of Edinburgh, King's Buildings, Edinburgh EH9 3FL, UK.
Fogarty International Center, National Institutes of Health, Bethesda, MD 20892, USA.

Marc A Suchard (MA)

Department of Biomathematics University of California, Los Angeles, CA 90095, USA.
Department of Biostatistics, University of California, Los Angeles, CA 90095, USA.
Department of Human Genetics, University of California, Los Angeles, CA 90095, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Animals Hemiptera Insect Proteins Phylogeny Insecticides
Amaryllidaceae Alkaloids Lycoris NADPH-Ferrihemoprotein Reductase Gene Expression Regulation, Plant Plant Proteins

Classifications MeSH