DeepNOG: fast and accurate protein orthologous group assignment.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 Apr 2021
Historique:
received: 24 04 2020
revised: 02 12 2020
accepted: 10 12 2020
medline: 29 12 2020
pubmed: 29 12 2020
entrez: 28 12 2020
Statut: ppublish

Résumé

Protein orthologous group databases are powerful tools for evolutionary analysis, functional annotation or metabolic pathway modeling across lineages. Sequences are typically assigned to orthologous groups with alignment-based methods, such as profile hidden Markov models, which have become a computational bottleneck. We present DeepNOG, an extremely fast and accurate, alignment-free orthology assignment method based on deep convolutional networks. We compare DeepNOG against state-of-the-art alignment-based (HMMER, DIAMOND) and alignment-free methods (DeepFam) on two orthology databases (COG, eggNOG 5). DeepNOG can be scaled to large orthology databases like eggNOG, for which it outperforms DeepFam in terms of precision and recall by large margins. While alignment-based methods still provide the most accurate assignments among the investigated methods, computing time of DeepNOG is an order of magnitude lower on CPUs. Optional GPU usage further increases throughput massively. A command-line tool enables rapid adoption by users. Source code and packages are freely available at https://github.com/univieCUBE/deepnog. Install the platform-independent Python program with $pip install deepnog. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 33367584
pii: 6050698
doi: 10.1093/bioinformatics/btaa1051
pmc: PMC8016488
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

5304-5312

Subventions

Organisme : Austrian Science Fund
ID : P27703
Organisme : GPU
Organisme : Nvidia corporation

Informations de copyright

© The Author(s) 2020. Published by Oxford University Press.

Auteurs

Roman Feldbauer (R)

Department of Microbiology and Ecosystem Science, University of Vienna, Vienna 1090, Austria.

Lukas Gosch (L)

Department of Microbiology and Ecosystem Science, University of Vienna, Vienna 1090, Austria.

Lukas Lüftinger (L)

Department of Microbiology and Ecosystem Science, University of Vienna, Vienna 1090, Austria.
Ares Genetics GmbH, Vienna 1030, Austria.

Patrick Hyden (P)

Department of Microbiology and Ecosystem Science, University of Vienna, Vienna 1090, Austria.

Arthur Flexer (A)

Institute of Computational Perception, Johannes Kepler University Linz, Linz 4040, Austria.

Thomas Rattei (T)

Department of Microbiology and Ecosystem Science, University of Vienna, Vienna 1090, Austria.

Classifications MeSH