VIDHOP, viral host prediction with deep learning.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
20 04 2021
Historique:
received: 07 02 2020
revised: 17 07 2020
accepted: 03 08 2020
pubmed: 11 8 2020
medline: 29 4 2021
entrez: 11 8 2020
Statut: ppublish

Résumé

Zoonosis, the natural transmission of infections from animals to humans, is a far-reaching global problem. The recent outbreaks of Zikavirus, Ebolavirus and Coronavirus are examples of viral zoonosis, which occur more frequently due to globalization. In case of a virus outbreak, it is helpful to know which host organism was the original carrier of the virus to prevent further spreading of viral infection. Recent approaches aim to predict a viral host based on the viral genome, often in combination with the potential host genome and arbitrarily selected features. These methods are limited in the number of different hosts they can predict or the accuracy of the prediction. Here, we present a fast and accurate deep learning approach for viral host prediction, which is based on the viral genome sequence only. We tested our deep neural network (DNN) on three different virus species (influenza A virus, rabies lyssavirus and rotavirus A). We achieved for each virus species an AUC between 0.93 and 0.98, allowing highly accurate predictions while using only fractions (100-400 bp) of the viral genome sequences. We show that deep neural networks are suitable to predict the host of a virus, even with a limited amount of sequences and highly unbalanced available data. The trained DNNs are the core of our virus-host prediction tool VIrus Deep learning HOst Prediction (VIDHOP). VIDHOP also allows the user to train and use models for other viruses. VIDHOP is freely available under https://github.com/flomock/vidhop. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 32777818
pii: 5890682
doi: 10.1093/bioinformatics/btaa705
pmc: PMC7454304
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

318-325

Informations de copyright

© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Auteurs

Florian Mock (F)

RNA Bioinformatics/High Throughput Analysis, Faculty of Mathematics and Computer Science, Jena 07743, Germany.

Adrian Viehweger (A)

RNA Bioinformatics/High Throughput Analysis, Faculty of Mathematics and Computer Science, Jena 07743, Germany.

Emanuel Barth (E)

Bioinformatics Core Facility Jena, Friedrich Schiller University Jena, Jena 07743, Germany.

Manja Marz (M)

RNA Bioinformatics/High Throughput Analysis, Faculty of Mathematics and Computer Science, Jena 07743, Germany.
RNA Bioinformatics/High Throughput Analysis, Leibnitz Institute for Age Research - Fritz Lipmann Institute (FLI), Jena 07743, Germany.
RNA Bioinformatics/High Throughput Analysis, German Center for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig 04103, Germany.
RNA Bioinformatics/High Throughput Analysis, European Virus Bioinformatics Center (EVBC), Jena 07743, Germany.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH