Pred-BVP-Unb: Fast prediction of bacteriophage Virion proteins using un-biased multi-perspective properties with recursive feature elimination.

Bacteriophage virion proteins Bi-profile evolutionary information Recursive feature elimination Split amino acid composition Support vector machine

Journal

Genomics
ISSN: 1089-8646
Titre abrégé: Genomics
Pays: United States
ID NLM: 8800135

Informations de publication

Date de publication:
03 2020
Historique:
received: 19 07 2019
revised: 27 08 2019
accepted: 11 09 2019
pubmed: 19 9 2019
medline: 5 1 2021
entrez: 19 9 2019
Statut: ppublish

Résumé

Bacteriophage virion proteins (BVPs) are bacterial viruses that have a great impact on different biological functions of bacteria. They are significantly used in genetic engineering and phage therapy applications. Correct identification of BVP through conventional pathogen methods are slow and expensive. Thus, designing a Bioinformatics predictor is urgently desirable to accelerate correct identification of BVPs within a huge volume of proteins. However, available prediction tools performance is inadequate due to the lack of useful feature representation and severe imbalance issue. In the present study, we propose an intelligent model, called Pred-BVP-Unb for discrimination of BVPs that employed three nominal sequences-driven descriptors, i.e. Bi-PSSM evolutionary information, composition & translation, and split amino acid composition. The imbalance phenomena between classes were coped with the help of a synthetic minority oversampling technique. The essential attributes are selected by a robust algorithm called recursive feature elimination. Finally, the optimal feature space is provided to support vector machine classifier using a radial base kernel in order to train the model. Our predictor remarkably outperforms than existing approaches in the literature by achieving the highest accuracy of 92.54% and 83.06% respectively on the benchmark and independent datasets. We expect that Pred-BVP-Unb tool can provide useful hints for designing antibacterial drugs and also helpful to expedite large scale discovery of new bacteriophage virion proteins. The source code and all datasets are publicly available at https://github.com/Muhammad-Arif-NUST/BVP_Pred_Unb.

Identifiants

pubmed: 31526842
pii: S0888-7543(19)30467-7
doi: 10.1016/j.ygeno.2019.09.006
pii:
doi:

Substances chimiques

Viral Structural Proteins 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1565-1574

Informations de copyright

Copyright © 2019 Elsevier Inc. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare no conflict of interest.

Auteurs

Muhammad Arif (M)

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China; Department of Computer Science, Abdul Wali Khan University Mardan, KP, Pakistan. Electronic address: mdarif@njust.edu.cn.

Farman Ali (F)

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China. Electronic address: farmanali@njust.edu.cn.

Saeed Ahmad (S)

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.

Muhammad Kabir (M)

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.

Zakir Ali (Z)

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.

Maqsood Hayat (M)

Department of Computer Science, Abdul Wali Khan University Mardan, KP, Pakistan. Electronic address: m.hayat@awkum.edu.pk.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Genome, Viral Ralstonia Composting Solanum lycopersicum Bacteriophages
Cephalometry Humans Anatomic Landmarks Software Internet

Classifications MeSH