StackCBPred: A stacking based prediction of protein-carbohydrate binding sites from sequence.

Binding prediction Machine learning Protein-carbohydrate binding Stacking

Journal

Carbohydrate research
ISSN: 1873-426X
Titre abrégé: Carbohydr Res
Pays: Netherlands
ID NLM: 0043535

Informations de publication

Date de publication:
01 Dec 2019
Historique:
received: 25 06 2019
revised: 05 10 2019
accepted: 23 10 2019
pubmed: 5 11 2019
medline: 24 3 2020
entrez: 5 11 2019
Statut: ppublish

Résumé

Carbohydrate-binding proteins play vital roles in many important biological processes. The study of these protein-carbohydrate interactions, at residue level, is useful in treating many critical diseases. Analyzing the local sequential environments of the binding and non-binding regions to predict the protein-carbohydrate binding sites is one of the challenging problems in molecular and computational biology. Existing experimental methods for identifying protein-carbohydrate binding sites are laborious and expensive. Thus, prediction of such binding sites, directly from sequences, using computational methods, can be useful to fast annotate the binding sites and guide the experimental process. Because the number of carbohydrate-binding residues is significantly lower than the number of non-carbohydrate-binding residues, most of the methods developed for the prediction of protein-carbohydrate binding sites are biased towards over predicting the negative class (or non-carbohydrate-binding). Here, we propose a balanced predictor, called StackCBPred, which utilizes features, extracted from evolution-driven sequence profile, called the position-specific scoring matrix (PSSM) and several predicted structural properties of amino acids to effectively train a Stacking-based machine learning method for the accurate prediction of protein-carbohydrate binding sites (https://bmll.cs.uno.edu/).

Identifiants

pubmed: 31683069
pii: S0008-6215(19)30390-8
doi: 10.1016/j.carres.2019.107857
pii:
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

107857

Informations de copyright

Published by Elsevier Ltd.

Auteurs

Suraj Gattani (S)

Department of Computer Science, University of New Orleans, 2000 Lakeshore Dr, New Orleans, LA, 70148, USA. Electronic address: sggattan@uno.edu.

Avdesh Mishra (A)

Department of Electrical Engineering and Computer Science, Texas A&M University-Kingsville, 700 University Blvd, Kingsville, TX, 78363, USA. Electronic address: avdesh.mishra@tamuk.edu.

Md Tamjidul Hoque (MT)

Department of Computer Science, University of New Orleans, 2000 Lakeshore Dr, New Orleans, LA, 70148, USA. Electronic address: thoque@uno.edu.

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
alpha-Synuclein Humans Animals Mice Lewy Body Disease
Adenosine Triphosphate Adenosine Diphosphate Mitochondrial ADP, ATP Translocases Binding Sites Mitochondria

Conservation of the cooling agent binding pocket within the TRPM subfamily.

Kate Huffer, Matthew C S Denley, Elisabeth V Oskoui et al.
1.00
TRPM Cation Channels Animals Binding Sites Mice Pyrimidinones

Classifications MeSH