RBProkCNN: Deep learning on appropriate contextual evolutionary information for RNA binding protein discovery in prokaryotes.
Computational biology
Evolutionary feature
Machine learning
Prediction model
RNA-binding proteins
Journal
Computational and structural biotechnology journal
ISSN: 2001-0370
Titre abrégé: Comput Struct Biotechnol J
Pays: Netherlands
ID NLM: 101585369
Informations de publication
Date de publication:
Dec 2024
Dec 2024
Historique:
received:
16
02
2024
revised:
12
04
2024
accepted:
12
04
2024
medline:
25
4
2024
pubmed:
25
4
2024
entrez:
25
4
2024
Statut:
epublish
Résumé
RNA-binding proteins (RBPs) are central to key functions such as post-transcriptional regulation, mRNA stability, and adaptation to varied environmental conditions in prokaryotes. While the majority of research has concentrated on eukaryotic RBPs, recent developments underscore the crucial involvement of prokaryotic RBPs. Although computational methods have emerged in recent years to identify RBPs, they have fallen short in accurately identifying prokaryotic RBPs due to their generic nature. To bridge this gap, we introduce RBProkCNN, a novel machine learning-driven computational model meticulously designed for the accurate prediction of prokaryotic RBPs. The prediction process involves the utilization of eight shallow learning algorithms and four deep learning models, incorporating PSSM-based evolutionary features. By leveraging a convolutional neural network (CNN) and evolutionarily significant features selected through extreme gradient boosting variable importance measure, RBProkCNN achieved the highest accuracy in five-fold cross-validation, yielding 98.04% auROC and 98.19% auPRC. Furthermore, RBProkCNN demonstrated robust performance with an independent dataset, showcasing a commendable 95.77% auROC and 95.78% auPRC. Noteworthy is its superior predictive accuracy when compared to several state-of-the-art existing models. RBProkCNN is available as an online prediction tool (https://iasri-sg.icar.gov.in/rbprokcnn/), offering free access to interested users. This tool represents a substantial contribution, enriching the array of resources available for the accurate and efficient prediction of prokaryotic RBPs.
Identifiants
pubmed: 38660008
doi: 10.1016/j.csbj.2024.04.034
pii: S2001-0370(24)00117-X
pmc: PMC11039349
doi:
Types de publication
Journal Article
Langues
eng
Pagination
1631-1640Informations de copyright
© 2024 The Authors.
Déclaration de conflit d'intérêts
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.