RNAelem: an algorithm for discovering sequence-structure motifs in RNA bound by RNA-binding proteins.
Journal
Bioinformatics advances
ISSN: 2635-0041
Titre abrégé: Bioinform Adv
Pays: England
ID NLM: 9918282081306676
Informations de publication
Date de publication:
2024
2024
Historique:
received:
25
04
2024
revised:
08
09
2024
accepted:
26
09
2024
medline:
14
10
2024
pubmed:
14
10
2024
entrez:
14
10
2024
Statut:
epublish
Résumé
RNA-binding proteins (RBPs) play a crucial role in the post-transcriptional regulation of RNA. Given their importance, analyzing the specific RNA patterns recognized by RBPs has become a significant research focus in bioinformatics. Deep Neural Networks have enhanced the accuracy of prediction for RBP-binding sites, yet understanding the structural basis of RBP-binding specificity from these models is challenging due to their limited interpretability. To address this, we developed RNAelem, which combines profile context-free grammar and the Turner energy model for RNA secondary structure to predict sequence-structure motifs in RBP-binding regions. RNAelem exhibited superior detection accuracy compared to existing tools for RNA sequences with structural motifs. Upon applying RNAelem to the eCLIP database, we were not only able to reproduce many known primary sequence motifs in the absence of secondary structures, but also discovered many secondary structural motifs that contained sequence-nonspecific insertion regions. Furthermore, the high interpretability of RNAelem yielded insightful findings such as long-range base-pairing interactions in the binding region of the U2AF protein. The code is available at https://github.com/iyak/RNAelem.
Identifiants
pubmed: 39399375
doi: 10.1093/bioadv/vbae144
pii: vbae144
pmc: PMC11471262
doi:
Types de publication
Journal Article
Langues
eng
Pagination
vbae144Informations de copyright
© The Author(s) 2024. Published by Oxford University Press.
Déclaration de conflit d'intérêts
None declared.