RNAelem: an algorithm for discovering sequence-structure motifs in RNA bound by RNA-binding proteins.


Journal

Bioinformatics advances
ISSN: 2635-0041
Titre abrégé: Bioinform Adv
Pays: England
ID NLM: 9918282081306676

Informations de publication

Date de publication:
2024
Historique:
received: 25 04 2024
revised: 08 09 2024
accepted: 26 09 2024
medline: 14 10 2024
pubmed: 14 10 2024
entrez: 14 10 2024
Statut: epublish

Résumé

RNA-binding proteins (RBPs) play a crucial role in the post-transcriptional regulation of RNA. Given their importance, analyzing the specific RNA patterns recognized by RBPs has become a significant research focus in bioinformatics. Deep Neural Networks have enhanced the accuracy of prediction for RBP-binding sites, yet understanding the structural basis of RBP-binding specificity from these models is challenging due to their limited interpretability. To address this, we developed RNAelem, which combines profile context-free grammar and the Turner energy model for RNA secondary structure to predict sequence-structure motifs in RBP-binding regions. RNAelem exhibited superior detection accuracy compared to existing tools for RNA sequences with structural motifs. Upon applying RNAelem to the eCLIP database, we were not only able to reproduce many known primary sequence motifs in the absence of secondary structures, but also discovered many secondary structural motifs that contained sequence-nonspecific insertion regions. Furthermore, the high interpretability of RNAelem yielded insightful findings such as long-range base-pairing interactions in the binding region of the U2AF protein. The code is available at https://github.com/iyak/RNAelem.

Identifiants

pubmed: 39399375
doi: 10.1093/bioadv/vbae144
pii: vbae144
pmc: PMC11471262
doi:

Types de publication

Journal Article

Langues

eng

Pagination

vbae144

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press.

Déclaration de conflit d'intérêts

None declared.

Auteurs

Hiroshi Miyake (H)

Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Chiba 277-8561, Japan.

Risa Karakida Kawaguchi (RK)

Department of Life Science Frontiers, Center for iPS Cell Research and Application (CiRA), Kyoto University, Sakyo-ku 606-8507, Japan.

Hisanori Kiryu (H)

Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Chiba 277-8561, Japan.

Classifications MeSH