NanoMUD: Profiling of pseudouridine and N1-methylpseudouridine using Oxford Nanopore direct RNA sequencing.
Deep learning
Epi-transcriptome
N1-methylpseudouridine
Nanopore direct RNA sequencing
Pseudouridine
mRNA vaccines
Journal
International journal of biological macromolecules
ISSN: 1879-0003
Titre abrégé: Int J Biol Macromol
Pays: Netherlands
ID NLM: 7909578
Informations de publication
Date de publication:
15 May 2024
15 May 2024
Historique:
received:
27
02
2024
revised:
13
05
2024
accepted:
14
05
2024
medline:
18
5
2024
pubmed:
18
5
2024
entrez:
17
5
2024
Statut:
aheadofprint
Résumé
Nanopore direct RNA sequencing provided a promising solution for unraveling the landscapes of modifications on single RNA molecules. Here, we proposed NanoMUD, a computational framework for predicting the RNA pseudouridine modification (Ψ) and its methylated analog N1-methylpseudouridine (m1Ψ), which have critical application in mRNA vaccination, at single-base and single-molecule resolution from direct RNA sequencing data. Electric signal features were fed into a bidirectional LSTM neural network to achieve improved accuracy and predictive capabilities. Motif-specific models (NNUNN, N = A, C, U or G) were trained based on features extracted from designed dataset and achieved superior performance on molecule-level modification prediction (Ψ models: min AUC = 0.86, max AUC = 0.99; m1Ψ models: min AUC = 0.87, max AUC = 0.99). We then aggregated read-level predictions for site stoichiometry estimation. Given the observed sequence-dependent bias in model performance, we trained regression models based on the distribution of modification probabilities for sites with known stoichiometry. The distribution-based site stoichiometry estimation method allows unbiased comparison between different contexts. To demonstrate the feasibility of our work, three case studies on both in vitro and in vivo transcribed RNAs were presented. NanoMUD will make a powerful tool to facilitate the research on modified therapeutic IVT RNAs and provides useful insight to the landscape and stoichiometry of pseudouridine and N1-pseudouridine on in vivo transcribed RNA species.
Identifiants
pubmed: 38759861
pii: S0141-8130(24)03238-0
doi: 10.1016/j.ijbiomac.2024.132433
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
132433Informations de copyright
Copyright © 2024 Elsevier B.V. All rights reserved.
Déclaration de conflit d'intérêts
Declaration of competing interest The authors declare no competing interests.