SHAPE-guided RNA structure homology search and motif discovery.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
31 03 2022
Historique:
received: 10 12 2021
accepted: 11 03 2022
entrez: 1 4 2022
pubmed: 2 4 2022
medline: 5 4 2022
Statut: epublish

Résumé

The rapidly growing popularity of RNA structure probing methods is leading to increasingly large amounts of available RNA structure information. This demands the development of efficient tools for the identification of RNAs sharing regions of structural similarity by direct comparison of their reactivity profiles, hence enabling the discovery of conserved structural features. We here introduce SHAPEwarp, a largely sequence-agnostic SHAPE-guided algorithm for the identification of structurally-similar regions in RNA molecules. Analysis of Dengue, Zika and coronavirus genomes recapitulates known regulatory RNA structures and identifies novel highly-conserved structural elements. This work represents a preliminary step towards the model-free search and identification of shared and conserved RNA structural features within transcriptomes.

Identifiants

pubmed: 35361788
doi: 10.1038/s41467-022-29398-y
pii: 10.1038/s41467-022-29398-y
pmc: PMC8971488
doi:

Substances chimiques

RNA, Guide 0
RNA 63231-63-0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1722

Informations de copyright

© 2022. The Author(s).

Références

Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinform. Oxf. Engl. 29, 2933–2935 (2013).
doi: 10.1093/bioinformatics/btt509
Eddy, S. R. & Durbin, R. RNA sequence analysis using covariance models. Nucleic Acids Res. 22, 2079–2088 (1994).
pubmed: 8029015 pmcid: 308124 doi: 10.1093/nar/22.11.2079
Strobel, E. J., Yu, A. M. & Lucks, J. B. High-throughput determination of RNA structures. Nat. Rev. Genet. 19, 615–634 (2018).
pubmed: 30054568 pmcid: 7388734 doi: 10.1038/s41576-018-0034-x
Incarnato, D. & Oliviero, S. The RNA epistructurome: uncovering RNA function by studying structure and post-transcriptional modifications. Trends Biotechnol. 35, 318–333 (2017).
pubmed: 27988057 doi: 10.1016/j.tibtech.2016.11.002
Wells, S. E., Hughes, J. M., Igel, A. H. & Ares, M. Use of dimethyl sulfate to probe RNA structure in vivo. Methods Enzymol. 318, 479–493 (2000).
pubmed: 10890007 doi: 10.1016/S0076-6879(00)18071-1
Spitale, R. C. et al. RNA SHAPE analysis in living cells. Nat. Chem. Biol. 9, 18–20 (2013).
pubmed: 23178934 doi: 10.1038/nchembio.1131
Marinus, T., Fessler, A. B., Ogle, C. A. & Incarnato, D. A novel SHAPE reagent enables the analysis of RNA structure in living cells with unprecedented accuracy. Nucleic Acids Res. 49, e34 (2021).
pubmed: 33398343 pmcid: 8034653 doi: 10.1093/nar/gkaa1255
Busan, S., Weidmann, C. A., Sengupta, A. & Weeks, K. M. Guidelines for SHAPE reagent choice and detection strategy for RNA structure probing studies. Biochemistry 58, 2655–2664 (2019).
pubmed: 31117385 doi: 10.1021/acs.biochem.8b01218
Turner, D. H. & Mathews, D. H. NNDB: the nearest neighbor parameter database for predicting stability of nucleic acid secondary structure. Nucleic Acids Res. 38, D280–D282 (2010).
pubmed: 19880381 doi: 10.1093/nar/gkp892
Deigan, K. E., Li, T. W., Mathews, D. H. & Weeks, K. M. Accurate SHAPE-directed RNA structure determination. Proc. Natl Acad. Sci. 106, 97–102 (2009).
pubmed: 19109441 doi: 10.1073/pnas.0806929106
Zarringhalam, K., Meyer, M. M., Dotu, I., Chuang, J. H. & Clote, P. Integrating chemical footprinting data into RNA secondary structure prediction. PLoS ONE 7, e45160 (2012).
pubmed: 23091593 pmcid: 3473038 doi: 10.1371/journal.pone.0045160
Washietl, S., Hofacker, I. L., Stadler, P. F. & Kellis, M. RNA folding with soft constraints: reconciliation of probing data and thermodynamic secondary structure prediction. Nucleic Acids Res. 40, 4261–4272 (2012).
pubmed: 22287623 pmcid: 3378861 doi: 10.1093/nar/gks009
Eddy, S. R. Computational analysis of conserved RNA secondary structure in transcriptomes and genomes. Biophysics 43, 433–456 (2014).
Ouyang, Z., Snyder, M. P. & Chang, H. Y. SeqFold: genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data. Genome Res. 23, 377–387 (2013).
pubmed: 23064747 pmcid: 3561878 doi: 10.1101/gr.138545.112
Warner, K. D., Hajdin, C. E. & Weeks, K. M. Principles for targeting RNA with drug-like small molecules. Nat. Rev. Drug Discov. 17, 547–558 (2018).
pubmed: 29977051 pmcid: 6420209 doi: 10.1038/nrd.2018.93
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
pubmed: 2231712 doi: 10.1016/S0022-2836(05)80360-2
Yeh, C.-C. M. et al. Matrix profile I: all pairs similarity joins for time series: a unifying view that includes motifs, discords and shapelets. 2016 IEEE 16th Int Conf Data Min Icdm 1317–1322 https://doi.org/10.1109/icdm.2016.0179 (2016).
Wittebolle, L. et al. Initial community evenness favours functionality under selective stress. Nature 458, 623–626 (2009).
pubmed: 19270679 doi: 10.1038/nature07840
Rouskin, S., Zubradt, M., Washietl, S., Kellis, M. & Weissman, J. S. Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo. Nature 505, 701–705 (2014).
pubmed: 24336214 doi: 10.1038/nature12894
Gotoh, O. An improved algorithm for matching biological sequences. J. Mol. Biol. 162, 705–708 (1982).
pubmed: 7166760 doi: 10.1016/0022-2836(82)90398-9
Pearson, W. R. Empirical statistical estimates for sequence similarity searches. J. Mol. Biol. 276, 71–84 (1998).
pubmed: 9514730 doi: 10.1006/jmbi.1997.1525
Bernhart, S. H., Hofacker, I. L., Will, S., Gruber, A. R. & Stadler, P. F. RNAalifold: improved consensus structure prediction for RNA alignments. Bmc Bioinform. 9, 474–474 (2008).
doi: 10.1186/1471-2105-9-474
Huber, R. G. et al. Structure mapping of dengue and Zika viruses reveals functional long-range interactions. Nat. Commun. 10, 1408 (2019).
pubmed: 30926818 pmcid: 6441010 doi: 10.1038/s41467-019-09391-8
Manfredonia, I. et al. Genome-wide mapping of SARS-CoV-2 RNA structures identifies therapeutically-relevant elements. Nucleic Acids Res. 48, 12436–12452 (2020).
pubmed: 33166999 pmcid: 7736786 doi: 10.1093/nar/gkaa1053
Morandi, E. et al. Genome-scale deconvolution of RNA structure ensembles. Nat. Methods 18, 249–252 (2021).
pubmed: 33619392 doi: 10.1038/s41592-021-01075-w
Liu, Z. & Qin, C. Structure and function of cis‐acting RNA elements of flavivirus. Rev. Med Virol. 30, e2092 (2020).
pubmed: 31777997 doi: 10.1002/rmv.2092
Liu, Z.-Y. et al. Novel cis-acting element within the capsid-coding region enhances flavivirus viral-RNA replication by regulating genome cyclization. J. Virol. 87, 6804–6818 (2013).
pubmed: 23576500 pmcid: 3676100 doi: 10.1128/JVI.00243-13
Kalvari, I. et al. Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families. Nucleic Acids Res. 46, D335–D342 (2018).
pubmed: 29112718 doi: 10.1093/nar/gkx1038
Li, P. et al. Integrative analysis of Zika virus genome RNA structure reveals critical determinants of viral infectivity. Cell Host Microbe 24, 875–886.e5 (2018).
pubmed: 30472207 doi: 10.1016/j.chom.2018.10.011
Rivas, E., Clements, J. & Eddy, S. R. A statistical test for conserved RNA structure shows lack of evidence for structure in lncRNAs. Nat. Methods 14, 45–48 (2017).
pubmed: 27819659 doi: 10.1038/nmeth.4066
Ziv, O. et al. COMRADES determines in vivo RNA structures and interactions. Nat. Methods 15, 785–788 (2018).
pubmed: 30202058 pmcid: 6168409 doi: 10.1038/s41592-018-0121-0
Ziv, O. et al. The short- and long-range RNA-RNA Interactome of SARS-CoV-2. Mol. Cell 80, 1067–1077.e5 (2020).
pubmed: 33259809 pmcid: 7643667 doi: 10.1016/j.molcel.2020.11.004
Zubradt, M. et al. DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo. Nat. Methods 14, 75–82 (2017).
pubmed: 27819661 doi: 10.1038/nmeth.4057
Incarnato, D., Morandi, E., Simon, L. M. & Oliviero, S. RNA Framework: an all-in-one toolkit for the analysis of RNA structures and post-transcriptional modifications. Nucleic Acids Res. 46, e97–e97 (2018).
pubmed: 29893890 pmcid: 6144828 doi: 10.1093/nar/gky486
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 17, 10–12 (2011).
doi: 10.14806/ej.17.1.200
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
pubmed: 22388286 pmcid: 3322381 doi: 10.1038/nmeth.1923
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
pubmed: 19505943 pmcid: 2723002 doi: 10.1093/bioinformatics/btp352
Lorenz, R. et al. ViennaRNA package 2.0. Algorithms Mol. Biol. 6, 26 (2011).
pubmed: 22115189 pmcid: 3319429 doi: 10.1186/1748-7188-6-26
Hemert, M. J.van et al. SARS-coronavirus replication/transcription complexes are membrane-protected and need a host factor for activity in vitro. Plos Pathog. 4, e1000054 (2008).
pubmed: 18451981 pmcid: 2322833 doi: 10.1371/journal.ppat.1000054
Simon, L. M. et al. In vivo analysis of influenza A mRNA secondary structures identifies critical regulatory motifs. Nucleic Acids Res. 47, 7003–7017 (2019).
pubmed: 31053845 pmcid: 6648356 doi: 10.1093/nar/gkz318
Pickett, B. E. et al. ViPR: an open bioinformatics database and analysis resource for virology research. Nucleic Acids Res. 40, D593–D598 (2011).
pubmed: 22006842 pmcid: 3245011 doi: 10.1093/nar/gkr859
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
pubmed: 23104886 doi: 10.1093/bioinformatics/bts635
Weinberg, Z. & Breaker, R. R. R2R—software to speed the depiction of aesthetic consensus RNA secondary structures. Bmc Bioinform. 12, 3–3 (2011).
doi: 10.1186/1471-2105-12-3

Auteurs

Edoardo Morandi (E)

Department of Molecular Genetics, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Groningen, The Netherlands.

Martijn J van Hemert (MJ)

Department of Medical Microbiology, Molecular Virology Laboratory, Leiden University Medical Center, Leiden, The Netherlands.

Danny Incarnato (D)

Department of Molecular Genetics, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Groningen, The Netherlands. d.incarnato@rug.nl.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH