A computational platform for high-throughput analysis of RNA sequences and modifications by mass spectrometry.
Base Sequence
/ genetics
Databases, Factual
/ statistics & numerical data
Datasets as Topic
Epigenomics
/ methods
High-Throughput Screening Assays
/ methods
Humans
Oligonucleotides
/ chemistry
RNA Processing, Post-Transcriptional
/ genetics
RNA, Transfer
/ chemistry
Reproducibility of Results
Search Engine
Tandem Mass Spectrometry
/ methods
Journal
Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555
Informations de publication
Date de publication:
17 02 2020
17 02 2020
Historique:
received:
11
02
2019
accepted:
22
01
2020
entrez:
19
2
2020
pubmed:
19
2
2020
medline:
6
5
2020
Statut:
epublish
Résumé
The field of epitranscriptomics continues to reveal how post-transcriptional modification of RNA affects a wide variety of biological phenomena. A pivotal challenge in this area is the identification of modified RNA residues within their sequence contexts. Mass spectrometry (MS) offers a comprehensive solution by using analogous approaches to shotgun proteomics. However, software support for the analysis of RNA MS data is inadequate at present and does not allow high-throughput processing. Existing software solutions lack the raw performance and statistical grounding to efficiently handle the numerous modifications found on RNA. We present a free and open-source database search engine for RNA MS data, called NucleicAcidSearchEngine (NASE), that addresses these shortcomings. We demonstrate the capability of NASE to reliably identify a wide range of modified RNA sequences in four original datasets of varying complexity. In human tRNA, we characterize over 20 different modification types simultaneously and find many cases of incomplete modification.
Identifiants
pubmed: 32066737
doi: 10.1038/s41467-020-14665-7
pii: 10.1038/s41467-020-14665-7
pmc: PMC7026122
doi:
Substances chimiques
Oligonucleotides
0
RNA, Transfer
9014-25-9
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Validation Study
Langues
eng
Sous-ensembles de citation
IM
Pagination
926Subventions
Organisme : NIEHS NIH HHS
ID : P30 ES013508
Pays : United States
Références
Borek, E. & Srinivasan, P. R. The methylation of nucleic acids. Annu. Rev. Biochem. 35, 275–298 (1966).
doi: 10.1146/annurev.bi.35.070166.001423
Davis, F. F. & Allen, F. W. Ribonucleic acids from yeast which contain a fifth nucleotide. J. Biol. Chem. 227, 907–915 (1957).
pubmed: 13463012
pmcid: 13463012
Bertero, A. et al. The SMAD2/3 interactome reveals that TGFβ controls m6A mRNA methylation in pluripotency. Nature 555, 256–259 (2018).
pubmed: 29489750
pmcid: 5951268
doi: 10.1038/nature25784
Kirino, Y. et al. Codon-specific translational defect caused by a wobble modification deficiency in mutant tRNA from a human mitochondrial disease. Proc. Natl Acad. Sci. USA 101, 15070–15075 (2004).
pubmed: 15477592
doi: 10.1073/pnas.0405173101
pmcid: 15477592
Jia, G., Fu, Y. & He, C. Reversible RNA adenosine methylation in biological regulation. Trends Genet. 29, 108–115 (2013).
pubmed: 23218460
doi: 10.1016/j.tig.2012.11.003
Weng, Y.-L. et al. Epitranscriptomic m 6 a regulation of axon regeneration in the adult mammalian nervous system. Neuron 97, 313–325.e6 (2018).
pubmed: 29346752
pmcid: 5777326
doi: 10.1016/j.neuron.2017.12.036
Blanco, S. et al. Aberrant methylation of tRNAs links cellular stress to neuro-developmental disorders. EMBO J. 33, 2020–2039 (2014).
pubmed: 25063673
pmcid: 4195770
doi: 10.15252/embj.201489282
Li, X., Xiong, X. & Yi, C. Epitranscriptome sequencing technologies: decoding RNA modifications. Nat. Methods 14, 23–31 (2016).
pubmed: 28032622
doi: 10.1038/nmeth.4110
Helm, M. & Motorin, Y. Detecting RNA modifications in the epitranscriptome: predict and validate. Nat. Rev. Genet. 18, 275–291 (2017).
pubmed: 28216634
doi: 10.1038/nrg.2016.169
Linder, B. et al. Single-nucleotide-resolution mapping of m6A and m6Am throughout the transcriptome. Nat. Methods 12, 767–772 (2015).
pubmed: 26121403
pmcid: 26121403
doi: 10.1038/nmeth.3453
Vilfan, I. D. et al. Analysis of RNA base modification and structural rearrangement by single-molecule real-time detection of reverse transcription. J. Nanobiotechnol. 11, 8 (2013).
doi: 10.1186/1477-3155-11-8
Smith, A. M., Jain, M., Mulroney, L., Garalde, D. R. & Akeson, M. Reading canonical and modified nucleobases in 16S ribosomal RNA using nanopore native RNA sequencing. PLoS ONE 14, e0216709 (2019).
pubmed: 31095620
pmcid: 6522004
doi: 10.1371/journal.pone.0216709
Rhoads, A. & Au, K. F. PacBio sequencing and its applications. Genomics Proteom. Bioinforma. 13, 278–289 (2015).
doi: 10.1016/j.gpb.2015.08.002
Dominissini, D. et al. The dynamic N1-methyladenosine methylome in eukaryotic messenger RNA. Nature 530, 441–446 (2016).
pubmed: 26863196
pmcid: 4842015
doi: 10.1038/nature16998
Safra, M. et al. The m1A landscape on cytosolic and mitochondrial mRNA at single-base resolution. Nature 551, 251–255 (2017).
pubmed: 29072297
doi: 10.1038/nature24456
Su, D. et al. Quantitative analysis of ribonucleoside modifications in tRNA by HPLC-coupled mass spectrometry. Nat. Protoc. 9, 828–841 (2014).
pubmed: 24625781
pmcid: 4313537
doi: 10.1038/nprot.2014.047
Yu, B. & Chen, X. Analysis of miRNA Modifications. Methods Mol. Biol. 592, 137–148 (2010).
pubmed: 19802594
pmcid: 5134323
doi: 10.1007/978-1-60327-005-2_10
Kullolli, M., Knouf, E., Arampatzidou, M., Tewari, M. & Pitteri, S. J. Intact MicroRNA ametry. J. Am. Soc. Mass. Spectrom. 25, 80–87 (2014).
pubmed: 24174127
doi: 10.1007/s13361-013-0759-x
Huber, C. G. & Oberacher, H. Analysis of nucleic acids by on-line liquid chromatography–mass spectrometry. Mass. Spectrom. Rev. 20, 310–343 (2001).
pubmed: 11948655
doi: 10.1002/mas.10011
Rozenski, J. & McCloskey, J. A. SOS: a simple interactive program for ab initio oligonucleotide sequencing by mass spectrometry. J. Am. Soc. Mass. Spectrom. 13, 200–203 (2002).
pubmed: 11908799
doi: 10.1016/S1044-0305(01)00354-3
Nakayama, H. et al. Ariadne: a database search engine for identification and chemical analysis of RNA using tandem mass spectrometry data. Nucleic Acids Res. 37, e47–e47 (2009).
pubmed: 19270066
pmcid: 2665244
doi: 10.1093/nar/gkp099
Nyakas, A., Blum, L. C., Stucki, S. R., Reymond, J.-L. & Schürch, S. OMA and OPA—software-supported mass spectra analysis of native and modified nucleic acids. J. Am. Soc. Mass. Spectrom. 24, 249–256 (2012).
pubmed: 23264149
doi: 10.1007/s13361-012-0529-1
Yu, N., Lobue, P. A., Cao, X. & Limbach, P. A. RNAModMapper: RNA modification mapping software for analysis of liquid chromatography tandem mass spectrometry data. Anal. Chem. 89, 10744–10752 (2017).
pubmed: 28942636
doi: 10.1021/acs.analchem.7b01780
Gillet, L. C., Leitner, A. & Aebersold, R. Mass spectrometry applied to bottom-up proteomics: entering the high-throughput era for hypothesis testing. Annu. Rev. Anal. Chem. 9, 449–472 (2016).
doi: 10.1146/annurev-anchem-071015-041535
Röst, H. L. et al. OpenMS: a flexible open-source software platform for mass spectrometry data analysis. Nat. Methods 13, 741–748 (2016).
pubmed: 27575624
doi: 10.1038/nmeth.3959
Kohlbacher, O. et al. TOPP—the OpenMS proteomics pipeline. Bioinforma 23, e191–e197 (2007).
doi: 10.1093/bioinformatics/btl299
Sturm, M. & Kohlbacher, O. TOPPView: an open-source viewer for mass spectrometry data. J. Proteome Res. 8, 3760–3763 (2009).
pubmed: 19425593
doi: 10.1021/pr900171m
Junker, J. et al. TOPPAS: a graphical workflow editor for the analysis of high-throughput proteomics data. J. Proteome Res. 11, 3914–3920 (2012).
pubmed: 22583024
doi: 10.1021/pr300187f
Mcluckey, S. A., Berkel, G. J. & Glish, G. L. Tandem mass spectrometry of small, multiply charged oligonucleotides. J. Am. Soc. Mass. Spectrom. 3, 60–70 (1992).
pubmed: 24242838
doi: 10.1016/1044-0305(92)85019-G
Lobue, P. A., Yu, N., Jora, M., Abernathy, S. & Limbach, P. A. Improved application of RNAModMapper – an RNA modification mapping software tool – for analysis of liquid chromatography tandem mass spectrometry (LC-MS/MS) data. Methods 156, 128–138 (2019).
pubmed: 30366097
doi: 10.1016/j.ymeth.2018.10.012
pmcid: 30366097
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).
pubmed: 19029910
doi: 10.1038/nbt.1511
pmcid: 19029910
Taoka, M. et al. Landscape of the complete RNA chemical modifications in the human 80S ribosome. Nucleic Acids Res. 46, 9289–9298 (2018).
pubmed: 30202881
pmcid: 6182160
doi: 10.1093/nar/gky811
Pan, T. Modifications and functional genomics of human transfer RNA. Cell Res. 28, 395–404 (2018).
pubmed: 29463900
pmcid: 5939049
doi: 10.1038/s41422-018-0013-y
Machnicka, M. A., Olchowik, A., Grosjean, H. & Bujnicki, J. M. Distribution and frequencies of post-transcriptional modifications in tRNAs. RNA Biol. 11, 1619–1629 (2015).
pmcid: 4615829
doi: 10.4161/15476286.2014.992273
Jühling, F. et al. tRNAdb 2009: compilation of tRNA sequences and tRNA genes. Nucleic Acids Res. 37, D159–D162 (2009).
pubmed: 18957446
doi: 10.1093/nar/gkn772
pmcid: 18957446
de Crécy-Lagard, V. et al. Matching tRNA modifications in humans to their known and predicted enzymes. Nucleic Acids Res. 47, 2143–2159 (2019).
pubmed: 30698754
pmcid: 6412123
doi: 10.1093/nar/gkz011
Rapino, F. et al. Codon-specific translation reprogramming promotes resistance to targeted therapy. Nature 558, 605–609 (2018).
pubmed: 29925953
doi: 10.1038/s41586-018-0243-7
pmcid: 29925953
Roundtree, I. A., Evans, M. E., Pan, T. & He, C. Dynamic RNA modifications in gene expression regulation. Cell 169, 1187–1200 (2017).
pubmed: 28622506
pmcid: 5657247
doi: 10.1016/j.cell.2017.05.045
Reinhart, B. J. et al. The 21-nucleotide let-7 RNA regulates developmental timing in Caenorhabditis elegans. Nature 403, 901–906 (2000).
pubmed: 10706289
doi: 10.1038/35002607
Motorin, Y. & Grosjean, H. Multisite-specific tRNA:m5C-methyltransferase (Trm4) in yeast Saccharomyces cerevisiae: identification of the gene and substrate specificity of the enzyme. RNA 5, 1105–1118 (1999).
pubmed: 10445884
pmcid: 1369833
doi: 10.1017/S1355838299982201
Cai, W. M. et al. A platform for discovery and quantification of modified ribonucleosides in RNA: application to stress-induced reprogramming of trna modifications. Methods Enzymol. 560, 29–71 (2015).
pubmed: 26253965
pmcid: 4774897
doi: 10.1016/bs.mie.2015.03.004
Boccaletto, P. et al. MODOMICS: a database of RNA modification pathways. 2017 update. Nucleic Acids Res. 46, D303–D307 (2018).
pubmed: 29106616
doi: 10.1093/nar/gkx1030
Fenyö, D. & Beavis, R. C. A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes. Anal. Chem. 75, 768–774 (2003).
pubmed: 12622365
doi: 10.1021/ac0258709
Käll, L., Storey, J. D., MacCoss, M. J. & Noble, W. S. Assigning significance to peptides identified by tandem mass spectrometry using decoy databases. J. Proteome Res. 7, 29–34 (2008).
pubmed: 18067246
doi: 10.1021/pr700600n
Griss, J. et al. The mzTab data exchange format: communicating mass-spectrometry-based proteomics and metabolomics experimental results to a wider audience. Mol. Cell. Proteom. 13, 2765–2775 (2014).
doi: 10.1074/mcp.O113.036681
Weisser, H. & Choudhary, J. S. Targeted feature detection for data-dependent shotgun proteomics. J. Proteome Res. 16, 2964–2974 (2017).
pubmed: 28673088
pmcid: 5547443
doi: 10.1021/acs.jproteome.7b00248
Martens, L. et al. mzML—a community standard for mass spectrometry data. Mol. Cell. Proteom. 10, R110.000133 (2011).
doi: 10.1074/mcp.R110.000133
Chambers, M. C. et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 30, 918–920 (2012).
pubmed: 23051804
pmcid: 3471674
doi: 10.1038/nbt.2377
Taoka, M. et al. The complete chemical structure of Saccharomyces cerevisiae rRNA: partial pseudouridylation of U2345 in 25S rRNA by snoRNA snR9. Nucleic Acids Res. 44, 8951–8961 (2016).
pubmed: 27325748
pmcid: 5062969
doi: 10.1093/nar/gkw564
R Core Team. R: A language and environment for statistical computing. (R Foundation for Statistical Computing, 2018).
Vizcaíno, J. A. et al. 2016 update of the PRIDE database and its related tools. Nucleic Acids Res. 44, 11033 (2016).
pubmed: 27683222
pmcid: 5159556
doi: 10.1093/nar/gkw880