A computational platform for high-throughput analysis of RNA sequences and modifications by mass spectrometry.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
17 02 2020
Historique:
received: 11 02 2019
accepted: 22 01 2020
entrez: 19 2 2020
pubmed: 19 2 2020
medline: 6 5 2020
Statut: epublish

Résumé

The field of epitranscriptomics continues to reveal how post-transcriptional modification of RNA affects a wide variety of biological phenomena. A pivotal challenge in this area is the identification of modified RNA residues within their sequence contexts. Mass spectrometry (MS) offers a comprehensive solution by using analogous approaches to shotgun proteomics. However, software support for the analysis of RNA MS data is inadequate at present and does not allow high-throughput processing. Existing software solutions lack the raw performance and statistical grounding to efficiently handle the numerous modifications found on RNA. We present a free and open-source database search engine for RNA MS data, called NucleicAcidSearchEngine (NASE), that addresses these shortcomings. We demonstrate the capability of NASE to reliably identify a wide range of modified RNA sequences in four original datasets of varying complexity. In human tRNA, we characterize over 20 different modification types simultaneously and find many cases of incomplete modification.

Identifiants

pubmed: 32066737
doi: 10.1038/s41467-020-14665-7
pii: 10.1038/s41467-020-14665-7
pmc: PMC7026122
doi:

Substances chimiques

Oligonucleotides 0
RNA, Transfer 9014-25-9

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't Validation Study

Langues

eng

Sous-ensembles de citation

IM

Pagination

926

Subventions

Organisme : NIEHS NIH HHS
ID : P30 ES013508
Pays : United States

Références

Borek, E. & Srinivasan, P. R. The methylation of nucleic acids. Annu. Rev. Biochem. 35, 275–298 (1966).
doi: 10.1146/annurev.bi.35.070166.001423
Davis, F. F. & Allen, F. W. Ribonucleic acids from yeast which contain a fifth nucleotide. J. Biol. Chem. 227, 907–915 (1957).
pubmed: 13463012 pmcid: 13463012
Bertero, A. et al. The SMAD2/3 interactome reveals that TGFβ controls m6A mRNA methylation in pluripotency. Nature 555, 256–259 (2018).
pubmed: 29489750 pmcid: 5951268 doi: 10.1038/nature25784
Kirino, Y. et al. Codon-specific translational defect caused by a wobble modification deficiency in mutant tRNA from a human mitochondrial disease. Proc. Natl Acad. Sci. USA 101, 15070–15075 (2004).
pubmed: 15477592 doi: 10.1073/pnas.0405173101 pmcid: 15477592
Jia, G., Fu, Y. & He, C. Reversible RNA adenosine methylation in biological regulation. Trends Genet. 29, 108–115 (2013).
pubmed: 23218460 doi: 10.1016/j.tig.2012.11.003
Weng, Y.-L. et al. Epitranscriptomic m 6 a regulation of axon regeneration in the adult mammalian nervous system. Neuron 97, 313–325.e6 (2018).
pubmed: 29346752 pmcid: 5777326 doi: 10.1016/j.neuron.2017.12.036
Blanco, S. et al. Aberrant methylation of tRNAs links cellular stress to neuro-developmental disorders. EMBO J. 33, 2020–2039 (2014).
pubmed: 25063673 pmcid: 4195770 doi: 10.15252/embj.201489282
Li, X., Xiong, X. & Yi, C. Epitranscriptome sequencing technologies: decoding RNA modifications. Nat. Methods 14, 23–31 (2016).
pubmed: 28032622 doi: 10.1038/nmeth.4110
Helm, M. & Motorin, Y. Detecting RNA modifications in the epitranscriptome: predict and validate. Nat. Rev. Genet. 18, 275–291 (2017).
pubmed: 28216634 doi: 10.1038/nrg.2016.169
Linder, B. et al. Single-nucleotide-resolution mapping of m6A and m6Am throughout the transcriptome. Nat. Methods 12, 767–772 (2015).
pubmed: 26121403 pmcid: 26121403 doi: 10.1038/nmeth.3453
Vilfan, I. D. et al. Analysis of RNA base modification and structural rearrangement by single-molecule real-time detection of reverse transcription. J. Nanobiotechnol. 11, 8 (2013).
doi: 10.1186/1477-3155-11-8
Smith, A. M., Jain, M., Mulroney, L., Garalde, D. R. & Akeson, M. Reading canonical and modified nucleobases in 16S ribosomal RNA using nanopore native RNA sequencing. PLoS ONE 14, e0216709 (2019).
pubmed: 31095620 pmcid: 6522004 doi: 10.1371/journal.pone.0216709
Rhoads, A. & Au, K. F. PacBio sequencing and its applications. Genomics Proteom. Bioinforma. 13, 278–289 (2015).
doi: 10.1016/j.gpb.2015.08.002
Dominissini, D. et al. The dynamic N1-methyladenosine methylome in eukaryotic messenger RNA. Nature 530, 441–446 (2016).
pubmed: 26863196 pmcid: 4842015 doi: 10.1038/nature16998
Safra, M. et al. The m1A landscape on cytosolic and mitochondrial mRNA at single-base resolution. Nature 551, 251–255 (2017).
pubmed: 29072297 doi: 10.1038/nature24456
Su, D. et al. Quantitative analysis of ribonucleoside modifications in tRNA by HPLC-coupled mass spectrometry. Nat. Protoc. 9, 828–841 (2014).
pubmed: 24625781 pmcid: 4313537 doi: 10.1038/nprot.2014.047
Yu, B. & Chen, X. Analysis of miRNA Modifications. Methods Mol. Biol. 592, 137–148 (2010).
pubmed: 19802594 pmcid: 5134323 doi: 10.1007/978-1-60327-005-2_10
Kullolli, M., Knouf, E., Arampatzidou, M., Tewari, M. & Pitteri, S. J. Intact MicroRNA ametry. J. Am. Soc. Mass. Spectrom. 25, 80–87 (2014).
pubmed: 24174127 doi: 10.1007/s13361-013-0759-x
Huber, C. G. & Oberacher, H. Analysis of nucleic acids by on-line liquid chromatography–mass spectrometry. Mass. Spectrom. Rev. 20, 310–343 (2001).
pubmed: 11948655 doi: 10.1002/mas.10011
Rozenski, J. & McCloskey, J. A. SOS: a simple interactive program for ab initio oligonucleotide sequencing by mass spectrometry. J. Am. Soc. Mass. Spectrom. 13, 200–203 (2002).
pubmed: 11908799 doi: 10.1016/S1044-0305(01)00354-3
Nakayama, H. et al. Ariadne: a database search engine for identification and chemical analysis of RNA using tandem mass spectrometry data. Nucleic Acids Res. 37, e47–e47 (2009).
pubmed: 19270066 pmcid: 2665244 doi: 10.1093/nar/gkp099
Nyakas, A., Blum, L. C., Stucki, S. R., Reymond, J.-L. & Schürch, S. OMA and OPA—software-supported mass spectra analysis of native and modified nucleic acids. J. Am. Soc. Mass. Spectrom. 24, 249–256 (2012).
pubmed: 23264149 doi: 10.1007/s13361-012-0529-1
Yu, N., Lobue, P. A., Cao, X. & Limbach, P. A. RNAModMapper: RNA modification mapping software for analysis of liquid chromatography tandem mass spectrometry data. Anal. Chem. 89, 10744–10752 (2017).
pubmed: 28942636 doi: 10.1021/acs.analchem.7b01780
Gillet, L. C., Leitner, A. & Aebersold, R. Mass spectrometry applied to bottom-up proteomics: entering the high-throughput era for hypothesis testing. Annu. Rev. Anal. Chem. 9, 449–472 (2016).
doi: 10.1146/annurev-anchem-071015-041535
Röst, H. L. et al. OpenMS: a flexible open-source software platform for mass spectrometry data analysis. Nat. Methods 13, 741–748 (2016).
pubmed: 27575624 doi: 10.1038/nmeth.3959
Kohlbacher, O. et al. TOPP—the OpenMS proteomics pipeline. Bioinforma 23, e191–e197 (2007).
doi: 10.1093/bioinformatics/btl299
Sturm, M. & Kohlbacher, O. TOPPView: an open-source viewer for mass spectrometry data. J. Proteome Res. 8, 3760–3763 (2009).
pubmed: 19425593 doi: 10.1021/pr900171m
Junker, J. et al. TOPPAS: a graphical workflow editor for the analysis of high-throughput proteomics data. J. Proteome Res. 11, 3914–3920 (2012).
pubmed: 22583024 doi: 10.1021/pr300187f
Mcluckey, S. A., Berkel, G. J. & Glish, G. L. Tandem mass spectrometry of small, multiply charged oligonucleotides. J. Am. Soc. Mass. Spectrom. 3, 60–70 (1992).
pubmed: 24242838 doi: 10.1016/1044-0305(92)85019-G
Lobue, P. A., Yu, N., Jora, M., Abernathy, S. & Limbach, P. A. Improved application of RNAModMapper – an RNA modification mapping software tool – for analysis of liquid chromatography tandem mass spectrometry (LC-MS/MS) data. Methods 156, 128–138 (2019).
pubmed: 30366097 doi: 10.1016/j.ymeth.2018.10.012 pmcid: 30366097
Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).
pubmed: 19029910 doi: 10.1038/nbt.1511 pmcid: 19029910
Taoka, M. et al. Landscape of the complete RNA chemical modifications in the human 80S ribosome. Nucleic Acids Res. 46, 9289–9298 (2018).
pubmed: 30202881 pmcid: 6182160 doi: 10.1093/nar/gky811
Pan, T. Modifications and functional genomics of human transfer RNA. Cell Res. 28, 395–404 (2018).
pubmed: 29463900 pmcid: 5939049 doi: 10.1038/s41422-018-0013-y
Machnicka, M. A., Olchowik, A., Grosjean, H. & Bujnicki, J. M. Distribution and frequencies of post-transcriptional modifications in tRNAs. RNA Biol. 11, 1619–1629 (2015).
pmcid: 4615829 doi: 10.4161/15476286.2014.992273
Jühling, F. et al. tRNAdb 2009: compilation of tRNA sequences and tRNA genes. Nucleic Acids Res. 37, D159–D162 (2009).
pubmed: 18957446 doi: 10.1093/nar/gkn772 pmcid: 18957446
de Crécy-Lagard, V. et al. Matching tRNA modifications in humans to their known and predicted enzymes. Nucleic Acids Res. 47, 2143–2159 (2019).
pubmed: 30698754 pmcid: 6412123 doi: 10.1093/nar/gkz011
Rapino, F. et al. Codon-specific translation reprogramming promotes resistance to targeted therapy. Nature 558, 605–609 (2018).
pubmed: 29925953 doi: 10.1038/s41586-018-0243-7 pmcid: 29925953
Roundtree, I. A., Evans, M. E., Pan, T. & He, C. Dynamic RNA modifications in gene expression regulation. Cell 169, 1187–1200 (2017).
pubmed: 28622506 pmcid: 5657247 doi: 10.1016/j.cell.2017.05.045
Reinhart, B. J. et al. The 21-nucleotide let-7 RNA regulates developmental timing in Caenorhabditis elegans. Nature 403, 901–906 (2000).
pubmed: 10706289 doi: 10.1038/35002607
Motorin, Y. & Grosjean, H. Multisite-specific tRNA:m5C-methyltransferase (Trm4) in yeast Saccharomyces cerevisiae: identification of the gene and substrate specificity of the enzyme. RNA 5, 1105–1118 (1999).
pubmed: 10445884 pmcid: 1369833 doi: 10.1017/S1355838299982201
Cai, W. M. et al. A platform for discovery and quantification of modified ribonucleosides in RNA: application to stress-induced reprogramming of trna modifications. Methods Enzymol. 560, 29–71 (2015).
pubmed: 26253965 pmcid: 4774897 doi: 10.1016/bs.mie.2015.03.004
Boccaletto, P. et al. MODOMICS: a database of RNA modification pathways. 2017 update. Nucleic Acids Res. 46, D303–D307 (2018).
pubmed: 29106616 doi: 10.1093/nar/gkx1030
Fenyö, D. & Beavis, R. C. A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes. Anal. Chem. 75, 768–774 (2003).
pubmed: 12622365 doi: 10.1021/ac0258709
Käll, L., Storey, J. D., MacCoss, M. J. & Noble, W. S. Assigning significance to peptides identified by tandem mass spectrometry using decoy databases. J. Proteome Res. 7, 29–34 (2008).
pubmed: 18067246 doi: 10.1021/pr700600n
Griss, J. et al. The mzTab data exchange format: communicating mass-spectrometry-based proteomics and metabolomics experimental results to a wider audience. Mol. Cell. Proteom. 13, 2765–2775 (2014).
doi: 10.1074/mcp.O113.036681
Weisser, H. & Choudhary, J. S. Targeted feature detection for data-dependent shotgun proteomics. J. Proteome Res. 16, 2964–2974 (2017).
pubmed: 28673088 pmcid: 5547443 doi: 10.1021/acs.jproteome.7b00248
Martens, L. et al. mzML—a community standard for mass spectrometry data. Mol. Cell. Proteom. 10, R110.000133 (2011).
doi: 10.1074/mcp.R110.000133
Chambers, M. C. et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 30, 918–920 (2012).
pubmed: 23051804 pmcid: 3471674 doi: 10.1038/nbt.2377
Taoka, M. et al. The complete chemical structure of Saccharomyces cerevisiae rRNA: partial pseudouridylation of U2345 in 25S rRNA by snoRNA snR9. Nucleic Acids Res. 44, 8951–8961 (2016).
pubmed: 27325748 pmcid: 5062969 doi: 10.1093/nar/gkw564
R Core Team. R: A language and environment for statistical computing. (R Foundation for Statistical Computing, 2018).
Vizcaíno, J. A. et al. 2016 update of the PRIDE database and its related tools. Nucleic Acids Res. 44, 11033 (2016).
pubmed: 27683222 pmcid: 5159556 doi: 10.1093/nar/gkw880

Auteurs

Samuel Wein (S)

Epigenetics Program, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
Center for Bioinformatics Tübingen, University of Tübingen, Tübingen, Germany.

Byron Andrews (B)

STORM Therapeutics Limited, Moneta Building, Babraham Research Campus, Cambridge, UK.

Timo Sachsenberg (T)

Applied Bioinformatics, Department for Computer Science, University of Tübingen, Tübingen, Germany.

Helena Santos-Rosa (H)

Gurdon Institute, University of Cambridge, Cambridge, UK.

Oliver Kohlbacher (O)

Center for Bioinformatics Tübingen, University of Tübingen, Tübingen, Germany.
Applied Bioinformatics, Department for Computer Science, University of Tübingen, Tübingen, Germany.
Quantitative Biology Center, University of Tübingen, Tübingen, Germany.
Biomolecular Interactions, Max Planck Institute for Developmental Biology, Tübingen, Germany.
Translational Bioinformatics, University Hospital Tübingen, Tübingen, Germany.

Tony Kouzarides (T)

Gurdon Institute, University of Cambridge, Cambridge, UK.

Benjamin A Garcia (BA)

Epigenetics Program, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA. bgarci@pennmedicine.upenn.edu.

Hendrik Weisser (H)

STORM Therapeutics Limited, Moneta Building, Babraham Research Campus, Cambridge, UK. hendrik.weisser@stormtherapeutics.com.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH