Detection of m6A from direct RNA sequencing using a multiple instance learning framework.


Journal

Nature methods
ISSN: 1548-7105
Titre abrégé: Nat Methods
Pays: United States
ID NLM: 101215604

Informations de publication

Date de publication:
12 2022
Historique:
received: 07 09 2021
accepted: 27 09 2022
pubmed: 11 11 2022
medline: 7 12 2022
entrez: 10 11 2022
Statut: ppublish

Résumé

RNA modifications such as m6A methylation form an additional layer of complexity in the transcriptome. Nanopore direct RNA sequencing can capture this information in the raw current signal for each RNA molecule, enabling the detection of RNA modifications using supervised machine learning. However, experimental approaches provide only site-level training data, whereas the modification status for each single RNA molecule is missing. Here we present m6Anet, a neural-network-based method that leverages the multiple instance learning framework to specifically handle missing read-level modification labels in site-level training data. m6Anet outperforms existing computational methods, shows similar accuracy as experimental approaches, and generalizes with high accuracy to different cell lines and species without retraining model parameters. In addition, we demonstrate that m6Anet captures the underlying read-level stoichiometry, which can be used to approximate differences in modification rates. Overall, m6Anet offers a tool to capture the transcriptome-wide identification and quantification of m6A from a single run of direct RNA sequencing.

Identifiants

pubmed: 36357692
doi: 10.1038/s41592-022-01666-1
pii: 10.1038/s41592-022-01666-1
pmc: PMC9718678
doi:

Substances chimiques

RNA 63231-63-0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1590-1598

Commentaires et corrections

Type : CommentIn

Informations de copyright

© 2022. The Author(s).

Références

Cohn, W. E. & Volkin, E. Nucleoside-5′-phosphates from ribonucleic acid. Nature 167, 483–484 (1951).
doi: 10.1038/167483a0
Kemp, J. W. & Allen, F. W. Ribonucleic acids from pancreas which contain new components. Biochim. Biophys. Acta 28, 51–58 (1958).
pubmed: 13535676 doi: 10.1016/0006-3002(58)90426-8
Boccaletto, P. et al. MODOMICS: a database of RNA modification pathways. 2017 update. Nucleic Acids Res. 46, D303–D307 (2018).
pubmed: 29106616 doi: 10.1093/nar/gkx1030
Dunin-Horkawicz, S. MODOMICS: a database of RNA modification pathways. Nucleic Acids Res. 34, D145–D149 (2006).
pubmed: 16381833 doi: 10.1093/nar/gkj084
Perry, R. P. & Kelley, D. E. Existence of methylated messenger RNA in mouse L cells. Cell 1, 37–42 (1974).
doi: 10.1016/0092-8674(74)90153-6
Roundtree, I. A., Evans, M. E., Pan, T. & He, C. Dynamic RNA modifications in gene expression regulation. Cell 169, 1187–1200 (2017).
pubmed: 28622506 pmcid: 5657247 doi: 10.1016/j.cell.2017.05.045
Liu, N. et al. N
pubmed: 25719671 pmcid: 4355918 doi: 10.1038/nature14234
Wang, X. et al. N
pubmed: 24284625 doi: 10.1038/nature12730
Ke, S. et al. m
pubmed: 28637692 pmcid: 5495127 doi: 10.1101/gad.301036.117
Xiao, W. et al. Nuclear m
doi: 10.1016/j.molcel.2016.03.004
Wang, X. et al. N
pubmed: 26046440 pmcid: 4825696 doi: 10.1016/j.cell.2015.05.014
Wang, Y. et al. N
pubmed: 24394384 pmcid: 4640932 doi: 10.1038/ncb2902
Weng, H. et al. METTL14 inhibits hematopoietic stem/progenitor differentiation and promotes leukemogenesis via mRNA m
pubmed: 29290617 doi: 10.1016/j.stem.2017.11.016
Xu, K. et al. Mettl3-mediated m6A regulates spermatogonial differentiation and meiosis initiation. Cell Res. 27, 1100–1114 (2017).
pubmed: 28809392 pmcid: 5587845 doi: 10.1038/cr.2017.100
Zhang, C. et al. Hypoxia induces the breast cancer stem cell phenotype by HIF-dependent and ALKBH5-mediated m
pubmed: 27001847 pmcid: 4833258
Yankova, E. et al. Small-molecule inhibition of METTL3 as a strategy against myeloid leukaemia. Nature 593, 597–601 (2021).
pubmed: 33902106 pmcid: 7613134 doi: 10.1038/s41586-021-03536-w
Vu, L. P. et al. The N
pubmed: 28920958 pmcid: 5677536 doi: 10.1038/nm.4416
Batista, P. J. et al. m
pubmed: 25456834 pmcid: 4278749 doi: 10.1016/j.stem.2014.09.019
Yoon, K.-J. et al. Temporal control of mammalian cortical neurogenesis by m
pubmed: 28965759 pmcid: 5679435 doi: 10.1016/j.cell.2017.09.003
Hsu, P. J., Shi, H. & He, C. Epitranscriptomic influences on development and disease. Genome Biol. 18, 197 (2017).
pubmed: 29061143 pmcid: 5654102 doi: 10.1186/s13059-017-1336-6
Jonkhout, N. et al. The RNA modification landscape in human disease. RNA 23, 1754–1769 (2017).
pubmed: 28855326 pmcid: 5688997 doi: 10.1261/rna.063503.117
Meyer, K. D. et al. Comprehensive analysis of mRNA methylation reveals enrichment in 3′ UTRs and near stop codons. Cell 149, 1635–1646 (2012).
pubmed: 22608085 pmcid: 3383396 doi: 10.1016/j.cell.2012.05.003
Dominissini, D. et al. Topology of the human and mouse m
pubmed: 22575960 doi: 10.1038/nature11112
Chen, K. et al. High-resolution N
pubmed: 25491922 doi: 10.1002/anie.201410647
Ke, S. et al. A majority of m
pubmed: 26404942 pmcid: 4604345 doi: 10.1101/gad.269415.115
Linder, B. et al. Single-nucleotide-resolution mapping of m
pubmed: 26121403 pmcid: 4487409 doi: 10.1038/nmeth.3453
Molinie, B. et al. m
pubmed: 27376769 pmcid: 5704921 doi: 10.1038/nmeth.3898
Koh, C. W. Q., Goh, Y.T & Sho Goh, W. S. Atlas of quantitative single-base-resolution N
Dierks, D. et al. Multiplexed profiling facilitates robust m
pubmed: 34480159 doi: 10.1038/s41592-021-01242-z
Carlile, T. M. et al. Pseudouridine profiling reveals regulated mRNA pseudouridylation in yeast and human cells. Nature 515, 143–146 (2014).
pubmed: 25192136 pmcid: 4224642 doi: 10.1038/nature13802
Marchand, V. et al. AlkAniline‐Seq: profiling of m
doi: 10.1002/anie.201810946
Garcia-Campos, M. A. et al. Deciphering the ‘m
pubmed: 31257032 doi: 10.1016/j.cell.2019.06.013
Zhang, Z. H. et al. Single-base mapping of m
doi: 10.1126/sciadv.aax0250
Meyer, K. D. DART-seq: an antibody-free method for global m
pubmed: 31548708 pmcid: 6884681 doi: 10.1038/s41592-019-0570-0
Ryvkin, P. et al. HAMR: high-throughput annotation of modified ribonucleotides. RNA 19, 1684–1692 (2013).
pubmed: 24149843 pmcid: 3884653 doi: 10.1261/rna.036806.112
Garalde, D. R. et al. Highly parallel direct on an array of nanopores. Nat. Methods 15, 201–206 (2018).
pubmed: 29334379 doi: 10.1038/nmeth.4577
Wan, Y. K., Hendra, C., Pratanwanich, P. N. & Göke, J. Beyond sequencing: machine learning algorithms extract biology hidden in Nanopore signal data. Trends Genet. 38, 246–257 (2022).
pubmed: 34711425 doi: 10.1016/j.tig.2021.09.001
Stoiber, M. et al. De novo identification of DNA modifications enabled by genome-guided nanopore signal processing. Preprint at bioRxiv https://doi.org/10.1101/094672 (2017).
Price, A. M. et al. Direct RNA sequencing reveals m
pubmed: 33243990 pmcid: 7691994 doi: 10.1038/s41467-020-19787-6
Ueda, H. nanoDoc: RNA modification detection using Nanopore raw reads with deep one-class classification. Preprint at bioRxiv https://doi.org/10.1101/2020.09.13.295089 (2021).
Leger, A. et al. RNA modifications detection by comparative Nanopore direct RNA sequencing. Nat. Commun. 12, 7198 (2021).
pubmed: 34893601 pmcid: 8664944 doi: 10.1038/s41467-021-27393-3
Jenjaroenpun, P. et al. Decoding the epitranscriptional landscape from native RNA sequences. Nucleic Acids Res. 49, e7 (2021).
pubmed: 32710622 doi: 10.1093/nar/gkaa620
Pratanwanich, P. N. et al. Identification of differential RNA modifications from nanopore direct RNA sequencing with xPore. Nat. Biotechnol. 13, 1394–1402 (2021).
doi: 10.1038/s41587-021-00949-w
Parker, M. T., Barton, G. J. & Simpson, G. G. Yanocomp: robust prediction of m6A modifications in individual nanopore direct RNA reads. Preprint at bioRxiv https://doi.org/10.1101/2021.06.15.448494 (2021).
Liu, H. et al. Accurate detection of m
doi: 10.1038/s41467-019-11713-9
Liu, H., Begik, O. & Novoa, E. M. EpiNano: detection of m
pubmed: 34085237 doi: 10.1007/978-1-0716-1374-0_3
Lorenz, D. A., Sathe, S., Einstein, J. M. & Yeo, G. W. Direct RNA sequencing enables m
pubmed: 31624092 pmcid: 6913132 doi: 10.1261/rna.072785.119
Gao, Y. et al. Quantitative profiling of N-methyladenosine at single-base resolution in stem-differentiating xylem of Populus trichocarpa using Nanopore direct RNA sequencing. Genome Biol. 22, 22 (2021).
pubmed: 33413586 pmcid: 7791831 doi: 10.1186/s13059-020-02241-7
Begik, O. et al. Quantitative profiling of pseudouridylation dynamics in native RNAs with nanopore sequencing. Nat. Biotechnol. 39, 1278–1291 (2021).
pubmed: 33986546 doi: 10.1038/s41587-021-00915-6
Dietterich, T. G., Lathrop, R. H. & Lozano-Pérez, T. Solving the multiple instance problem with axis-parallel rectangles. Artif. Intell. 89, 31–71 (1997).
doi: 10.1016/S0004-3702(96)00034-3
Maron, O. & Lozano-Pérez, T. A Framework for Multiple-Instance Learning. in Advances in Neural Information Processing Systems 10 (eds Jordan, M. I., Kearns, M. J. & Solla, S. A.) 570–576 (MIT Press, 1998).
Loman, N. J., Quick, J. & Simpson, J. T. A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat. Methods 12, 733–735 (2015).
pubmed: 26076426 doi: 10.1038/nmeth.3444
Chen, Y. et al. A systematic benchmark of Nanopore long read RNA sequencing for transcript level analysis in human cell lines. Preprint at bioRxiv https://doi.org/10.1101/2021.04.21.440736 (2021).
Parker, M. T. et al. Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and m
doi: 10.7554/eLife.49658
Zhang, T. et al. RNALocate: a resource for RNA subcellular localizations. Nucleic Acids Res. 45, D135–D138 (2017).
pubmed: 27543076
Pratanwanich, P. N. et al. Detection of differential RNA modifications from direct RNA sequencing of human cell lines. Nat. Biotechnol. 39, 1394–1402 (2021).
pubmed: 34282325 doi: 10.1038/s41587-021-00949-w
Grozhik, A. V. & Jaffrey, S. R. Distinguishing RNA modifications from noise in epitranscriptome maps. Nat. Chem. Biol. 14, 215–225 (2018).
pubmed: 29443978 doi: 10.1038/nchembio.2546
McIntyre, A. B. R. et al. Limits in the detection of m
pubmed: 32313079 pmcid: 7170965 doi: 10.1038/s41598-020-63355-3
Miladi, M. et al. The landscape of SARS-CoV-2 RNA modifications. Preprint at https://doi.org/10.1101/2020.07.18.204362 (2020).
Aw, J. G. A. et al. Determination of isoform-specific RNA structure with nanopore long reads. Nat. Biotechnol. 39, 336–346 (2021).
pubmed: 33106685 doi: 10.1038/s41587-020-0712-z
Ilse, M., Tomczak, J. M. & Welling, M. Attention-based deep multiple instance learning. Preprint at arXiv https://doi.org/10.48550/arXiv.1802.04712 (2018).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. Preprint at arXiv https://doi.org/10.48550/arXiv.1412.6980 (2014).
Reddi, S. J., Kale, S. & Kumar, S. On the convergence of Adam and beyond. Preprint at arXiv https://doi.org/10.48550/arXiv.1904.09237 (2019).
Paszke, A. et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library. in Advances in Neural Information Processing Systems (eds Wallach, H. et al.) Vol. 32 (Curran Associates, 2019).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).

Auteurs

Christopher Hendra (C)

Institute of Data Science, National University of Singapore, Singapore, Singapore.
Genome Institute of Singapore, A*STAR, Singapore, Singapore.
Department of Statistics and Data Science, National University of Singapore, Singapore, Singapore.

Ploy N Pratanwanich (PN)

Genome Institute of Singapore, A*STAR, Singapore, Singapore.
Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Chulalongkorn, Thailand.
Chula Intelligent and Complex Systems Research Unit, Chulalongkorn University, Chulalongkorn, Thailand.

Yuk Kei Wan (YK)

Genome Institute of Singapore, A*STAR, Singapore, Singapore.
Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.

W S Sho Goh (WSS)

Institute of Molecular Physiology, Shenzhen Bay Laboratory, Shenzhen, China.

Alexandre Thiery (A)

Department of Statistics and Data Science, National University of Singapore, Singapore, Singapore. a.h.thiery@nus.edu.sg.

Jonathan Göke (J)

Genome Institute of Singapore, A*STAR, Singapore, Singapore. gokej@gis.a-star.edu.sg.
Department of Statistics and Data Science, National University of Singapore, Singapore, Singapore. gokej@gis.a-star.edu.sg.
National Cancer Center of Singapore, Singapore, Singapore. gokej@gis.a-star.edu.sg.

Articles similaires

Arabidopsis Arabidopsis Proteins Osmotic Pressure Cytoplasm RNA, Messenger
Animals Lung India Sheep Transcriptome

Revealing molecular mechanisms of early-onset tongue cancer by spatial transcriptomics.

Marina R Patysheva, Elena S Kolegova, Anna A Khozyainova et al.
1.00
Humans Tongue Neoplasms Male Adult Middle Aged
Humans RNA, Circular Exosomes Cell Proliferation Epithelial-Mesenchymal Transition

Classifications MeSH