IsoMiRmap: fast, deterministic and exhaustive mining of isomiRs from short RNA-seq datasets.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
27 Jul 2021
27 Jul 2021
Historique:
received:
10
07
2020
revised:
30
11
2020
accepted:
10
01
2021
medline:
21
1
2021
pubmed:
21
1
2021
entrez:
20
1
2021
Statut:
ppublish
Résumé
MicroRNA (miRNA) precursor arms give rise to multiple isoforms simultaneously called 'isomiRs.' IsomiRs from the same arm typically differ by a few nucleotides at either their 5' or 3' termini or both. In humans, the identities and abundances of isomiRs depend on a person's sex and genetic ancestry as well as on tissue type, tissue state and disease type/subtype. Moreover, nearly half of the time the most abundant isomiR differs from the miRNA sequence found in public databases. Accurate mining of isomiRs from deep sequencing data is thus important. We developed isoMiRmap, a fast, standalone, user-friendly mining tool that identifies and quantifies all isomiRs by directly processing short RNA-seq datasets. IsoMiRmap is a portable 'plug-and-play' tool, requires minimal setup, has modest computing and storage requirements, and can process an RNA-seq dataset with 50 million reads in just a few minutes on an average laptop. IsoMiRmap deterministically and exhaustively reports all isomiRs in a given deep sequencing dataset and quantifies them accurately (no double-counting). IsoMiRmap comprehensively reports all miRNA precursor locations from which an isomiR may be transcribed, tags as 'ambiguous' isomiRs whose sequences exist both inside and outside of the space of known miRNA sequences and reports the public identifiers of common single-nucleotide polymorphisms and documented somatic mutations that may be present in an isomiR. IsoMiRmap also identifies isomiRs with 3' non-templated post-transcriptional additions. Compared to similar tools, isoMiRmap is the fastest, reports more bona fide isomiRs, and provides the most comprehensive information related to an isomiR's transcriptional origin. The codes for isoMiRmap are freely available at https://cm.jefferson.edu/isoMiRmap/ and https://github.com/TJU-CMC-Org/isoMiRmap/. IsomiR profiles for the datasets of the 1000 Genomes Project, spanning five population groups, and The Cancer Genome Atlas (TCGA), spanning 33 cancer studies, are also available at https://cm.jefferson.edu/isoMiRmap/. Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 33471076
pii: 6104842
doi: 10.1093/bioinformatics/btab016
pmc: PMC8317110
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
1828-1838Subventions
Organisme : NHLBI NIH HHS
ID : R01 HL141424
Pays : United States
Organisme : NIH HHS
ID : R01 HL141424
Pays : United States
Organisme : Institutional Funds
Informations de copyright
© The Author(s) 2021. Published by Oxford University Press.
Références
Sci Rep. 2017 Feb 21;7:41184
pubmed: 28220888
Nucleic Acids Res. 2010 Mar;38(5):e34
pubmed: 20008100
Oncotarget. 2015 Sep 22;6(28):24797-822
pubmed: 26325506
Sci Rep. 2019 Feb 5;9(1):1406
pubmed: 30723229
Cancer Cell. 2017 Aug 14;32(2):185-203.e13
pubmed: 28810144
Genes Dev. 2008 Oct 15;22(20):2773-85
pubmed: 18923076
Nature. 2003 Dec 18;426(6968):845-9
pubmed: 14685240
Cell Death Differ. 2013 Dec;20(12):1603-14
pubmed: 24212931
Nucleic Acids Res. 2019 Jan 8;47(D1):D155-D162
pubmed: 30423142
FEBS Lett. 2013 Aug 19;587(16):2629-34
pubmed: 23831580
RNA Biol. 2010 Sep-Oct;7(5):573-6
pubmed: 20818168
Curr Opin Biotechnol. 2019 Aug;58:202-210
pubmed: 31323485
Genes Dev. 2009 Feb 15;23(4):433-8
pubmed: 19240131
Nucleic Acids Res. 2014 Jan;42(Database issue):D68-73
pubmed: 24275495
Genome Biol. 2009;10(3):R25
pubmed: 19261174
Sci Rep. 2018 Mar 28;8(1):5314
pubmed: 29593348
Pigment Cell Melanoma Res. 2020 Jan;33(1):52-62
pubmed: 31283110
Nat Struct Mol Biol. 2006 Sep;13(9):763-71
pubmed: 16936727
Nucleic Acids Res. 2015 Oct 30;43(19):9158-75
pubmed: 26400174
Nucleic Acids Res. 2001 Jan 1;29(1):308-11
pubmed: 11125122
Genome Res. 2010 Oct;20(10):1398-410
pubmed: 20719920
Sci China Life Sci. 2015 Nov;58(11):1057-66
pubmed: 26563174
RNA Biol. 2015;12(4):375-80
pubmed: 25849196
Genome Biol. 2014;15(12):550
pubmed: 25516281
Nature. 2020 May;581(7809):434-443
pubmed: 32461654
BMC Bioinformatics. 2018 Jul 23;19(1):275
pubmed: 30153801
Nucleic Acids Res. 2017 Sep 6;45(15):8731-8744
pubmed: 28911107
Nucleic Acids Res. 2018 Jan 4;46(D1):D160-D167
pubmed: 29036653
BMC Biol. 2020 Apr 13;18(1):38
pubmed: 32279660
Cancer Res. 2006 Aug 1;66(15):7390-4
pubmed: 16885332
J Integr Bioinform. 2016 Dec 22;13(5):307
pubmed: 28187421
RNA Biol. 2010 May-Jun;7(3):373-80
pubmed: 20421741
Oncotarget. 2014 Sep 30;5(18):8790-802
pubmed: 25229428
Noncoding RNA. 2017 Jun;3(2):
pubmed: 28730153
Mol Cell. 2019 Aug 8;75(3):511-522.e4
pubmed: 31178353
Nucleic Acids Res. 2020 Sep 25;48(17):9433-9448
pubmed: 32890397
Nucleic Acids Res. 2017 Apr 7;45(6):2973-2985
pubmed: 28206648
Genes Dev. 2002 Jul 1;16(13):1616-26
pubmed: 12101121
Proc Natl Acad Sci U S A. 2015 Mar 10;112(10):E1106-15
pubmed: 25713380
Nucleic Acids Res. 2016 Apr 7;44(6):e53
pubmed: 26635395
Nature. 2013 Sep 26;501(7468):506-11
pubmed: 24037378
Cell Rep. 2014 Sep 25;8(6):1649-1658
pubmed: 25242326
Bioinformatics. 2020 Feb 1;36(3):698-703
pubmed: 31504201
Cell. 2009 Jan 23;136(2):215-33
pubmed: 19167326
Genome Res. 2007 Dec;17(12):1823-36
pubmed: 17989253
Nucleic Acids Res. 2017 Jan 4;45(D1):D777-D783
pubmed: 27899578
Genes Dev. 2010 Jun 1;24(11):1086-92
pubmed: 20516194
Nucleic Acids Res. 2016 Jan 8;44(1):e3
pubmed: 26271990
Nucleic Acids Res. 2019 Mar 18;47(5):2630-2640
pubmed: 30605524
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Brief Bioinform. 2021 Jan 18;22(1):463-473
pubmed: 31885040
Science. 2003 Mar 7;299(5612):1540
pubmed: 12624257
PLoS One. 2015 Nov 16;10(11):e0143066
pubmed: 26571139
Genome Res. 2007 Dec;17(12):1865-79
pubmed: 17989255
Cell. 2004 Jan 23;116(2):281-97
pubmed: 14744438
Bioinformatics. 2016 Aug 15;32(16):2481-9
pubmed: 27153631
Nucleic Acids Res. 2018 Jan 4;46(D1):D152-D159
pubmed: 29186503
Nat Rev Cancer. 2006 Nov;6(11):857-66
pubmed: 17060945
Front Genet. 2014 Oct 08;5:344
pubmed: 25339973
Cancer Res. 2018 Mar 1;78(5):1140-1154
pubmed: 29229607
Noncoding RNA Res. 2020 Feb 19;5(1):27-31
pubmed: 32128468
BMC Bioinformatics. 2016 Mar 10;17:123
pubmed: 26961774