Unification of miRNA and isomiR research: the mirGFF3 format and the mirtop API.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 02 2020
Historique:
received: 19 03 2019
revised: 17 07 2019
accepted: 28 08 2019
pubmed: 11 9 2019
medline: 18 9 2020
entrez: 11 9 2019
Statut: ppublish

Résumé

MicroRNAs (miRNAs) are small RNA molecules (∼22 nucleotide long) involved in post-transcriptional gene regulation. Advances in high-throughput sequencing technologies led to the discovery of isomiRs, which are miRNA sequence variants. While many miRNA-seq analysis tools exist, the diversity of output formats hinders accurate comparisons between tools and precludes data sharing and the development of common downstream analysis methods. To overcome this situation, we present here a community-based project, miRNA Transcriptomic Open Project (miRTOP) working towards the optimization of miRNA analyses. The aim of miRTOP is to promote the development of downstream isomiR analysis tools that are compatible with existing detection and quantification tools. Based on the existing GFF3 format, we first created a new standard format, mirGFF3, for the output of miRNA/isomiR detection and quantification results from small RNA-seq data. Additionally, we developed a command line Python tool, mirtop, to create and manage the mirGFF3 format. Currently, mirtop can convert into mirGFF3 the outputs of commonly used pipelines, such as seqbuster, isomiR-SEA, sRNAbench, Prost! as well as BAM files. Some tools have also incorporated the mirGFF3 format directly into their code, such as, miRge2.0, IsoMIRmap and OptimiR. Its open architecture enables any tool or pipeline to output or convert results into mirGFF3. Collectively, this isomiR categorization system, along with the accompanying mirGFF3 and mirtop API, provide a comprehensive solution for the standardization of miRNA and isomiR annotation, enabling data sharing, reporting, comparative analyses and benchmarking, while promoting the development of common miRNA methods focusing on downstream steps of miRNA detection, annotation and quantification. https://github.com/miRTop/mirGFF3/ and https://github.com/miRTop/mirtop. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 31504201
pii: 5556118
doi: 10.1093/bioinformatics/btz675
pmc: PMC7566869
doi:

Substances chimiques

MicroRNAs 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

698-703

Subventions

Organisme : NIGMS NIH HHS
ID : R01 GM130564
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL137811
Pays : United States

Informations de copyright

© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Références

Sci Rep. 2018 May 23;8(1):8072
pubmed: 29795287
Bioinformatics. 2016 Jul 1;32(13):2069-71
pubmed: 27153728
Bioinformatics. 2009 Jun 1;25(11):1422-3
pubmed: 19304878
Genome Biol. 2005;6(5):R44
pubmed: 15892872
Cell. 1993 Dec 3;75(5):843-54
pubmed: 8252621
BMC Genomics. 2019 Jun 21;20(1):513
pubmed: 31226924
Int J Cancer. 2014 Mar 15;134(6):1359-68
pubmed: 23999999
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Bioinformatics. 2019 May 1;35(9):1576-1578
pubmed: 30295744
Bioinformatics. 2016 Sep 1;32(17):2722-4
pubmed: 27153626
Sci Rep. 2017 Feb 21;7:41184
pubmed: 28220888
Gene. 2018 Oct 5;673:181-193
pubmed: 29913239
Nucleic Acids Res. 2014 Sep;42(15):10050-60
pubmed: 25056317
Nucleic Acids Res. 2014 Jan;42(Database issue):D68-73
pubmed: 24275495
Genome Biol. 2009;10(3):R25
pubmed: 19261174
Nature. 2004 Nov 11;432(7014):231-5
pubmed: 15531879
Sci Rep. 2018 Mar 28;8(1):5314
pubmed: 29593348
Genome Biol. 2008;9(6):R102
pubmed: 18570664
Cell. 2012 Nov 9;151(4):900-911
pubmed: 23141545
Nucleic Acids Res. 2015 Oct 30;43(19):9158-75
pubmed: 26400174
Nature. 2008 Oct 23;455(7216):1124-8
pubmed: 18806776
BMC Bioinformatics. 2018 Jul 23;19(1):275
pubmed: 30153801
Bioinformatics. 2013 May 15;29(10):1325-32
pubmed: 23479348
Sci Rep. 2019 Mar 8;9(1):3913
pubmed: 30850632
Nucleic Acids Res. 2018 Jan 4;46(D1):D160-D167
pubmed: 29036653
Nat Biotechnol. 2018 Sep;36(8):746-757
pubmed: 30010675
Sci Rep. 2018 Nov 12;8(1):16699
pubmed: 30420640
Bioinformatics. 2011 Dec 15;27(24):3423-4
pubmed: 21949271
Genome Res. 2008 Apr;18(4):610-21
pubmed: 18285502
Science. 2007 Jan 5;315(5808):97-100
pubmed: 17204650
Annu Rev Genet. 2015;49:213-42
pubmed: 26473382
Nucleic Acids Res. 2010 Mar;38(5):e34
pubmed: 20008100
Carcinogenesis. 2019 Jul 4;40(5):643-650
pubmed: 30428030
Nucleic Acids Res. 2016 Jan 4;44(D1):D126-32
pubmed: 26578589
Trends Genet. 2015 Nov;31(11):613-626
pubmed: 26453491
Nucleic Acids Res. 2019 Jul 2;47(W1):W530-W535
pubmed: 31114926
Nucleic Acids Res. 2014 Aug;42(14):9424-35
pubmed: 25056318
Cell Rep. 2019 Jan 8;26(2):447-459.e4
pubmed: 30625327
Front Biosci. 2008 Jan 01;13:2537-47
pubmed: 17981733
Oncotarget. 2014 Sep 30;5(18):8790-802
pubmed: 25229428
RNA. 2019 Jun;25(6):657-668
pubmed: 30819774
Cell. 2018 Mar 22;173(1):20-51
pubmed: 29570994
Chem Biol. 2004 Dec;11(12):1619-23
pubmed: 15610845
Mol Cell. 2019 Aug 8;75(3):511-522.e4
pubmed: 31178353
Nucleic Acids Res. 2017 Apr 7;45(6):2973-2985
pubmed: 28206648
J Biotechnol. 2017 Nov 10;261:157-168
pubmed: 28888961
RNA. 2018 May;24(5):643-655
pubmed: 29445025
Science. 2007 Feb 23;315(5815):1137-40
pubmed: 17322061
Nat Biotechnol. 2019 Apr;37(4):358-367
pubmed: 30940948
Proc Natl Acad Sci U S A. 2015 Mar 10;112(10):E1106-15
pubmed: 25713380
RNA. 2014 Sep;20(9):1431-9
pubmed: 24984703
Nat Methods. 2012 Mar 04;9(4):357-9
pubmed: 22388286
Nucleic Acids Res. 2018 Jan 4;46(D1):D754-D761
pubmed: 29155950
Nucleic Acids Res. 2019 Mar 18;47(5):2630-2640
pubmed: 30605524
Avicenna J Med Biotechnol. 2010 Oct;2(4):161-79
pubmed: 23407304
Curr Protoc Bioinformatics. 2014 Sep 08;47:11.12.1-34
pubmed: 25199790
Nature. 2004 Oct 21;431(7011):931-45
pubmed: 15496913
BMC Bioinformatics. 2016 Mar 31;17:148
pubmed: 27036505
Sci Rep. 2017 Aug 28;7(1):9655
pubmed: 28848235
BMC Cancer. 2018 Aug 20;18(1):834
pubmed: 30126353
Cell. 2004 Jan 23;116(2):281-97
pubmed: 14744438
Bioinformatics. 2016 Aug 15;32(16):2481-9
pubmed: 27153631
RNA. 2011 Feb;17(2):312-26
pubmed: 21177881
Front Mol Biosci. 2018 Jul 13;5:61
pubmed: 30057901
Nucleic Acids Res. 2019 Jan 8;47(D1):D221-D229
pubmed: 30395267
Cancer Res. 2018 Mar 1;78(5):1140-1154
pubmed: 29229607
Brief Bioinform. 2013 Mar;14(2):178-92
pubmed: 22517427

Auteurs

Thomas Desvignes (T)

Institute of Neuroscience, University of Oregon, Eugene, OR 97403, USA.

Phillipe Loher (P)

Computational Medicine Center, Thomas Jefferson University, Philadelphia, PA 19144, USA.

Karen Eilbeck (K)

University of Utah, Biomedical Informatics, Salt Lake City, UT 84108, USA.

Jeffery Ma (J)

Computational Medicine Center, Thomas Jefferson University, Philadelphia, PA 19144, USA.

Gianvito Urgese (G)

Department of Control and Computer Engineering, Politecnico di Torino, Torino 10129, Italy.

Bastian Fromm (B)

Science for Life Laboratory, Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm 114 18, Sweden.

Jason Sydes (J)

Institute of Neuroscience, University of Oregon, Eugene, OR 97403, USA.

Ernesto Aparicio-Puerta (E)

Computational Epigenomics Laboratory, Genetics Department and Biotechnology Institute and Biosanitary Institute, University of Granada, Granada 18002, Spain.

Victor Barrera (V)

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA.

Roderic Espín (R)

Universitat Oberta de Catalunya, Barcelona 08018, Spain.

Florian Thibord (F)

Sorbonne Université, Pierre Louis Doctoral School of Public Health, Paris 75006, France.
Institut National pour la Santé et la Recherche Médicale (INSERM) Unité Mixte de Recherche en Santé (UMR_S), University of Bordeaux, Bordeaux 33076, France.

Xavier Bofill-De Ros (X)

RNA Biology Laboratory, Center for Cancer Research, National Cancer Institute, Frederick, MD 21702, USA.

Eric Londin (E)

Computational Medicine Center, Thomas Jefferson University, Philadelphia, PA 19144, USA.

Aristeidis G Telonis (AG)

Computational Medicine Center, Thomas Jefferson University, Philadelphia, PA 19144, USA.

Elisa Ficarra (E)

Department of Control and Computer Engineering, Politecnico di Torino, Torino 10129, Italy.

Marc R Friedländer (MR)

Science for Life Laboratory, Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm 114 18, Sweden.

John H Postlethwait (JH)

Institute of Neuroscience, University of Oregon, Eugene, OR 97403, USA.

Isidore Rigoutsos (I)

Computational Medicine Center, Thomas Jefferson University, Philadelphia, PA 19144, USA.

Michael Hackenberg (M)

Computational Epigenomics Laboratory, Genetics Department and Biotechnology Institute and Biosanitary Institute, University of Granada, Granada 18002, Spain.

Ioannis S Vlachos (IS)

Non-coding Research Lab, Department of Pathology, Cancer Research Institute, Harvard Medical School Initiative for RNA Medicine, Beth Israel Deaconess Medical Center, Boston, MA 02115, USA.

Marc K Halushka (MK)

Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.

Lorena Pantano (L)

Bioinformatics Core, The Picower Institute for Learning and Memory, Cambridge, MA 02139, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Humans Endoribonucleases RNA, Messenger RNA Caps Gene Expression Regulation
Coal Metagenome Phylogeny Bacteria Genome, Bacterial

Classifications MeSH