Automatic curation of large comparative animal MicroRNA datasets.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 11 2019
Historique:
received: 12 12 2018
revised: 11 03 2019
accepted: 10 04 2019
pubmed: 18 4 2019
medline: 2 7 2020
entrez: 18 4 2019
Statut: ppublish

Résumé

MicroRNAs form an important class of RNA regulators that has been studied extensively. The miRBase and Rfam database provide rich, frequently updated information on both pre-miRNAs and their mature forms. These data sources, however, rely on individual data submission and thus are neither complete nor consistent in their coverage across different miRNA families. Quantitative studies of miRNA evolution therefore are difficult or impossible on this basis. We present here a workflow and a corresponding implementation, MIRfix, that automatically curates miRNA datasets by improving alignments of their precursors, the consistency of the annotation of mature miR and miR* sequence, and the phylogenetic coverage. MIRfix produces alignments that are comparable across families and sets the stage for improved homology search as well as quantitative analyses. MIRfix can be downloaded from https://github.com/Bierinformatik/MIRfix. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 30993337
pii: 5472340
doi: 10.1093/bioinformatics/btz271
doi:

Substances chimiques

MicroRNAs 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

4553-4559

Informations de copyright

© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Auteurs

Ali M Yazbeck (AM)

Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, D-04107 Leipzig, Germany.
Doctoral School of Science and Technology, Center for Biotechnology Research, Lebanese University, Hadath Campus, Beirut, Lebanon.
Helmholtz Centre for Environmental Research - UFZ, Young Investigators Group Bioinformatics and Transcriptomics, D-04318 Leipzig, Germany.

Peter F Stadler (PF)

Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, D-04107 Leipzig, Germany.
German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Competence Center for Scalable Data Services and Solutions, and Leipzig Research Center for Civilization Diseases, University Leipzig, D-04107 Leipzig, Germany.
Max Planck Institute for Mathematics in the Sciences, D-04103 Leipzig, Germany.
Institute for Theoretical Chemistry, University of Vienna, A-1090 Wien, Austria.
Facultad de Ciencias, Universidad National de Colombia, Sede Bogotá, Colombia.
Santa Fe Institute, Santa Fe, NM 87501, USA.

Kifah Tout (K)

Doctoral School of Science and Technology, Center for Biotechnology Research, Lebanese University, Hadath Campus, Beirut, Lebanon.

Jörg Fallmann (J)

Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, D-04107 Leipzig, Germany.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice

Classifications MeSH