RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
15 03 2019
Historique:
received: 27 06 2018
revised: 27 07 2018
accepted: 23 08 2018
pubmed: 31 8 2018
medline: 1 1 2020
entrez: 31 8 2018
Statut: ppublish

Résumé

Repetitive elements comprise large proportion of many genomes. They have impact on both genome evolution and regulation. Their classification and the study of evolutionary history is a major emerging field. Various software exist to-date to classify and map repeats across genomes. The major unresolved drawback, however, is the fragmented nature of many identified repeat loci. This ultimately makes the classification of novel repeats and their evolutionary analyses difficult. To improve on this, we developed a pipeline (RepeatCraft) that integrates results from several repeat element classification tools based on both sequence similarity and structural features. The pipeline de-fragments closely spaced repeat loci in the genomes, reconstructing longer copies, thus allowing for a better annotation and sequence comparisons. The pipeline also includes a user interface that can run in a web browser allowing for an easy access and exploration of the repeat data. RepeatCraft is implemented in Python and the web application is implemented in R. Download and documentation is freely available at https://github.com/niccw/repeatCraftp. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 30165587
pii: 5079332
doi: 10.1093/bioinformatics/bty745
pmc: PMC6419915
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1051-1052

Informations de copyright

© The Author(s) 2018. Published by Oxford University Press.

Références

J Exp Zool B Mol Dev Evol. 2014 Sep;322(6):322-33
pubmed: 23908136
Proc Natl Acad Sci U S A. 2001 Jul 17;98(15):8714-9
pubmed: 11447285
Nature. 2010 Mar 25;464(7288):592-6
pubmed: 20228792
Annu Rev Genet. 2007;41:331-68
pubmed: 18076328
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W265-8
pubmed: 17485477

Auteurs

Wai Yee Wong (WY)

Department of Molecular Evolution and Development, Faculty of Life Science, University of Vienna, A-1090 Vienna, Austria.

Oleg Simakov (O)

Department of Molecular Evolution and Development, Faculty of Life Science, University of Vienna, A-1090 Vienna, Austria.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Coal Metagenome Phylogeny Bacteria Genome, Bacterial
Cephalometry Humans Anatomic Landmarks Software Internet

Classifications MeSH