RepBox: a toolbox for the identification of repetitive elements.

Bioinformatics pipeline Detection software Genomics Transposable elements

Journal

BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194

Informations de publication

Date de publication:
22 Aug 2023
Historique:
received: 05 08 2021
accepted: 18 07 2023
medline: 24 8 2023
pubmed: 23 8 2023
entrez: 23 8 2023
Statut: epublish

Résumé

Transposable elements (TEs) are short, mobile DNA elements that are known to play important roles in the genomes of many eukaryotic species. The identification and categorization of these elements is a critical task for many genomic studies, and the continued increase in the number of de novo assembled genomes demands new tools to improve the efficiency of this process. For this reason, we developed RepBox, a suite of Python scripts that combine several pre-existing family-specific TE detection methods into a single user-friendly pipeline. Based on comparisons of RepBox with the standard TE detection software RepeatModeler, we find that RepBox consistently classifies more elements and is also able to identify a more diverse array of TE families than the existing methods in plant genomes. The performance of RepBox on two different plant genomes indicates that our toolbox represents a significant improvement over existing TE detection methods, and should facilitate future TE annotation efforts in additional species.

Sections du résumé

BACKGROUND BACKGROUND
Transposable elements (TEs) are short, mobile DNA elements that are known to play important roles in the genomes of many eukaryotic species. The identification and categorization of these elements is a critical task for many genomic studies, and the continued increase in the number of de novo assembled genomes demands new tools to improve the efficiency of this process. For this reason, we developed RepBox, a suite of Python scripts that combine several pre-existing family-specific TE detection methods into a single user-friendly pipeline.
RESULTS RESULTS
Based on comparisons of RepBox with the standard TE detection software RepeatModeler, we find that RepBox consistently classifies more elements and is also able to identify a more diverse array of TE families than the existing methods in plant genomes.
CONCLUSIONS CONCLUSIONS
The performance of RepBox on two different plant genomes indicates that our toolbox represents a significant improvement over existing TE detection methods, and should facilitate future TE annotation efforts in additional species.

Identifiants

pubmed: 37608271
doi: 10.1186/s12859-023-05419-5
pii: 10.1186/s12859-023-05419-5
pmc: PMC10463291
doi:

Substances chimiques

DNA Transposable Elements 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

317

Subventions

Organisme : U.S. Department of Education
ID : P200A150266

Informations de copyright

© 2023. BioMed Central Ltd., part of Springer Nature.

Références

Brief Bioinform. 2007 Nov;8(6):382-92
pubmed: 17932080
Curr Biol. 1996 Jul 1;6(7):817-20
pubmed: 8805293
BMC Bioinformatics. 2019 Jun 24;20(1):354
pubmed: 31234777
Nat Rev Genet. 2017 May;18(5):292-308
pubmed: 28286338
Proc Natl Acad Sci U S A. 2020 Apr 28;117(17):9451-9457
pubmed: 32300014
Plant Cell Physiol. 2013 Feb;54(2):e6
pubmed: 23299411
Methods Mol Biol. 2019;1910:177-207
pubmed: 31278665
Genomics Inform. 2014 Sep;12(3):87-97
pubmed: 25317107
BMC Med Genomics. 2018 Nov 20;11(Suppl 5):101
pubmed: 30453969
PeerJ. 2016 Oct 18;4:e2584
pubmed: 27781170
Nucleic Acids Res. 2020 Jan 8;48(D1):D689-D695
pubmed: 31598706
Genome Biol. 2019 Dec 16;20(1):275
pubmed: 31843001
Bioinformatics. 2017 Mar 1;33(5):743-745
pubmed: 28062442
BMC Bioinformatics. 2018 Oct 3;19(1):348
pubmed: 30285604
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Genome Res. 2007 Jul;17(7):1072-81
pubmed: 17556529
Bioinformatics. 2016 Aug 1;32(15):2366-8
pubmed: 27153580
Nucleic Acids Res. 2008 Jan;36(Database issue):D1009-14
pubmed: 17986450
Proc Natl Acad Sci U S A. 2014 Jul 15;111(28):10263-8
pubmed: 24982153
Nat Rev Genet. 2017 Feb;18(2):71-86
pubmed: 27867194
Proc Natl Acad Sci U S A. 1997 Jul 22;94(15):7704-11
pubmed: 9223252

Auteurs

Shelvasha Burkes-Patton (S)

Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA.

Elizabeth A Cooper (EA)

Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA.
North Carolina Research Campus, Kannapolis, NC, 28081, USA.

Jessica Schlueter (J)

Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA. jschluet@charlotte.edu.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH