MetaCOXI: an integrated collection of metazoan mitochondrial cytochrome oxidase subunit-I DNA sequences.


Journal

Database : the journal of biological databases and curation
ISSN: 1758-0463
Titre abrégé: Database (Oxford)
Pays: England
ID NLM: 101517697

Informations de publication

Date de publication:
03 02 2022
Historique:
received: 19 05 2021
revised: 15 12 2021
accepted: 07 01 2022
entrez: 8 2 2022
pubmed: 9 2 2022
medline: 30 4 2022
Statut: ppublish

Résumé

Nucleotide sequences reference collections or databases are fundamental components in DNA barcoding and metabarcoding data analyses pipelines. In such analyses, the accurate taxonomic assignment is a crucial aspect, relying directly on the availability of comprehensive and curated reference sequence collection and its taxonomy information. The currently wide use of the mitochondrial cytochrome oxidase subunit-I (COXI) as a standard DNA barcode marker in metazoan biodiversity studies highlights the need to shed light on the availability of the related relevant information from different data sources and their eventual integration. To adequately address data integration process, many aspects should be markedly considered starting from DNA sequence curation followed by taxonomy alignment with solid reference backbone and metadata harmonization according to universal standards. Here, we present MetaCOXI, an integrated collection of curated metazoan COXI DNA sequences with their associated harmonized taxonomy and metadata. This collection was built on the two most extensive available data resources, namely the European Nucleotide Archive (ENA) and the Barcode of Life Data System (BOLD). The current release contains more than 5.6 million entries (39.1% unique to BOLD, 3.6% unique to ENA, and 57.2% shared between both), their related taxonomic classification based on NCBI reference taxonomy, and their available main metadata relevant to environmental DNA studies, such as geographical coordinates, sampling country and host species. MetaCOXI is available in standard universal formats ('fasta' for sequences & 'tsv' for taxonomy and metadata), which can be easily incorporated in standard or specific DNA barcoding and/or metabarcoding data analysis pipelines. Database URL: https://github.com/bachob5/MetaCOXI.

Identifiants

pubmed: 35134858
pii: 6521297
doi: 10.1093/database/baab084
pmc: PMC9216479
pii:
doi:

Substances chimiques

DNA, Mitochondrial 0
Electron Transport Complex IV EC 1.9.3.1

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2022. Published by Oxford University Press.

Références

Trends Genet. 2012 Nov;28(11):535-7
pubmed: 22951138
Methods Mol Biol. 2012;858:339-53
pubmed: 22684963
Nat Biotechnol. 2011 May;29(5):415-20
pubmed: 21552244
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
Sci Data. 2020 Jul 3;7(1):209
pubmed: 32620910
PeerJ. 2018 Jun 26;6:e5126
pubmed: 29967752
Mol Phylogenet Evol. 2020 Sep;150:106857
pubmed: 32473333
PLoS Comput Biol. 2011 Oct;7(10):e1002195
pubmed: 22039361
Mol Ecol. 2017 Nov;26(21):5872-5895
pubmed: 28921802
Mol Ecol Notes. 2007 May 1;7(3):355-364
pubmed: 18784790
Sci Rep. 2015 Oct 30;5:15894
pubmed: 26516098
Mol Ecol. 2020 Nov;29(22):4258-4264
pubmed: 32966665
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
Sci Rep. 2020 Oct 20;10(1):17767
pubmed: 33082418
PeerJ. 2018 Jun 13;6:e4845
pubmed: 29915686
BMC Evol Biol. 2019 Feb 11;19(1):52
pubmed: 30744573
Mol Ecol. 2019 Apr;28(8):1857-1862
pubmed: 31033079
Database (Oxford). 2020 Jan 1;2020:
pubmed: 32761142
PeerJ. 2018 May 4;6:e4705
pubmed: 29740514
Ecol Appl. 2019 Jun;29(4):e01877
pubmed: 30811075
J Biomed Semantics. 2016 Sep 23;7(1):57
pubmed: 27664130
Sci Data. 2018 Aug 07;5:180156
pubmed: 30084847
Commun Biol. 2021 May 3;4(1):512
pubmed: 33941836
Bioinformatics. 2010 Oct 1;26(19):2460-1
pubmed: 20709691
Proc Biol Sci. 2003 Feb 7;270(1512):313-21
pubmed: 12614582
Microorganisms. 2020 Feb 23;8(2):
pubmed: 32102216
Nucleic Acids Res. 2021 Jan 8;49(D1):D412-D419
pubmed: 33125078
Nucleic Acids Res. 2011 Jan;39(Database issue):D28-31
pubmed: 20972220
Mol Ecol. 2012 Apr;21(8):2045-50
pubmed: 22486824
Sci Data. 2017 Mar 14;4:170027
pubmed: 28291235

Auteurs

Articles similaires

Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice
Animals Tail Swine Behavior, Animal Animal Husbandry

Classifications MeSH