Repositories for Taxonomic Data: Where We Are and What is Missing.


Journal

Systematic biology
ISSN: 1076-836X
Titre abrégé: Syst Biol
Pays: England
ID NLM: 9302532

Informations de publication

Date de publication:
01 11 2020
Historique:
received: 13 11 2019
revised: 20 02 2020
accepted: 24 03 2020
pubmed: 17 4 2020
medline: 6 1 2021
entrez: 17 4 2020
Statut: ppublish

Résumé

Natural history collections are leading successful large-scale projects of specimen digitization (images, metadata, DNA barcodes), thereby transforming taxonomy into a big data science. Yet, little effort has been directed towards safeguarding and subsequently mobilizing the considerable amount of original data generated during the process of naming 15,000-20,000 species every year. From the perspective of alpha-taxonomists, we provide a review of the properties and diversity of taxonomic data, assess their volume and use, and establish criteria for optimizing data repositories. We surveyed 4113 alpha-taxonomic studies in representative journals for 2002, 2010, and 2018, and found an increasing yet comparatively limited use of molecular data in species diagnosis and description. In 2018, of the 2661 papers published in specialized taxonomic journals, molecular data were widely used in mycology (94%), regularly in vertebrates (53%), but rarely in botany (15%) and entomology (10%). Images play an important role in taxonomic research on all taxa, with photographs used in >80% and drawings in 58% of the surveyed papers. The use of omics (high-throughput) approaches or 3D documentation is still rare. Improved archiving strategies for metabarcoding consensus reads, genome and transcriptome assemblies, and chemical and metabolomic data could help to mobilize the wealth of high-throughput data for alpha-taxonomy. Because long-term-ideally perpetual-data storage is of particular importance for taxonomy, energy footprint reduction via less storage-demanding formats is a priority if their information content suffices for the purpose of taxonomic studies. Whereas taxonomic assignments are quasifacts for most biological disciplines, they remain hypotheses pertaining to evolutionary relatedness of individuals for alpha-taxonomy. For this reason, an improved reuse of taxonomic data, including machine-learning-based species identification and delimitation pipelines, requires a cyberspecimen approach-linking data via unique specimen identifiers, and thereby making them findable, accessible, interoperable, and reusable for taxonomic research. This poses both qualitative challenges to adapt the existing infrastructure of data centers to a specimen-centered concept and quantitative challenges to host and connect an estimated $ \le $2 million images produced per year by alpha-taxonomic studies, plus many millions of images from digitization campaigns. Of the 30,000-40,000 taxonomists globally, many are thought to be nonprofessionals, and capturing the data for online storage and reuse therefore requires low-complexity submission workflows and cost-free repository use. Expert taxonomists are the main stakeholders able to identify and formalize the needs of the discipline; their expertise is needed to implement the envisioned virtual collections of cyberspecimens. [Big data; cyberspecimen; new species; omics; repositories; specimen identifier; taxonomy; taxonomic data.].

Identifiants

pubmed: 32298457
pii: 5820981
doi: 10.1093/sysbio/syaa026
pmc: PMC7584136
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't Review

Langues

eng

Sous-ensembles de citation

IM

Pagination

1231-1253

Informations de copyright

© The Author(s) 2020. Published by Oxford University Press.

Références

Zookeys. 2015 Oct 05;(525):117-27
pubmed: 26487819
Sci Data. 2018 Jun 26;5:180118
pubmed: 29944145
BMC Evol Biol. 2010 Nov 30;10:372
pubmed: 21118572
Zootaxa. 2017 Apr 11;4251(1):1-124
pubmed: 28609991
Syst Biol. 2007 Dec;56(6):879-86
pubmed: 18027281
Genetics. 2000 Jun;155(2):945-59
pubmed: 10835412
PLoS One. 2012;7(1):e29715
pubmed: 22238640
Big Data Soc. 2014 Jun 1;1(1):
pubmed: 25729586
Nature. 2018 Sep;561(7722):163-166
pubmed: 30209383
Insect Sci. 2014 Jun;21(3):392-9
pubmed: 24302684
PLoS Biol. 2015 Nov 10;13(11):e1002295
pubmed: 26556502
Genetics. 2005 Jul;170(3):1261-80
pubmed: 15520263
Zookeys. 2012;(209):193-202
pubmed: 22859888
Metabolites. 2019 Jun 24;9(6):
pubmed: 31238512
Appl Plant Sci. 2019 Mar 20;7(3):e01233
pubmed: 30937225
Mol Ecol. 2012 Apr;21(8):1864-77
pubmed: 21883587
Trends Ecol Evol. 2010 Dec;25(12):686-91
pubmed: 20961649
Sci Data. 2017 Feb 14;4:170016
pubmed: 28195585
PLoS One. 2012;7(5):e36881
pubmed: 22649502
Arch Microbiol. 2019 Mar;201(2):143-145
pubmed: 30539264
Annu Rev Entomol. 2018 Jan 7;63:513-530
pubmed: 29058981
Proc Biol Sci. 2014 Jan 08;281(1777):20132765
pubmed: 24403337
Zootaxa. 2016 Nov 23;4196(3):zootaxa.4196.3.9
pubmed: 27988669
Zootaxa. 2017 Jan 26;4226(3):zootaxa.4226.3.9
pubmed: 28187627
Zookeys. 2016 Jan 07;(550):261-81
pubmed: 26877664
Front Genet. 2016 Jan 11;6:361
pubmed: 26793234
BMC Bioinformatics. 2014 Jun 25;15:218
pubmed: 24964954
Database (Oxford). 2018 Jan 1;2018:
pubmed: 29688348
Am J Bot. 2006 Nov;93(11):1667-74
pubmed: 21642112
Zootaxa. 2014 Sep 10;3860(6):547-60
pubmed: 25283290
Nucleic Acids Res. 2012 Jan;40(Database issue):D136-43
pubmed: 22139910
Nat Biotechnol. 2011 May;29(5):415-20
pubmed: 21552244
PLoS One. 2013 Nov 04;8(11):e78080
pubmed: 24223762
J Anat. 2016 Jun;228(6):889-909
pubmed: 26970556
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
Parasitology. 2013 Mar;140(3):318-27
pubmed: 23171762
Front Zool. 2013 Sep 17;10(1):55
pubmed: 24044698
Sci Rep. 2019 Jun 24;9(1):9182
pubmed: 31235850
Zookeys. 2012;(209):165-81
pubmed: 22859886
IMA Fungus. 2018 Jun;9(1):167-175
pubmed: 30018877
Front Zool. 2010 May 25;7:16
pubmed: 20500846
Gigascience. 2015 Jul 31;4:34
pubmed: 26236474
Nature. 2008 Sep 4;455(7209):28-9
pubmed: 18769419
Syst Biol. 2015 Jan;64(1):144-51
pubmed: 25190593
Evolution. 2012 Sep;66(9):2834-49
pubmed: 22946806
Science. 2013 Jan 25;339(6118):413-6
pubmed: 23349283
Zookeys. 2013 Feb 04;(263):1-45
pubmed: 23653515
Proc Natl Acad Sci U S A. 2016 May 24;113(21):5970-5
pubmed: 27140646
Nat Prod Rep. 2018 Sep 19;35(9):992-1014
pubmed: 29774351
Science. 2017 Feb 24;355(6327):805
pubmed: 28232546
Appl Plant Sci. 2018 Mar 07;6(2):e1027
pubmed: 29732258
Syst Biol. 2018 Nov 1;67(6):1110-1119
pubmed: 29893962
Syst Biol. 2006 Aug;55(4):595-609
pubmed: 16967577
Syst Biol. 2016 Nov;65(6):1085-1095
pubmed: 27146045
Zootaxa. 2016 Jul 08;4137(1):121-8
pubmed: 27395746
PeerJ. 2015 Sep 15;3:e991
pubmed: 26734510
PLoS One. 2015 Jul 08;10(7):e0130312
pubmed: 26154157
PLoS One. 2014 Jul 08;9(7):e101704
pubmed: 25004097
Philos Trans R Soc Lond B Biol Sci. 2016 Sep 5;371(1702):
pubmed: 27481786
Bioscience. 2020 Jan 1;70(1):23-30
pubmed: 31949317
Bioinformatics. 2013 Nov 15;29(22):2869-76
pubmed: 23990417
PLoS One. 2015 Aug 26;10(8):e0135243
pubmed: 26309113
Zookeys. 2015 Apr 06;(494):133-54
pubmed: 25901117
Trends Ecol Evol. 2011 Nov;26(11):551-3
pubmed: 21862170
PLoS One. 2015 Oct 28;10(10):e0141039
pubmed: 26509918
Nature. 2007 Mar 15;446(7133):259-60
pubmed: 17361160
Biol J Linn Soc Lond. 2013 Dec 1;110(4):
pubmed: 24277973
Biodivers Data J. 2013 Sep 16;(1):e995
pubmed: 24723782
PLoS Biol. 2017 Aug 18;15(8):e2002231
pubmed: 28820884
PLoS One. 2013 Jul 08;8(7):e66213
pubmed: 23861743
Angew Chem Int Ed Engl. 2012 Feb 27;51(9):2187-90
pubmed: 22266641
Syst Biol. 2007 Dec;56(6):924-42
pubmed: 18066928
Evolution. 2015 Feb;69(2):492-507
pubmed: 25495061
PLoS Biol. 2011 Aug;9(8):e1001127
pubmed: 21886479
Sci Data. 2019 Apr 26;6(1):41
pubmed: 31028285
Mycol Res. 2008 Feb;112(Pt 2):231-40
pubmed: 18319145
Mol Phylogenet Evol. 2016 Jul;100:372-381
pubmed: 27085671
Syst Biol. 2013 Jul;62(4):616-24
pubmed: 23576317

Auteurs

Aurélien Miralles (A)

Departement Origins and Evolution, Institut Systématique, Evolution, Biodiversité (ISYEB), Muséum national d'Histoire naturelle, CNRS, Sorbonne Université, EPHE, 57 rue Cuvier, CP50, 75005 Paris, France.
Systematic Botany and Mycology, University of Munich (LMU), Menzingerstraße 67, 80638 Munich, Germany.

Teddy Bruy (T)

Departement Origins and Evolution, Institut Systématique, Evolution, Biodiversité (ISYEB), Muséum national d'Histoire naturelle, CNRS, Sorbonne Université, EPHE, 57 rue Cuvier, CP50, 75005 Paris, France.
Systematic Botany and Mycology, University of Munich (LMU), Menzingerstraße 67, 80638 Munich, Germany.

Katherine Wolcott (K)

Systematic Botany and Mycology, University of Munich (LMU), Menzingerstraße 67, 80638 Munich, Germany.
National Museum of Natural History, Smithsonian Institution, Washington, DC, USA.

Mark D Scherz (MD)

Department of Herpetology, Zoologische Staatssammlung München (ZSM-SNSB), Münchhausenstraße 21, 81247 München, Germany.
Department of Biology, Universität Konstanz, Universitätstraße 10, 78464 Konstanz, Germany.

Dominik Begerow (D)

Department of Geobotany, Ruhr-University Bochum, Universitätsstraße 150, 44780 Bochum, Germany.

Bank Beszteri (B)

Department of Phycology, Faculty of Biology, University of Duisburg-Essen, Universitätsstraße 2, 45141 Essen, Germany.

Michael Bonkowski (M)

Department of Terrestrial Ecology, Center of Excellence in Plant Sciences (CEPLAS), Terrestrial Ecology, Institute of Zoology, University of Cologne, 50674 Köln, Germany.

Janine Felden (J)

MARUM - Center for Marine Environmental Sciences, University of Bremen, Leobenerstraße 8, 28359 Bremen, Germany.
Alfred Wegener Institute - Helmholtz Center for Polar- and Marine Research, Am Handelshafen 12, 27570 Bremerhaven, Germany.

Birgit Gemeinholzer (B)

Department of Systematic Botany, Justus Liebig University Gießen, Heinrich-Buff Ring 38, 35392 Giessen, Germany.

Frank Glaw (F)

Department of Herpetology, Zoologische Staatssammlung München (ZSM-SNSB), Münchhausenstraße 21, 81247 München, Germany.

Frank Oliver Glöckner (FO)

Alfred Wegener Institute - Helmholtz Center for Polar- and Marine Research, Am Handelshafen 12, 27570 Bremerhaven, Germany.

Oliver Hawlitschek (O)

Department of Herpetology, Zoologische Staatssammlung München (ZSM-SNSB), Münchhausenstraße 21, 81247 München, Germany.
Department of Scientific Infrastructure, Centrum für Naturkunde (CeNak), Universität Hamburg, Martin-Luther-King-Platz 3, 20146 Hamburg, Germany.

Ivaylo Kostadinov (I)

GFBio - Gesellschaft für Biologische Daten e.V., c/o Research II, Campus Ring 1, 28759 Bremen, Germany.

Tim W Nattkemper (TW)

Biodata Mining Group, Center of Biotechnology (CeBiTec), Bielefeld University, PO Box 100131, 33501 Bielefeld, Germany.

Christian Printzen (C)

Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, 60325 Frankfurt/Main, Germany.

Jasmin Renz (J)

Zooplankton Research Group, DZMB - Senckenberg am Meer, Martin-Luther-King Platz 3, 20146 Hamburg, Germany.

Nataliya Rybalka (N)

Department of Experimental Phycology and Culture Collection of Algae, University Göttingen, Nikolausberger-Weg 18, 37073 Göttingen, Germany.

Marc Stadler (M)

Department Microbial Drugs, Helmholtz Centre for Infection Research (HZI), and German Centre for Infection Research (DZIF), Partner Site Hannover-Braunschweig, Inhoffenstrasse 7, 38124 Braunschweig, Germany.

Tanja Weibulat (T)

GFBio - Gesellschaft für Biologische Daten e.V., c/o Research II, Campus Ring 1, 28759 Bremen, Germany.

Thomas Wilke (T)

Department of Animal Ecology and Systematics, Justus Liebig University Gießen, Heinrich-Buff Ring 26, 35392 Giessen, Germany.

Susanne S Renner (SS)

Systematic Botany and Mycology, University of Munich (LMU), Menzingerstraße 67, 80638 Munich, Germany.

Miguel Vences (M)

Department of Evolutionary Biology, Zoological Institute, Technische Universität Braunschweig, Mendelssohnstraße 4, 38106 Braunschweig, Germany.

Articles similaires

Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice
Animals Tail Swine Behavior, Animal Animal Husbandry

Classifications MeSH