The Herbarium 2021 Half-Earth Challenge Dataset and Machine Learning Competition.
datasets
fine-grained visual categorization
herbarium specimen image
hierarchical classification
machine learning competition
Journal
Frontiers in plant science
ISSN: 1664-462X
Titre abrégé: Front Plant Sci
Pays: Switzerland
ID NLM: 101568200
Informations de publication
Date de publication:
2021
2021
Historique:
received:
30
09
2021
accepted:
20
12
2021
entrez:
18
2
2022
pubmed:
19
2
2022
medline:
19
2
2022
Statut:
epublish
Résumé
Herbarium sheets present a unique view of the world's botanical history, evolution, and biodiversity. This makes them an all-important data source for botanical research. With the increased digitization of herbaria worldwide and advances in the domain of fine-grained visual classification which can facilitate automatic identification of herbarium specimen images, there are many opportunities for supporting and expanding research in this field. However, existing datasets are either too small, or not diverse enough, in terms of represented taxa, geographic distribution, and imaging protocols. Furthermore, aggregating datasets is difficult as taxa are recognized under a multitude of names and must be aligned to a common reference. We introduce the Herbarium 2021 Half-Earth dataset: the largest and most diverse dataset of herbarium specimen images, to date, for automatic taxon recognition. We also present the results of the Herbarium 2021 Half-Earth challenge, a competition that was part of the Eighth Workshop on Fine-Grained Visual Categorization (FGVC8) and hosted by Kaggle to encourage the development of models to automatically identify taxa from herbarium sheet images.
Identifiants
pubmed: 35178056
doi: 10.3389/fpls.2021.787127
pmc: PMC8846375
doi:
Types de publication
Journal Article
Langues
eng
Pagination
787127Informations de copyright
Copyright © 2022 de Lutio, Park, Watson, D'Aronco, Wegner, Wieringa, Tulig, Pyle, Gallaher, Brown, Guymer, Franks, Ranatunga, Baba, Belongie, Michelangeli, Ambrose and Little.
Déclaration de conflit d'intérêts
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The handling editor declared a shared research group [FGVC virtual lab] with one of the authors [SB] at time of review.
Références
Appl Plant Sci. 2019 Mar 20;7(3):e01233
pubmed: 30937225
Ecol Lett. 2013 Aug;16(8):1037-44
pubmed: 23786499
BMC Evol Biol. 2016 Nov 16;16(1):248
pubmed: 27852219
Bioscience. 2020 Jul 1;70(6):610-620
pubmed: 32665738
Appl Plant Sci. 2019 Feb 28;7(4):e01223
pubmed: 31024779
Sci Data. 2020 Nov 26;7(1):416
pubmed: 33243996
Biodivers Data J. 2019 Feb 08;(7):e31817
pubmed: 30833825
Biodivers Data J. 2017 Nov 02;(5):e21139
pubmed: 29200929
New Phytol. 2021 Apr;230(2):433-450
pubmed: 33280123
Arch Comput Methods Eng. 2018;25(2):507-543
pubmed: 29962832
Plant Divers. 2022 May 27;44(4):335-339
pubmed: 35967255
Proc Biol Sci. 2011 Feb 22;278(1705):554-9
pubmed: 20610425
Proc Natl Acad Sci U S A. 2010 Dec 21;107(51):22169-71
pubmed: 21135225
Appl Plant Sci. 2020 Jul 01;8(6):e11368
pubmed: 32626610
Proc Natl Acad Sci U S A. 2021 Feb 9;118(6):
pubmed: 33526679
Appl Plant Sci. 2020 Jul 01;8(6):e11365
pubmed: 32626608
PLoS One. 2019 Jun 26;14(6):e0217779
pubmed: 31242215
BMC Evol Biol. 2017 Aug 11;17(1):181
pubmed: 28797242
Philos Trans R Soc Lond B Biol Sci. 2018 Nov 19;374(1763):
pubmed: 30455209
Front Plant Sci. 2020 Jul 31;11:1129
pubmed: 32849691
New Phytol. 2019 Jan;221(1):110-122
pubmed: 30160314
Appl Plant Sci. 2020 Jul 01;8(6):e11370
pubmed: 32626612
Trends Ecol Evol. 2017 Jul;32(7):531-546
pubmed: 28465044
Proc Natl Acad Sci U S A. 2016 Mar 22;113(12):3305-10
pubmed: 26951664
Front Plant Sci. 2017 Jul 07;8:1190
pubmed: 28736569
Appl Plant Sci. 2020 Jul 01;8(6):e11372
pubmed: 32626613