A new R package to parse plant species occurrence records into unique collection events efficiently reduces data redundancy.

Digital duplicates Gatherings Global biodiversity information facility Myrtaceae Species taxonomic names Synonyms Taxonomic inflation World checklist of vascular plants

Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
05 Mar 2024
Historique:
received: 08 11 2023
accepted: 02 03 2024
medline: 6 3 2024
pubmed: 6 3 2024
entrez: 5 3 2024
Statut: epublish

Résumé

Biodiversity data aggregators, such as Global Biodiversity Information Facility (GBIF) suffer from inflation of the number of occurrence records when data from different databases are merged but not fully reconciled. The ParseGBIF workflow is designed to parse duplicate GBIF species occurrence records into unique collection events (gatherings) and to optimise the quality of the spatial data associated with them. ParseGBIF provides tools to verify and standardize species scientific names according to the World Checklist of Vascular Plants taxonomic backbone, and to parse duplicate records into unique 'collection events', in the process compiling the most informative spatial data, where more than one duplicate is available, and providing crude estimates of taxonomic and spatial data quality. When GBIF occurrence records for a medium-sized vascular plant family, the Myrtaceae, were processed by ParseGBIF, the average number of records useful for spatial analysis increased by 180%. ParseGBIF could therefore be valuable in the evaluation of species' occurrences at the national scale in support for national biodiversity plans, identification of plant areas important for biodiversity, sample bias estimation to inform future sampling efforts, and to forecast species range shifts in response to global climate change.

Identifiants

pubmed: 38443673
doi: 10.1038/s41598-024-56158-3
pii: 10.1038/s41598-024-56158-3
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

5450

Informations de copyright

© 2024. The Author(s).

Références

Nic Lughadha, E. et al. Extinction risk and threats to plants and fungi. Plants People Planet 2, 389–408 (2020).
doi: 10.1002/ppp3.10146
Meyer, C., Weigelt, P. & Kreft, H. Multidimensional biases, gaps and uncertainties in global plant occurrence information. Ecol. Lett. 19, 992–1006 (2016).
doi: 10.1111/ele.12624 pubmed: 27250865
Higino, G. T. et al. Mismatch between IUCN range maps and species interactions data illustrated using the Serengeti food web. PeerJ 11, 1–19 (2023).
doi: 10.7717/peerj.14620
Oliver, R. Y., Meyer, C., Ranipeta, A., Winner, K. & Jetz, W. Global and national trends, gaps, and opportunities in documenting and monitoring species distributions. PLoS Biol. 19, 1–14 (2021).
doi: 10.1371/journal.pbio.3001336
García-Roselló, E., González-Dacosta, J. & Lobo, J. M. The biased distribution of existing information on biodiversity hinders its use in conservation, and we need an integrative approach to act urgently. Biol. Conserv. 283, 110118 (2023).
doi: 10.1016/j.biocon.2023.110118
Wyborn, C. & Evans, M. C. Conservation needs to break free from global priority mapping. Nat. Ecol. Evol. 5, 1322–1324 (2021).
doi: 10.1038/s41559-021-01540-x pubmed: 34426678
Richard-Bollans, A. et al. Machine learning enhances prediction of plants as potential sources of antimalarials. Front. Plant Sci. 14, 1–14 (2023).
doi: 10.3389/fpls.2023.1173328
Cai, L. et al. Global models and predictions of plant diversity based on advanced machine learning techniques. New Phytol. 237, 1432–1445 (2023).
doi: 10.1111/nph.18533 pubmed: 36375492
Feng, X. et al. A review of the heterogeneous landscape of biodiversity databases: Opportunities and challenges for a synthesized biodiversity knowledge base. Glob. Ecol. Biogeogr. 31, 1242–1260 (2022).
doi: 10.1111/geb.13497
Schellenberger Costa, D. et al. The big four of plant taxonomy—A comparison of global checklists of vascular plant names. New Phytol. 240, 1687–1702 (2023).
doi: 10.1111/nph.18961 pubmed: 37243532
Mesibov, R. An audit of some processing effects in aggregated occurrence records. Zookeys 2018, 129–146 (2018).
doi: 10.3897/zookeys.751.24791
Isaac, N. J. B., Mallet, J. & Mace, G. M. Taxonomic inflation: Its influence on macroecology and conservation. Trends Ecol. Evol. 19, 464–469 (2004).
doi: 10.1016/j.tree.2004.06.004 pubmed: 16701308
Govaerts, R., Nic Lughadha, E., Black, N., Turner, R. & Paton, A. The World Checklist of Vascular Plants, a continuously updated resource for exploring global plant diversity. Sci. Data 8, 1–10 (2021).
doi: 10.1038/s41597-021-00997-6
GBIF Secretariat. GBIF Backbone Taxonomy. (2022). Available at: https://doi.org/10.15468/39omei . (Accessed: 16th June 2023).
Govaerts, R., Sobral, M., Ashton, P. & Barrie, F. World Checklist of Myrtaceae (The University of Chicago Press, 2008).
Ribeiro, B. R. et al. bdc: A toolkit for standardizing, integrating and cleaning biodiversity data. Methods Ecol. Evol. 13, 1421–1428 (2022).
doi: 10.1111/2041-210X.13868
Nicolson, N., Paton, A., Phillips, S. & Tucker, A. Specimens as research objects: Reconciliation across distributed repositories to enable metadata propagation. in 2018 IEEE 14th International Conference on e-Science (e-Science) 125–135 (2018). doi: https://doi.org/10.1109/eScience.2018.00028
Grenié, M. et al. Harmonizing taxon names in biodiversity data: A review of tools, databases and best practices. Methods Ecol. Evol. 2023, 12–25 (2022).
de Moura, C. O., de Melo, P. H. A., de Amorim, E. T., Marcusso, G. M. & Carvalho-Silva, M. Peperomia (Piperaceae) endemic to Brazil: Distribution, richness, and conservation status. Flora Morphol. Distrib. Funct. Ecol. Plants 297, 152170 (2022).
doi: 10.1016/j.flora.2022.152170
Jardim Botânico do Rio de Janeiro. Flora e Funga do Brasil. Available at: http://floradobrasil.jbrj.gov.br/ . (Accessed: 29th October 2023)
POWO. Plants of the World Online, facilitated by the Royal Botanic Gardens, Kew. (2023). Available at: http://www.plantsoftheworldonline.org/ .

Auteurs

Pablo Hendrigo Alves de Melo (PHA)

IFMG - Instituto Federal de Educação, Ciência e Tecnologia de Minas Gerais, Campus Avançado Piumhi, Rua Severo Veloso, 1880 - Bairro Bela Vista, Piumhi, Minas Gerais, 37925-000, Brazil.

Nadia Bystriakova (N)

The Natural History Museum, Cromwell Road, London, SW7 5BD, UK. n.bystriakova@nhm.ac.uk.

Eve Lucas (E)

Royal Botanic Gardens, Kew, Richmond, London, TW9 3AE, UK.

Alexandre K Monro (AK)

Royal Botanic Gardens, Kew, Richmond, London, TW9 3AE, UK.

Classifications MeSH