Slow improvement to the archiving quality of open datasets shared by researchers in ecology and evolution.
data sharing
fair data
metascience
open science
public data archiving
reproducibility
Journal
Proceedings. Biological sciences
ISSN: 1471-2954
Titre abrégé: Proc Biol Sci
Pays: England
ID NLM: 101245157
Informations de publication
Date de publication:
25 05 2022
25 05 2022
Historique:
entrez:
18
5
2022
pubmed:
19
5
2022
medline:
21
5
2022
Statut:
ppublish
Résumé
Many leading journals in ecology and evolution now mandate open data upon publication. Yet, there is very little oversight to ensure the completeness and reusability of archived datasets, and we currently have a poor understanding of the factors associated with high-quality data sharing. We assessed 362 open datasets linked to first- or senior-authored papers published by 100 principal investigators (PIs) in the fields of ecology and evolution over a period of 7 years to identify predictors of data completeness and reusability (data archiving quality). Datasets scored low on these metrics: 56.4% were complete and 45.9% were reusable. Data reusability, but not completeness, was slightly higher for more recently archived datasets and PIs with less seniority. Journal open data policy, PI gender and PI corresponding author status were unrelated to data archiving quality. However, PI identity explained a large proportion of the variance in data completeness (27.8%) and reusability (22.0%), indicating consistent inter-individual differences in data sharing practices by PIs across time and contexts. Several PIs consistently shared data of either high or low archiving quality, but most PIs were inconsistent in how well they shared. One explanation for the high intra-individual variation we observed is that PIs often conduct research through students and postdoctoral researchers, who may be responsible for the data collection, curation and archiving. Levels of data literacy vary among trainees and PIs may not regularly perform quality control over archived files. Our findings suggest that research data management training and culture within a PI's group are likely to be more important determinants of data archiving quality than other factors such as a journal's open data policy. Greater incentives and training for individual researchers at all career stages could improve data sharing practices and enhance data transparency and reusability.
Identifiants
pubmed: 35582791
doi: 10.1098/rspb.2021.2780
pmc: PMC9114975
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
20212780Références
PLoS One. 2020 Mar 11;15(3):e0229003
pubmed: 32160189
Biol Rev Camb Philos Soc. 2010 Nov;85(4):935-56
pubmed: 20569253
J Exp Biol. 2022 Mar 8;225(Suppl_1):
pubmed: 35258604
R Soc Open Sci. 2018 Aug 15;5(8):180448
pubmed: 30225032
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
Nature. 2014 Apr 3;508(7494):44
pubmed: 24695306
Integr Comp Biol. 2017 Aug 1;57(2):362-371
pubmed: 28859406
PLoS Biol. 2014 Jan 28;12(1):e1001779
pubmed: 24492920
Nature. 2020 Feb;578(7796):491
pubmed: 32099131
Evolution. 2011 Jan;65(1):1-2
pubmed: 21070223
FASEB J. 2013 Apr;27(4):1304-8
pubmed: 23288929
Gigascience. 2019 May 1;8(5):
pubmed: 30715291
J Evol Biol. 2010 Apr;23(4):659-60
pubmed: 20149022
PLoS Biol. 2015 Nov 10;13(11):e1002295
pubmed: 26556502
J Exp Biol. 2016 Dec 15;219(Pt 24):3832-3843
pubmed: 27852750
Proc Biol Sci. 2022 May 25;289(1975):20212780
pubmed: 35582791
J Anim Ecol. 2013 Jan;82(1):39-54
pubmed: 23171297
Proc Biol Sci. 2021 Mar 10;288(1946):20202830
pubmed: 33653143
PLoS One. 2020 Mar 25;15(3):e0230281
pubmed: 32210449
Nature. 2016 Oct 05;538(7623):41
pubmed: 27708293
PLoS One. 2011;6(6):e21101
pubmed: 21738610
Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2557-2560
pubmed: 29487213
Conserv Biol. 2022 Jun;36(3):e13835
pubmed: 34476839
PLoS One. 2018 May 2;13(5):e0194768
pubmed: 29719004
Behav Res Methods. 2021 Aug;53(4):1455-1468
pubmed: 33179123
PLoS Biol. 2020 Jul 28;18(7):e3000763
pubmed: 32722681
BMC Biol. 2021 Apr 9;19(1):68
pubmed: 33836762
Trends Ecol Evol. 2019 Feb;34(2):95-98
pubmed: 30573193
Sci Data. 2021 Jul 27;8(1):192
pubmed: 34315906
Ecol Evol. 2021 Oct 13;11(21):14344-14350
pubmed: 34765110
PLoS One. 2018 Jul 6;13(7):e0199789
pubmed: 29979709
Trends Ecol Evol. 2015 Oct;30(10):581-589
pubmed: 26411615
Trends Ecol Evol. 2005 Jul;20(7):362-3
pubmed: 16701396
PLoS One. 2015 Aug 26;10(8):e0134826
pubmed: 26308551
Sci Data. 2020 Mar 24;7(1):106
pubmed: 32210236