Curating and extending data for language comparison in Concepticon and NoRaRe.
Cross-linguistic database
Language comparison
Lexical data
Test-driven data curation
Word properties
Journal
Open research Europe
ISSN: 2732-5121
Titre abrégé: Open Res Eur
Pays: Belgium
ID NLM: 9918230081006676
Informations de publication
Date de publication:
2022
2022
Historique:
accepted:
16
05
2023
medline:
30
8
2023
pubmed:
30
8
2023
entrez:
30
8
2023
Statut:
epublish
Résumé
Language comparison requires user-friendly tools that facilitate the standardization of linguistic data. We present two resources built on the basis of a standardized cross-linguistic format and show how the data is curated and extended. The first resource, the Concepticon, is a reference catalog for standardized concepts from linguistic research. While curating the Concepticon, we found that a variety of studies in distinct research fields collected information on word properties. However, until recently, no resource existed that contained these data to enable the comparison of the different word properties across languages. This gap was filled by the Database of Norms, Ratings, and Relations (NoRaRe), which is an extension of the Concepticon. Here, we present the major release of both resources - Concepticon Version 3.0 and NoRaRe Version 1.0 - which represents an important step in our data development. We show that extending and adapting the data curation workflow in Concepticon to NoRaRe is useful for the standardization of cross-linguistic datasets. In addition, combining datasets from different research fields enables studies grounded in language comparison. Concepticon and NoRaRe include lexical data for various languages, tools for test-driven data curation, and the possibility for data reuse. The first major release of NoRaRe is also accompanied by a new web application that allows convenient access to the data.
Identifiants
pubmed: 37645322
doi: 10.12688/openreseurope.15380.3
pmc: PMC10446050
doi:
Types de publication
Journal Article
Langues
eng
Pagination
141Informations de copyright
Copyright: © 2023 Tjuka A et al.
Déclaration de conflit d'intérêts
No competing interests were disclosed.
Références
Behav Res Methods. 2009 Nov;41(4):977-90
pubmed: 19897807
PLoS One. 2019 Aug 8;14(8):e0220611
pubmed: 31393919
Arch Clin Neuropsychol. 2007 Mar;22(3):297-307
pubmed: 17303376
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
Sci Data. 2018 Oct 16;5:180205
pubmed: 30325347
Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):13666-13671
pubmed: 27849594
Proc Natl Acad Sci U S A. 2013 May 21;110(21):8471-6
pubmed: 23650390
PLoS One. 2010 Jun 02;5(6):e10729
pubmed: 20532192
Behav Res Methods. 2022 Apr;54(2):864-884
pubmed: 34357536
Behav Res Methods. 2020 Jun;52(3):1271-1291
pubmed: 31832879
Behav Res Methods. 2012 Dec;44(4):978-90
pubmed: 22581493
Behav Res Methods. 2014 Dec;46(4):1128-37
pubmed: 24366716
PLoS Comput Biol. 2017 Jun 22;13(6):e1005510
pubmed: 28640806
Top Cogn Sci. 2015 Oct;7(4):570-94
pubmed: 26466949
Proc Natl Acad Sci U S A. 2019 May 21;116(21):10317-10322
pubmed: 31061123
Nature. 2007 Oct 11;449(7163):717-20
pubmed: 17928860
Wiley Interdiscip Rev Cogn Sci. 2013 Nov;4(6):583-597
pubmed: 26304265