A revised digital edition of Wurm & Hattori's Language Atlas of the Pacific Area.
Journal
Scientific data
ISSN: 2052-4463
Titre abrégé: Sci Data
Pays: England
ID NLM: 101640192
Informations de publication
Date de publication:
29 Aug 2024
29 Aug 2024
Historique:
received:
02
07
2024
accepted:
21
08
2024
medline:
1
9
2024
pubmed:
1
9
2024
entrez:
29
8
2024
Statut:
epublish
Résumé
Wurm & Hattori's Language Atlas of the Pacific Area describes the geographic speaker areas of the languages and language varieties spoken in the Pacific. Thanks to the efforts of the Electronic Cultural Atlas Initiative, this monumental piece of work has been available in digital form for over 15 years. But lacking proper identification of language varieties, this digitized data was largely unusable for today's research methods. We turned ECAI's digitized artefacts of the Language Atlas into an open, reusable geo-referenced dataset of speaker area polygons for a quarter of the world's languages. This allows for much more refined analysis methods to, for example, analyse language contact in the area of the world with the highest linguistic diversity. We also describe a number of tool applications and quality checks which may be useful for methodological development in similar digitization efforts.
Identifiants
pubmed: 39209857
doi: 10.1038/s41597-024-03816-w
pii: 10.1038/s41597-024-03816-w
pmc: PMC11362564
doi:
Types de publication
Dataset
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
949Informations de copyright
© 2024. The Author(s).
Références
Proc Natl Acad Sci U S A. 2015 Feb 3;112(5):1322-7
pubmed: 25605876
Sci Data. 2018 Oct 16;5:180205
pubmed: 30325347
PLoS One. 2020 Oct 7;15(10):e0239359
pubmed: 33027273
PLoS One. 2022 Jun 8;17(6):e0269648
pubmed: 35675367