A studyforrest extension, an annotation of spoken language in the German dubbed movie "Forrest Gump" and its audio-description.
annotation
fMRI
language
narrative
naturalistic stimulus
speech
studyforrest
Journal
F1000Research
ISSN: 2046-1402
Titre abrégé: F1000Res
Pays: England
ID NLM: 101594320
Informations de publication
Date de publication:
2021
2021
Historique:
accepted:
12
01
2021
entrez:
18
3
2021
pubmed:
19
3
2021
medline:
3
6
2021
Statut:
epublish
Résumé
Here we present an annotation of speech in the audio-visual movie "Forrest Gump" and its audio-description for a visually impaired audience, as an addition to a large public functional brain imaging dataset ( studyforrest.org). The annotation provides information about the exact timing of each of the more than 2500 spoken sentences, 16,000 words (including 202 non-speech vocalizations), 66,000 phonemes, and their corresponding speaker. Additionally, for every word, we provide lemmatization, a simple part-of-speech-tagging (15 grammatical categories), a detailed part-of-speech tagging (43 grammatical categories), syntactic dependencies, and a semantic analysis based on word embedding which represents each word in a 300-dimensional semantic space. To validate the dataset's quality, we build a model of hemodynamic brain activity based on information drawn from the annotation. Results suggest that the annotation's content and quality enable independent researchers to create models of brain activity correlating with a variety of linguistic aspects under conditions of near-real-life complexity.
Identifiants
pubmed: 33732435
doi: 10.12688/f1000research.27621.1
pmc: PMC7921887
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
54Informations de copyright
Copyright: © 2021 Häusler CO and Hanke M.
Déclaration de conflit d'intérêts
No competing interests were disclosed.
Références
Neuroimage. 2020 Aug 15;217:116860
pubmed: 32376301
Hum Brain Mapp. 2004 Feb;21(2):75-85
pubmed: 14755595
F1000Res. 2016 Sep 8;5:2273
pubmed: 27781092
Neuroimage. 2015 Apr 15;110:136-48
pubmed: 25662868
Science. 2014 Feb 28;343(6174):1006-10
pubmed: 24482117
Nat Commun. 2019 Dec 5;10(1):5568
pubmed: 31804504
Neuroimage. 2004;23 Suppl 1:S208-19
pubmed: 15501092
Lang Cogn Neurosci. 2018 Jul 22;35(5):573-582
pubmed: 32656294
Neuroimage. 2012 Aug 15;62(2):816-47
pubmed: 22584224
Proc Natl Acad Sci U S A. 2014 Oct 28;111(43):E4687-96
pubmed: 25267658
Neuroimage. 2005 May 1;25(4):1325-35
pubmed: 15850749
Science. 2004 Mar 12;303(5664):1634-40
pubmed: 15016991
Cereb Cortex. 2008 Jan;18(1):230-42
pubmed: 17504783
Trends Cogn Sci. 2019 Aug;23(8):699-714
pubmed: 31257145
J Neurosci. 2011 Feb 23;31(8):2906-15
pubmed: 21414912
Neuroimage. 2008 Jun;41(2):286-301
pubmed: 18407525
Hear Res. 2014 Jan;307:42-52
pubmed: 23916753
PLoS One. 2012;7(4):e35215
pubmed: 22496909
Sci Data. 2016 Oct 25;3:160092
pubmed: 27779621
Neuroimage. 2004 Apr;21(4):1732-47
pubmed: 15050594
Sci Data. 2014 May 27;1:140003
pubmed: 25977761
Neuroimage. 2020 Aug 1;216:116128
pubmed: 31473349
J Neurosci. 2012 Oct 31;32(44):15277-83
pubmed: 23115166
Neuroimage. 2007 Jul 1;36(3):511-21
pubmed: 17499520
F1000Res. 2015 Apr 13;4:92
pubmed: 25977755
Neuroimage. 2006 Jul 1;31(3):968-80
pubmed: 16530430
Sci Data. 2020 Oct 13;7(1):347
pubmed: 33051448
Sci Data. 2016 Jun 21;3:160044
pubmed: 27326542
Neuroimage. 2001 Dec;14(6):1370-86
pubmed: 11707093
J Neurosci. 2015 Jan 14;35(2):634-42
pubmed: 25589757
Physiol Rev. 2011 Oct;91(4):1357-92
pubmed: 22013214
Neuroimage. 2003 Oct;20(2):1052-63
pubmed: 14568475
Neuroimage. 2004 May;22(1):419-33
pubmed: 15110035
Nat Rev Neurosci. 2007 May;8(5):393-402
pubmed: 17431404