Identification of perceived sentences using deep neural networks in EEG.

EEG brain computer interfaces deep neural networks sentence identification speech decoding

Journal

Journal of neural engineering
ISSN: 1741-2552
Titre abrégé: J Neural Eng
Pays: England
ID NLM: 101217933

Informations de publication

Date de publication:
18 Oct 2024
Historique:
medline: 19 10 2024
pubmed: 19 10 2024
entrez: 18 10 2024
Statut: aheadofprint

Résumé

Decoding speech from brain activity can enable communication for individuals with speech disorders. Deep neural networks have shown great potential for speech decoding applications. However, the limited availability of large datasets containing neural recordings from speech-impaired subjects poses a challenge. Leveraging data from healthy participants can mitigate this limitation and expedite the development of speech neuroprostheses while minimizing the need for patient-specific training data. 
Approach. In this study, we collected a substantial dataset consisting of recordings from 56 healthy participants using 64 EEG channels. Multiple neural networks were trained to classify perceived sentences in the Spanish language using subject-independent, mixed-subjects, and fine-tuning approaches. The dataset has been made publicly available to foster further research in this area.
Main results. Our results demonstrate a remarkable level of accuracy in distinguishing sentence identity across 30 classes, showcasing the feasibility of training Deep Neural Networks (DNNs) to decode sentence identity from perceived speech using EEG. Notably, the subject-independent approach rendered accuracy comparable to the mixed-subjects approach, although with higher variability among subjects. Additionally, our fine-tuning approach yielded even higher accuracy, indicating an improved capability to adapt to individual subject characteristics, which enhances performance. This suggests that DNNs have effectively learned to decode universal features of brain activity across individuals while also being adaptable to specific participant data. Furthermore, our analyses indicate that EEGNet and DeepConvNet exhibit comparable performance, outperforming ShallowConvNet for sentence identity decoding. Finally, our Grad-CAM visualization analysis identifies key areas influencing the network's predictions, offering valuable insights into the neural processes underlying language perception and comprehension.
Significance. These findings advance our understanding of EEG-based speech perception decoding and hold promise for the development of speech neuroprostheses, particularly in scenarios where subjects cannot provide their own training data.

Identifiants

pubmed: 39423829
doi: 10.1088/1741-2552/ad88a3
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© 2024 IOP Publishing Ltd. All rights, including for text and data mining, AI training, and similar technologies, are reserved.

Auteurs

Carlos Valle (C)

Pontificia Universidad Catolica de Chile, Avda.Vicuña Mackenna 4860, Macul, IIBM, Pontificia Universidad Catolica de Chile, Santiago, Metropolitana, 7820436, CHILE.

Carolina Mendez-Orellana (C)

Pontificia Universidad Catolica de Chile, Faculty of Medicine, Santiago, 7820436, CHILE.

Christian Herff (C)

Maastricht University, Minderbroedersberg 4-6, Maastricht, Limburg, 6211 LK, NETHERLANDS.

Maria Rodriguez-Fernandez (M)

Institute for Biological and Medical Engineering, Pontificia Universidad Catolica de Chile, Avda. Vicuña Mackenna 4860, Macul, Santiago, 7820436, CHILE.

Classifications MeSH