Text and image generation from intracranial electroencephalography using an embedding space for text and images.
image generation
intracranial EEG
neural decoding
text generation
Journal
Journal of neural engineering
ISSN: 1741-2552
Titre abrégé: J Neural Eng
Pays: England
ID NLM: 101217933
Informations de publication
Date de publication:
22 Apr 2024
22 Apr 2024
Historique:
medline:
23
4
2024
pubmed:
23
4
2024
entrez:
22
4
2024
Statut:
aheadofprint
Résumé

Invasive brain-computer interfaces (BCIs) are promising communication devices for severely paralyzed patients. Recent advances in intracranial electroencephalography (iEEG) coupled with natural language processing have enhanced communication speed and accuracy. It should be noted that such a speech BCI uses signals from the motor cortex. However, BCIs based on motor cortical activities may experience signal deterioration in users with motor cortical degenerative diseases such as amyotrophic lateral sclerosis (ALS). An alternative approach to using iEEG of the motor cortex is necessary to support patients with such conditions.
Approach:
In this study, a multimodal embedding of text and images was used to decode visual semantic information from iEEG signals of the visual cortex to generate text and images. We used contrastive language-image pretraining (CLIP) embedding to represent images presented to 17 patients implanted with electrodes in the occipital and temporal cortexes. A CLIP image vector was inferred from the high-γ power of the iEEG signals recorded while viewing the images.
Main results:
Text was generated by CLIPCAP from the inferred CLIP vector with better-than-chance accuracy. Then, an image was created from the generated text using StableDiffusion with significant accuracy.
Significance:
The text and images generated from iEEG through the CLIP embedding vector can be used for improved communication.
.
Identifiants
pubmed: 38648781
doi: 10.1088/1741-2552/ad417a
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
Creative Commons Attribution license.