Text and image generation from intracranial electroencephalography using an embedding space for text and images.

image generation intracranial EEG neural decoding text generation

Journal

Journal of neural engineering
ISSN: 1741-2552
Titre abrégé: J Neural Eng
Pays: England
ID NLM: 101217933

Informations de publication

Date de publication:
22 Apr 2024
Historique:
medline: 23 4 2024
pubmed: 23 4 2024
entrez: 22 4 2024
Statut: aheadofprint

Résumé


Invasive brain-computer interfaces (BCIs) are promising communication devices for severely paralyzed patients. Recent advances in intracranial electroencephalography (iEEG) coupled with natural language processing have enhanced communication speed and accuracy. It should be noted that such a speech BCI uses signals from the motor cortex. However, BCIs based on motor cortical activities may experience signal deterioration in users with motor cortical degenerative diseases such as amyotrophic lateral sclerosis (ALS). An alternative approach to using iEEG of the motor cortex is necessary to support patients with such conditions.
Approach:
In this study, a multimodal embedding of text and images was used to decode visual semantic information from iEEG signals of the visual cortex to generate text and images. We used contrastive language-image pretraining (CLIP) embedding to represent images presented to 17 patients implanted with electrodes in the occipital and temporal cortexes. A CLIP image vector was inferred from the high-γ power of the iEEG signals recorded while viewing the images.
Main results:
Text was generated by CLIPCAP from the inferred CLIP vector with better-than-chance accuracy. Then, an image was created from the generated text using StableDiffusion with significant accuracy.
Significance:
The text and images generated from iEEG through the CLIP embedding vector can be used for improved communication.&#xD.

Identifiants

pubmed: 38648781
doi: 10.1088/1741-2552/ad417a
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

Creative Commons Attribution license.

Auteurs

Yuya Ikegawa (Y)

Institute for advanced co-creation studies, Osaka University, 2-2 Yamadaoka, Suita, Suita, Osaka, 5670871, JAPAN.

Ryohei Fukuma (R)

Institute for Advanced co-creation studies, Osaka University, 2-2 Yamadaoka, Suita, Suita, Osaka, 5670871, JAPAN.

Hidenori Sugano (H)

Department of Neurosurgery, Juntendo University, Bunkyo-ku, Tokyo, Tokyo, Tokyo, 113-8421, JAPAN.

Satoru Oshino (S)

Department of Neurosurgery, Osaka University Faculty of Medicine Graduate School of Medicine, 2-2 Yamadaoka, suita, Osaka, Japan, Osaka University Graduate School of Medicine, Dept of Neurosurgery, Osaka, Osaka, 5670871, JAPAN.

Naoki Tani (N)

Department of Neurosurgery, Osaka University Faculty of Medicine Graduate School of Medicine, 2-2 Yamadaoka, suita, Osaka, Japan, Osaka University Graduate School of Medicine, Dept of Neurosurgery, Osaka, Osaka, 5670871, JAPAN.

Kentaro Tamura (K)

Department of Neurosurgery, Nara Medical University, 840 Shijyocho, Kashihara, Kashihara, Nara, 634-8523, JAPAN.

Yasushi Iimura (Y)

Juntendo University, 2-1-1 Hongo, Bunkyo-ku, Tokyo, 113-8421, JAPAN.

Hiroharu Suzuki (H)

Department of Neurosurgery, Juntendo University, 2-1-1 Hongo, Bunkyo-ku, Tokyo, 113-8421, JAPAN.

Shota Yamamoto (S)

Department of Neurosurgery, Osaka University, 2-2 Yamadaoka, Suita, Suita, Osaka, 5670871, JAPAN.

Yuya Fujita (Y)

Institute for Advanced co-creation studies, Osaka University, 2-2 Yamadaoka Suita Osaka Japan, Suita, 565-0871, JAPAN.

Shinji Nishimoto (S)

Graduate School of Frontier Biosciences, Osaka University, 1-3 Yamadaoka, Suita, Suita, Osaka, 565-0871, JAPAN.

Haruhiko Kishima (H)

Department neurosurgery, Osaka University, 2-2 Yamadaoka, Suita, Suita, Osaka, 5650871, JAPAN.

Takufumi Yanagisawa (T)

Institute for Advanced co-creation studies, Osaka University, 2-2 Yamadaoka Suita Osaka Japan, Suita, 565-0871, JAPAN.

Classifications MeSH