Deep Representation Learning for Open Vocabulary Electroencephalography-to-Text Decoding

Hamza Amrani,Daniela Micucci,Paolo Napoletano

2023-11-15

Abstract:Previous research has demonstrated the potential of using pre-trained language models for decoding open vocabulary Electroencephalography (EEG) signals captured through a non-invasive Brain-Computer Interface (BCI). However, the impact of embedding EEG signals in the context of language models and the effect of subjectivity, remain unexplored, leading to uncertainty about the best approach to enhance decoding performance. Additionally, current evaluation metrics used to assess decoding effectiveness are predominantly syntactic and do not provide insights into the comprehensibility of the decoded output for human understanding. We present an end-to-end deep learning framework for non-invasive brain recordings that brings modern representational learning approaches to neuroscience. Our proposal introduces the following innovations: 1) an end-to-end deep learning architecture for open vocabulary EEG decoding, incorporating a subject-dependent representation learning module for raw EEG encoding, a BART language model, and a GPT-4 sentence refinement module; 2) a more comprehensive sentence-level evaluation metric based on the BERTScore; 3) an ablation study that analyses the contributions of each module within our proposal, providing valuable insights for future research. We evaluate our approach on two publicly available datasets, ZuCo v1.0 and v2.0, comprising EEG recordings of 30 subjects engaged in natural reading tasks. Our model achieves a BLEU-1 score of 42.75%, a ROUGE-1-F of 33.28%, and a BERTScore-F of 53.86%, outperforming the previous state-of-the-art methods by 3.38%, 8.43%, and 6.31%, respectively.

Signal Processing,Computation and Language,Human-Computer Interaction,Machine Learning

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve This paper aims to address several key issues in decoding non-invasive electroencephalogram (EEG) signals into open vocabulary text: 1. **Improving Decoding Performance**: - Current research has demonstrated the potential of using pre-trained language models to decode open vocabulary EEG signals, but the impact of embedding EEG signals in language models and the influence of individual differences have not been fully explored. - Existing evaluation metrics mainly focus on grammatical structure and fail to adequately reflect the human comprehensibility of the decoded output. 2. **Introducing Innovative Methods**: - Proposes an end-to-end deep learning framework for open vocabulary decoding of non-invasive EEG signals. - Introduces a subject-dependent representation learning module based on the BART language model and a GPT-4 sentence optimization module. - Uses BERTScore as a more comprehensive sentence-level evaluation metric to better reflect semantic similarity. 3. **Analyzing Module Contributions**: - Conducts ablation studies to analyze the contribution of each module to overall performance, providing valuable insights for future research. 4. **Validating Method Effectiveness**: - Conducts extensive evaluations on two public datasets, ZuCo v1.0 and v2.0, which contain EEG recordings of 30 subjects during natural reading tasks. - Experimental results show that the model outperforms existing state-of-the-art methods by 3.38%, 8.43%, and 6.31% on BLEU-1, ROUGE-1-F, and BERTScore-F metrics, respectively. ### Summary The main goal of the paper is to improve the performance of decoding non-invasive EEG signals into open vocabulary text by introducing a new end-to-end deep learning framework that combines pre-trained language models and subject-dependent representation learning modules. The paper also validates the effectiveness of the approach through more comprehensive evaluation metrics. Through ablation studies, the paper explores the contribution of each module to overall performance, providing important references for future research.

Deep Representation Learning for Open Vocabulary Electroencephalography-to-Text Decoding

Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification

Towards Linguistic Neural Representation Learning and Sentence Retrieval from Electroencephalogram Recordings

EEG2TEXT: Open Vocabulary EEG-to-Text Decoding with EEG Pre-Training and Multi-View Transformer

Decoding speech from non-invasive brain recordings

Speech decoding from stereo-electroencephalography (sEEG) signals using advanced deep learning methods

Enhancing EEG-to-Text Decoding through Transferable Representations from Pre-trained Contrastive EEG-Text Masked Autoencoder

Multimodal Speech Recognition Using EEG and Audio Signals: A Novel Approach for Enhancing ASR Systems

Towards Open-World EEG Decoding via Deep Learning

Sparse Bayesian Learning for End-to-End EEG Decoding

Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models

Continuous and discrete decoding of overt speech with electroencephalography

Decoding speech perception from non-invasive brain recordings

Continuous and discrete decoding of overt speech with scalp electroencephalography (EEG)

Brain2Char: A Deep Architecture for Decoding Text from Brain Recordings

Deep Recurrent Encoder: A scalable end-to-end network to model brain signals

Toward Open-World Electroencephalogram Decoding Via Deep Learning: A Comprehensive Survey

Identification of perceived sentences using deep neural networks in EEG

Towards Unified Neural Decoding of Perceived, Spoken and Imagined Speech from EEG Signals

Geometric neural network based on phase space for BCI-EEG decoding

NeuSpeech: Decode Neural signal as Speech