Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

Wei-Bang Jiang,Li-Ming Zhao,Bao-Liang Lu

2024-05-29

Abstract:The current electroencephalogram (EEG) based deep learning models are typically designed for specific datasets and applications in brain-computer interaction (BCI), limiting the scale of the models and thus diminishing their perceptual capabilities and generalizability. Recently, Large Language Models (LLMs) have achieved unprecedented success in text processing, prompting us to explore the capabilities of Large EEG Models (LEMs). We hope that LEMs can break through the limitations of different task types of EEG datasets, and obtain universal perceptual capabilities of EEG signals through unsupervised pre-training. Then the models can be fine-tuned for different downstream tasks. However, compared to text data, the volume of EEG datasets is generally small and the format varies widely. For example, there can be mismatched numbers of electrodes, unequal length data samples, varied task designs, and low signal-to-noise ratio. To overcome these challenges, we propose a unified foundation model for EEG called Large Brain Model (LaBraM). LaBraM enables cross-dataset learning by segmenting the EEG signals into EEG channel patches. Vector-quantized neural spectrum prediction is used to train a semantically rich neural tokenizer that encodes continuous raw EEG channel patches into compact neural codes. We then pre-train neural Transformers by predicting the original neural codes for the masked EEG channel patches. The LaBraMs were pre-trained on about 2,500 hours of various types of EEG signals from around 20 datasets and validated on multiple different types of downstream tasks. Experiments on abnormal detection, event type classification, emotion recognition, and gait prediction show that our LaBraM outperforms all compared SOTA methods in their respective fields. Our code is available at <a class="link-external link-https" href="https://github.com/935963004/LaBraM" rel="external noopener nofollow">this https URL</a>.

Machine Learning

What problem does this paper attempt to address?

The paper aims to address the limitations of current EEG-based deep learning models in Brain-Computer Interface (BCI) applications. Specifically, existing EEG deep learning models are typically designed for specific datasets and tasks, which limits the scalability of the models, thereby weakening their perceptual and generalization capabilities. To overcome these limitations, the paper proposes a unified foundational model named "Large Brain Model" (LaBraM), which acquires general EEG signal perception capabilities through unsupervised pre-training on a large amount of unlabeled EEG data, and then fine-tunes for different downstream tasks. The main contributions of the paper include: 1. **Large-scale EEG pre-training**: A large-scale neural Transformer model was collected and pre-trained on over 2500 hours of diverse EEG data, marking the first use of such a large and diverse dataset for EEG pre-training. 2. **Compatibility with various EEG configurations**: LaBraM can handle EEG signals with different numbers of channels and time lengths, allowing a single pre-trained model to adapt to different configurations of various downstream datasets. 3. **Effective EEG representation learning**: The neural Transformer effectively captures the temporal and spatial features of EEG signals with different channels and lengths, making it suitable for a wide range of downstream tasks, and defines a compact, versatile, and meaningful EEG signal representation codebook. 4. **Comprehensive downstream experiments**: In four representative BCI downstream tasks, LaBraM significantly outperforms all existing best methods and explores the relationship between model size and performance by extending the pre-training data volume.

Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals

Exploiting Asymmetric EEG Signals with EFD in Deep Learning Domain for Robust BCI

Hidden States in LLMs Improve EEG Representation Learning and Visual Decoding

BENDR: Using Transformers and a Contrastive Self-Supervised Learning Task to Learn From Massive Amounts of EEG Data

Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models

LLM4Brain: Training a Large Language Model for Brain Video Understanding

An Extended Computer-Aided Diagnosis System for Multidomain EEG Classification

MBrain: A Multi-channel Self-Supervised Learning Framework for Brain Signals

EEG-GPT: Exploring Capabilities of Large Language Models for EEG Classification and Interpretation

Deep Representation Learning for Open Vocabulary Electroencephalography-to-Text Decoding

EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model

Large Transformers are Better EEG Learners

EEGNet: a compact convolutional neural network for EEG-based brain–computer interfaces

LMDA-Net:A lightweight multi-dimensional attention network for general EEG-based brain-computer interfaces and interpretability

Sparse Bayesian Learning for End-to-End EEG Decoding

A Method for the Spatial Interpolation of EEG Signals Based on the Bidirectional Long Short-Term Memory Network

EEGNet: A Compact Convolutional Network for EEG-based Brain-Computer Interfaces

BrainLM: A foundation model for brain activity recordings

Aggregating Intrinsic Information to Enhance BCI Performance through Federated Learning

LMDA-Net:A lightweight multi-dimensional attention network for general EEG-based brain-computer interface paradigms and interpretability