Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

Wei-Bang Jiang,Li-Ming Zhao,Bao-Liang Lu
2024-05-29
Abstract:The current electroencephalogram (EEG) based deep learning models are typically designed for specific datasets and applications in brain-computer interaction (BCI), limiting the scale of the models and thus diminishing their perceptual capabilities and generalizability. Recently, Large Language Models (LLMs) have achieved unprecedented success in text processing, prompting us to explore the capabilities of Large EEG Models (LEMs). We hope that LEMs can break through the limitations of different task types of EEG datasets, and obtain universal perceptual capabilities of EEG signals through unsupervised pre-training. Then the models can be fine-tuned for different downstream tasks. However, compared to text data, the volume of EEG datasets is generally small and the format varies widely. For example, there can be mismatched numbers of electrodes, unequal length data samples, varied task designs, and low signal-to-noise ratio. To overcome these challenges, we propose a unified foundation model for EEG called Large Brain Model (LaBraM). LaBraM enables cross-dataset learning by segmenting the EEG signals into EEG channel patches. Vector-quantized neural spectrum prediction is used to train a semantically rich neural tokenizer that encodes continuous raw EEG channel patches into compact neural codes. We then pre-train neural Transformers by predicting the original neural codes for the masked EEG channel patches. The LaBraMs were pre-trained on about 2,500 hours of various types of EEG signals from around 20 datasets and validated on multiple different types of downstream tasks. Experiments on abnormal detection, event type classification, emotion recognition, and gait prediction show that our LaBraM outperforms all compared SOTA methods in their respective fields. Our code is available at <a class="link-external link-https" href="https://github.com/935963004/LaBraM" rel="external noopener nofollow">this https URL</a>.
Machine Learning
What problem does this paper attempt to address?
The paper aims to address the limitations of current EEG-based deep learning models in Brain-Computer Interface (BCI) applications. Specifically, existing EEG deep learning models are typically designed for specific datasets and tasks, which limits the scalability of the models, thereby weakening their perceptual and generalization capabilities. To overcome these limitations, the paper proposes a unified foundational model named "Large Brain Model" (LaBraM), which acquires general EEG signal perception capabilities through unsupervised pre-training on a large amount of unlabeled EEG data, and then fine-tunes for different downstream tasks. The main contributions of the paper include: 1. **Large-scale EEG pre-training**: A large-scale neural Transformer model was collected and pre-trained on over 2500 hours of diverse EEG data, marking the first use of such a large and diverse dataset for EEG pre-training. 2. **Compatibility with various EEG configurations**: LaBraM can handle EEG signals with different numbers of channels and time lengths, allowing a single pre-trained model to adapt to different configurations of various downstream datasets. 3. **Effective EEG representation learning**: The neural Transformer effectively captures the temporal and spatial features of EEG signals with different channels and lengths, making it suitable for a wide range of downstream tasks, and defines a compact, versatile, and meaningful EEG signal representation codebook. 4. **Comprehensive downstream experiments**: In four representative BCI downstream tasks, LaBraM significantly outperforms all existing best methods and explores the relationship between model size and performance by extending the pre-training data volume.