Neuro-GPT: Towards A Foundation Model for EEG

Wenhui Cui,Woojae Jeong,Philipp Thölke,Takfarinas Medani,Karim Jerbi,Anand A. Joshi,Richard M. Leahy

2024-03-02

Abstract:To handle the scarcity and heterogeneity of electroencephalography (EEG) data for Brain-Computer Interface (BCI) tasks, and to harness the power of large publicly available data sets, we propose Neuro-GPT, a foundation model consisting of an EEG encoder and a GPT model. The foundation model is pre-trained on a large-scale data set using a self-supervised task that learns how to reconstruct masked EEG segments. We then fine-tune the model on a Motor Imagery Classification task to validate its performance in a low-data regime (9 subjects). Our experiments demonstrate that applying a foundation model can significantly improve classification performance compared to a model trained from scratch, which provides evidence for the generalizability of the foundation model and its ability to address challenges of data scarcity and heterogeneity in EEG. The code is publicly available at <a class="link-external link-http" href="http://github.com/wenhui0206/NeuroGPT" rel="external noopener nofollow">this http URL</a>.

Machine Learning,Signal Processing

What problem does this paper attempt to address?

The main problems that this paper attempts to solve are the scarcity and heterogeneity of electrophysiological signal (EEG) data in brain - computer interface (BCI) tasks. Specifically: 1. **Data Scarcity**: The scale of EEG data is limited, especially in specific tasks, which makes it difficult for deep - learning models to converge and generalize to unseen test data. 2. **Data Heterogeneity**: Differences between individuals lead to heterogeneity in feature representation, thereby increasing the difficulty of cross - individual generalization. 3. **High Dimensionality and Low Signal - to - Noise Ratio**: EEG data is characterized by high dimensionality and low signal - to - noise ratio, which further hinders the effective training of models. To solve these problems, the authors propose Neuro - GPT, a foundation model, which consists of an EEG encoder and a GPT model. The model is trained and applied through the following steps: - **Pre - training Stage**: The model is pre - trained on large - scale public datasets (such as the TUH EEG dataset) using self - supervised tasks to learn how to reconstruct masked EEG segments. - **Fine - tuning Stage**: The pre - trained model is fine - tuned on downstream tasks (such as motor imagery classification tasks) to verify its performance in the case of small amounts of data. Experimental results show that, compared with models trained from scratch, using pre - trained foundation models can significantly improve classification performance, demonstrating the effectiveness and generalization ability of this model in processing EEG data. Formula part: - The objective function of self - supervised pre - training is defined as the causal reconstruction loss: \[ L=\frac{1}{N - 1}\sum_{i = 2}^{N}\|\hat{Y}_i - H(D_i)\|_2^2 \] where \(\hat{Y}_i = G[M|H(D_{i - 1}), H(D_{i - 2}),\dots, H(D_1)]\), and \(G\) represents the GPT model. Through this method, Neuro - GPT can effectively capture the spatio - temporal features in EEG data and show better performance in downstream tasks.

Neuro-GPT: Towards A Foundation Model for EEG

EEGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training

Graph-Enhanced EEG Foundation Model

Foundational GPT Model for MEG

EEG-GPT: Exploring Capabilities of Large Language Models for EEG Classification and Interpretation

BrainFounder: Towards Brain Foundation Models for Neuroimage Analysis

EEGGAN-Net: enhancing EEG signal classification through data augmentation

FoME: A Foundation Model for EEG using Adaptive Temporal-Lateral Attention Scaling

Meta Transfer of Self-Supervised Knowledge: Foundation Model in Action for Post-Traumatic Epilepsy Prediction

Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

GET: A Generative EEG Transformer for Continuous Context-Based Neural Signals

EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model

NeuGPT: Unified multi-modal Neural GPT

ERP-WGAN: A Data Augmentation Method for EEG Single-trial Detection

EEG Decoding for Datasets with Heterogenous Electrode Configurations using Transfer Learning Graph Neural Networks

PTGB: Pre-Train Graph Neural Networks for Brain Network Analysis

EEGNet: a compact convolutional neural network for EEG-based brain–computer interfaces

EEGNet: A Compact Convolutional Network for EEG-based Brain-Computer Interfaces

GNN4EEG: A Benchmark and Toolkit for Electroencephalography Classification with Graph Neural Network

A Data Augmentation Method for Motor Imagery EEG Signals Based on DCGAN-GP Network

Toward reliable signals decoding for electroencephalogram: A benchmark study to EEGNeX