Neuro-GPT: Towards A Foundation Model for EEG

Wenhui Cui,Woojae Jeong,Philipp Thölke,Takfarinas Medani,Karim Jerbi,Anand A. Joshi,Richard M. Leahy
2024-03-02
Abstract:To handle the scarcity and heterogeneity of electroencephalography (EEG) data for Brain-Computer Interface (BCI) tasks, and to harness the power of large publicly available data sets, we propose Neuro-GPT, a foundation model consisting of an EEG encoder and a GPT model. The foundation model is pre-trained on a large-scale data set using a self-supervised task that learns how to reconstruct masked EEG segments. We then fine-tune the model on a Motor Imagery Classification task to validate its performance in a low-data regime (9 subjects). Our experiments demonstrate that applying a foundation model can significantly improve classification performance compared to a model trained from scratch, which provides evidence for the generalizability of the foundation model and its ability to address challenges of data scarcity and heterogeneity in EEG. The code is publicly available at <a class="link-external link-http" href="http://github.com/wenhui0206/NeuroGPT" rel="external noopener nofollow">this http URL</a>.
Machine Learning,Signal Processing
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are the scarcity and heterogeneity of electrophysiological signal (EEG) data in brain - computer interface (BCI) tasks. Specifically: 1. **Data Scarcity**: The scale of EEG data is limited, especially in specific tasks, which makes it difficult for deep - learning models to converge and generalize to unseen test data. 2. **Data Heterogeneity**: Differences between individuals lead to heterogeneity in feature representation, thereby increasing the difficulty of cross - individual generalization. 3. **High Dimensionality and Low Signal - to - Noise Ratio**: EEG data is characterized by high dimensionality and low signal - to - noise ratio, which further hinders the effective training of models. To solve these problems, the authors propose Neuro - GPT, a foundation model, which consists of an EEG encoder and a GPT model. The model is trained and applied through the following steps: - **Pre - training Stage**: The model is pre - trained on large - scale public datasets (such as the TUH EEG dataset) using self - supervised tasks to learn how to reconstruct masked EEG segments. - **Fine - tuning Stage**: The pre - trained model is fine - tuned on downstream tasks (such as motor imagery classification tasks) to verify its performance in the case of small amounts of data. Experimental results show that, compared with models trained from scratch, using pre - trained foundation models can significantly improve classification performance, demonstrating the effectiveness and generalization ability of this model in processing EEG data. Formula part: - The objective function of self - supervised pre - training is defined as the causal reconstruction loss: \[ L=\frac{1}{N - 1}\sum_{i = 2}^{N}\|\hat{Y}_i - H(D_i)\|_2^2 \] where \(\hat{Y}_i = G[M|H(D_{i - 1}), H(D_{i - 2}),\dots, H(D_1)]\), and \(G\) represents the GPT model. Through this method, Neuro - GPT can effectively capture the spatio - temporal features in EEG data and show better performance in downstream tasks.