Cascaded Self-supervised Learning for Subject-independent EEG-based Emotion Recognition

Hanqi Wang,Tao Chen,Liang Song
2024-03-07
Abstract:EEG-based Emotion recognition holds significant promise for applications in human-computer interaction, medicine, and neuroscience. While deep learning has shown potential in this field, current approaches usually rely on large-scale high-quality labeled datasets, limiting the performance of deep learning. Self-supervised learning offers a solution by automatically generating labels, but its inter-subject generalizability remains under-explored. For this reason, our interest lies in offering a self-supervised learning paradigm with better inter-subject generalizability. Inspired by recent efforts in combining low-level and high-level tasks in deep learning, we propose a cascaded self-supervised architecture for EEG emotion recognition. Then, we introduce a low-level task, time-to-frequency reconstruction (TFR). This task leverages the inherent time-frequency relationship in EEG signals. Our architecture integrates it with the high-level contrastive learning modules, performing self-supervised learning for EEG-based emotion recognition. Experiment on DEAP and DREAMER datasets demonstrates superior performance of our method over similar works. The outcome results also highlight the indispensability of the TFR task and the robustness of our method to label scarcity, validating the effectiveness of the proposed method.
Signal Processing
What problem does this paper attempt to address?
The paper introduces a cascaded self-supervised learning approach for subject-independent emotion recognition using Electroencephalography (EEG) signals. The main goal is to address the limitations of current deep learning models in EEG-based emotion recognition, particularly the reliance on large-scale labeled datasets and the challenge of generalizing across different subjects due to high inter-subject variability. ### Problem Addressed The key problems the paper seeks to solve are: 1. **Label Dependency**: Deep learning models typically require large amounts of labeled data, which is time-consuming and laborious to collect for EEG signals. Manual labeling is also prone to noise and subjective bias. 2. **Inter-Subject Generalization**: Existing self-supervised learning methods for EEG emotion recognition do not sufficiently address the issue of generalizing across different subjects, limiting their practical applicability. ### Proposed Solution To tackle these issues, the authors propose a cascaded self-supervised learning architecture that combines a low-level task with a high-level task: 1. **Low-Level Task**: Time-to-Frequency Reconstruction (TFR) - This task leverages the inherent time-frequency relationship in EEG signals, forcing the model to learn a Fourier-based transformation. The goal is to capture subject-invariant simple patterns in the raw EEG data, aligning the distribution of data from various subjects. 2. **High-Level Task**: Contrastive Learning