Contrastive Learning of Subject-Invariant EEG Representations for Cross-Subject Emotion Recognition

Xinke Shen,Xianggen Liu,Xin Hu,Dan Zhang,Sen Song
DOI: https://doi.org/10.1109/TAFFC.2022.3164516
2022-04-06
Abstract:EEG signals have been reported to be informative and reliable for emotion recognition in recent years. However, the inter-subject variability of emotion-related EEG signals still poses a great challenge for the practical applications of EEG-based emotion recognition. Inspired by recent neuroscience studies on inter-subject correlation, we proposed a Contrastive Learning method for Inter-Subject Alignment (CLISA) to tackle the cross-subject emotion recognition problem. Contrastive learning was employed to minimize the inter-subject differences by maximizing the similarity in EEG signal representations across subjects when they received the same emotional stimuli in contrast to different ones. Specifically, a convolutional neural network was applied to learn inter-subject aligned spatiotemporal representations from EEG time series in contrastive learning. The aligned representations were subsequently used to extract differential entropy features for emotion classification. CLISA achieved state-of-the-art cross-subject emotion recognition performance on our THU-EP dataset with 80 subjects and the publicly available SEED dataset with 15 subjects. It could generalize to unseen subjects or unseen emotional stimuli in testing. Furthermore, the spatiotemporal representations learned by CLISA could provide insights into the neural mechanisms of human emotion processing.
Human-Computer Interaction,Machine Learning,Signal Processing,Neurons and Cognition
What problem does this paper attempt to address?
The paper attempts to address the issue of how to overcome the variability of EEG signals between different individuals in cross-subject emotion recognition to improve the accuracy and generalization ability of emotion recognition. Specifically, researchers have found that although EEG signals are rich and reliable in emotion recognition, there is significant variability in emotion-related EEG signals between different individuals, which poses a huge challenge for practical applications. The presence of this variability leads to the performance of cross-subject emotion recognition being far lower than that of single-subject recognition. To solve this problem, this paper proposes a contrastive learning-based method called Cross-Subject Aligned Contrastive Learning (CLISA). This method reduces the differences between different subjects by maximizing the similarity between EEG signal representations under the same emotional stimuli while minimizing the similarity between signal representations under different emotional stimuli. Specifically, CLISA uses Convolutional Neural Networks (CNN) to learn cross-subject aligned spatio-temporal representations from EEG time series and further extracts differential entropy features for emotion classification. Through this method, CLISA not only improves the performance of cross-subject emotion recognition but also generalizes to unseen subjects or unseen emotional stimuli during testing. Additionally, the spatio-temporal representations learned by CLISA can provide insights into the neural mechanisms of human emotion processing.