Multimodal Emotion Distribution Learning.

Xiuyi Jia,Xiaoxia Shen
DOI: https://doi.org/10.1007/s12559-021-09927-5
IF: 4.89
2021-01-01
Cognitive Computation
Abstract:Background Emotion recognition is an interesting and challenging problem and has attracted much attention in recent years. To more accurately express emotions, emotion distribution learning (EDL) introduces the emotion description degree to form an emotion distribution at a fine granularity, which is used to describe the fusion of multiple basic emotions at different levels. Challenge Existing EDL research has shown a strong representation ability on emotion recognition, but all studies are based on unimodal information, meaning the results may be one-sided. Method As the first pioneering investigation of multimodal emotion distribution learning, we present a corresponding learning method named MEDL. First, for each modality, we learn an emotion distribution and obtain the corresponding label correlation matrix. Second, we constrain the consistency of label correlation matrices between different modalities to utilize modal complementarity. Finally, the final emotion distribution is achieved based on a simple decision fusion strategy. Results and Conclusions The experimental results demonstrate that our proposal performs better than some state-of-the-art multimodal emotion recognition methods and unimodal emotion distribution learning methods.
What problem does this paper attempt to address?