Multimodal Sentiment Analysis Missing Modality Reconstruction Network Based on Shared-Specific Features

Yong Qin,Xuwen Qin,Yu Guo,Ziliang Ren,Lei Chen
DOI: https://doi.org/10.1109/ICSMD60522.2023.10490667
2023-11-02
Abstract:In multimodal sentiment analysis, heterogeneity between modalities makes inconsistent modal distributions a challenge. Especially in the case of incomplete features of certain modalities, the differences between modalities may interfere with the accurate prediction of sentiment categories. To address this problem, this paper proposes a missing modal reconstruction network (SSF-MMRN) for multimodal sentiment analysis based on sharing specific features. Firstly, a CMD distance-constrained training strategy is used to learn inter-modal consistency features. Second, based on the consistency features, a reconstruction module is proposed to generate missing modal features, check the semantic consistency of the recovered modalities with the original available modalities, and introduce inconsistencies into multiple models for better decision-making once they exist. Extensive experiments on the IEMOCAP benchmark dataset show that our proposed model effectively mitigates the modality gap during missing modality prediction and significantly improves the emotion recognition performance.
Computer Science
What problem does this paper attempt to address?