Multi-View Interactive Representations for Multimodal Sentiment Analysis

Zemin Tang,Qi Xiao,Yunchuan Qin,Xu Zhou,Joey Tianyi Zhou,Kenli Li
DOI: https://doi.org/10.1109/tce.2024.3357480
2024-01-01
IEEE Transactions on Consumer Electronics
Abstract:Multimodal Sentiment Analysis (MSA) technology, prevalent in consumer applications and mobile edge computing (MEC), enables sentiment examination through user data collected by smart devices. Despite the focus on representation learning in MSA, current methods often prioritize recognition performance through modality interaction and fusion. However, they struggle to capture multi-view sentiment cues across different interaction states, limiting multimodal sentiment representations’ expressiveness. This paper develops an innovative MSA framework, MVIR, learning multi-view interactive representations in diverse interaction states. Multilple meticulously designed sentiment tasks and an introduced self-supervised label generation algorithm (SSLGM) enable a comprehensive understanding of multi-view sentiment tendencies. The dual-view attention weighted fusion (DVAWF) module is designed to facilitate inter-modality information exchange in different interaction states. Extensive experiments on three MSA datasets affirm the efficacy and superiority of MVIR, showcasing its ability to capture sentiment information from multimodal data across various interaction states.
telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?