Sentiment Analysis Based on Text Information Enhancement and Multimodal Feature Fusion

Zijun Liu,Li Cai,Wenjie Yang,Junhui Liu
DOI: https://doi.org/10.1016/j.patcog.2024.110847
IF: 8
2024-01-01
Pattern Recognition
Abstract:Rapid advancements in multimedia technology have created explosive growth in sentiment data generated across various social media platforms. While previous research on sentiment analysis has shifted from analyzing single data types to incorporating multimodal data, current studies face certain limitations. These include overlooking the impact of redundant information within feature sequences of each modality, failing to account for the complementarity between modality data, and neglecting the varying significance of different modalities in conveying sentiments. This paper introduces a sentiment analysis framework designed for text information enhancement and multimodal feature fusion. The text modality is central to this framework, around which an attention mechanism augments emotional correlations between modalities. An expanded sentiment lexicon refines the representation of multimodal features, thus capturing emotional information more accurately. Experimental evaluations conducted on two standard datasets, CMU-MOSI and CMU-MOSEI, show that the accuracy of the proposed method in multimodal emotion recognition tasks reaches 85.7% and 85.8% respectively, at 1.6% and 1.8% higher than the baseline methods. Thus, it demonstrates robust regression and classification performance.
What problem does this paper attempt to address?