Multi-Feature Fusion Multi-Modal Sentiment Analysis Model Based on Cross-Attention Mechanism

Yanxian Tan,Zhengjun Pan,Lianfen Zhao
DOI: https://doi.org/10.1109/ICCCBDA61447.2024.10569861
2024-04-25
Abstract:To address the problem of insufficient intra-modal feature extraction and inter-modal interaction information fusion in current multimodal sentiment analysis, a multi-feature fusion multimodal sentiment analysis model based on cross-attention mechanism is proposed. The model first uses subnetworks and self-attention mechanism to obtain important features of multiple modalities such as text, audio, and video. Then, it calculates the correlation between modalities through cross-modal cross-attention mechanism to achieve the interaction and fusion of multimodal information. After that, soft attention mechanism is used to assign attention weights to each modality feature. Finally, the modalities features are spliced to output the final sentiment classification result. The experimental results show that compared with the benchmark model on the public datasets CH-SIMS and CMU-MOSEI, this model has certain improvements in two-class accuracy, three-class accuracy, and F1 value.
Computer Science
What problem does this paper attempt to address?