Multichannel Multimodal Emotion Analysis of Cross-Modal Feedback Interactions Based on Knowledge Graph

Shaohua Dong,Xiaochao Fan,Xinchun Ma

DOI: https://doi.org/10.1007/s11063-024-11641-w

IF: 2.565

2024-05-30

Neural Processing Letters

Abstract:Multimodal sentiment analysis is a downstream branch task of sentiment analysis with high attention at present. Previous work in multimodal sentiment analysis have focused on the representation and fusion of modalities, capturing the underlying semantic relationships between modalities by considering contextual information. While this approach is feasible for simple contextual comments, more complex comments require the integration of external knowledge to obtain more accurate sentiment information. However, incorporating external knowledge into sentiment analysis to enhance information complementarity has not been thoroughly investigated. To address this, we propose a multichannel cross-modal feedback interaction model that incorporates the knowledge graph into multimodal sentiment analysis. Our proposed model consists of two main components: the cross-modal feedback recurrent interaction module and the external knowledge module for capturing latent information. The cross-modal interaction employs a self-feedback mechanism during network training, extracting feature representations of each modality and using these representations to mask sensory inputs, allowing the model to perform feedback-based feature masking. The external knowledge graph captures potential semantic information representations in the textual data through knowledge graph embedding. Finally, a global feature fusion module is employed for multichannel multimodal information integration. On two publicly available datasets, our method demonstrates good performance in terms of accuracy and F1 scores, compared to state-of-the-art models and several baselines.

computer science, artificial intelligence

What problem does this paper attempt to address?

### Problems the Paper Aims to Solve This paper aims to address several key issues in multimodal sentiment analysis: 1. **Capturing Semantic Information in Complex Contexts**: Existing methods perform well in simple contextual reviews but require the integration of external knowledge to obtain more accurate sentiment information when dealing with complex reviews. Currently, research on incorporating external knowledge into sentiment analysis to enhance information complementarity is still insufficient. 2. **Complementarity of Cross-Modal Information**: A multi-channel cross-modal feedback interaction model based on knowledge graph (MMKGE) is proposed. By introducing a knowledge graph to capture potential semantic information and utilizing a cross-modal self-feedback mechanism, the quality of feature representation is improved. 3. **Model Efficiency and Lightweight Design**: By combining cross-modal information completion and external knowledge dual channels, the semantic features are enhanced while keeping the model simple, achieving a more efficient and lightweight model design. In summary, this paper is mainly dedicated to improving the accuracy and robustness of multimodal sentiment analysis by introducing knowledge graphs and improved cross-modal interaction mechanisms, especially in capturing implicit semantic information in complex contexts.

Multichannel Multimodal Emotion Analysis of Cross-Modal Feedback Interactions Based on Knowledge Graph

Multi-Channel Attentive Graph Convolutional Network with Sentiment Fusion for Multimodal Sentiment Analysis

Multimodal Sentiment Analysis Using Multi-tensor Fusion Network with Cross-modal Modeling

Multimodal Knowledge-enhanced Interactive Network with Mixed Contrastive Learning for Emotion Recognition in Conversation

Multimodal Sentiment Analysis Based on Cross-Modal Attention and Gated Cyclic Hierarchical Fusion Networks

Cross-Modal Sentiment Sensing with Visual-Augmented Representation and Diverse Decision Fusion

Multimodal sentiment analysis based on cross-instance graph neural networks

EffMulti: Efficiently Modeling Complex Multimodal Interactions for Emotion Analysis

Multimodal Sentiment Analysis with Missing Modality: A Knowledge-Transfer Approach

Multi-Feature Fusion Multi-Modal Sentiment Analysis Model Based on Cross-Attention Mechanism

DGFN Multimodal Emotion Analysis Model Based on Dynamic Graph Fusion Network

Multimodal Affective Analysis Using Hierarchical Attention Strategy with Word-Level Alignment

Multimodal Sentiment Analysis of Graphic Texts Based on Multicategorical Relative Fusion

SKEAFN: Sentiment Knowledge Enhanced Attention Fusion Network for multimodal sentiment analysis

Affective Interaction: Attentive Representation Learning for Multi-Modal Sentiment Classification

Multimodal Sentiment Analysis Based on a Cross-Modal Multihead Attention Mechanism

Target and Source Modality Co-Reinforcement for Emotion Understanding from Asynchronous Multimodal Sequences.

Context-Dependent Multimodal Sentiment Analysis Based on a Complex Attention Mechanism

InterMulti:Multi-view Multimodal Interactions with Text-dominated Hierarchical High-order Fusion for Emotion Analysis

A cross-model hierarchical interactive fusion network for end-to-end multimodal aspect-based sentiment analysis