Abstract:The recent booming of artificial intelligence (AI) applications, e.g., affective robots, human-machine interfaces, autonomous vehicles, etc., has produced a great number of multi-modal records of human communication. Such data often carry latent subjective users’ attitudes and opinions, which provides a practical and feasible path to realize the connection between human emotion and intelligence services. Sentiment and emotion analysis of multi-modal records is of great value to improve the intelligent level of affective services. However, how to find an optimal manner to learn people’s sentiment and emotion representations has been a difficult problem, since both of them involve subtle mind activity. To solve this problem, a lot of approaches have been published, but most of which are insufficient to mine sentiment and emotion, since they have treated sentiment analysis and emotion recognition as two separate tasks. The interaction between them has been neglected, which limits the efficiency of sentiment and emotion representation learning. In this work, emotion is seen as the external expression of sentiment, while sentiment is the essential nature of emotion. We thus argue that they are strongly related to each other where one’s judgment helps the decision of the other. The key challenges are multi-modal fused representation and the interaction between sentiment and emotion. To solve such issues, we design an external knowledge enhanced multi-task representation learning network, termed KAMT. The major elements contain two attention mechanisms, which are inter-modal and inter-task attentions and an external knowledge augmentation layer. The external knowledge augmentation layer is used to extract the vector of the participant’s gender, age, occupation and that of overall color or shape. The main use of inter-modal attention is to capture effective multi-modal fused features. Inter-task attention is designed to model the correlation between sentiment analysis and emotion classification. We perform experiments on three widely used datasets, and the experimental performance proves the effectiveness of the KAMT model.

Sentiment Classification in Customer Service Dialogue with Topic-Aware Multi-Task Learning

Topic-Oriented Spoken Dialogue Summarization for Customer Service with Saliency-Aware Topic Modeling

Prior-Bert and Multi-Task Learning for Target-Aspect-Sentiment Joint Detection

SBTM: A Joint Sentiment and Behaviour Topic Model for Online Course Discussion Forums

Emotion and sentiment analysis for intelligent customer service conversation using a multi-task ensemble framework

Multi-modal Sentiment and Emotion Joint Analysis with a Deep Attentive Multi-task Learning Model

Text-Centric Multimodal Contrastive Learning for Sentiment Analysis

Aspect-Level Sentiment Analysis of Customer Reviews Based on Neural Multi-task Learning

Sentiment analysis from Customer-generated online videos on product review using topic modeling and Multi-attention BLSTM

Multi-Task Learning Model Based on Multi-Scale CNN and LSTM for Sentiment Classification

A Deep Learning System for Sentiment Analysis of Service Calls

CLMLF:A Contrastive Learning and Multi-Layer Fusion Method for Multimodal Sentiment Detection

Jointly Discovering Fine-grained and Coarse-grained Sentiments Via Topic Modeling.

TDAM: a Topic-Dependent Attention Model for Sentiment Analysis

Multi-turn dialogue comprehension from a topic-aware perspective

Affective Interaction: Attentive Representation Learning for Multi-Modal Sentiment Classification

Chinese Dialogue Analysis Using Multi-Task Learning Framework

Sentiment interpretability analysis on Chinese texts employing multi-task and knowledge base

A Bi-directional Multi-hop Inference Model for Joint Dialog Sentiment Classification and Act Recognition