Abstract:<p>Multimodal sentiment analysis is one of the most attractive interdisciplinary research topics in artificial intelligence (AI). Different from other classification issues, multimodal sentiment analysis of human is a much finer classification problem. However, most current work accept all multimodalities as the input together and then output final results at one time after fusion and decision processes. Rare models try to divide their models into more than one fusion modules with different fusion strategies for better adaption of different tasks. Additionally, most recent multimodal sentiment analysis methods pay great focuses on binary classification, but the accuracy of multi-classification still remains difficult to improve. Inspired by the emotional processing procedure in cognitive science, both binary and multi-classification abilities are improved in our method by dividing the complicated problem into smaller issues which are easier to be handled. In this paper, we propose a Hierarchal Attention-BiLSTM(Bidirectional Long-Short Term Memory) model based on Cognitive Brain limbic system (HALCB). HALCB splits the multimodal sentiment analysis into two modules responsible for two tasks, the binary classification and the multi-classification. The former module divides the input items into two categories by recognizing their polarity and then sends them to the latter module separately. In this module, Hash algorithm is utilized to improve the retrieve accuracy and speed. Correspondingly, the latter module contains a positive sub-net dedicated for positive inputs and a negative sub-nets dedicated for negative inputs. Each of these binary module and two sub-nets in multi-classification module possesses different fusion strategy and decision layer for matching its respective function. We also add a random forest at the final link to collect outputs from all modules and fuse them at the decision-level at last. Experiments are conducted on three datasets and compare the results with baselines on both binary classification and multi-classification tasks. Our experimental results surpass the state-of-the-art multimodal sentiment analysis methods on both binary and multi-classification by a big margin.</p>

On the Performance of Blanking Nonlinearity in Real-Valued OFDM-Based PLC

Multimodal Affective Analysis Using Hierarchical Attention Strategy with Word-Level Alignment

Multimodal Utterance-level Affect Analysis using Visual, Audio and Text Features

A cross modal hierarchical fusion multimodal sentiment analysis method based on multi-task learning

Multimodal Sentiment Analysis in Realistic Environments Based on Cross-Modal Hierarchical Fusion Network

A cognitive brain model for multimodal sentiment analysis based on attention neural networks

A Multimodal Sentiment Analysis Approach Based on a Joint Chained Interactive Attention Mechanism

Heterogeneous Hierarchical Fusion Network for Multimodal Sentiment Analysis in Real-World Environments

Multimodal Sentiment Analysis Based on Cross-Modal Attention and Gated Cyclic Hierarchical Fusion Networks

Video Sentiment Analysis with Bimodal Information-augmented Multi-Head Attention

Multimodal Affective Computing Based on Weighted Linear Fusion

InterMulti:Multi-view Multimodal Interactions with Text-dominated Hierarchical High-order Fusion for Emotion Analysis

Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-Decoder

Multimodal Sentiment Analysis Based on a Cross-Modal Multihead Attention Mechanism

Multimodal Sentiment Analysis Using Multi-tensor Fusion Network with Cross-modal Modeling

Multimodal Emotion Recognition Based on Cascaded Multichannel and Hierarchical Fusion

Cross-Attention is Not Enough: Incongruity-Aware Dynamic Hierarchical Fusion for Multimodal Affect Recognition

Multimodal sentiment analysis based on multiple attention

Multimodal Sentiment Analysis Based on Composite Hierarchical Fusion

Multimodal Sentiment Analysis using Hierarchical Fusion with Context Modeling

Context-Dependent Multimodal Sentiment Analysis Based on a Complex Attention Mechanism