Joint Visual-Textual Sentiment Analysis Based on Cross-Modality Attention Mechanism.

Xuelin Zhu,Biwei Cao,Shuai Xu,Bo Liu,Jiuxin Cao
DOI: https://doi.org/10.1007/978-3-030-05710-7_22
2019-01-01
Abstract:Recently, many researchers have focused on the joint visual-textual sentiment analysis since it can better extract user sentiments toward events or topics. In this paper, we propose that visual and textual information should differ in their contribution to sentiment analysis. Our model learns a robust joint visual-textual representation by incorporating a cross-modality attention mechanism and semantic embedding learning based on bidirectional recurrent neural network. Experimental results show that our model outperforms existing the state-of-the-art models in sentiment analysis under real datasets. In addition, we also investigate different proposed model’s variants and analyze the effects of semantic embedding learning and cross-modality attention mechanism in order to provide deeper insight on how these two techniques help the learning of joint visual-textual sentiment classifier.
What problem does this paper attempt to address?