ModalNet: an aspect-level sentiment classification model by exploring multimodal data with fusion discriminant attentional network

Zhe Zhang,Zhu Wang,Xiaona Li,Nannan Liu,Bin Guo,Zhiwen Yu
DOI: https://doi.org/10.1007/s11280-021-00955-7
2021-09-20
World Wide Web
Abstract:Aspect-level sentiment classification aims to identify sentiment polarity over each aspect of a sentence. In the past, such analysis tasks mainly relied on text data. Nowadays, due to the popularization of smart devices and Internet services, people are generating more abundant data, including text, image, video, et al. Multimodal data from the same post (e.g., a tweet) usually has certain correlation. For example, image data might has an auxiliary effect on the text data, and reasonable processing of such multimodal data can help obtain much richer information for sentiment analysis. To this end, we propose an aspect-level sentiment classification model by exploring multimodal data with fusion discriminant attentional network. Specifically, we first leverage two memory networks for mining the intra-modality information of text and image, and then design a discriminant matrix to supervise the fusion of inter-modality information. Experimental results demonstrate the effectiveness of the proposed model.
What problem does this paper attempt to address?