Target-oriented Sentiment Classification with Sequential Cross-modal Semantic Graph

Yufeng Huang,Zhuo Chen,Jiaoyan Chen,Jeff Z. Pan,Zhen Yao,Wen Zhang
DOI: https://doi.org/10.1007/978-3-031-44216-2_48
2023-01-01
Abstract:Multi-modal aspect-based sentiment classification (MABSC) is an approach aimed at classifying the sentiment of a target entity mentioned in a sentence using images. However, previous methods failed to account for the fine-grained semantic association between the image and the text, which resulted in limited identification of fine-grained image aspects and opinions. To address these limitations, a new approach called SeqCSG has been proposed in this paper. SeqCSG enhances the encoder-decoder sentiment classification framework using sequential cross-modal semantic graphs. SeqCSG utilizes image captions and scene graphs to extract both global and local fine-grained image information and considers them as elements of the cross-modal semantic graph along with tokens from tweets. The sequential cross-modal semantic graph is represented as a sequence with a multi-modal adjacency matrix indicating relationships between elements. Experimental results have shown that the approach outperforms existing methods and achieves state-of-the-art performance on two standard datasets. Further analysis has demonstrated that the model can implicitly learn the correlation between fine-grained information of the image and the text with the given target.
What problem does this paper attempt to address?