A Multi-dimension and Multi-granularity Feature Fusion Method for Chinese Microblog Sentiment Classification

Yanxia Chen,Tingting Wei,Leyao Qu,Xiaoyue Liu
DOI: https://doi.org/10.1145/3639479.3639490
2023-12-27
Abstract:Chinese microblog comments exhibit a wide range of expressions, including brief terms and internet slang. Traditional single-feature models may not capturing this diversity comprehensively and in effectively discerning the usage of sentiment words, language structure, and contextual information. Consequently, they tend to exhibit suboptimal performance, particularly when dealing with informally expressed datasets in this domain. This study introduces a novel approach for Chinese microblog sentiment classification that leverages multi-dimensional (common text features and sentiment features) and multi-granularity (word-level and character-level) sentiment features. To achieve this, we first employ word2vec to train separate word-level and character-level embedding vectors. And then, we introduce LSTM (Long Short-Term Memory) neural networks to capture sentence-level representations. Additionally, self-attention mechanisms are incorporated to assess the importance of sentiment words at both word and character levels, finally integrating them into the LSTM network to capture the sentimental features that incorporate contextual semantics. We conduct experiments on two different datasets. Our experimental results reveal that our method performs comparably to competing approaches on high-quality datasets. However, on lower-quality datasets, it outperforms other methods, demonstrating the robustness of our proposed approach. Our innovative method provides an effective solution to address the challenges of Chinese microblog sentiment classification, with wide-ranging potential for practical applications.
Computer Science
What problem does this paper attempt to address?