Sina Microblog Sentiment Classification Based on Distributed Representation of Documents

Yuting Yang,Mingyang Wang,Xianyun Tian,Pengyu Li
DOI: https://doi.org/10.3969/j.issn.1002-1965.2016.02.027
2016-01-01
Abstract:Purpose/Significance] Sina Microblog produces a large amount of texts every day, the sentiment classification based on these data is meaningful in analyzing and monitoring public opinion. It’ s crucial for the microblogging sentiment classification research to mine the text characteristics and sentiment information. [ Method/Process] A sentiment classification method based on distributed repre-sentation of documents is proposed, allowing a distributed representation of texts with the consideration of the contexts, the semantics, the word order and the syntactic structure of Chinese texts being introduced in the analysis, thus helps transfer the microblogging texts into vec-tors in higher space, then a SVM classification tool is used to facilitate judging the sentiment polarization of the texts. [ Result/Conclu-sion]The result of classification accuracy of 90. 46% shows the superiority of the method proposed to other methods.
What problem does this paper attempt to address?