Emotional multi-source correlation model for chinese micro-blog sentiment analysis

Lingxiao LI,Shaozi LI,Donglin CAO
DOI: https://doi.org/10.11992/tis.201605019
2016-01-01
Abstract:With the explosion of social media information, sentiment analysis of public opinion is attracting more and more attention. Compared with traditional text, the Sina micro?blog contains a variety of emotional sources, in?cluding sentiment words, emoticons, pictures, etc. To solve the problem of the poor timeliness of lexicons in Chi?nese social short messages and to utilize the correlation between different emotional sources, an emotional multi?source correlation model (EMCM) is proposed to carry out sentiment analysis on a micro?blog. In particular, it takes advantage of the correlation between sentiment words and emoticons. It imports the multi?sources and correla?tion probabilities, and then builds a correlation model between the two emotional sources, emotional words and emoticons, based on a voting model using sentimental words. Experimental results show that this model achieved an accuracy of 85.3% in 6 171 micro?blogs, higher than either the traditional method based on voting (83.4%) or the SVM method based on similar multi?features (82.9%).
What problem does this paper attempt to address?