Twitter Sentiment Classification with Sentimental Feature Vector

Shun-ming YI,Hao YI,Guo-dong ZHOU
2016-01-01
Abstract:Sentiment classification on the content of the public media is a fundamental task for analyzing public sentiment.A classic model for sentiment classification is based on word frequency model which takes advantages of word frequency in text classification.However,the relationship between word frequency and sentiment information is not actually close as we expected.This paper presents a distinct approach of sentiment classification using sentimental feature vector instead of word-frequency feature vector.First,preprocessing is done to clean,lemmatize,and POS tag each word in a single tweet;Second,with the sentiment dictionary,each word is attached with a score corresponding to positive or negative sentiment respectively so as to get the sentimental feature vector for each tweet;Third,sentiment of tweets are classified by training models of different algorithms such as Multinomial Na? ve Bayes (MNB) and Support Vector Machine (SVM).Empirical studies show that our sentimental feature vector is beneficial for Twitter sentiment classification.
What problem does this paper attempt to address?