Sentiment Classification Model Based on Word Embedding and CNN

Cai Huiping,Wang Lidan,Duan Shukai
DOI: https://doi.org/10.3969/j.issn.1001-3695.2016.10.005
2016-01-01
Abstract:This paper tried to propose a method to solve the problem of sentiment classification by integrating word embedding and convolutional neural network (CNN).First of all,the method accomplished a training process with skip-gram model to gen-erate word embedding of each word in the dataset.Then,it created a two-dimensional feature matrix which was the combination of word embedding of each word in a training sample as the input of CNN model.Each iteration process of training,entries of feature matrix would also update as part of model parameters.Secondly,this paper proposed a CNN structure which was mainly composed of three different sizes of convolution kernels so as to complete the automatic extraction process of a variety of local abstract features.Compared with traditional machine learning algorithms,the proposed word embedding and CNN based senti-ment classification model has successfully improved classification accuracy by 5 .04%.
What problem does this paper attempt to address?