Sentiment Analysis Using Averaged Weighted Word Vector Features

Ali Erkan,Tunga Gungor
2023-10-16
Abstract:People use the world wide web heavily to share their experience with entities such as products, services, or travel destinations. Texts that provide online feedback in the form of reviews and comments are essential to make consumer decisions. These comments create a valuable source that may be used to measure satisfaction related to products or services. Sentiment analysis is the task of identifying opinions expressed in such text fragments. In this work, we develop two methods that combine different types of word vectors to learn and estimate polarity of reviews. We develop average review vectors from word vectors and add weights to this review vectors using word frequencies in positive and negative sensitivity-tagged reviews. We applied the methods to several datasets from different domains that are used as standard benchmarks for sentiment analysis. We ensemble the techniques with each other and existing methods, and we make a comparison with the approaches in the literature. The results show that the performances of our approaches outperform the state-of-the-art success rates.
Computation and Language,Information Retrieval
What problem does this paper attempt to address?
The paper aims to address the problem of sentiment analysis, specifically targeting the identification and classification of sentiment polarity (positive, negative, or neutral) from text reviews. The authors propose two new methods to learn and estimate the sentiment polarity of reviews by combining different types of word vectors. These methods include: 1. **Weighted Average Review Vector (WARV)**: Using word frequency to weight average word vectors in positive and negative labeled reviews. 2. **Convolutional Neural Network (CNN)**: Directly using word vectors as features for sentiment classification. The authors applied these methods to datasets from multiple different domains and compared them with existing sentiment analysis techniques. Experimental results show that their proposed methods outperform some of the existing advanced techniques in terms of performance. Additionally, the authors further improved accuracy through ensemble learning, achieving an accuracy of 95.032% on the IMDB movie review dataset, approaching the results of studies using the bidirectional transformer BERT embedding method.