Abstract:Online reviews are becoming increasingly important for decision-making. Consumers often refer to online reviews for opinions before making a purchase. Marketers also acknowledge the importance of online reviews and use them to improve product success. However, the massive amount of online review data, as well as its unstructured nature, is a challenge for anyone wanting to derive a conclusion quickly. In this paper, we propose a novel framework for gauging the ratings of online reviews using machine learning techniques. This framework uses a combination of text pre-processing and feature extraction methods. Here, we investigate four different aspects of the new framework. First, we assess the performance of single and ensemble classifiers in predicting sentiment—positive or negative—initially on a specific dataset (Yelp), but subsequently also on two other datasets (Amazon's product reviews and a movie review dataset). Second, using the best identified classifiers, we improve the accuracy with which neutral polarity can be predicted, an ability largely overlooked in the literature. Third, we further improve the performance of these classifiers by testing different pre-processing and feature extraction methods. Finally, we measure how well our deep learning approach performs on the same task compared to the best previously identified classifiers. Our extensive testing shows that the linear-kernel support vector machine, logistic regression and multilayer perceptron are the three best single classifiers in terms of accuracy, precision, recall, and F-measure. Their performance could be further improved if they were used as base classifiers for ensemble models. We also observe that several text pre-processing techniques—negation word identification, word elongation correction, and part of speech lemmatisation (combined with Terms Frequency and N-gram words)—can increase accuracy. In addition, we demonstrate that the general sentiment of lexicons such as SentiWordNet 3.0 and SenticNet 4 can be used to generate features with good results, although deep learning models can perform equally well. Experiments with different datasets confirm that our framework provides consistent outcomes. In particular, we have focused on improving the accuracy of neutral sentiment, and we conclude by showing how this can be achieved without sacrificing the accuracy of positive or negative ratings.

BERT-Based Meta-Learning Approach with Looking Back for Sentiment Analysis of Literary Book Reviews

Sentiment Analysis of Modern Chinese Literature Based on Deep Learning

Using Machine Learning to Predict the Sentiment of Online Reviews: A New Framework for Comparative Analysis

Aspect-Based Sentiment Analysis for User Reviews

An Empirical Study of Unsupervised Sentiment Classification of Chinese Reviews

Experimental Study on Sentiment Classification of Chinese Review Using Machine Learning Techniques

Sentiment analysis of movie reviews based on deep learning

Weakly-Supervised Deep Learning for Customer Review Sentiment Classification.

Are Your Comments Positive? A Self-Distillation Contrastive Learning Method for Analyzing Online Public Opinion

Performance evaluation of Reddit Comments using Machine Learning and Natural Language Processing methods in Sentiment Analysis

An Al-BERT-Bi-GRU-LDA algorithm for negative sentiment analysis on Bilibili comments

Aspect-Level Sentiment-Controllable Review Generation with Mutual Learning Framework

A sentiment analysis model for car review texts based on adversarial training and whole word mask BERT

Sentiment Analysis of Peer Review Texts for Scholarly Papers

More than a Feeling: Accuracy and Application of Sentiment Analysis

Leveraging deep learning with sentiment analysis for Online Book reviews polarity classification model

Research on MOOC Reviews Oriented Sentiment Analysis by Awareness of Emotional Distinctions

Editorial in the September 2006 issue of Pain Management Nursing.

Tourist attraction reviews based on deep learning Sentiment Analysis System

Proposing sentiment analysis model based on BERT and XLNet for movie reviews

Educational Big Data Analytics Using Sentiment Analysis for Student Requirement Analysis on Courses