Abstract:Sentiment analysis (SA) is a technique that lets people in different fields such as business, economy, research, government, and politics to know about people's opinions, which greatly affects the process of decision-making. SA techniques are classified into: lexicon-based techniques, machine learning techniques, and a hybrid between both approaches. Each approach has its limitations and drawbacks, the machine learning approach depends on manual feature extraction, lexicon-based approach relies on sentiment lexicons that are usually unscalable, unreliable, and manually annotated by human experts. Nowadays, word-embedding techniques have been commonly used in SA classification. Currently, Word2Vec and GloVe are some of the most accurate and usable word embedding techniques, which can transform words into meaningful semantic vectors. However, these techniques ignore sentiment information of texts and require a huge corpus of texts for training and generating accurate vectors, which are used as inputs of deep learning models. In this paper, we propose an enhanced ensemble classifier framework. Our framework is based on our previously published lexicon-based method, bag-of-words, and pre-trained word embedding, first the sentence is preprocessed by removing stop-words, POS tagging, stemming and lemmatization, shortening exaggerated word. Second, the processed sentence is passed to three modules, our previous lexicon-based method (Sum Votes), bag-of-words module and semantic module (Word2Vec and Glove) and produced feature vectors. Finally, the previous features vectors are fed into 11 different classifiers. The proposed framework is tested and evaluated over four datasets with five different lexicons, the experiment results show that our proposed model outperforms the previous lexicon based and the machine learning methods individually.

Bayesian Estimation‐based Sentiment Word Embedding Model for Sentiment Analysis

Learning Sentiment-Inherent Word Embedding for Word-Level and Sentence-Level Sentiment Analysis

Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification.

Sentiment-Aware Word Embedding for Emotion Classification

BERT- and BiLSTM-Based Sentiment Analysis of Online Chinese Buzzwords

Sentiment Embeddings with Applications to Sentiment Analysis

A Text Sentiment Classification Model Using Double Word Embedding Methods

Learning Bilingual Embedding Model for Cross-Language Sentiment Classification

A Word2vec Model For Sentiment Analysis Of Weibo

Extraction New Sentiment Words in Weibo Based on Relative Branch Entropy

Cross-Domain Sentiment Encoding through Stochastic Word Embedding

Word Embedding Composition for Data Imbalances in Sentiment and Emotion Classification

More than Bags of Words: Sentiment Analysis with Word Embeddings

Senti-BSAS: A BERT-based Classification Model with Sentiment Calculating for Happiness Research.

Multi-Emotion Category Improving Embedding for Sentiment Classification.

Learning Word Representations for Sentiment Analysis.

An Improved BERT and Syntactic Dependency Representation Model for Sentiment Analysis

Exploiting BERT for End-to-End Aspect-based Sentiment Analysis

Learning Bilingual Sentiment-Specific Word Embeddings without Cross-Lingual Supervision

An Enhanced Sentiment Analysis Framework Based on Pre-Trained Word Embedding

Sentiment Analysis of Chinese Words Using Word Embedding and Sentiment Morpheme Matching.