Abstract:Sentiment analysis (SA) is a technique that lets people in different fields such as business, economy, research, government, and politics to know about people's opinions, which greatly affects the process of decision-making. SA techniques are classified into: lexicon-based techniques, machine learning techniques, and a hybrid between both approaches. Each approach has its limitations and drawbacks, the machine learning approach depends on manual feature extraction, lexicon-based approach relies on sentiment lexicons that are usually unscalable, unreliable, and manually annotated by human experts. Nowadays, word-embedding techniques have been commonly used in SA classification. Currently, Word2Vec and GloVe are some of the most accurate and usable word embedding techniques, which can transform words into meaningful semantic vectors. However, these techniques ignore sentiment information of texts and require a huge corpus of texts for training and generating accurate vectors, which are used as inputs of deep learning models. In this paper, we propose an enhanced ensemble classifier framework. Our framework is based on our previously published lexicon-based method, bag-of-words, and pre-trained word embedding, first the sentence is preprocessed by removing stop-words, POS tagging, stemming and lemmatization, shortening exaggerated word. Second, the processed sentence is passed to three modules, our previous lexicon-based method (Sum Votes), bag-of-words module and semantic module (Word2Vec and Glove) and produced feature vectors. Finally, the previous features vectors are fed into 11 different classifiers. The proposed framework is tested and evaluated over four datasets with five different lexicons, the experiment results show that our proposed model outperforms the previous lexicon based and the machine learning methods individually.

Refining Word Embeddings Using Intensity Scores for Sentiment Analysis

Refined Global Word Embeddings Based on Sentiment Concept for Sentiment Analysis

Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors

Improving Twitter Sentiment Classification via Multi-Level Sentiment-Enriched Word Embeddings

Improvement of sentiment analysis via re-evaluation of objective words in SenticNet for hotel reviews

Sentiment Lexicon Enhanced Neural Sentiment Classification

Word Embedding Composition for Data Imbalances in Sentiment and Emotion Classification

Disentangling Latent Emotions of Word Embeddings on Complex Emotional Narratives

Context-aware Embedding for Targeted Aspect-based Sentiment Analysis

Revisit Word Embeddings with Semantic Lexicons for Modeling Lexical Contrast

SentiVec: Learning Sentiment-Context Vector via Kernel Optimization Function for Sentiment Analysis

Towards Fine-grained Text Sentiment Transfer

A method of constructing a fine-grained sentiment lexicon for the humanities computing of classical chinese poetry

SentiLR: Linguistic Knowledge Enhanced Language Representation for Sentiment Analysis

A Fuzzy Computing Model for Identifying Polarity of Chinese Sentiment Words

Context-aware Sentiment Word Identification: Sentiword2vec.

Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias

An Improved BERT and Syntactic Dependency Representation Model for Sentiment Analysis

Improving Word Embeddings for Antonym Detection Using Thesauri and SentiWordNet.

An Enhanced Sentiment Analysis Framework Based on Pre-Trained Word Embedding

Improving the Accuracy of Pre-trained Word Embeddings for Sentiment Analysis