Abstract:Binary sentiment analysis uses sentiment dictionaries, TF-IDF, word2vec, and BERT to convert text documents such as product and movie reviews into vectors. Dimensionality reduction by feature selection can effectively reduce the complexity of sentiment analysis. Existing feature selection methods put all samples together and ignore the difference in the feature representation between different categories. For binary sentiment analysis, there are some reviews with uncertain sentiment polarity, three-way decision divides samples into positive (POS) region, negative (NEG) region, and uncertain region (UNC). The model based on the three-way decision is beneficial to process the UNC and improve the effect of binary sentiment analysis. However, how to obtain the optimal feature representation in certain regions respectively to process the uncertain samples is a challenge. In this paper, a classified feature representation three-way decision model is proposed to obtain the optimal feature representation of the positive and negative domains for sentiment analysis. In the positive domain and the negative domain, m- and n-layer feature representations are obtained. The optimal layer with the best performance is selected as the optimal feature representation. The POS region and the NEG region in the testing set are processed by the optimal feature representation, the UNC region is processed by the original feature representation. Experiments on IMDB and Amazon show that the performance of our proposed method in terms of classification accuracy in sentiment analysis is significantly higher than that of the chi-square, principal component analysis, and mutual information methods.

Feature subsumption for sentiment classification in multiple languages

Sentiment Classification for Chinese Reviews Based on Key Substring Features

Exploiting effective features for chinese sentiment classification

Towards a Universal Sentiment Classifier in Multiple Languages

Learning Bilingual Embedding Model for Cross-Language Sentiment Classification

A Multi-dimension and Multi-granularity Feature Fusion Method for Chinese Microblog Sentiment Classification

Parallel Sentiment Polarity Classification Method with Substring Feature Reduction

Cross-Lingual Propagation for Deep Sentiment Analysis

Experimental Study on Sentiment Classification of Chinese Review Using Machine Learning Techniques

Sentiment Classification in Chinese Microblogs: Lexicon-based and Learning-based Approaches

SASICM A Multi-Task Benchmark For Subtext Recognition

A method of constructing a fine-grained sentiment lexicon for the humanities computing of classical chinese poetry

SentiXRL: An advanced large language Model Framework for Multilingual Fine-Grained Emotion Classification in Complex Text Environment

Cross-Lingual Mixture Model for Sentiment Classification.

A Novel Sentiment Polarity Detection Framework for Chinese

Co-training for Cross-Lingual Sentiment Classification

A classified feature representation three-way decision model for sentiment analysis

A Comparative Study of Cross-Lingual Sentiment Classification

Research on Sentiment Dictionary Based on Sentiment Analysis in News Domain.

Bilingual co-training for sentiment classification of chinese product reviews

Sentiment interpretability analysis on Chinese texts employing multi-task and knowledge base