Modeling Sentiment and Aspect Using Syntax: A Topic Model Approach

Chunping Li,Meng Xia,Raymond Y.K. Lau
DOI: https://doi.org/10.12792/icisip2017.036
2017-01-01
Abstract:In this paper, based on Latent Dirichlet Allocation (LDA), we propose a novel probabilistic modeling framework, which aims to reveal the latent aspects and sentiments of reviews simultaneously. Unlike other topic models which only consider the words appearing in online reviews, we consider Part-of-Speech (POS) tags in our model. Since users may use different types of words to express different meanings, we have proposed two Tag Sentiment Aspect models (TSA) to integrate syntactical information into the review mining models. We have applied the proposed models to two datasets, electronic product reviews and movie reviews, and evaluated the results in terms of sentiment aspect extraction and sentiment polarity classification. Our study shows that the proposed models not only achieve promising results on sentiment classification, but also effectively extract different latent sentiment aspects. Moreover, the proposed TSA models are fully unsupervised, and they do not need any manually labeled reviews for training. To incorporate priors, only the lists of positive and negative words are required. Moreover, the proposed TSA models are effective across different domain.
What problem does this paper attempt to address?