Sentiment Lossless Summarization

Xiaodong Li,Pangjing Wu,Chenxin Zou,Haoran Xie,Fu Lee Wang
DOI: https://doi.org/10.1016/j.knosys.2021.107170
2021-09-01
Abstract:<p>The aim of automatic text summarization (ATS) is to extract representative texts from documents and keep major points of the extracted texts consistent with the original documents. However, most existing studies ignore sentimental information loss in the summarization process, which leads to sentiment loss summarization. To address the sentiment loss issue during summarization, we introduce a sentiment compensation mechanism into document summarization and propose a graph-based extractive summarization approach named Sentiment Lossless Summarization (SLS). SLS first creates a graph representation for a document to obtain the importance score (i.e., literal indicator) of each sentence. Second, sentiment dictionaries are leveraged to analyze the sentence sentiments. Third, during each summarization iteration, the sentences with the lowest scores are iteratively removed, and the sentiment compensation weights of the remaining sentences are updated. With the help of sentiment compensation during the summarization process, sentiment consistencies between candidate summaries and the original documents are maintained. Intrinsic evaluations conducted on the DUC2001, DUC2002, DUC2004, and Multi-News datasets demonstrate that our approach outperforms baselines and state-of-the-art summarization methods in terms of Recall-Oriented Understudy for Gisting Evaluation (ROUGE) scores. Additionally, to further evaluate SLS performance in sentiment retention, extrinsic evaluations are introduced, and summary quality in terms of sentiment loss is evaluated by measuring the prediction accuracy for sentiment polarities of either movie (IMDb dataset) or product (Amazon dataset) review summaries. The experimental results demonstrate that our approach can improve prediction accuracy by at most 6% compared to the baseline.</p>
computer science, artificial intelligence
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of sentiment information loss in the process of Automatic Text Summarization (ATS). Specifically: 1. **Sentiment Consistency**: Most existing automatic text summarization methods ignore sentiment information during the summarization process, leading to sentiment inconsistency between the generated summary and the original document. This sentiment loss is particularly noticeable in sentiment-rich documents, such as movie reviews or product evaluations. 2. **Proposed Method**: To tackle this challenge, the authors introduce a sentiment compensation mechanism and propose a graph-based extractive summarization method called Sentiment Lossless Summarization (SLS). The SLS method is implemented through the following steps: - First, the document is represented as a weighted directed graph to obtain the importance score of each sentence (i.e., literal indicator). - Second, a sentiment lexicon is used to analyze the sentiment of each sentence. - In each summarization iteration, the sentence with the lowest score is removed, and the sentiment compensation weights of the remaining sentences are updated. - Through the sentiment compensation mechanism, the sentiment consistency between the candidate summary and the original document is maintained during the summarization process. 3. **Experimental Validation**: Intrinsic evaluations on the DUC2001, DUC2002, DUC2004, and Multi-News datasets indicate that the SLS method outperforms baseline methods and state-of-the-art summarization methods in terms of ROUGE scores. Additionally, to further assess the SLS method's ability to retain sentiment, an extrinsic evaluation task was introduced to measure the sentiment polarity prediction accuracy of summaries for IMDB movie reviews and Amazon product reviews. Experimental results show that the SLS method improves sentiment polarity classification accuracy by up to 6% compared to baseline methods. In summary, the main goal of this paper is to maintain sentiment consistency during the automatic text summarization process and to validate its effectiveness through experiments on multiple datasets.