BERT-Based Meta-Learning Approach with Looking Back for Sentiment Analysis of Literary Book Reviews

Hui Bao,Kai He,Xuemeng Yin,Xuanyu Li,Xinrui Bao,Haichuan Zhang,Jialun Wu,Zeyu Gao
DOI: https://doi.org/10.1007/978-3-030-88483-3_18
2021-01-01
Abstract:Sentiment analysis of literary book reviews is crucial for authors to understand the voice of readers and for humanities researchers to investigate the reader reception of the literary book. Constructing an automated sentiment analysis system can help us analyze literary reviews in a big-data way, achieving a complete view of the full understanding. The main challenge in developing such a system is the lack of large-scale datasets with precise annotations. Some related researches collect the star rates from social networks and convert them into corresponding sentiment annotations for data labeling. Such automatically generated annotations can greatly reduce the human labeling efforts but with two defects. The first is that such annotations are filling with a vast amount of noise. For example, different people have significantly different perceptions for sentiment markers (positive, neutral, and negative) of a five-star comment system. Besides, literary book reviews contain more positive emotions than neutral and negative. Namely, this task is also challenged by imbalanced data. This paper introduces an automatically generated sentiment analysis dataset for Chinese literary book reviews with parts of manual verification, containing 187 literary books with 109,286 reviews. Furthermore, we propose a novel meta-learning approach with Looking Back mechanism to build a robust BERT-based sentiment classification model. Specifically, we design an extra meta-model for learning a sample weighting function from the historical sample information, which mitigates the influence of noisy labels and class imbalance problems. The parameters of this meta-model are finely updated with the whole training process by the guidance of a small amount of unbiased meta data. The results show the F1 score raise of 13.51% with the Looking Back mechanism, showing that our method can significantly promote sentiment analysis performance with noisy labels and class imbalance data.
What problem does this paper attempt to address?