Detecting Spoilers in Movie Reviews with External Movie Knowledge and User Networks

Heng Wang,Wenqian Zhang,Yuyang Bai,Zhaoxuan Tan,Shangbin Feng,Qinghua Zheng,Minnan Luo
2023-10-26
Abstract:Online movie review platforms are providing crowdsourced feedback for the film industry and the general public, while spoiler reviews greatly compromise user experience. Although preliminary research efforts were made to automatically identify spoilers, they merely focus on the review content itself, while robust spoiler detection requires putting the review into the context of facts and knowledge regarding movies, user behavior on film review platforms, and more. In light of these challenges, we first curate a large-scale network-based spoiler detection dataset LCS and a comprehensive and up-to-date movie knowledge base UKM. We then propose MVSD, a novel Multi-View Spoiler Detection framework that takes into account the external knowledge about movies and user activities on movie review platforms. Specifically, MVSD constructs three interconnecting heterogeneous information networks to model diverse data sources and their multi-view attributes, while we design and employ a novel heterogeneous graph neural network architecture for spoiler detection as node-level classification. Extensive experiments demonstrate that MVSD advances the state-of-the-art on two spoiler detection datasets, while the introduction of external knowledge and user interactions help ground robust spoiler detection. Our data and code are available at <a class="link-external link-https" href="https://github.com/Arthur-Heng/Spoiler-Detection" rel="external noopener nofollow">this https URL</a>
Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the issue of spoiler detection in movie reviews. Existing spoiler detection models mainly focus on the content of the review text itself. The method proposed in this paper goes beyond this limitation by improving the accuracy of spoiler detection through the combination of external movie knowledge and user interaction behavior on movie review platforms. Specifically, the paper makes the following contributions: 1. **Constructing a Large-Scale Dataset**: The paper first creates a large-scale spoiler detection dataset, LCS, which contains over 1.8 million reviews, nearly 500,000 actor information, and 15 types of metadata. Compared to existing datasets, it is larger in scale and more timely updated. 2. **Building a Movie Knowledge Base**: The paper also constructs a comprehensive movie knowledge base, UKM, which covers important information about modern movies. Compared with existing movie knowledge bases, it shows a larger scale and richer information. 3. **Proposing a Multi-View Spoiler Detection Framework (MVSD)**: The paper proposes a multi-view spoiler detection framework, MVSD, based on graph neural networks. This framework can comprehensively consider external movie knowledge and user interaction networks by constructing three interconnected heterogeneous information networks to model different data sources and their multi-view features. 4. **Experimental Validation**: Extensive experiments show that MVSD significantly outperforms baseline models on two spoiler detection datasets, improving the F1 score by at least 2.01 and 3.22, demonstrating the importance of incorporating external knowledge and user interactions for achieving accurate and reliable spoiler prediction. In summary, the paper greatly advances the development of spoiler detection research through resource construction and methodological innovation.