Hierarchical and Incremental Structural Entropy Minimization for Unsupervised Social Event Detection

Yuwei Cao,Hao Peng,Zhengtao Yu,Philip S. Yu
2023-12-19
Abstract:As a trending approach for social event detection, graph neural network (GNN)-based methods enable a fusion of natural language semantics and the complex social network structural information, thus showing SOTA performance. However, GNN-based methods can miss useful message correlations. Moreover, they require manual labeling for training and predetermining the number of events for prediction. In this work, we address social event detection via graph structural entropy (SE) minimization. While keeping the merits of the GNN-based methods, the proposed framework, HISEvent, constructs more informative message graphs, is unsupervised, and does not require the number of events given a priori. Specifically, we incrementally explore the graph neighborhoods using 1-dimensional (1D) SE minimization to supplement the existing message graph with edges between semantically related messages. We then detect events from the message graph by hierarchically minimizing 2-dimensional (2D) SE. Our proposed 1D and 2D SE minimization algorithms are customized for social event detection and effectively tackle the efficiency problem of the existing SE minimization algorithms. Extensive experiments show that HISEvent consistently outperforms GNN-based methods and achieves the new SOTA for social event detection under both closed- and open-set settings while being efficient and robust.
Social and Information Networks,Machine Learning
What problem does this paper attempt to address?
The paper aims to address several key issues in social media event detection: 1. **Problems with existing methods**: Although methods based on Graph Neural Networks (GNN) perform well, they have the following limitations: - They only link messages with completely identical attributes, ignoring the associations between semantically similar but differently attributed messages. - They require manually labeled data for training and need to pre-determine the number of events during prediction. 2. **Research objectives**: The paper proposes a new framework called HISEvent to address the above issues by minimizing Graph Structural Entropy (SE). Specifically, the goals of this method include: - Constructing a more informative message graph that includes associations based on common attributes as well as those based on semantic similarity. - Proposing an unsupervised method to detect social events without the need for manually labeled data or pre-setting the number of events. - Designing efficient algorithms to handle large and complex message graphs, addressing the inefficiency of existing SE minimization algorithms. In summary, this research aims to improve the limitations of existing GNN methods by introducing an information-theoretic perspective and SE minimization techniques to build a more comprehensive and accurate social event detection system.