A Microblog Hot Topic Detection Algorithm Based on Discrete Particle Swarm Optimization

Huifang Ma,Yugang Ji,Xiaohong Li,Runan Zhou
DOI: https://doi.org/10.1007/978-3-319-42911-3_23
2016-01-01
Abstract:AbstractTraditional hot topic detection algorithms cannot show its optimal performance on microblogs for their inherent flaws in constructing short-text representation model, implementing the core algorithm in large corpus with short time and evaluating the algorithms’ qualities during the process of detecting hot topics. In this paper, a novel method for detecting hot topics in microblogs is presented. This approach takes advantage of a probabilistic correlation-based representation measure in order to ensure a dense and low-dimension microblog representation matrix. Besides, we take the clustering as an optimization problem and introduce a discrete particle swarm optimization (DPSO) to simplify the clustering process to detect topics. Furthermore, the clustering quality evaluation criteria is adopted as the optimization objective function for topic detection which can evaluate the algorithms’ qualities after each iteration. Experimental results with corpora containing more than 148,000 twitters show that our algorithm is an effective hot topic detection method for microblog.
What problem does this paper attempt to address?