Hot topic detection in Chinese web forum using statistics approach

Xiaoyu Li,GuanZhong Dai,Shuang Lai,Hang Dai
DOI: https://doi.org/10.1109/ICSPCC.2011.6061621
2011-01-01
Abstract:In this paper we propose a statistics approach for hot topic detection in Chinese web forum. In order to solve the fundamental obstacles of Chinese web data mining, such as new words, nonstandard syntax and Chinese word segmentation, we present the longest common segmented consecutive subsequence (LCSCS) and other techniques. The algorithm can run even without prior knowledge. Our experiments show the satisfying results both in performance and quality. © 2011 IEEE.
What problem does this paper attempt to address?