An Efficient Algorithm of Hot Events Detection in Text Streams

Junliang Bai,Jun Guo,Guang Chen,Weiran Xu,Gang Du
DOI: https://doi.org/10.1109/CyberC.2010.65
2010-01-01
Abstract:Hot events detection in text streams has drawn increasing attention in recent sequential data mining works. Different from traditional TDT task which find all the real events' cluster, hot events detection only identify hot events concerned by public. This paper proposes a novel approach to identify those events based on burst terms, terms co-occurrence and generative probabilistic model. Experiments with huge text stream sets crawled from WWW suggest that our algorithm can work on-line and identify hot events effectively and efficiently.
What problem does this paper attempt to address?