An Online Detection and Tracking Method for Bursty Topics

XUE Feng,ZHOU Yadong,GAO Feng,LIU Ji,ZHAO Junzhou,DANG Qi
2011-01-01
Abstract:Text representation in text mining plays an important role,but the traditional vector space model based on TF-IDF is a static statistical model and is not flexible for bursty topic detection and tracking since it could not model the bursty dynamic text flow(such as news text flow,blog text flow,etc.) effectively.A new model called dynamic bursty vector space model is proposed to model text flow,and to detect and track bursty topics based on bursty feature detection.The proposed dynamic model has several characteristics in contrast to the traditional static model: 1) The model generates features dynamically in feature selection process;2) A unified representation of the text and topics is given;3) The model gives more weights to temporal bursty features.The experiments of bursty topic detection and tracking demonstrate that the dynamic bursty vector space model could be able to get higher precision and recall.
What problem does this paper attempt to address?