Outburst Topic Detection for Web Forums

CHEN You,YANG Sen
DOI: https://doi.org/10.3969/j.issn.1003-0077.2010.03.005
2010-01-01
Abstract:Web forum has become an important resource on the Web due to its rich information contributed by millions of Internet users every day.Consequently,the outburst topic detection becomes a fundamental task in Search Engine and Web Mining systems.Most existing topic detection and tracking(TDT) methods deal with the news stories,which are proved not suitable for extracting topics in casual,oral and informal languageon the noisy Web formus.This paper presents a noise-filtered model to extract the outburst topics from web forums using terms and participations of users.The proposed model employs not only content similarity,but also user participation information.Experiments on ShuiMu community demonstrate the efficiency of the proposed model: not only extracting the outburst topics which are better organized for search and visualization but also discovering communities corresponding to these topics.
What problem does this paper attempt to address?