Efficient Keywords Clustering Method for Topic Detection

YANG Pan,GUI Xiaolin,TIAN Feng,WANG Gang
2012-01-01
Abstract:An improved term-committee-based event identification algorithm is presented to meet the requirements of efficiency and accuracy in public opinion monitor system,where the original event identification algorithm can not be applied due to its lower efficiency.While the similarity between the clusters is calculated,the weight is taken into consideration simultaneously.Referencing the examples from normal curve,an evaluation algorithm is proposed to help choosing cluster with a proper term number,thus the improved algorithm only needs clustering once.The experiments indicate the operating efficiency for the required accuracy.
What problem does this paper attempt to address?