Improving Short Text Clustering Performance with Keyword Expansion.

Jun Wang,Yiming Zhou,Lin Li,Biyun Hu,Xia Hu
DOI: https://doi.org/10.1007/978-3-642-01216-7_31
2009-01-01
Abstract:Most of traditional text clustering methods are based on bag of words representation, which ignore the important information on semantic relationship between key terms. To overcome this problem, researchers have recently proposed several new methods for improving short text clustering accuracy based on enriching short text representation. However, the computational costs of these methods based on expanding words appeared in short texts are usually time-consuming. In this paper, we improve previous work by enriching short text representation with keyword expansion. Empirical results show that the proposed method can greatly save time without sacrificing clustering accuracy.
What problem does this paper attempt to address?