Document Clustering Description Based on Combination Strategy

Chengzhi Zhang
DOI: https://doi.org/10.1109/icicic.2009.178
2009-01-01
Abstract:Document clustering description is a problem of labeling the clustered results of document collection clustering. It can help users determine whether one of the clusters is relevant to users' information require. Therefore, labeling a clustered set of documents is an important and challenging work in document clustering applications. The DCF (description comes first) method can generate document clustering description. For the clustering description base on DCF is generate before document clustering, there is 'semantic interval' between clustering description and cluster central vector. So, it contradicts to the intuition of 'first clustering, second description', and decreases the readability of clustering description. A method based on combination strategy, i.e. combination of the DCF and DCL (description comes last) is proposed to solve the problem of the weak readability of clustering description in this paper. Experimental results show that the method is effective, and the method is used to describe the search result clustering.
What problem does this paper attempt to address?