Document Clustering Using Sample Weighting

Chengzhi Zhang,Xinning Su,Dongmin Zhou
2007-01-01
Abstract:Clustering algorithm based on Sample weighting has been noticed recently. In this paper, a novel sample weighting clustering algorithm is presented based on K-Means and fuzzy C-Means algorithm. The algorithm uses academic documents as the clustering objects. The PageRank value of each document is calculated according to the cited relationship among them, and it is used as the weight in the algorithm. Experiments show that the proposed algorithm is effective to improve performance of document clustering.
What problem does this paper attempt to address?