An adapted algorithm of choosing initial values for k-means document clustering

Yuanchao Liu,Xiaolong Wang,Bingquan Liu
DOI: https://doi.org/10.3321/j.issn:1002-0470.2006.01.003
2006-01-01
Abstract:In this paper, a novel algorithm of choosing initial values for k-means document clustering is proposed, which is based on an adapted minimum maximum principle. Firstly similarity matrix is constructed, and then an adapted minimum maximum principle is used to select both the initial seeds and the value of k. The experiment results show that the value of k found by this method is very near to the true value.
What problem does this paper attempt to address?