Analyze Blog′s Simularity Passing the New Clustering Algorithm Called Increase K-Means

WU Hai-hua,LI Shao-zi,LIN Da-zhen,KE Xiao,CAO Dong-lin
DOI: https://doi.org/10.3321/j.issn:0438-0479.2009.02.010
2009-01-01
Abstract:In view of existing situations that K-Means must assign the breeds in advance and Affinity Propagation must endure high Computational Complexity,we presented a new clustering algorithm called Increase K-Means.We applied this new approach to the analysis of the Blog′s content similarity,and served the need of Community finding and Topic tracking better.Experiments showed that our new approach approximated to the K-Means in the running time and got close to the Affinity Propagation in the accuracy,just saying this,our new approach was suited to deal with the large-scale Web text better.
What problem does this paper attempt to address?