K-Means Clustering Center Optimization Method Based on Random Perturbation

Lilei SUN,Jin? QIN
DOI: https://doi.org/10.15958/j.cnki.gdxbzrb.2016.04.18
2016-01-01
Abstract:For the shortcomings of K-Means algorithm that it is sensitive to initial value and easily plunge into lo-cal optimum, a randomized clustering center optimization algorithm was proposed. First of all, recalculating the clustering center after each iteration, searching points within a certain neighborhood area outward of the center, choosing the clustering center with probability from the points found in neighborhood area, this point is called the physical center. Then, the selected physical center moving to the cluster center with a certain distance at a cer-tain speed, the calculating location is the new clustering center; Finally, dividing the data set according to the Euclidean distance. The improved algorithm changes the clustering center by probability perturbation method, and enlarges its center to a certain area to search the optimal solution, so the improved algorithm can greatly a-voids the K-Means algorithm falling into local optimum; and even if the calculation process is trapped in local optimum, the optimized algorithm can also jump out with a certain probability through searching the optimal re-gion.
What problem does this paper attempt to address?