Brief Survey of K-Means Clustering Algorithms

Hui Ming Liu,Jin Grong Lu
DOI: https://doi.org/10.4028/www.scientific.net/amm.740.624
2015-01-01
Applied Mechanics and Materials
Abstract:K-means is one of the most widely used algorithms for clustering. Ease of implementation, efficiency, simplicity, and empirical success are the main reasons for its popularity. In actual application, there are some defects in traditional k-means, for example, the value of K need to be specified ahead, initial clustering center is a random choice and so on; this influences the performance of the K-means. In order to overcome these obstacles, many variants of K-means algorithm have appeared. We provide a brief overview of k-means, point out existing problems; summarize major improvements in the determination of clusters number, the initialization of the cluster, the similarity measurement, the sensitivity of noise and outliers and so on. Further study directions of K-means are pointed at last.
What problem does this paper attempt to address?