Algorithm for Initialization of K-Means Clustering Center Based on Optimized-Division

张健沛,杨悦,杨静,张泽宝
2009-01-01
Abstract:In process of clustering with traditional K-Means algorithm, it is difficult to identify the value of the number of clusters k, while the accuracy and efficiency of algorithm is reduced when it selects the cluster centers randomly. An algorithm for initialization of K-Means clustering center based on optimized division was proposed. This new algorithm could divide the data sample space optimized with histogram method, and identify the initial cluster centers obeying the natural character of data space. Experiment results demonstrate that the times of iterate in the process of K-Means algorithm is diseased clearly, and the accuracy of cluster results and efficiency of algorithm has been improved.
What problem does this paper attempt to address?