Improved K-means Clustering Algorithm Based Density and Sample Size

ZHAO Da-wei,Xiao Zhou-fang
2008-01-01
Abstract:Studying and improving the K-means algorithm, it can search the proper value k automatically, and try the best to find the isolated points. Firstly, The algorithm calculates the most probable number of clustering. Then, finding the sample circle, dividing the cicle, swatches are distributed to their shares by their positions. Followed, the improved algorithm puts clustering into practice. Lastly, some small classes are combined by adopting DBSCAN algorithm. The source code of the improved algorithm is put into the open platform "weak" which is developed by University of Waikato, New Zealand to test the performance of the improved algorithm. It has compared with the original K-means algorithm in many datas, which proves that it precedes the original K-means algorithm in quality and stability.
What problem does this paper attempt to address?