A Novel Grid-Based K-Means Combined With Self-Organizing Map

Jin Ding,Yong-Zai Lu,Jian Chu
2013-01-01
Abstract:Clustering plays an important role in data analysis. The classic k-means method is not well scaled to large datasets and its performance is sensitive to the initial seeds. In this paper we present the development of a novel hybrid k-means which proceeds on the result of the self- organizing map (SOM) training and initializes its seeds based on the trained SOM grid. Theoretical analysis suggests this method is time-efficient to the massive datasets. And in addition, the numerical results show the grid-based seed initialization has faster convergence characteristics and competitive accuracy on both computer-generated and real-world test cases compared to its randomized seed initialization counterpart.
What problem does this paper attempt to address?