K-Centers Algorithm for Clustering Mixed Type Data

Wei-Dong Zhao,Wei-Hui Dai,Chun-Bin Tang
DOI: https://doi.org/10.1007/978-3-540-71701-0_129
2007-01-01
Abstract:The K-modes and K-prototypes algorithms both apply the frequency-based update method for centroids, regarding attribute values with the highest frequency but neglecting other attribute values, which affects the accuracy of clustering results. To solve this problem, the K-centers clustering algorithm is proposed to handle mixed type data. As the extension to the K-prototypes algorithms, hard and fuzzy K-centers algorithm, focusing on effects of attribute values with different frequencies on clustering accuracy, a new update method for centroids is proposed in this paper. Experiments on many UCI machine-learning databases show that the K-centers algorithm can cluster categorical and mixed-type data more efficiently and effectively than the K-modes and K-prototypes algorithms.
What problem does this paper attempt to address?