A Neighborhood-Based K-medoids Clustering Algorithm

谢娟英,郭文娟,谢维信
DOI: https://doi.org/10.3969/j.issn.1672-4291.2012.04.005
2012-01-01
Abstract:A new K-medoids algorithm is proposed based on the neighborhood of samples in a dataset. This algorithm defines the radius of the neighborhood and the related neighborhood for each sample according to the distribution of samples in a dataset, and selects the samples that not only lie in higher density area, but also are far away from each other as initial seeds for K- medoids algorithm to overcome the potential disadvantage of the fast K-medoids algorithm to select the samples in the same cluster as initial seeds for different clusters. The proposed K- medoids algorithm is tested on some well-known data sets from UCI machine learning repository and on some synthetic datasets with noisy samples. The experimental results demonstrate that the proposed algorithm achieves the excellent clustering result in short time and is not sensible to noisy data. It outperforms the traditional K-medoids algorithm(Partitioning Around Medoids, PAM) and the fast K-medoids algorithm of Park's.
What problem does this paper attempt to address?