A Fast Spectral Clustering Method Based on Growing Vector Quantization for Large Data Sets

Xiujun Wang,Xiao Zheng,Feng Qin,Baohua Zhao
DOI: https://doi.org/10.1007/978-3-642-53917-6_3
2013-01-01
Abstract:Spectral clustering is a flexible clustering algorithm that can produce high-quality clusters on small scale data sets, but it is limited applicable to large scale data sets because it needs On 3 computational operations to process a data set of n data points[1]. Based on the minimization of the increment of distortion, we tackle this problem by developing a novel efficient growing vector quantization method to preprocess a large scale data set, which can compress the original data set into a small set of representative data points in one scan of the original data set. Then we apply spectral clustering algorithm to the small set. Experiments on real data sets show that our method provides fast and accurate clustering results.
What problem does this paper attempt to address?