Parallelization of Spectral Clustering Algorithm on Multi-Core Processors and GPGPU

Jing Zheng,Wenguang Chen,Yurong Chen,Yimin Zhang,Ying Zhao,Weimin Zheng
DOI: https://doi.org/10.1109/apcsac.2008.4625449
2008-01-01
Abstract:Spectral clustering is a widely-used algorithm in the field of information retrieval, data mining, machine learning and many others. It can help to cluster a large number of data into several categories without requiring any additional information about the dataset or the categories, so that people can find information by categories easily. In this paper, we parallelize the algorithm proposed by Andrew Y. Ng, Michael I. Jordan and Yair Weiss. We provide two versions of implementation: one is parallelized in OpenMP; the other is programmed in the NVIDIA CUDA (compute unified device architecture), which is the environment provided by NVIDIA to program on its CUDA-Enabled GPGPUs (general-purpose graphic processing unit). We can achieve about three times speedup in OpenMP and around ten times speedup using CUDA in our experiments.
What problem does this paper attempt to address?