Distributed Density Peaks Clustering Revisited

Jing Lu,Yuhai Zhao,Kian-Lee Tan,Zhengkui Wang
DOI: https://doi.org/10.1109/tkde.2020.3034611
IF: 9.235
2020-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Density Peaks (DP) Clustering organizes data into clusters by finding peaks in dense regions. This involves computing density (ρ) and distance (δ) of every point, and the time complexity is O(N2) where N is the number of data points. In this paper, we propose a fast distributed density peaks clustering algorithm, FDDP, based on the z-value index. We also propose FC, an efficient algorithm that employs a forward computing strategy to calculate ρ linearly, and CB, which uses a caching and efficient searching strategy to compute δ. Our experimental results show that FDDP outperforms the state-of-the-art algorithms significantly.
What problem does this paper attempt to address?