DEMOS: Clustering by Pruning a Density-Boosting Cluster Tree of Density Mounts

Junyi Guan,Sheng Li,Xiaojun Chen,Xiongxiong He,Jiajia Chen
DOI: https://doi.org/10.1109/tkde.2023.3266451
IF: 9.235
2023-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Most existing clustering algorithms require presetting cluster number and often fail to capture complex shapes. Herein, we propose a clustering algorithm by pruning a density-boosting cluster tree of density mounts—DEnsity MOuntains Separation clustering algorithm (DEMOS). A cluster is assumed to be a density-connected area with multiple (or a single) density mounts (i.e., single-peak clusters) and a relatively large dis-connectivity from density-connected areas of higher densities. Based on this assumption, DEMOS can easily detect the number of clusters and robustly reconstruct their complex shapes. It first builds the dataset into a peak graph, where each density peak represents a density mount. A multi-valley-link-based connectivity estimation method is embedded to efficiently estimate the connectivity between density peaks during peak graph building. Then, by applying a new linkage metric designed based on our assumption, DEMOS builds density mounts into a reasonably density-boosting cluster tree. After obtaining a robust center detection in a clarity-enhancing decision graph (i.e., a two-dimensional plot for detecting centers), DEMOS prunes the cluster tree into final clusters to finish clustering. Experimental results on both synthetic and real datasets demonstrated the effectiveness of DEMOS and its applicability to large-scale data clustering.
What problem does this paper attempt to address?