Improving Density Peak Clustering on Multi-Dimensional Time Series: Rediscover and Subdivide

Huina Wang,Bo Liu,Huaipu Zhao,Guangzhi Qu
DOI: https://doi.org/10.1007/s10115-024-02272-7
IF: 2.7
2024-01-01
Knowledge and Information Systems
Abstract:The density peak clustering (DPC) algorithm identifies patterns in high-dimensional data and obtains robust outcomes across diverse data types with minimal hyperparameters. However, DPC may produce inaccurate pattern sizes in multi-dimensional datasets and exhibit poor performance in recognizing similar patterns. To solve these issues, we propose the rediscover and subdivide density peak clustering algorithm (RSDPC), which follows three key strategies. The first strategy, rediscover, iteratively uncovers prominent patterns within the existing data. The second strategy, subdivide, partitions patterns into several similar subclasses. The third strategy, re-sort, rectifies errors from the preceding steps by incorporating critical distance and nearest distance considerations. The experimental results show that RSDPC is feasible and effective in synthetic and practical datasets compared with state-of-the-art works.
What problem does this paper attempt to address?