Improved Density-Based Algorithm for Robust Clustering

HOU Yue-xian,SHI Lu-kui,HE Pi-lian,ZHANG Ying
DOI: https://doi.org/10.3969/j.issn.0493-2137.2005.12.012
2005-01-01
Abstract:A formal criterion to verify the validity of sequential clustering algorithms is presented. Based on the criterion, DBSCANJZPoll, an algorithmic variation of DBSCAN clustering algorithm, is proposed. DBSCANJZ-Poll performs an order-dependent clustering sub-procedure a few times with random processing order, and determines the last clustering results by means of a simple statistical principle, which is derived from the above formal criterion. Simulation results show that in the case of noisy data, the clustering precision of DBSCANJZPoll is superior to DBSCAN algorithm. Moreover, theoretical analysis indicates that, for high dimensionality data, the time complexity of DBSCANJZPoll is asymptotical to O(N) , which is superior to other algorithms in DBSCAN family. The simulation of low dimensionality data shows superior time efficiency of DBSCANJZPoll.
What problem does this paper attempt to address?