EvoS&R: Evolving Multiple Seeds and Radii For Varying Density Data Clustering

Jun-Xian Chen,Yue-Jiao Gong,Wei-Neng Chen,Jun Zhang
DOI: https://doi.org/10.1109/tkde.2023.3312760
IF: 9.235
2023-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Density clustering has shown advantages over other types of clustering methods for processing arbitrarily shaped datasets. In recent years, extensive research efforts has been made on the improvements of DBSCAN or the algorithms incorporating the concept of density peaks. However, these previous studies remain the problems of being sensitive to the parameter settings, and some of them will stuck in weak results when encountering the situations of varying-density distributions. To overcome these issues, we propose an evolution framework named EvoS&R that evolves multiple seeds and the corresponding radii for varying-density data clustering. Compared with the traditional methods, EvoS&R handles the parameter tuning and multi-density fitting problems in an integrated and straightforward manner. Note that, however, the underlying task in EvoS&R is a mixed-variable optimization problem that is challenging in nature. We specifically design a hybrid encoding differential evolution algorithm with novel encoding, mutation, etc., to solve the optimization problem efficiently. Extensive experiments on density-based datasets shows that our algorithm outperforms the other state-of-the-arts in most cases, which validates the effectiveness of the proposed method.
computer science, information systems, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?