A Parameterless Clustering Algorithm Based on Strict Neighborhood Graph

Weiyi Zhang,Juntao Yang,Rujuan Zhang,Lijun Yang
DOI: https://doi.org/10.1109/isctis58954.2023.10213093
2023-01-01
Abstract:As one of the important methods of data mining, clustering algorithm has a wide application prospect in artificial intelligence and other fields. In the clustering algorithm, most popular algorithms need to set a parameter K artificially to divide the data into different categories. This paper presents a robust nonparametric clustering algorithm based on Strict Neighborhood Graph. This algorithm uses natural neighborhood algorithm and optimized density formula for preprocessing noisy data sets, which can better process noisy data and outliers and improve the accuracy of clustering results. Compared with traditional clustering algorithms such as K-means and DBSCAN, the algorithm based on natural neighborhood can effectively eliminate the step of artificial K value setting, so as to ensure that the interference of human factors on experimental results is significantly reduced in experiments and applications. At the same time, the algorithm can obtain higher accuracy by combining the formula of noise point preprocessing. After the validity experiment and comparison experiment, the results show that the algorithm has better clustering effect without setting K value, and can deal with noisy data sets better. Compared with traditional algorithms, this algorithm does not need parameter adjustment, so it is simpler and more adaptable, and has better performance when dealing with different types of data sets. Therefore, in many clustering scenarios, this algorithm can efficiently complete the clustering task and its advantages will become more and more obvious with the increasing of data scale and complexity.
What problem does this paper attempt to address?