Abstract:Clustering analysis is frequently used in data mining, image processing, artificial intelligence, and so on. Traditional approaches heavily rely on manually configured parameters, of which the initial selection exerts a profound influence on the clustering outcomes. In addition, they usually only consider the relationship between two individual samples when calculating distances, neglecting the overall structure of the dataset, which can negatively affect clustering performance. At the same time, many contemporary algorithms are tailored to specific datasets, posing challenges in achieving optimal clustering performance for intricate, noisy datasets. To address these limitations, we propose an Adaptive Gravitational Clustering Algorithm Integrated with Noise Detection called GCIND. Inspired by the law of gravitation, GCIND takes into account the natural neighborhood structure of the entire dataset, adaptively computing the gravitation between data points by leveraging shared neighbors and Euclidean distance relationships. Our algorithm initially identifies and eliminates outliers or edge points in the dataset. It subsequently uses gravitation to autonomously cluster the remaining core data. Finally, the removed data are reallocated to their respective clusters. GCIND has four notable advantages: (1) it uses gravitation to build the neighborhood graph, reflecting the overall dataset structure; (2) it demonstrates stronger robustness in handling noisy datasets; (3) it uses adaptive gravitational neighborhood graph clustering, removing manual parameter tuning; (4) it adapts to complex manifold-structured datasets, offering broad applicability. Experiments have shown that GCIND, without requiring any parameter settings, demonstrates slightly better performance than the algorithms compared in the study, especially when dealing with complex manifold datasets.

NonPC: Non-parametric Clustering Algorithm with Adaptive Noise Detecting.

A robust clustering method with noise identification based on directed K-nearest neighbor graph

A Novel Graph-Based Clustering Method Using Noise Cutting

A Novel Density Peaks Clustering Algorithm Based on K Nearest Neighbors with Adaptive Merging Strategy

An Effective Nonparametric Clustering Algorithm Based on Statistical Features of Neighborhood

Graph Distance and Adaptive K-Nearest Neighbors Selection-Based Density Peak Clustering

DPC-DNG: Graph-based Label Propagation of K-Nearest Higher-Density Neighbors for Density Peaks Clustering

Non-parameter Clustering Algorithm Based on Chain Propagation and Natural Neighbor

Adaptive Gravitational Clustering Algorithm Integrated with Noise Detection

ANN-DPC: Density peak clustering by finding the adaptive nearest neighbors

A Parameterless Clustering Algorithm Based on Strict Neighborhood Graph

Noises Cutting and Natural Neighbors Spectral Clustering Based on Coupling P System

Clustering by Detecting Density Peaks and Assigning Points by Similarity-First Search Based on Weighted K-Nearest Neighbors Graph

A Hierarchical Clustering Algorithm Based on Noise Removal

Fast Clustering Using Adaptive Density Peak Detection

WC-KNNG-PC: Watershed clustering based on k-nearest-neighbor graph and Pauta Criterion

Adaptive K Near Neighbor Clustering Algorithm for Data with Non-spherical-shape Distribution

Intrusion Detection Based on Adaptive Polyclonal Clustering

Nonlinear Subspace Clustering Via Adaptive Graph Regularized Autoencoder.

An Adaptive Anomaly Detection Based on Hierarchical Clustering

Adaptive Density Peaks Clustering Based on K-Nearest Neighbor and Gini Coefficient