Abstract:This paper introduces a novel family of outlier detection algorithms based on Cluster Catch Digraphs (CCDs), specifically tailored to address the challenges of high dimensionality and varying cluster shapes, which deteriorate the performance of most traditional outlier detection methods. We propose the Uniformity-Based CCD with Mutual Catch Graph (U-MCCD), the Uniformity- and Neighbor-Based CCD with Mutual Catch Graph (UN-MCCD), and their shape-adaptive variants (SU-MCCD and SUN-MCCD), which are designed to detect outliers in data sets with arbitrary cluster shapes and high dimensions. We present the advantages and shortcomings of these algorithms and provide the motivation or need to define each particular algorithm. Through comprehensive Monte Carlo simulations, we assess their performance and demonstrate the robustness and effectiveness of our algorithms across various settings and contamination levels. We also illustrate the use of our algorithms on various real-life data sets. The U-MCCD algorithm efficiently identifies outliers while maintaining high true negative rates, and the SU-MCCD algorithm shows substantial improvement in handling non-uniform clusters. Additionally, the UN-MCCD and SUN-MCCD algorithms address the limitations of existing methods in high-dimensional spaces by utilizing Nearest Neighbor Distances (NND) for clustering and outlier detection. Our results indicate that these novel algorithms offer substantial advancements in the accuracy and adaptability of outlier detection, providing a valuable tool for various real-world applications. Keyword: Outlier detection, Graph-based clustering, Cluster catch digraphs, $k$-nearest-neighborhood, Mutual catch graphs, Nearest neighbor distance.

An Outlier Detection Technique Based on Spectral Clustering

Outlier Cluster Formation in Spectral Clustering

A New Outlier Detection Algorithm Based on Fast Density Peak Clustering Outlier Factor

RE SEARCH ON EN ERGY SPEC TRUM ANOM ALY DE TEC TION METHOD FOR STATE CON TROL RA DI A TION EN VI RON MEN TAL MON I TOR ING BASED ON LOF AL GO RITHM

Parallel spectral clustering algorithm

Detecting outliers by clustering algorithms

A method for outlier detection based on cluster analysis and visual expert criteria

A Statistical Information-Based Clustering Approach in Distance Space

An Improved K-Means Clustering Algorithm Based on Spectral Method

A neighborhood weighted-based method for the detection of outliers

Outlier Detection with Cluster Catch Digraphs

Data-driven cluster analysis method: a novel outliers detection method in multivariate data

A Unified Framework for Representation-Based Subspace Clustering of Out-of-Sample and Large-Scale Data.

Spatial-temporal trajectory anomaly detection based on an improved spectral clustering algorithm

A fast MST-inspired kNN-based outlier detection method

MSD-Kmeans: A Novel Algorithm for Efficient Detection of Global and Local Outliers

MSD-Kmeans: A Hybrid Algorithm for Efficient Detection of Global and Local Outliers

Efficient and Robust KPI Outlier Detection for Large-Scale Datacenters

An Outlier Detection Method Based On Symmetry and Curvature Threshold

A Robust Spectral Clustering Algorithm for Sub-Gaussian Mixture Models with Outliers

A Fast Density Peak Clustering Method for Power Data Security Detection Based on Local Outlier Factors