Abstract:Outlier detection is of vital importance in data mining tasks, with numerous applications, including video surveillance and credit card fraud detection. Quite a few outlier detection algorithms have been developed and have received considerable attention, and most existing methods are classified as distance-based algorithms and density-based algorithms. However, both of these approaches have some flaws. The former has difficulty detecting local outliers, and the latter cannot handle low-density pattern problems. Moreover, outlier detection algorithms are sensitive to parameter settings. This paper proposes a simple and efficient outlier detection approach (called ADD) based on the average divergence difference of data objects; in this method there is no need to artificially define the number of neighbors of objects k to solve the above issues. In this algorithm, two new measures, called the divergence factor (DF) and the average divergence difference (LADD), are developed based on the skewed distribution characteristics of data objects and their natural neighbors, thus improving the accuracy of local outlier detection from an innovative research perspective. These factors are presented as external and internal characterization factors because the former characterizes the skew distribution characteristics and compactness relationship of data objects and the latter represents the difference in the skew distribution characteristics of data objects in a neighborhood. Then, we set an appropriate threshold to distinguish whether a data point is an outlier, which eliminates the interference of the Top-N problem. Finally, the final experimental results show that the ADD algorithm achieves an overall improvement in local outlier detection, especially in the detection of outliers in some datasets with complex distributions and in low-density areas, compared to that achieved by state-of-the-art algorithms.

A minimum spanning tree-inspired clustering-based outlier detection technique

A fast MST-inspired kNN-based outlier detection method

MS2OD: Outlier Detection Using Minimum Spanning Tree and Medoid Selection

Detecting outliers by clustering algorithms

A New Outlier Detection Algorithm Based on Fast Density Peak Clustering Outlier Factor

Outlier detection using iterative adaptive mini-minimum spanning tree generation with applications on medical data

Outlier Detection Algorithm Based on Reachable Neighbor

A neighborhood weighted-based method for the detection of outliers

Outlier detection algorithm based on k-nearest neighbors-local outlier factor

A local search algorithm for k-means with outliers

Clustering With Outlier Removal

Info-Detection: An Information-Theoretic Approach To Detect Outlier

Data-driven cluster analysis method: a novel outliers detection method in multivariate data

On Saving Outliers for Better Clustering over Noisy Data.

ADD: a new average divergence difference-based outlier detection method with skewed distribution of data objects

A Scalable Algorithm for Detecting Community Outliers in Social Networks.

Outliers Detection Is Not So Hard: Approximation Algorithms for Robust Clustering Problems Using Local Search Techniques

A method for outlier detection based on cluster analysis and visual expert criteria

Outlier edge detection using random graph generation models and applications

Outlier Detection with Cluster Catch Digraphs

Mean-shift outlier detection and filtering