A New Outlier Detection Algorithm Based on Manifold Learning

Zhigang Tang,Jun Yang,Bingru Yang
DOI: https://doi.org/10.1109/ccdc.2010.5499017
2010-01-01
Abstract:Detecting outliers in a large set of data objects is a major data mining task aiming at finding different mechanisms responsible for different groups of objects in a data set. All existing approaches, however, are based on an assessment of distances (sometimes indirectly by assuming certain distributions) in the full-dimensional Euclidean data space. In high-dimensional data, these approaches are bound to deteriorate due to the notorious “curse of dimensionality”. In this paper, we propose a novel approach named MLOD (Manifold Learning -Based Outlier Detection), This way, the effects of the “curse of dimensionality” are alleviated compared to purely distance-based approaches. A main advantage of our new approach is that our method does not rely on any parameter selection influencing the quality of the achieved ranking. Empirical studies conducted on both real and synthetic data sets show that significant improvements in detection rate and false alarm rate are achieved using the proposed framework.
What problem does this paper attempt to address?