Robust Local Outlier Detection

Haizhou Du,Shengjie Zhao,Daqiang Zhang
DOI: https://doi.org/10.1109/icdmw.2015.114
2015-01-01
Abstract:With the rapid expansion of data scale, big data mining and analysis has attracted increasing attention. Outlier detection as an important task of data mining is widely used in many applications. However, conventional outlier detection methods have difficulty handling large-scale datasets. In addition, most of them typically can only identify global outliers and are over sensitive to parameters variation. In this paper, we propose a robust method for robust local outlier detection with statistical parameters, which incorporates the clustering based ideas in dealing with big data. Firstly, This method find some density peaks of dataset by 3s standard. Secondly each remaining data object in the dataset is assigned to the same cluster as its nearest neighbor of higher density. Finally, we use Chebyshevs inequality and density peak reachability to identify local outliers of each group. The experimental results demonstrate the efficiency and accuracy of the proposed method in identifying both global and local outliers, Moreover, the method also proved more robust analysis than typical outlier detection methods, such as LOF and DBSCAN.
What problem does this paper attempt to address?