Improved Method for Noise Detection by DBSCAN and Angle Based Outlier Factor in High Dimensional Datasets

Sarita Tripathy,Laxman Sahoo
DOI: https://doi.org/10.1007/978-981-13-8715-9_27
2019-08-02
Abstract:Various data mining methods are used to detect outliers from different databases. It is essential to detect outliers in different kinds of real time application areas, such as the health care sector and insurance, marketing, banks and finance et c. The proposed method in this paper is a new outlier detection method from a high dimensional data sets which combines the angle based outlier detection (ABOD) with the classical density based clustering method DBSCAN. The algorithm consists of three stages in which the first stage consists of applying of the PCA on the data set which will result in a subset of attributes, to this subset of attributes the dbscan algorithm is applied which results in detection of a set of outliers. In the third and final stage ABOD is applied to the set of outliers. Experimental analysis conducted state that the result improves the detection accuracy and decreases the number of false positives.
What problem does this paper attempt to address?