Algorithm for Outlier Detection in Large Dataset Based on Weighted KNN

WANG Qian,YANG Zheng-kuan
DOI: https://doi.org/10.3969/j.issn.1002-137X.2011.10.040
2011-01-01
Computer Science
Abstract:Traditional KNN is an advanced algorithm based on the distance of outlier detection algorithm on large dataset.However this algorithm only uses the kth nearest neighbor as the criterion for outlier which is inaccurate under certain condition.This paper presented a weighted KNN outlier detection algorithm for large datasets.In this algorithm,a weight factor is presented.It represents the average distance of its k nearest neighbors.The outliers are those having the largest distance with it's kth neighbor and having the biggest weight under the same condition.The algorithm improves the accuracy of the outlier detection algorithm.Experiment result shows that the algorithm is feasible compared with the traditional KNN.
What problem does this paper attempt to address?