Outlier Detecting Algorithm Based on Clustering and Local Information

ZHANG Qiang,WANG Chun-xia,ZHAO Jian,WU Long-ju,LI Jing-yong
DOI: https://doi.org/10.13413/j.cnki.jdxblxb.2012.06.012
2012-01-01
Abstract:Most existing outlier detection algorithms ignore local information of data sets,they are of low accuracy.We adopted a two-phase algorithm based on k-means clustering algorithm,defined a new local stray factor as the standard to judge whether data objects are outliers.We also improved the process of detecting outliers and solved the above problem.Experiments show that our algorithm overcomes the shortcomings of existing methods,ensure the algorithm has linear time complexity and is able to find outliers in data sets more accurately and effectively.
What problem does this paper attempt to address?