Research Progress on Outlier Mining

WANG Hong-ding,TONG Yun-hai,TAN Shao-hua,TANG Shi-wei,YANG Dong-qing
DOI: https://doi.org/10.3969/j.issn.1673-4785.2006.01.011
2006-01-01
Abstract:An outlier is a data point that is significantly different from the others in a data set. One person’s noise could be another person’s signal, and therefore the problem of outlier mining attracts more and more interests in research of information science when the research fields of data quality, fraud detection, intrusion detection, fault diagnosis, military scout and so on receive wide attentions. In this paper, a survey was presented for the problem of outlier mining from the basic concepts to the principal research problems and the underlying techniques, including origination of outlier, definition of outlier and the comparison of popular outlier mining methods. A summary of the current state of the art of these techniques, a discussion on future research topics, and the challenges of the outlier mining were also presented.
What problem does this paper attempt to address?