Research on Subspace Characteristic of High Dimension Outlier Dataset

Jin Yifu,Zhu Qingsheng,Zou Xianlin
DOI: https://doi.org/10.3321/j.issn:1002-8331.2006.09.046
2006-01-01
Abstract:Some efficient methods of explaining and analyzing outliers is discussed in this paper.For describing outlying feature of high dimension dataset quantificationally,a concept of degree of outlying contribution is defined in the paper based on attribute reduction in the theory of rough set.With outlying partition and reduction and the analyzing method of the key attribute subspace of outliers are put forward,this paper presents an algorithm for outlying reduction and analyzes its complexity.Experimental results show that the approach can be used for identifying the origin of outliers and improve the understanding of whole data set and the proposed algorithm is scalable and efficient.
What problem does this paper attempt to address?