A Spectral Clustering-Based Dataset Structure Analysis and OutlierDetection Progress

lin hai,zhu qingsheng
DOI: https://doi.org/10.1007/978-1-4471-2386-6_91
2012-01-01
Abstract:A dataset structure analysis and outlier detection progress is proposed in the paper. The progress is designed to process those datasets, which their records’ data can be accessed but their data space structuresare not known. The proposed progress is on the basis of spectral clustering algorithm, which the number of clusters of the dataset is needed. But if the number is not given, the proposed progress first apply some certain clustering algorithm which does not need the number to cluster the dataset approximately to get a approximation of the number of clusters. Then the approximation is used to get the boundary of the number of clusters. The third step is to assign different index to each value within the to obtain the optimized result of the clustering and the number of clusters. After that the LOF algorithm is applied to find those records, which have the largest possibility to be outliers.
What problem does this paper attempt to address?