A K-MEANS CLUSTERING ALGORITHM WITH NOISE PROCESSING

Jin Lu,Yuefei Guo
DOI: https://doi.org/10.3969/j.issn.1000-386x.2015.10.063
2015-01-01
Abstract:K-means clustering as classical clustering algorithm is sensitive to noise.In practical applications,the data usually contain many noises and this makes it difficult to obtain a good clustering result.This paper proposes a K-means clustering algorithm with noise pro-cessing.The algorithm divides original space to several regions dynamically,and calculates the weighted similarity matrix of sample and each regional centroid using correlated regional density and uses it as the input of K-means algorithm.The matrix effectively describes the distribu-tion information of data and at the same time realises the dimensionality reduction of features so that the clustering tasks with noise data can be processed more effectively.The proposed algorithm is more suitable for the situation of complex data distribution.Experimental result proves the effectiveness of the algorithm.
What problem does this paper attempt to address?