K -means Algorithm Based on Outliers Detection

Yong-lin LENG,Qing-chen ZHANG,Liang ZHAO,Fu-yu LU
DOI: https://doi.org/10.13831/j.cnki.issn.1673-0569.2014.01.014
2014-01-01
Abstract:K-means algorithm is widely used in real life for its simple and rapid characteristics .However , traditional K-means algorithm is affected by outliers , leading to the instability of the clustering results and low accuracy of the clustering .For this problem , the paper proposes a novel K -means algorithm based on outliers detection .The presented algorithm firstly detects outliers from the given dataset , which can avoid selecting outli-ers as the initial seed .After clustering all the objects which are not outliers , the algorithm allocates every outlier to the corresponding cluster according to distance between the outlier and different clusters .The presented algo-rithm reduces the impact of outliers on traditional K -means algorithm and improves the clustering accuracy .For the given number of categories of the clusters and in the standard UCI data sets ,the experimental results indicate that the algorithm is effective , reduces the influence of outlier on the K -means algorithm , improving the accura-cy and stability of the cluster .
What problem does this paper attempt to address?