xk-split:A Split Clustering Algorithm Bases on k-medoids

Yi-fei CHEN,Hui-qun YU
DOI: https://doi.org/10.14135/j.cnki.1006-3080.2017.06.015
2017-01-01
Abstract:In recent years,the scale of internet data has explosive growth,which makes big data analysis become a hot topic.However,it is difficult to directly utilize the collected data,so a certain degree of pretreatment had to be made in order to improve the quality of big data.In this work,the data set will be gradually divided into smaller subsets by using the split iterative process,which can effectively avoid the limitation of traditional clustering algorithm and reduce the time complexity.In addition,by thresholdbased noise data filtering,the dirty data can be eliminated during the iterative process so as to enhance the tolerance of the clustering algorithm to the dirty data.
What problem does this paper attempt to address?