Approximate backbone guided heuristic clustering algorithm for uncertain data

Ping Jin,Yu Zong,Shichao Qu,Yan Hu,Yuan Tian
DOI: https://doi.org/10.13232/j.cnki.jnju.2015.01.027
2015-01-01
Abstract:As an extension of traditional data mining,uncertain data clustering gets wide interest of researchers. Uncertain data clustering is equivalent to a combinatorial optimization problem,and researchers usually solve it by using heuristic algorithms.As we knew,the existence heuristic clustering algorithms for uncertain data,such as UK-Means and UK-Medoids,are easy to be understood and to implemented,but these algorithms also have the initialization sensitivity problems which affect the clustering quality severely.In this paper,we propose an approximate backbone based heuristic clustering algorithm APPGCU (Approximate backbone guided heuristic clustering algorithm for uncertain data).In this algorithm,we first do P times sampling on the original dataset,and run UK-Medoids on the P sampled datasets to get P sub-optimal solutions;then get the approximate backbone from the P sub-optimal solutions,and extract the initialization cluster center;eventually,re-run UK-Medoids with the initialization and get the solution by a heuristic search.The experiments on synthetic and standard UCI uncertain datasets demonstrate that APPGCU gets better clustering results than existing heuristic clustering algorithms,and improves the quality of clustering results.
What problem does this paper attempt to address?