High Dimensional Uncertain Data Efficient Clustering Algorithm

Jian HU,Shu-bin SU,Yi-min MAO
2014-01-01
Abstract:Cursing of dimensionality, including noise data and the input parameters are highly dependence on relevant domain knowledge are all challenging problems in the field of uncertain data clustering. For these problems, HDUDEC(High Dimension-al Uncertain Data Efficient Clustering )algorithm based on Similarity measure and agglomerative hierarchical clustering idea was proposed. The algorithm uses a metric function who can accurately express the similarity between the uncertain high-dimension-al objects to calculate the similarity between objects, and then cluster analysis from the bottom up based on similarity threshold. Experiments show that the new algorithm can filter noise data effectively and obtain arbitrary shape uncertain clustering results ef-ficiently with little priori knowledge.
What problem does this paper attempt to address?