Research on High Dimensional Clustering Algorithm Based on Similarity Measure

HUANG Si-da,CHEN Qi-mai
DOI: https://doi.org/10.3969/j.issn.1000-386x.2009.09.032
2009-01-01
Abstract:Facing the difficult problem of how to define similarity measure for high dimensional data, a new high dimensional clustering algorithm is designed in this paper. This new clustering algorithm is based on a new similarity measure function, which can more accurately to express the similarity degree among the high dimensional data. The executing process of the algorithm is followed: firstly it uses the similarity measure function to compute the similarity degree for each high dimensional data to obtain the similarity matrix, and then conducts the cluster analysis based on the similarity matrix by the Bottom-up method. The experiment shows that this algorithm can improve the clustering analysis accurately and effectively, and will not be influent by the outliers. This algorithm is also insensitive to the input order of the data.
What problem does this paper attempt to address?