Clustering ensemble algorithm for categorical data

LI Tao-ying,CHEN Yan,ZHANG Jin-song,ZHANG Lin
DOI: https://doi.org/10.3969/j.issn.1001-3695.2011.05.021
2011-01-01
Abstract:In order to prevent the inaccuracy and randomness of single clustering algorithm,and error of existing clustering algorithm transferring categorical data into numerical data for clustering,this paper proposed the clustering ensemble for catego-rical data.The algorithm produced clustering memberships by values of categorical data,and then used similarity degree to partition dataset,which reduced the process of clustering by minimizing the objective function.Finally,applied the algorithm into UCI dataset.The results show its efficiency and accuracy are better than existing algorithms,the design and refreshing methods are effective.
What problem does this paper attempt to address?