A New Partition-based Clustering Algorithm For Mixed Data

ZHONG Xian,YU TianBao,XIA HongXia
2017-01-01
Abstract:In practical application field, it is common to see the mixed data containing both the numerical attributes and categorical attributes simultaneously. However, most of the given clustering algorithms can only deal with data in single type, either numerical or categorical. Therefore we present a partition-based clustering algorithm for mixed data. First, the multi-Modes representation means of category centres of categorical data in K-prototypes algorithm is given, and then the Euclidean is expanded to mixed data so as to reflect the dissimilarity between the objects and categories more accurately in same framework. On the basis, a partitioned clustering algorithm for processing mixed data is proposed. Finally, the experimental results on UCI dataset show that the proposed algorithm can effectively improve the clustering performance comparing with the K-prototypes algorithm.
What problem does this paper attempt to address?