A Clustering Algorithm for Mixed Valued Data Based on Aggregate Function

WANG Yu,YANG Li
DOI: https://doi.org/10.3321/j.issn:1000-8608.2006.03.026
2006-01-01
Abstract:Aggregate function which approximates the maximum function,is introduced,and data clustering problem is reformulated as the unconstrained optimization.Firstly,a computing scheme for clustering center is inferred for numeric valued data, applying the first order necessary condition.Secondly,a new distance concept and computing scheme for categorical valued data are presented using decomposition method of categorical valued attributes,and furthermore,a new clustering approach for mixed numeric and categorical valued data is presented.Finally,computing experiment and analysis for Chinese loanwords in English are given by using different centers of clustering.The results show that the aggregate clustering algorithm is superior to the fuzzy k-prototypes algorithm in both computing efficiency and effects.
What problem does this paper attempt to address?