The Research of Qualitative Data Hierarchical Cluster Based on Dummy Variable

Xiaoyu Zhang,Meilin Zhu,Jian Huang,Guohua Chen
DOI: https://doi.org/10.1109/WCICA.2006.1714237
2006-01-01
Abstract:Existing data clustering algorithms measure similarity between objects by using a distance metrics that is defined mainly for quantitative data. According to this, the idea of dummy variables in statistics is used to transform the qualitative data to measure swatch matrix shown by binary scale. Relativities between variables is eliminated in order to reduce complexity in analysis. Then the dissimilarity and distances among the variables are computed. Using the method introduced, an experiment is carried out, and its outcome shows that this method is simple and effective
What problem does this paper attempt to address?