A Clustering Method for Analysis of Data Subject to Pre-Defined Classifications

Yang Liu
DOI: https://doi.org/10.2139/ssrn.3403864
2019-01-01
SSRN Electronic Journal
Abstract:In this paper, we present a methodology to perform clustering and grouping analysis for dataset with classification constraints or definitions. The discussion is demonstrated with a full example based on real data. We start with the observed difference in the CIA and UN subregional definition of European countries, and consider what the impact is from a subregional house price ratio perspective. As documented in this report, we find that the presented approach useful for clustering analysis of the pre-identified subgroups to address subgroup based clustering problems.
What problem does this paper attempt to address?