An Automatic Clustering Algorithm Based on a Competition Model of Probabilistic PCA

Yunxia Li,Jian Cheng Lv,Xiaojie Li
DOI: https://doi.org/10.1007/978-3-319-13359-1_6
2015-01-01
Abstract:A number of mixture models of local Principal Component Analysis (PCA) have been developed to analyze data distributed in space. Most of these models require the users to determine the number of the local PCA models, i.e., the number of clusters for clustering analysis. This is not a reasonable requirement in practical applications. This paper proposes an automatic clustering algorithm to analyze data based on a competition model of probabilistic PCA. Without identifying the number of clusters in advance, the algorithm automatically evolves to partition a given data set into some small clusters in terms of the empirical rule of Gaussian distribution. It is shown the algorithm will not only group data but also can explore the hierarchical structure of a given data.
What problem does this paper attempt to address?