Clustering, classification, discriminant analysis, and dimension reduction via generalized hyperbolic mixtures

Katherine Morris,Paul D. McNicholas
DOI: https://doi.org/10.1016/j.csda.2015.10.008
2016-05-01
Abstract:A method for dimension reduction with clustering, classification, or discriminant analysis is introduced. This mixture model-based approach is based on fitting generalized hyperbolic mixtures on a reduced subspace within the paradigm of model-based clustering, classification, or discriminant analysis. A reduced subspace of the data is derived by considering the extent to which group means and group covariances vary. The members of the subspace arise through linear combinations of the original data, and are ordered by importance via the associated eigenvalues. The observations can be projected onto the subspace, resulting in a set of variables that captures most of the clustering information available. The use of generalized hyperbolic mixtures gives a robust framework capable of dealing with skewed clusters. Although dimension reduction is increasingly in demand across various application areas, many applications are biological and so some of the real data examples are within that sphere. Simulated data are also used for illustration.
statistics & probability,computer science, interdisciplinary applications
What problem does this paper attempt to address?