Choosing models in model-based clustering and discriminant analysis

Christophe Biernacki,Gérard. Govaert
DOI: https://doi.org/10.1080/00949659908811966
IF: 1.225
1999-08-01
Journal of Statistical Computation and Simulation
Abstract:Using an eigenvalue decomposition of variance matrices, Celeux and Govaert (1993) obtained numerous and powerful models for Gaussian model-based clustering and discriminant analysis. Through Monte Carlo simulations, we compare the performances of many classical criteria to select these models: information criteria as AIC, the Bayesian criterion BIC, classification criteria as NEC and cross-validation. In the clustering context, information criteria and BIC outperform the classification criteria. In the discriminant analysis context, cross-validation shows good performance but information criteria and BIC give satisfactory results as well with, by far, less time computing.
statistics & probability,computer science, interdisciplinary applications
What problem does this paper attempt to address?