Parametric and non-parametric unsupervised cluster analysis

Stephen J. Roberts
DOI: https://doi.org/10.1016/s0031-3203(96)00079-9
IF: 8
1997-02-01
Pattern Recognition
Abstract:Much work has been published on methods for assessing the probable number of clusters or structures within unknown data sets. This paper aims to look in more detail at two methods, a broad parametric method, based around the assumption of Gaussian clusters and the other a non-parametric method which utilises methods of scale-space filtering to extract robust structures within a data set. It is shown that, whilst both methods are capable of determining cluster validity for data sets in which clusters tend towards a multivariate Gaussian distribution, the parametric method inevitably fails for clusters which have a non-Gaussian structure whilst the scale-space method is more robust.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?