Deep Clustering Using the Soft Silhouette Score: Towards Compact and Well-Separated Clusters

Georgios Vardakas,Ioannis Papakostas,Aristidis Likas
2024-02-01
Abstract:Unsupervised learning has gained prominence in the big data era, offering a means to extract valuable insights from unlabeled datasets. Deep clustering has emerged as an important unsupervised category, aiming to exploit the non-linear mapping capabilities of neural networks in order to enhance clustering performance. The majority of deep clustering literature focuses on minimizing the inner-cluster variability in some embedded space while keeping the learned representation consistent with the original high-dimensional dataset. In this work, we propose soft silhoutte, a probabilistic formulation of the silhouette coefficient. Soft silhouette rewards compact and distinctly separated clustering solutions like the conventional silhouette coefficient. When optimized within a deep clustering framework, soft silhouette guides the learned representations towards forming compact and well-separated clusters. In addition, we introduce an autoencoder-based deep learning architecture that is suitable for optimizing the soft silhouette objective function. The proposed deep clustering method has been tested and compared with several well-studied deep clustering methods on various benchmark datasets, yielding very satisfactory clustering results.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main goal of this paper is to propose a new deep clustering method aimed at improving the quality of clustering by optimizing a new metric called the "soft silhouette score." Specifically, this method attempts to address the following issues: 1. **Improving Clustering Quality**: Existing deep clustering methods often focus only on reducing intra-cluster variability while neglecting inter-cluster separability. The method proposed in this paper aims to consider both intra-cluster compactness and inter-cluster separability to achieve clearer and better-separated clustering results. 2. **Overcoming Existing Limitations**: Traditional silhouette scores assume hard clustering, which is not suitable for probabilistic clustering results; moreover, they are not differentiable, making them unsuitable for use as loss functions in neural network training. To address these issues, the authors propose the soft silhouette score, which can evaluate probabilistic clustering results and is differentiable, making it suitable as a training objective function. 3. **Developing a New Deep Clustering Framework**: The paper introduces a deep clustering architecture based on autoencoders, which directly learns cluster assignment probabilities from the data and uses the soft silhouette score as part of the loss function to guide the network in learning more compact and well-separated embedding representations. In summary, the goal of this research is to propose a deep learning framework capable of producing high-quality clustering results, particularly in handling high-dimensional data to form clusters that are both compact and well-separated, thereby better reflecting the intrinsic structure of the data.