Abstract:Deep clustering is a crucial task in machine learning and data mining that focuses on acquiring feature representations conducive to clustering. Previous research relies on self-supervised representation learning for general feature representations, such features may not be optimally suited for downstream clustering tasks. In this paper, we introduce MICCF, a framework designed to bridge this gap and enhance clustering performance. MICCF enhances feature representations by combining mutual information at different levels and employs an auxiliary alignment mutual information module for learning clustering-oriented features. To be specific, we propose a dual mutual information constraints module, incorporating minimal mutual information constraints at the feature level and maximal mutual information constraints at the instance level. This reduction in feature redundancy encourages the neural network to extract more discriminative features, while maximization ensures more unbiased and robust representations. To obtain clustering-oriented representations, the auxiliary alignment mutual information module utilizes pseudo-labels to maximize mutual information through a multi-classifier network, aligning features with the clustering task. The main network and the auxiliary one work in synergy to jointly optimize feature representations that are well-suited for the clustering task. We validate the effectiveness of our method through extensive experiments on six benchmark datasets. The results indicate that our method performs well in most scenarios, particularly on fine-grained datasets, where our approach effectively distinguishes subtle differences between closely related categories. Notably, our approach achieved a remarkable accuracy of 96.4% on the ImageNet-10 dataset, surpassing other comparison methods. The code is available at https://github.com/Li-Hyn/MICCF.git .

Information Maximization Clustering Via Multi-View Self-Labelling

Clustering by Maximizing Mutual Information Across Views

Image Clustering Based on Multi-Scale Deep Maximize Mutual Information and Self-Training Algorithm

Self-labelling via simultaneous clustering and representation learning

Multi-View Data Fusion Oriented Clustering via Nuclear Norm Minimization

Multi-View Clustering from the Perspective of Mutual Information

Learning Representations by Maximizing Mutual Information Across Views

Self-representation and matrix factorization based multi-view clustering

Self-supervised Multi-view Clustering Framework with Graph Filtering and Contrast Fusion.

Generalized Information-theoretic Multi-view Clustering

Self-Supervised Discriminative Feature Learning for Deep Multi-View Clustering

Latent Representation Guided Multi-view Clustering

Self-Supervised Information Bottleneck for Deep Multi-View Subspace Clustering

MICCF: A Mutual Information Constrained Clustering Framework for Learning Clustering-Oriented Feature Representations

Efficient and Effective Deep Multi-view Subspace Clustering

Multi-view Self-Paced Learning for Clustering

Self-Supervised Deep Multi-View Subspace Clustering.

Deep Multiview Clustering Via Iteratively Self-Supervised Universal and Specific Space Learning

Towards Generalized Multi-stage Clustering: Multi-view Self-distillation

Integrating Vision-Language Semantic Graphs in Multi-View Clustering

Intrinsic Self-Representation for Multi-View Subspace Clustering