Abstract:Generalized category discovery presents a challenge in a realistic scenario, which requires the model's generalization ability to recognize unlabeled samples from known and unknown categories. This paper revisits the challenge of generalized category discovery through the lens of information maximization (InfoMax) with a probabilistic parametric classifier. Our findings reveal that ensuring independence between known and unknown classes while concurrently assuming a uniform probability distribution across all classes, yields an enlarged margin among known and unknown classes that promotes the model's performance. To achieve the aforementioned independence, we propose a novel InfoMax-based method, Regularized Parametric InfoMax (RPIM), which adopts pseudo labels to supervise unlabeled samples during InfoMax, while proposing a regularization to ensure the quality of the pseudo labels. Additionally, we introduce novel semantic-bias transformation to refine the features from the pre-trained model instead of direct fine-tuning to rescue the computational costs. Extensive experiments on six benchmark datasets validate the effectiveness of our method. RPIM significantly improves the performance regarding unknown classes, surpassing the state-of-the-art method by an average margin of 3.5%.

What problem does this paper attempt to address?

The paper attempts to address the problem of how to improve the model's ability to recognize known and unknown categories in Generalized Category Discovery (GCD), especially when dealing with unlabeled samples. Specifically, the paper focuses on how to ensure the independence between known and unknown categories during model training, while assuming a uniform probability distribution across all categories, to expand the boundary between known and unknown categories and thus enhance model performance. ### Background and Challenges The task of Generalized Category Discovery requires the model to recognize unlabeled samples from both known and unknown categories, which is very challenging in real-world scenarios. Existing methods perform poorly when dealing with long-tailed datasets and fail to adequately consider the independence between known and unknown categories, leading to confusion when the model distinguishes between these two types of samples. ### Main Contributions of the Paper 1. **Revisiting Information Maximization**: The paper revisits the problem of Generalized Category Discovery from the perspective of Information Maximization (InfoMax), revealing that ignoring the independence assumption between known and unknown categories can lead to confusion, especially when using probabilistic parametric classifiers. 2. **Proposing a New Method**: To ensure the independence between known and unknown categories, the paper proposes a new method—Regularized Parametric InfoMax (RPIM). This method uses reliable pseudo-labels as additional supervision to ensure the independence assumption is met during the information maximization process. 3. **Introducing Semantic Bias Transformation**: To further improve probabilistic parametric classifiers and reduce computational costs, the paper introduces a simple yet effective semantic bias transformation to refine the features of pre-trained models without extensive fine-tuning. ### Experimental Results The paper conducts extensive experiments on six benchmark datasets to validate the effectiveness of the proposed method. The experimental results show that RPIM significantly outperforms existing state-of-the-art (SOTA) methods on unknown categories, with an average performance improvement of 3.5%. ### Key Technical Details - **Information Maximization**: By maximizing the mutual information \(I(X;Y;Z)\) between input features \(X\), true labels \(Y\), and predictions \(Z\), the model's performance on both known and unknown categories is ensured. - **Pseudo-Labels**: The Sinkhorn-Knopp algorithm is used to generate soft pseudo-labels, and high-quality pseudo-labels are selected through thresholding. - **Regularization Term**: A regularization term \(L_R\) is introduced to ensure the independence between known and unknown categories by minimizing the entropy of unlabeled samples. - **Semantic Bias Transformation**: A linear layer is used to learn a bias \(b\) and add it to the features \(X\), followed by normalization to maintain the original characteristics of the semantic features. In summary, through theoretical analysis and experiments, the paper demonstrates that ensuring the independence between known and unknown categories can significantly improve the performance of Generalized Category Discovery tasks.

Revisiting Mutual Information Maximization for Generalized Category Discovery

Parametric Information Maximization for Generalized Category Discovery

ImbaGCD: Imbalanced Generalized Category Discovery

Parametric Classification for Generalized Category Discovery: A Baseline Study

Pseudo-supervised contrastive learning with inter-class separability for generalized category discovery

Generalized Categories Discovery for Long-tailed Recognition

Multimodal Generalized Category Discovery

Contextuality Helps Representation Learning for Generalized Category Discovery

Feature Extractor Optimization for Discriminative Representations in Generalized Category Discovery

Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information Estimator

Learning Semi-supervised Gaussian Mixture Models for Generalized Category Discovery

A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization

Preserving domain private information via mutual information maximization

Person Re-Identification by Enhanced Local Maximal Occurrence Representation and Generalized Similarity Metric Learning

Unknown Sample Selection and Discriminative Classifier Learning for Generalized Category Discovery

Prototypical Classifier with Distribution Consistency Regularization for Generalized Category Discovery: A Strong Baseline

CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery

Genetic InfoMax: Exploring Mutual Information Maximization in High-Dimensional Imaging Genetics Studies

Unleashing the Potential of Model Bias for Generalized Category Discovery

Let’s Start Over: Retraining with Selective Samples for Generalized Category Discovery

Maximizing Mutual Information Across Feature and Topology Views for Learning Graph Representations