Revisiting Mutual Information Maximization for Generalized Category Discovery

Zhaorui Tan,Chengrui Zhang,Xi Yang,Jie Sun,Kaizhu Huang
2024-05-31
Abstract:Generalized category discovery presents a challenge in a realistic scenario, which requires the model's generalization ability to recognize unlabeled samples from known and unknown categories. This paper revisits the challenge of generalized category discovery through the lens of information maximization (InfoMax) with a probabilistic parametric classifier. Our findings reveal that ensuring independence between known and unknown classes while concurrently assuming a uniform probability distribution across all classes, yields an enlarged margin among known and unknown classes that promotes the model's performance. To achieve the aforementioned independence, we propose a novel InfoMax-based method, Regularized Parametric InfoMax (RPIM), which adopts pseudo labels to supervise unlabeled samples during InfoMax, while proposing a regularization to ensure the quality of the pseudo labels. Additionally, we introduce novel semantic-bias transformation to refine the features from the pre-trained model instead of direct fine-tuning to rescue the computational costs. Extensive experiments on six benchmark datasets validate the effectiveness of our method. RPIM significantly improves the performance regarding unknown classes, surpassing the state-of-the-art method by an average margin of 3.5%.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the problem of how to improve the model's ability to recognize known and unknown categories in Generalized Category Discovery (GCD), especially when dealing with unlabeled samples. Specifically, the paper focuses on how to ensure the independence between known and unknown categories during model training, while assuming a uniform probability distribution across all categories, to expand the boundary between known and unknown categories and thus enhance model performance. ### Background and Challenges The task of Generalized Category Discovery requires the model to recognize unlabeled samples from both known and unknown categories, which is very challenging in real-world scenarios. Existing methods perform poorly when dealing with long-tailed datasets and fail to adequately consider the independence between known and unknown categories, leading to confusion when the model distinguishes between these two types of samples. ### Main Contributions of the Paper 1. **Revisiting Information Maximization**: The paper revisits the problem of Generalized Category Discovery from the perspective of Information Maximization (InfoMax), revealing that ignoring the independence assumption between known and unknown categories can lead to confusion, especially when using probabilistic parametric classifiers. 2. **Proposing a New Method**: To ensure the independence between known and unknown categories, the paper proposes a new method—Regularized Parametric InfoMax (RPIM). This method uses reliable pseudo-labels as additional supervision to ensure the independence assumption is met during the information maximization process. 3. **Introducing Semantic Bias Transformation**: To further improve probabilistic parametric classifiers and reduce computational costs, the paper introduces a simple yet effective semantic bias transformation to refine the features of pre-trained models without extensive fine-tuning. ### Experimental Results The paper conducts extensive experiments on six benchmark datasets to validate the effectiveness of the proposed method. The experimental results show that RPIM significantly outperforms existing state-of-the-art (SOTA) methods on unknown categories, with an average performance improvement of 3.5%. ### Key Technical Details - **Information Maximization**: By maximizing the mutual information \(I(X;Y;Z)\) between input features \(X\), true labels \(Y\), and predictions \(Z\), the model's performance on both known and unknown categories is ensured. - **Pseudo-Labels**: The Sinkhorn-Knopp algorithm is used to generate soft pseudo-labels, and high-quality pseudo-labels are selected through thresholding. - **Regularization Term**: A regularization term \(L_R\) is introduced to ensure the independence between known and unknown categories by minimizing the entropy of unlabeled samples. - **Semantic Bias Transformation**: A linear layer is used to learn a bias \(b\) and add it to the features \(X\), followed by normalization to maintain the original characteristics of the semantic features. In summary, through theoretical analysis and experiments, the paper demonstrates that ensuring the independence between known and unknown categories can significantly improve the performance of Generalized Category Discovery tasks.