Proxy Anchor-based Unsupervised Learning for Continuous Generalized Category Discovery

Hyungmin Kim,Sungho Suh,Daehwan Kim,Daun Jeong,Hansang Cho,Junmo Kim
2023-11-03
Abstract:Recent advances in deep learning have significantly improved the performance of various computer vision applications. However, discovering novel categories in an incremental learning scenario remains a challenging problem due to the lack of prior knowledge about the number and nature of new categories. Existing methods for novel category discovery are limited by their reliance on labeled datasets and prior knowledge about the number of novel categories and the proportion of novel samples in the batch. To address the limitations and more accurately reflect real-world scenarios, in this paper, we propose a novel unsupervised class incremental learning approach for discovering novel categories on unlabeled sets without prior knowledge. The proposed method fine-tunes the feature extractor and proxy anchors on labeled sets, then splits samples into old and novel categories and clusters on the unlabeled dataset. Furthermore, the proxy anchors-based exemplar generates representative category vectors to mitigate catastrophic forgetting. Experimental results demonstrate that our proposed approach outperforms the state-of-the-art methods on fine-grained datasets under real-world scenarios.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the problem of discovering new categories from unlabeled datasets in a continuous category discovery task without prior knowledge, and mitigating catastrophic forgetting during incremental learning. Specifically, the paper proposes a new scenario—Continuous Generalized novel Category Discovery (CGCD), aiming to overcome the constraints of existing methods that assume unlabeled datasets belong to new categories, making it more applicable to real-world situations. The main contributions of the paper include: 1. **Proposing the new CGCD scenario**: This scenario is suitable for addressing the challenges of discovering new categories in the real world, removing the constraint that unlabeled data only belong to new categories. 2. **Proposing a new unsupervised learning method**: This method is used for incremental new category discovery without requiring prior knowledge about the number of new categories or the proportion of new category samples. 3. **Introducing noisy label learning and deep metric learning**: These techniques are used to divide unlabeled data into old and new categories and demonstrate how to mitigate catastrophic forgetting through metric-based examples. 4. **Outperforming existing methods on various fine-grained datasets**: Experimental results show that the proposed method outperforms state-of-the-art methods in new category discovery and forgetting mitigation, without requiring any prior knowledge, making it applicable to unlabeled joint datasets and thus a more realistic and practical solution. Through these contributions, the paper provides a more realistic framework for addressing the problem of new category discovery in incremental learning.