Abstract:In this paper, we study a practical yet challenging task, On-the-fly Category Discovery (OCD), aiming to online discover the newly-coming stream data that belong to both known and unknown classes, by leveraging only known category knowledge contained in labeled data. Previous OCD methods employ the hash-based technique to represent old/new categories by hash codes for instance-wise inference. However, directly mapping features into low-dimensional hash space not only inevitably damages the ability to distinguish classes and but also causes "high sensitivity" issue, especially for fine-grained classes, leading to inferior performance. To address these issues, we propose a novel Prototypical Hash Encoding (PHE) framework consisting of Category-aware Prototype Generation (CPG) and Discriminative Category Encoding (DCE) to mitigate the sensitivity of hash code while preserving rich discriminative information contained in high-dimension feature space, in a two-stage projection fashion. CPG enables the model to fully capture the intra-category diversity by representing each category with multiple prototypes. DCE boosts the discrimination ability of hash code with the guidance of the generated category prototypes and the constraint of minimum separation distance. By jointly optimizing CPG and DCE, we demonstrate that these two components are mutually beneficial towards an effective OCD. Extensive experiments show the significant superiority of our PHE over previous methods, e.g., obtaining an improvement of +5.3% in ALL ACC averaged on all datasets. Moreover, due to the nature of the interpretable prototypes, we visually analyze the underlying mechanism of how PHE helps group certain samples into either known or unknown categories. Code is available at <a class="link-external link-https" href="https://github.com/HaiyangZheng/PHE" rel="external noopener nofollow">this https URL</a>.

CLOVER: a faster prior-free approach to rare-category detection

Prior-free Rare Category Detection: More Effective and Efficient Solutions

Semisupervised Prior Free Rare Category Detection with Mixed Criteria

Radar: Rare Category Detection Via Computation Of Boundary Degree

Rare Category Detection Algorithm Based on Weighted Boundary Degree

Rare Category Detection Forest.

Rare Category Exploration Via Wavelet Analysis: Theory and Applications

Rare Category Exploration with Noisy Labels.

Interactive Rare-Category-of-Interest Mining from Large Datasets

Rare Category Exploration

LERI: Local Exploration for Rare-Category Identification

Fast Rare Category Detection Using Nearest Centroid Neighborhood

Privacy preserving and fast decision for novelty detection using support vector data description

Fast-RCM: Fast Tree-Based Unsupervised Rare-Class Mining

Boosting Dense Long-Tailed Object Detection from Data-Centric View

Novel Class Discovery for Ultra-Fine-Grained Visual Categorization

CLOVER: Context-aware Long-term Object Viewpoint- and Environment- Invariant Representation Learning

CELOF: Effective and fast memory efficient local outlier detection in high-dimensional data streams

Prototypical Hash Encoding for On-the-Fly Fine-Grained Category Discovery

Prior-Free Continual Learning with Unlabeled Data in the Wild

Novel Class Discovery for Long-tailed Recognition