Learning Primitive-aware Discriminative Representations for FSL

Jian Yang
DOI: https://doi.org/10.48550/arxiv.2208.09717
2022-01-01
Abstract:Few-shot learning (FSL)aims to learn a classifier that can be easily adapted to recognize novel classes,given only a few labeled examples per class.Limited data keep this task challenging for deep learning.Recent work has achieved promising classification performance,where the image-level feature from global average pooling operation is used to measure the similarity among samples.However,these global features ignore abundant local and structural information that is transferable and consistent between seen and unseen classes.How can humans easily recognize novel classes with only few samples?Some study in cognitive science argue that humans can recognize novel classes with the learned primitives .Although base and novel classes are non-overlapping, they can share some primitives in common.We expect to mine both transferable and discriminative representation from base classes and adopt them to recognize novel classes.Concretely, building on the episodic training mechanism, We propose a Primitive Mining and Reasoning Network(PMRN) to learn primitive-aware discriminative representation in an end-to-end manner for metric-based FSL model.We first add self-supervision auxiliary task in parallel,forcing model to learn visual pattern corresponding to primitives.To further mine and produce transferable primitive-aware representations,we design an Adaptive Channel Grouping(ACG) module to synthesize a set of visual primitive features from object embedding by enhancing informative channel maps while suppressing useless ones. Based on the learned primitive feature,a Semantic Correlation Reasoning(SCR) module is proposed to improve discriminative power of primitives by capturing internal relations among them.Finally,we learn the task-specific importance of primitives and conduct the primitive-level metric based on task-specific attention feature.Extensive experiments show that our method achieves state-of-the- art results on six standard benchmarks.
What problem does this paper attempt to address?