Abstract:Learning novel object concepts from limited samples remains a considerable challenge in deep learning. The main directions for improving the few-shot learning models include (i) designing a stronger backbone, (ii) designing a powerful (dynamic) meta-classifier, and (iii) using a larger pre-training set obtained by generating or hallucinating additional samples from the small scale dataset. In this paper, we focus on item (iii) and present a novel meta-hallucination strategy. Presently, most image generators are based on a generative network (i.e., GAN) that generates new samples from the captured distribution of images. However, such networks require numerous annotated samples for training. In contrast, we propose a novel saliency-based end-to-end meta-hallucinator, where a saliency detector produces foregrounds and backgrounds of support images. Such images are fed into a two-stream network to hallucinate feature samples directly in the feature space by mixing foreground and background feature samples. Then, we propose several novel mixing strategies that improve the quality and diversity of hallucinated feature samples. Moreover, as not all saliency maps are meaningful or high quality, we further introduce a meta-hallucination controller that decides which foreground feature samples should participate in mixing with backgrounds. To our knowledge, we are the first to leverage saliency detection for few-shot learning. Our proposed network achieves state-of-the-art results on publicly available few-shot image classification and anomaly detection benchmarks, and outperforms competing sample mixing strategies such as the so-called Manifold Mixup.

Learning to Memorize Feature Hallucination for One-Shot Image Generation

TcGAN: Semantic-Aware and Structure-Preserved GANs with Individual Vision Transformer for Fast Arbitrary One-Shot Image Generation

Memory Matching Networks for One-Shot Image Recognition

Multi-Level Semantic Feature Augmentation for One-Shot Learning

One-shot Learning with Memory-Augmented Neural Networks

Dual-View Data Hallucination with Semantic Relation Guidance for Few-Shot Image Recognition

Discriminative learning of imaginary data for few-shot classification

Learning to Generate with Memory

Saliency-guided meta-hallucinator for few-shot learning

Hallucination Improves the Performance of Unsupervised Visual Representation Learning

VOCABULARY-INFORMED VISUAL FEATURE AUGMENTATION FOR ONE-SHOT LEARNING

Heterogenous Memory Augmented Neural Networks

Adaptive Forgetting, Drafting and Comprehensive Guiding: Text-to-Image Synthesis with Hierarchical Generative Adversarial Networks

Semantics-Guided Intra-Category Knowledge Transfer for Generalized Zero-Shot Learning

Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond

Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation

NIFF: Alleviating Forgetting in Generalized Few-Shot Object Detection via Neural Instance Feature Forging

Self-Attentive Networks for One-Shot Image Recognition

One-Shot Fine-Grained Instance Retrieval

MS-GAN: Learn to Memorize Scene for Unpaired SAR-to-Optical Image Translation

Adversarial Feature Hallucination Networks for Few-Shot Learning