Abstract:With the tremendous advances made by Convolutional Neural Networks (ConvNets) on object recognition, we can now easily obtain adequately reliable machine-labeled annotations easily from predictions by off-the-shelf ConvNets. In this work, we present an "abstraction memory" based framework for few-shot learning, building upon machinelabeled image annotations. Our method takes large-scale machine-annotated dataset (e.g., OpenImages) as an external memory bank. In the external memory bank, the information is stored in the memory slots in the form of keyvalue, in which image feature is regarded as the key and the label embedding serves as the value. When queried by the few-shot examples, our model selects visually similar data from the external memory bank and writes the useful information obtained from related external data into another memory bank, i.e. abstraction memory. Long Short-Term Memory (LSTM) controllers and attention mechanisms are utilized to guarantee the data written to the abstraction memory correlates with the query example. The abstraction memory concentrates information from the external memory bank to make the few-shot recognition effective. In the experiments, we first confirm that our model can learn to conduct few-shot object recognition on clean humanlabeled data from the ImageNet dataset. Then, we demonstrate that with our model, machine-labeled image annotations are very effective and abundant resources for performing object recognition on novel categories. Experimental results show that our proposed model with machine-labeled annotations achieves great results, with only a 1% difference in accuracy between the machine-labeled annotations and the human-labeled annotations.

Memory Matching Networks for One-Shot Image Recognition

One-shot Learning with Memory-Augmented Neural Networks

Self-Attentive Networks for One-Shot Image Recognition

Learning Meta-class Memory for Few-Shot Semantic Segmentation

Multi-Attention Network For One Shot Learning

Alignment Based Matching Networks for One-Shot Classification and Open-Set Recognition

Memory-Augmented Relation Network for Few-Shot Learning

Compound Memory Networks for Few-Shot Video Classification

Learning to focus: cascaded feature matching network for few-shot image recognition

Few-Shot Object Recognition from Machine-Labeled Web Images.

Learning to Memorize Feature Hallucination for One-Shot Image Generation

Memory transformation networks for weakly supervised visual classification

Memory Segment Matching Network Based Image Geo-Localization.

Joint Neural Networks for One-shot Object Recognition and Detection

Associative Memory Optimized Method on Deep Neural Networks for Image Classification.

Memorizing Complementation Network for Few-Shot Class-Incremental Learning

Few-shot activity recognition with cross-modal memory network

Attention-Augmented Memory Network for Image Multi-Label Classification

MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning

A memory model for image recognition and classification based on convolutional neural network and Bayesian decision

Learning Permutation Invariant Representations using Memory Networks