Abstract:Few-shot object detection (FSOD) identifies objects from extremely few annotated samples. Most existing FSOD methods, recently, apply the two-stage learning paradigm, which transfers the knowledge learned from abundant base classes to assist the few-shot detectors by learning the global features. However, such existing FSOD approaches seldom consider the localization of objects from local to global. Limited by the scarce training data in FSOD, the training samples of novel classes typically capture part of objects, resulting in such FSOD methods being unable to detect the completely unseen object during testing. To tackle this problem, we propose an Extensible Co-Existing Attention (ECEA) module to enable the model to infer the global object according to the local parts. Specifically, we first devise an extensible attention mechanism that starts with a local region and extends attention to co-existing regions that are similar and adjacent to the given local region. We then implement the extensible attention mechanism in different feature scales to progressively discover the full object in various receptive fields. In the training process, the model learns the extensible ability on the base stage with abundant samples and transfers it to the novel stage of continuous extensible learning, which can assist the few-shot model to quickly adapt in extending local regions to co-existing regions. Extensive experiments on the PASCAL VOC and COCO datasets show that our ECEA module can assist the few-shot detector to completely predict the object despite some regions failing to appear in the training samples and achieve the new state-of-the-art compared with existing FSOD methods. Code is released at https://github.com/zhimengXin/ECEA.

Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object Detection

ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection

FSNA: Few-Shot Object Detection via Neighborhood Information Adaption and All Attention

Few-shot Object Detection with Feature Attention Highlight Module in Remote Sensing Images

AFD-Net: Adaptive Fully-Dual Network for Few-Shot Object Detection

Meta Faster R-CNN: Towards Accurate Few-Shot Object Detection with Attentive Feature Alignment

Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition

Few-Shot Object Detection Based on Adaptive Attention Mechanism and Large-Margin Softmax

Dma-Net: Decoupled Multi-Scale Attention for Few-Shot Object Detection

Self-attention network for few-shot learning based on nearest-neighbor algorithm

Cross Attention Network for Few-shot Classification

Few-Shot Object Detection With Attention-RPN and Multi-Relation Detector

Fine-Grained Prototypes Distillation for Few-Shot Object Detection

Three-Dimension Attention Mechanism and Self-Supervised Pretext Task for Augmenting Few-Shot Learning

Few-Shot Object Detection of Remote Sensing Images via Two-Stage Fine-Tuning

In defense of local descriptor-based few-shot object detection

Information Extraction Enhancement for Few-Shot Object Detection in Remote Sensing Images

A Comparative Review of Recent Few-Shot Object Detection Algorithms

Object detection based on few-shot learning via instance-level feature correlation and aggregation

Dual-Awareness Attention for Few-Shot Object Detection

Few-Shot Object Detection for Remote Sensing Imagery Using Segmentation Assistance and Triplet Head