Abstract:Despite significant success of deep learning in object detection tasks, the standard training of deep neural networks requires access to a substantial quantity of annotated images across all classes. Data annotation is an arduous and time-consuming endeavor, particularly when dealing with infrequent objects. Few-shot object detection (FSOD) methods have emerged as a solution to the limitations of classic object detection approaches based on deep learning. FSOD methods demonstrate remarkable performance by achieving robust object detection using a significantly smaller amount of training data. A challenge for FSOD is that instances from novel classes that do not belong to the fixed set of training classes appear in the background and the base model may pick them up as potential objects. These objects behave similarly to label noise because they are classified as one of the training dataset classes, leading to FSOD performance degradation. We develop a semi-supervised algorithm to detect and then utilize these unlabeled novel objects as positive samples during the FSOD training stage to improve FSOD performance. Specifically, we develop a hierarchical ternary classification region proposal network (HTRPN) to localize the potential unlabeled novel objects and assign them new objectness labels to distinguish these objects from the base training dataset classes. Our improved hierarchical sampling strategy for the region proposal network (RPN) also boosts the perception ability of the object detection model for large objects. We test our approach and COCO and PASCAL VOC baselines that are commonly used in FSOD literature. Our experimental results indicate that our method is effective and outperforms the existing state-of-the-art (SOTA) FSOD methods. Our implementation is provided as a supplement to support reproducibility of the results.

Few-shot Object Detection Based on Self-Supervised Feature Pyramid Network

Few-Shot Object Detection With Self-Supervising and Cooperative Classifier

Mutually Reinforcing Structure with Proposal Contrastive Consistency for Few-Shot Object Detection.

Few-shot Object Detection Via Message Transfer Mechanism

Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection.

Enhancing Few-Shot Object Detection with Modified Faster R-CNN and Transfer Learning

MPF-Net: multi-projection filtering network for few-shot object detection

Incremental Detection of Remote Sensing Objects with Feature Pyramid and Knowledge Distillation

Small Object Detection Using Deep Feature Pyramid Networks

Object detection based on few-shot learning via instance-level feature correlation and aggregation

Few-Shot Object Detection of Remote Sensing Images via Two-Stage Fine-Tuning

Few-shot Object Detection with Feature Attention Highlight Module in Remote Sensing Images

Few-shot Object Detection in Remote Sensing: Lifting the Curse of Incompletely Annotated Novel Objects

Identification of Novel Classes for Improving Few-Shot Object Detection

Crpn: distinguish novel categories via class-relevant region proposal network for few-shot object detection

Fine-Grained Prototypes Distillation for Few-Shot Object Detection

Few-shot Object Detection via Improved Classification Features

Few Shot Object Detection via a Generalized Feature Extraction Net

Improved Region Proposal Network for Enhanced Few-Shot Object Detection

A New Feature Pyramid Network for Object Detection

Enhanced semantic feature pyramid network for small object detection