Abstract:<p>Small object detection is a highly challenging problem due to the limited resolution and information of small objects. Current state-of-the-art detectors only utilize the appearance feature to locate and classify objects. However, such detectors are prone to failure when detecting small objects, especially in the case of heavy appearance changes and background distractors, in which the appearance feature alone is not sufficient for robust detection. Exploiting context information in the surrounding scene can be highly beneficial in such cases. In this paper, we propose a novel detector, the Internal-External Network (IENet), which uses both the appearance and context information of the object for robust detection. In the proposed approach, small object detection is improved from feature extraction, proposal location, and classification. Specifically, three customized modules are designed, including the Bidirectional Feature Fusion Module (Bi-FFM), Context Reasoning Module (CRM), and Context Feature Augmentation Module (CFAM). Bi-FFM is designed to capture the internal feature of objects by transferring the semantic feature of deeper-level layers to lower-level layers and the detailed feature of lower-level layers to deeper-level layers in neural networks. The proposed approach not only utilizes the hierarchy of convolutional features but also improves its prediction via context relationships. CRM is designed to improve the quality of region proposals by context reasoning that uses easily detected objects to help understand hard ones. Furthermore, CFAM is designed to learn pair-wise relations between region proposals produced by CRM, and such relations are used to produce global feature information associated with the region proposals for accurate classification. Extensive experiments are conducted on the challenging COCO and WIDER FACE datasets to demonstrate the effectiveness of the proposed approach. Experimental results show that the detection performance of small objects is greatly improved over state-of-the-art detectors.</p>

Exploring Context Information for Accurate and Fast Object Detection

Realize your surroundings: Exploiting context information for small object detection

CFENet: an Accurate and Efficient Single-Shot Object Detector for Autonomous Driving.

Object Detection Algorithm Based on Context Information and Self-Attention Mechanism

Comprehensive Feature Enhancement Module For Single-Shot Object Detector

A Rich Feature Fusion Single-Stage Object Detector.

Extend the Shallow Part of Single Shot MultiBox Detector Via Convolutional Neural Network

Dynamic Feature and Context Enhancement Network for Faster Detection of Small Objects

EfficientDet: Scalable and Efficient Object Detection

RefineDetLite: A Lightweight One-stage Object Detection Framework for CPU-only Devices

EfficientFace: an efficient deep network with feature enhancement for accurate face detection

A novel fast combine-and-conquer object detector based on only one-level feature map

Tiny object detection with context enhancement and feature purification

DSSD : Deconvolutional Single Shot Detector

Attention-guided Context Feature Pyramid Network for Object Detection

Disentangle Your Dense Object Detector

Efficient object detector via dynamic prior and dynamic feature fusion

Light-Head R-CNN: In Defense of Two-Stage Object Detector

An Adaptive Attention Fusion Mechanism Convolutional Network for Object Detection in Remote Sensing Images

A Single-shot Object Detector with Feature Aggragation and Enhancement

DetNet: A Backbone network for Object Detection