Abstract:Object detection is a critical technology for the intelligent analytical processing of images captured by drones. The objects usually come in various scales and can be extremely small. Existing detection methods are inherently based on pyramid hierarchy architectures to extract multi-scale features and provide better feature representation for small objects. Nevertheless, they inevitably dilute the representation of details in low-level features during top-down feature fusion and are totally unconcerned with whether the fused feature fits the objects of specific scales within a layer. Moreover, the pyramid can only implicitly fuse the spatial context, which makes the fused features cannot receive fine spatial location information for object localization. In this work, we propose an effective boundary-aware network with attention refinement and spatial interaction to tackle the above challenges. Specifically, we first present a highly effective yet simple boundary-aware detection head (BAH), which directly guides representation learning of object structure semantics in the prediction layer to preserve object-related boundary semantics. Additionally, the attentional feature parallel fusion (AFPF) module offers multi-scale feature encoding capability in a parallel triple fusion fashion and adaptively selects features appropriate for objects of certain scales. Furthermore, we design a spatial interactive module (SIM) to preserve fine spatial detail through cross-spatial feature association. Extensive experiments prove that the proposed network significantly outperforms the state-of-the-art methods, in which we achieve 33.1 mAP and 56.5 AP50 on the VisDrone benchmark, 63.4 mAP and 94 AP50 on the NWPU VHR-10 benchmark. The source code will be released.

Improving Small Object Detection Via Cross-Layer Attention

MLA-Net: Feature Pyramid Network with Multi-Level Local Attention for Object Detection

CA2Det: Cascaded Adaptive Fusion Pyramid Network Based on Attention Mechanism for Small Object Detection

EBiDA-FPN: Enhanced Bi-Directional Attention Feature Pyramid Network for Object Detection

Small Object Detection using Multi-scale Feature Fusion and Attention

An attention-based feature pyramid network for single-stage small object detection

Attention-based Fusion Factor in FPN for Object Detection

Small object detection based on attention mechanism and enhanced network

Composite Backbone Small Object Detection Based on Context and Multi-Scale Information with Attention Mechanism

Small object detection combining attention mechanism and a novel FPN

Pyramid attention object detection network with multi-scale feature fusion

Cross-Layer Attention Network for Small Object Detection in Remote Sensing Imagery

Attentional feature pyramid network for small object detection

Multi-scale Vertical Cross-layer Feature Aggregation and Attention Fusion Network for Object Detection

ALFPN: Adaptive Learning Feature Pyramid Network for Small Object Detection

Small Object Detection in Traffic Scenes Based on Attention Feature Fusion

Boundary-aware Small Object Detection with Attention and Interaction

MultiResolution Attention Extractor for Small Object Detection

Small Object Detection Method Based on Weighted Feature Fusion and CSMA Attention Module

3D Small Object Detection from Cameras and Point Clouds Using Five-Head Attention in a Fusion Method

Object Detection With Extended Attention And Spatial Information