Abstract:The effective use of multi-scale features remains an open problem for object detection tasks. Recently, proposed object detectors have usually used Feature Pyramid Networks (FPN) to fuse multi-scale features. Since Feature Pyramid Networks use a relatively simple feature map fusion approach, it can lead to the loss or misalignment of semantic information in the fusion process. Several works have demonstrated that using a bottom-up structure in a Feature Pyramid Network can shorten the information path between lower layers and the topmost feature, allowing an adequate exchange of semantic information from different layers. We further enhance the bottom-up path by proposing a multi-scale residual aggregation Feature Pyramid Network (MSRA-FPN), which uses a unidirectional cross-layer residual module to aggregate features from multiple layers bottom-up in a triangular structure to the topmost layer. In addition, we introduce a Residual Squeeze and Excitation Module to mitigate the aliasing effects that occur when features from different layers are aggregated. MSRA-FPN enhances the semantic information of the high-level feature maps, mitigates the information decay during feature fusion, and enhances the detection capability of the model for large objects. It is experimentally demonstrated that our proposed MSRA-FPN improves the performance of the three baseline models by 0.5–1.9% on the PASCAL VOC dataset and is also quite competitive with other state-of-the-art FPN methods. On the MS COCO dataset, our proposed method can also improve the performance of the baseline model by 0.8% and the baseline model's performance for large object detection by 1.8%. To further validate the effectiveness of MSRA-FPN for large object detection, we constructed the Thangka Figure Dataset and conducted comparative experiments. It is experimentally demonstrated that our proposed method improves the performance of the baseline model by 2.9–4.7% on this dataset and can reach up to 71.2%.

Multi-Scale Reinforcement Learning Strategy for Object Detection

Multi-Scale Residual Aggregation Feature Pyramid Network for Object Detection

Improving Multiscale Object Detection With Off-Centered Semantics Refinement

MM-FPN: Multi-path and Multi-scale Feature Pyramid Network for Object Detection

Multi-scale redistribution feature pyramid for object detection

MGFPN: Enhancing Multi-Scale Feature for Object Detection

Learning Discriminated Features Based on Feature Pyramid Networks and Attention for Multi-scale Object Detection

Concise Feature Pyramid Region Proposal Network for Multi-Scale Object Detection.

Feature Pyramid Reconfiguration with Consistent Loss for Object Detection

A Multi-Scale Detector Based on Attention Mechanism

Object Detection in Remote Sensing Images Based on Adaptive Multi-Scale Feature Fusion Method

Scale Adaptive Feature Pyramid Networks for 2D Object Detection

Multi-Scale Positive Sample Refinement for Few-Shot Object Detection

Deep Feature Pyramid Reconfiguration for Object Detection

Two Cases of Sinusitis Induced by Immune Checkpoint Inhibition.

Towards Better Object Detection in Scale Variation with Adaptive Feature Selection

Multi-Scale Feature Selective Matching Network for Object Detection

Pyramid attention object detection network with multi-scale feature fusion

FFR-SSD: feature fusion and reconstruction single shot detector for multi-scale object detection

MDFN: Multi-scale deep feature learning network for object detection

SSN: Scale Selection Network for Multi-Scale Object Detection in Remote Sensing Images