Abstract:In recent years, the field of object detection has made significant progress. The success of most state-of-the-art object detectors is derived from the use of feature pyramid and the carefully designed anchor boxes. However, the existing methods for constructing feature pyramid blindly integrate multi-scale representations on each feature hierarchy. Furthermore, these detectors also suffer from some drawbacks brought by the hand-designed anchors. To mitigate the adverse effects caused thereby, we propose a semi-anchor-free network with enhanced feature pyramid for object detection, named SAFNet. Specifically, to better construct feature pyramid, we propose a novel enhanced feature pyramid generation paradigm, which consists of two modules, i.e., adaptive feature fusion module (AFFM) and self-enhanced module (SEM). The paradigm adaptively integrates multi-scale representations in a non-linear way meanwhile suppresses the redundant semantic information for each pyramid level. Thus, a clean and enhanced feature pyramid could be obtained. In addition, an adaptive anchor generator (AAG) is designed to yield fewer but more suitable anchor boxes for each input image. Benefiting from the enhanced feature pyramid, AAG is capable of generating more accurate anchor boxes by introducing few priors. With this semi-anchor-free method, our detector has the ability to alleviate the drawbacks of hand-designed anchors meanwhile retain the merits of anchor-based methods. Extensive experiments demonstrate the effectiveness of our approach. Profited from the proposed modules, SAFNet significantly boosts the detection performance, i.e., achieving 2 points and 2.1 points higher Average Precision (AP) than RetinaNet on PASCAL VOC and MS COCO respectively.

Putting the Anchors Efficiently: Geometric Constrained Pedestrian Detection.

Towards Accurate Dense Pedestrian Detection Via Occlusion-Prediction Aware Label Assignment and Hierarchical-Nms.

See Extensively While Focusing on the Core Area for Pedestrian Detection.

Pedestrian As Points: an Improved Anchor-Free Method for Center-Based Pedestrian Detection.

MetaAnchor: Learning to Detect Objects with Customized Anchors.

Object as Hotspots: An Anchor-Free 3D Object Detection Approach via Firing of Hotspots

LGADet: Light-weight Anchor-free Multispectral Pedestrian Detection with Mixed Local and Global Attention

Adaptive Anchor Box Mechanism to Improve the Accuracy in the Object Detection System

An Anchor-Free 3D Object Detection Approach Based on Hierarchical Pillars

A Robust Anchor-based Method for Multi-Camera Pedestrian Localization

One-Stage Anchor-Free 3D Vehicle Detection from LiDAR Sensors

An Anchor-Free Dual-Branch Approach for Real-Time Metro Passenger Detection

Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection

CenterPoint-SE: A Single-Stage Anchor-Free 3-D Object Detection Algorithm With Spatial Awareness Enhancement

Anchor Box Optimization for Object Detection

Learning from Noisy Anchors for One-Stage Object Detection.

Dynamic Anchor Learning for Arbitrary-Oriented Object Detection

Detecting Text in Scene and Traffic Guide Panels With Attention Anchor Mechanism

L4Net: an Anchor‐free Generic Object Detector with Attention Mechanism for Autonomous Driving

PosNeg-Balanced Anchors with Aligned Features for Single-Shot Object Detection

SAFNet: A Semi-Anchor-Free Network with Enhanced Feature Pyramid for Object Detection.