Abstract:Accurate and fast 3D object detection from point clouds is a key task in autonomous driving. Existing one-stage 3D object detection methods can achieve real-time performance, however, they are dominated by anchor-based detectors which are inefficient and require additional post-processing. In this paper, we eliminate anchors and model an object as a single point—the center point of its bounding box. Based on the center point, we propose an anchor-free CenterNet3D network that performs 3D object detection without anchors. Our CenterNet3D uses keypoint estimation to find center points and directly regresses 3D bounding boxes. However, because inherent sparsity of point clouds, 3D object center points are likely to be in empty space which makes it difficult to estimate accurate boundaries. To solve this issue, we propose an extra corner attention module to enforce the CNN backbone to pay more attention to object boundaries. Besides, considering that one-stage detectors suffer from the discordance between the predicted bounding boxes and corresponding classification confidences, we develop an efficient keypoint-sensitive warping operation to align the confidences to the predicted bounding boxes. Our proposed CenterNet3D is non-maximum suppression free which makes it more efficient and simpler. We evaluate CenterNet3D on the widely used KITTI dataset and more challenging nuScenes dataset. Our method outperforms all state-of-the-art anchor-based one-stage methods and has comparable performance to two-stage methods as well. It has an inference speed of 20 FPS and achieves the best speed and accuracy trade-off. Our source code will be released at https://github.com/wangguojun2018/CenterNet3d.

Center3D: Center-based Monocular 3D Object Detection with Joint Depth Understanding

Leveraging Front and Side Cues for Occlusion Handling in Monocular 3D Object Detection

Depth Dynamic Center Difference Convolutions for Monocular 3D Object Detection.

An Algorithm on Monocular 3D Object Detection Based on Depth Estimation

Densely Constrained Depth Estimator for Monocular 3D Object Detection

Depth Is All You Need for Monocular 3D Detection

Depth-assisted joint detection network for monocular 3d object detection

Boosting Monocular 3D Object Detection with Object-Centric Auxiliary Depth Supervision

MonoCD: Monocular 3D Object Detection with Complementary Depths

ABC: Aligning Binary Centers for Single-Stage Monocular 3D Object Detection

Center-Based 3D Object Detection and Tracking.

CenterNet3D: An Anchor Free Object Detector for Point Cloud

Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation

Learning Depth-Guided Convolutions for Monocular 3D Object Detection

Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection.

Depth-Enhancement Network for Monocular 3D object detection

CenterNet3D: An Anchor free Object Detector for Autonomous Driving.

Monocular 3D Detection for Autonomous Vehicles by Cascaded Geometric Constraints and Depurated Using 3D Results

CenterLoc3D: Monocular 3D Vehicle Localization Network for Roadside Surveillance Cameras

DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection

Monocular 3D object detection via estimation of paired keypoints for autonomous driving