Abstract:Detecting accurate 3D bounding boxes of the object from point clouds is a major task in autonomous driving perception. At present, the anchor-based or anchor-free models that use LiDAR point clouds for 3D object detection use the center assigner strategy to infer the 3D bounding boxes. However, in the real-world scene, due to the occlusions and the effective detection range of the LiDAR system, only part of the object surface can be covered by the collected point clouds, and there are no measured 3D points corresponding to the physical object center. Obtaining the object by aggregating the incomplete surface point clouds will bring a loss of accuracy in direction and dimension estimation. To address this problem, we propose a corner-guided anchor-free single-stage 3D object detection model (CG-SSD). Firstly, the point clouds within a single frame are assigned to regular 3D grids. 3D sparse convolution backbone network composed of residual layers and sub-manifold sparse convolutional layers are used to construct bird’s eye view (BEV) features for further deeper feature mining by a lite U-shaped network; Secondly, a novel corner-guided auxiliary module (CGAM) with adaptive corner classification algorithm is proposed to incorporate corner supervision signals into the neural network. CGAM is explicitly designed and trained to estimate locations of partially visible and invisible corners to obtain a more accurate object feature representation, especially for small or partial occluded objects; Finally, the deep features from both the backbone networks and CGAM module are concatenated and fed into the head module to predict the classification and 3D bounding boxes of the objects in the scene. The experiments demonstrate that CG-SSD achieves the state-of-art performance on the ONCE benchmark for supervised 3D object detection using single frame point cloud data, with 62.77% mAP. Additionally, the experiments on ONCE and Waymo Open Dataset show that CGAM can be extended to most anchor-based models which use the BEV feature to detect objects, as a plug-in and bring +1.17%∼+14.23% AP improvement. The code is available at https://github.com/mrqrs/CG-SSD.

You Only Label Once: 3D Box Adaptation from Point Cloud to Image with Semi-Supervised Learning

SSDA3D: Semi-supervised Domain Adaptation for 3D Object Detection from Point Cloud

Lidar Point Cloud Guided Monocular 3D Object Detection

ADeLA: Automatic Dense Labeling with Attention for Viewpoint Shift in Semantic Segmentation

3D Point Cloud Labeling Tool for Driving Automatically

Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts

CG-SSD: Corner Guided Single Stage 3D Object Detection from LiDAR Point Cloud

Box2Seg: Learning Semantics of 3D Point Clouds with Box-Level Supervision

Visual Boundary-Guided Pseudo-Labeling for Weakly Supervised 3D Point Cloud Segmentation in Indoor Environments

Towards A Weakly Supervised Framework for 3D Point Cloud Object Detection and Annotation

Autolabeling 3D Objects With Differentiable Rendering of SDF Shape Priors

Multi-Sensor 3D Object Box Refinement for Autonomous Driving

CG-SSD: Corner Guided Single Stage 3D Object Detection from LiDAR Point Cloud

Eliminating Spatial Ambiguity for Weakly Supervised 3D Object Detection Without Spatial Labels

Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection

Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection

ALPI: Auto-Labeller with Proxy Injection for 3D Object Detection using 2D Labels Only

Back to Reality: Learning Data-Efficient 3D Object Detector with Shape Guidance.

Reinforcing Lidar-Based 3d Object Detection With Rgb And 3d Information

SC3D: Label-Efficient Outdoor 3D Object Detection via Single Click Annotation

Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene