Abstract:3D object detection is one of the most important tasks in 3D vision perceptual system of autonomous vehicles. In this paper, we propose a novel two stage 3D object detection method aimed at get the optimal solution of object location in 3D space based on regressing two additional 3D object properties by a deep convolutional neural network and combined with cascaded geometric constraints between the 2D and 3D boxes. First, we modify the existing 3D properties regressing network by adding two additional components, viewpoints classification and the center projection of the 3D bounding box s bottom face. Second, we use the predicted center projection combined with similar triangle constraint to acquire an initial 3D bounding box by a closed-form solution. Then, the location predicted by previous step is used as the initial value of the over-determined equations constructed by 2D and 3D boxes fitting constraint with the configuration determined with the classified viewpoint. Finally, we use the recovered physical world information by the 3D detections to filter out the false detection and false alarm in 2D detections. We compare our method with the state-of-the-arts on the KITTI dataset show that although conceptually simple, our method outperforms more complex and computational expensive methods not only by improving the overall precision of 3D detections, but also increasing the orientation estimation precision. Furthermore our method can deal with the truncated objects to some extent and remove the false alarm and false detections in both 2D and 3D detections.

KPV3D: Enhancing Multiview 3-D Vehicle Detection With 2-D Keypoint Priors

A Multi-view 3D Vehicle Detection Method Based On Novel 3D Proposal Generation Method

Accelerating Point-Voxel Representation of 3-D Object Detection for Automatic Driving

3D Vehicle Detection Using Cheap LiDAR and Camera Sensors.

Multi-view 3D Object Detection Network for Autonomous Driving

Enhance the 3D Object Detection With 2D Prior

3D Bounding Box Estimation for Autonomous Vehicles by Cascaded Geometric Constraints and Depurated 2D Detections Using 3D Results

3M3D: Multi-view, Multi-path, Multi-representation for 3D Object Detection

F-PVNet: Frustum-Level 3-D Object Detection on Point–Voxel Feature Representation for Autonomous Driving

MP-Mono: Monocular 3D Detection Using Multiple Priors for Autonomous Driving

3D Detection for Occluded Vehicles From Point Clouds

Image Guidance Based 3D Vehicle Detection in Traffic Scene.

DVPE: Divided View Position Embedding for Multi-View 3D Object Detection

VKP-P3D: Real-Time Monocular Pseudo 3D Object Detection Based on Visible Key Points and Camera Geometry

A Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation

X-View: Non-Egocentric Multi-View 3D Object Detector.

MSPV3D: Multi-Scale Point-Voxels 3D Object Detection Net

Unleash the Potential of Image Branch for Cross-modal 3D Object Detection

Object as Query: Equipping Any 2D Object Detector with 3D Detection Ability

Monocular 3D object detection via estimation of paired keypoints for autonomous driving