Abstract:Two-stage detectors have gained much popularity in 3D object detection. Most two-stage 3D detectors utilize grid points, voxel grids, or sampled keypoints for RoI feature extraction in the second stage. Such methods, however, are inefficient in handling unevenly distributed and sparse outdoor points. This paper solves this problem in three aspects. 1) Dynamic Point Aggregation. We propose the patch search to quickly search points in a local region for each 3D proposal. The dynamic farthest voxel sampling is then applied to evenly sample the points. Especially, the voxel size varies along the distance to accommodate the uneven distribution of points. 2) RoI-graph Pooling. We build local graphs on the sampled points to better model contextual information and mine point relations through iterative message passing. 3) Visual Features Augmentation. We introduce a simple yet effective fusion strategy to compensate for sparse LiDAR points with limited semantic cues. Based on these modules, we construct our Graph R-CNN as the second stage, which can be applied to existing one-stage detectors to consistently improve the detection performance. Extensive experiments show that Graph R-CNN outperforms the state-of-the-art 3D detection models by a large margin on both the KITTI and Waymo Open Dataset. And we rank first place on the KITTI BEV car detection leaderboard. Code will be available at \url{<a class="link-external link-https" href="https://github.com/Nightmare-n/GraphRCNN" rel="external noopener nofollow">this https URL</a>}.

Path Aggregation One-Stage Anchor Free 3D Object Detection

AGO-Net: Association-Guided 3D Point Cloud Object Detection Network

Sparse Embedded Convolution Based Dual Feature Aggregation 3D Object Detection Network

Multiclass objects detection algorithm using DarkNet-53 and DenseNet for intelligent vehicles

A Multi-view 3D Vehicle Detection Method Based On Novel 3D Proposal Generation Method

Monocular 3-D Vehicle Detection Using a Cascade Network for Autonomous Driving

Structure Aware Single-Stage 3D Object Detection From Point Cloud

An Anchor-Free 3D Object Detection Approach Based on Hierarchical Pillars

PLOT: a 3D Point Cloud Object Detection Network for Autonomous Driving.

HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds

AFDet: Anchor Free One Stage 3D Object Detection.

Graph R-CNN: Towards Accurate 3D Object Detection with Semantic-Decorated Local Graph

Multi-View 3D Object Detection Network for Autonomous Driving

Transformation-Equivariant 3D Object Detection for Autonomous Driving

Keypoint-Aware Single-Stage 3D Object Detector for Autonomous Driving

One-Stage Anchor-Free 3D Vehicle Detection from LiDAR Sensors

Ground-aware Monocular 3D Object Detection for Autonomous Driving

F-PVNet: Frustum-Level 3-D Object Detection on Point–Voxel Feature Representation for Autonomous Driving

AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection

3D-DFM: Anchor-Free Multimodal 3-D Object Detection With Dynamic Fusion Module for Autonomous Driving