Abstract:3D object detection plays a pivotal role in autonomous driving. Although single-stage detectors excel in speed, they often fall short in accuracy. We have identified two main issues. First, there is a significant discrepancy in prediction accuracy across different Intersection over Union (IoU) thresholds, indicating the presence of localization errors within the model. Second, traditional point-based detection models rely heavily on 1×1 convolution operations at the Set Abstraction layer, neglecting the relationship between adjacent points. To address these issues, we present the Magnification Transformation Single-Stage Detector (MT-SSD), featuring an innovative magnification Linear Transformation Module. This module applies a linear transformation to the original point cloud, sampling radius, and object labels, magnifying the error between model predictions and true values. During inference, an inverse linear transformation is applied to the detections to achieve accurate object localization. Moreover, MT-SSD introduces the Contextual Set Abstraction (CSA) layer, incorporating 1×N convolutions within the Set Abstraction layer to achieve more thorough aggregation of features among neighboring points. Our comprehensive evaluations on various autonomous driving datasets validate MT-SSD's superior performance and efficiency. Particularly noteworthy is its achievement on the Waymo Open Dataset, where MT-SSD establishes new benchmarks in single-stage 3D object detection, setting a series of state-of-the-art records. The code is available at <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/qifeng22/MT-SSD</uri> .

Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving Scenes

MT-SSD: Single-Stage 3D Object Detector Based on Magnification Transformation

Real-Time And Robust 3D Object Detection with Roadside LiDARs

Stereo RGB and Deeper LIDAR-Based Network for 3D Object Detection in Autonomous Driving.

Cross-Modal 3D Object Detection and Tracking for Auto-Driving

6DoF-3D: Efficient and accurate 3D object detection using six degrees-of-freedom for autonomous driving

DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds

RT3D: Real-Time 3-D Vehicle Detection in LiDAR Point Cloud for Autonomous Driving

DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries

Complementary Features With Reasonable Receptive Field For Road Scene 3d Object Detection

InfraDet3D: Multi-Modal 3D Object Detection based on Roadside Infrastructure Camera and LiDAR Sensors

DALDet: Depth-Aware Learning Based Object Detection for Autonomous Driving

Stereo RGB and Deeper LIDAR Based Network for 3D Object Detection

Sparse4D v3: Advancing End-to-End 3D Detection and Tracking

R2Det: Redemption from Range-view for Accurate 3D Object Detection

Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences

Better Monocular 3D Detectors with LiDAR from the Past

RTM3D: Real-Time Monocular 3D Detection from Object Keypoints for Autonomous Driving

V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection

Enhanced 3D object detection for autonomous driving: A spatial-temporal alignment approach in Bird's Eye View scenarios

Optimizing Anchor-based Detectors for Autonomous Driving Scenes