Abstract:Automotive radar systems have evolved to provide not only range, azimuth and Doppler velocity, but also elevation data. This additional dimension allows for the representation of 4D radar as a 3D point cloud. As a result, existing deep learning methods for 3D object detection, which were initially developed for LiDAR data, are often applied to these radar point clouds. However, this neglects the special characteristics of 4D radar data, such as the extreme sparsity and the optimal utilization of velocity information. To address these gaps in the state-of-the-art, we present RadarPillars, a pillar-based object detection network.
By decomposing radial velocity data, introducing PillarAttention for efficient feature extraction, and studying layer scaling to accommodate radar sparsity, RadarPillars significantly outperform state-of-the-art detection results on the View-of-Delft dataset. Importantly, this comes at a significantly reduced parameter count, surpassing existing methods in terms of efficiency and enabling real-time performance on edge devices.
What problem does this paper attempt to address?
### What problems does this paper attempt to solve?
This paper aims to solve the unique challenges of 4D radar point - cloud data in object detection, especially the problem that existing deep - learning methods fail to fully utilize the characteristics of 4D radar data. Specifically:
1. **Extreme Sparseness**: Compared with LiDAR data, 4D radar data is very sparse. For example, in the View - of - Delft dataset, the 4D radar scan has an average of only 216 points, while the LiDAR scan in the same field of view contains 21,344 points. This sparseness makes traditional LiDAR - based detection methods ineffective when applied to 4D radar.
2. **Optimal Utilization of Velocity Information**: 4D radar not only provides spatial position information (range, azimuth, elevation), but also provides radial velocity (Doppler velocity). However, existing methods have not fully explored or adapted to these unique features, especially how to effectively use velocity information to improve detection performance.
3. **Real - time and Efficiency**: Many existing improvement methods, although they improve the detection performance, often at the cost of increasing computational complexity and memory requirements, resulting in the inability to achieve real - time processing, especially on edge devices.
To this end, the author proposes RadarPillars, a new 3D object detection network based on pillar representation, aiming to solve the above problems in the following ways:
- **Enhanced Utilization of Velocity Information**: By decomposing radial velocity data and extracting additional features, the network performance is significantly improved.
- **Adaptation to Radar Sparseness**: Use the pillar representation method for efficient real - time processing, and introduce the PillarAttention mechanism, treating each pillar as a token to maintain efficiency and real - time performance.
- **Network Scaling Optimization**: By uniformly scaling the network structure, reduce the number of parameters, improve the runtime efficiency, and maintain high performance.
These innovations make RadarPillars perform significantly better than existing methods on the View - of - Delft dataset, especially achieving higher detection accuracy and faster inference speed with a significant reduction in the number of parameters.