Sparse Embedded Convolution Based Dual Feature Aggregation 3D Object Detection Network

Hai-Sheng Li,Yan-Ling Lu
DOI: https://doi.org/10.1007/s11063-024-11506-2
IF: 2.565
2024-02-14
Neural Processing Letters
Abstract:The algorithm design of compatible detection speed and accuracy based on LiDAR point clouds is a challenging issue in various practical applications of 3D object detection, including the field of autonomous driving. This paper designs a single-stage object detection algorithm that is lightweight and compatible with detection speed and accuracy for the above issue. To achieve these objectives, we propose a framework for a 3D object detection algorithm using a single-stage detection network as the backbone network. Firstly, we design a dual feature extraction module to reduce the occurrence of vehicle miss and error detection problems. Then, we use a multi-scale feature fusion scheme to fuse feature information with different scales. Furthermore, we design a data enhancement scheme suitable for this network architecture. Experimental results in the KITTI dataset show that the proposed method achieves improvement ratios of 38.5% for the detection speed and 2.88% 13.65% in terms of the average precision of vehicle detection compared to the existing algorithm based on single-stage object detection (SECOND).
computer science, artificial intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve compatibility between detection speed and accuracy in 3D object detection based on LiDAR point clouds. Specifically, the author points out that in fields such as autonomous driving, the design of 3D object detection algorithms faces the challenge of how to improve the detection speed while maintaining high detection accuracy. Although existing methods have made certain progress in single - stage or two - stage detectors, there are still problems such as high computational cost and difficulty in achieving both detection accuracy and speed. For this reason, this paper proposes a lightweight single - stage 3D object detection algorithm, aiming to simultaneously improve the detection speed and accuracy by designing a dual - feature extraction module, a multi - scale feature fusion scheme and a new data augmentation scheme, so as to solve the above challenges. Experimental results show that this method has a significant improvement in detection speed and average precision of vehicle detection on the KITTI dataset compared with existing algorithms (such as SECOND).