HP3D-V2V: High-Precision 3D Object Detection Vehicle-to-Vehicle Cooperative Perception Algorithm

Hongmei Chen,Haifeng Wang,Zilong Liu,Dongbing Gu,Wen Ye
DOI: https://doi.org/10.3390/s24072170
IF: 3.9
2024-03-29
Sensors
Abstract:Cooperative perception in the field of connected autonomous vehicles (CAVs) aims to overcome the inherent limitations of single-vehicle perception systems, including long-range occlusion, low resolution, and susceptibility to weather interference. In this regard, we propose a high-precision 3D object detection V2V cooperative perception algorithm. The algorithm utilizes a voxel grid-based statistical filter to effectively denoise point cloud data to obtain clean and reliable data. In addition, we design a feature extraction network based on the fusion of voxels and PointPillars and encode it to generate BEV features, which solves the spatial feature interaction problem lacking in the PointPillars approach and enhances the semantic information of the extracted features. A maximum pooling technique is used to reduce the dimensionality and generate pseudoimages, thereby skipping complex 3D convolutional computation. To facilitate effective feature fusion, we design a feature level-based crossvehicle feature fusion module. Experimental validation is conducted using the OPV2V dataset to assess vehicle coperception performance and compare it with existing mainstream coperception algorithms. Ablation experiments are also carried out to confirm the contributions of this approach. Experimental results show that our architecture achieves lightweighting with a higher average precision (AP) than other existing models.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
The paper focuses on the algorithm for high-precision 3D object detection in Vehicle-to-Vehicle (V2V) communication environments. Current single-vehicle perception systems face challenges such as line-of-sight occlusion, low resolution, and weather interference. Therefore, the authors propose an algorithm called HP3D-V2V to overcome these limitations through cooperative perception among vehicles. The algorithm first uses a voxel-based statistical filter to denoise the point cloud data and obtain clean and reliable data. Then, a feature extraction network is designed that integrates voxels and PointPillars to overcome the limited spatial feature interaction in the PointPillars method and enhance the semantic information of the extracted features. By using max pooling, the dimension is reduced to generate pseudo-images, avoiding complex 3D convolution computations. In addition, the paper proposes a vehicle-level cross-vehicle feature fusion module to effectively merge feature information from multiple vehicles. The algorithm is validated on the OPV2V dataset. Compared to existing mainstream cooperative perception algorithms, HP3D-V2V achieves higher average precision (AP) while maintaining lightweight. The effectiveness of the method is further confirmed through ablation experiments. In conclusion, the goal of this paper is to improve the accuracy and robustness of 3D object detection in autonomous driving by achieving more accurate environmental perception through information sharing and fusion among vehicles.