3D Dynamic Multi-target Detection Algorithm Based on Cross-view Feature Fusion

Feng Zhou,Chongben Tao,Zhen Gao,Zufeng Zhang,Sifa Zheng,Yuan Zhu
DOI: https://doi.org/10.1109/tai.2023.3342104
2023-01-01
IEEE Transactions on Artificial Intelligence
Abstract:In autonomous driving, data degradation and insufficient feature-richness in the current single-modal algorithms can not effectively perform dynamic multi-target detection. Therefore, a 3D dynamic multi-target detection algorithm based on cross-view feature fusion is proposed. A two-stage parallel fusion framework is proposed, which simultaneously extracts point cloud and image features in the first stage. Additionally, a Lidar-Camera feature mapping module is designed to achieve point-wised correspondence between different data. Then, a feature weighted fusion module is designed to judge the weight of each point in the point cloud feature and image feature. In the second stage, a keypoint-based feature extraction module is designed to enrich the features, which integrates the multi-scale features and image features in the first stage to improve the detection accuracy. The proposed algorithm was compared with other SOTA methods on the Kitti, Waymo and Nuscene datasets. The result showed that the accuracy of vehicle target has reached to 93.03%. The module ablation study and accuracy detection on self-made dataset showed that the proposed algorithm not only had good robustness, strong portability and generalization ability, but also had high detection accuracy.
What problem does this paper attempt to address?