RGB-LiDAR fusion for accurate 2D and 3D object detection
Morteza Mousa-Pasandi,Tianran Liu,Yahya Massoud,Robert Laganière,Mousa-Pasandi, Morteza,Liu, Tianran,Massoud, Yahya,Laganière, Robert
DOI: https://doi.org/10.1007/s00138-023-01435-w
IF: 2.983
2023-08-21
Machine Vision and Applications
Abstract:Effective detection of road objects in diverse environmental conditions is a critical requirement for autonomous driving systems. Multi-modal sensor fusion is a promising approach for improving perception, as it enables the combination of information from multiple sensor streams in order to optimize the integration of their respective data. Fusion operators are employed within fully convolutional architectures to combine features derived from different modalities. In this research, we present a framework that utilizes early fusion mechanisms to train and evaluate 2D object detection algorithms. Our evaluation shows that sensor fusion outperforms RGB-only detection methods, yielding a boost of +15.07% for car detection, +10.81% for pedestrian detection, and +19.86% for cyclist detection. In our comparative study, we evaluated three arithmetic-based fusion operators and two learnable fusion operators. Furthermore, we conducted a performance comparison between early- and mid-level fusion techniques and investigated the effects of early fusion on state-of-the-art 3D object detectors. Lastly, we provide a comprehensive analysis of the computational complexity of our proposed framework, along with an ablation study.
computer science, cybernetics, artificial intelligence,engineering, electrical & electronic