Real-Time Volumetric Perception for Unmanned Surface Vehicles Through Fusion of Radar and Camera

Hu Xu,Xiaomin Zhang,Ju He,Yang Yu,Yuwei Cheng
DOI: https://doi.org/10.1109/tim.2024.3381690
IF: 5.6
2024-04-05
IEEE Transactions on Instrumentation and Measurement
Abstract:In recent years, unmanned surface vehicles (USVs) have played an increasingly important role in various applications. Due to the expansion of USV application scenes from common marine areas to inland waters with complex environments, environmental perception has become an essential requirement for autonomous navigation systems of USVs. Traditional perception methods utilize either light detection and ranging (LiDAR) or radar to construct volumetric maps for environmental perception. To improve the accuracy of perception systems and reduce deployment costs, this article proposes a novel radar and camera fusion volumetric map network named FVMNet for real-time volumetric perception. FVMNet is based on a novel radar and image fusion architecture and comprises four modules: 1) the radar and image encoders can extract different features; 2) only using in training stage without extra valid time costs, auxiliary segmentation head advances the image encoder; 3) to eliminate the representation difference between image features and radar features, the BEV spatial transformer module transfers image feature representations from the perspective view to BEV space; and 4) the fusion segmentation head predicts the volumetric perception results. Compared to other baseline methods that use a single modality, FVMNet achieves state-of-the-art accuracy in the public USVInland dataset and our collected wharf dataset. We conducted comprehensive ablation experiments to validate the efficacy of the designed modules. Moreover, the proposed method demonstrates generalization in zero-shot real-world scenarios and robustness under extreme weather conditions.
engineering, electrical & electronic,instruments & instrumentation
What problem does this paper attempt to address?