”Where Does the Devil Lie?”: Multimodal Multitask Collaborative Revision Network for Trusted Road Segmentation

Guoguang Hua,Dalian Zheng,Shishun Tian,Wenbin Zou,Shenglan Liu,Xia Li
DOI: https://doi.org/10.1109/TMM.2024.3406146
IF: 7.3
2024-01-01
IEEE Transactions on Multimedia
Abstract:Road segmentation is an essential component of navigation systems. Although recent advancements in road segmentation, the occurrence of failure segmentations remains inevitable. For safety-critical tasks, e.g., navigation, knowing when and where road segmentation fails is crucial. In this paper, we propose a novel trusted road segmentation architecture, namely Multimodal Multitask Collaborative Revision Network (M2CRN), to improve the trust of road segmentation. Our approach incorporates two strategies to predict and rectify segmentation errors. Firstly, a joint learning framework is devised to generate road segmentation results while estimating failure segmentation masks. Secondly, the road segmentation branch is equipped with an Uncertainty-Aware Revision Module (UARM), which eliminates the error in road segmentation. Additionally, we suppress the response of error regions in the road segmentation branch with an innovative design, called Adaptive Soft Error Suppression (ASES). To validate our methods, extensive experiments are conducted on three benchmark road segmentation datasets. The results demonstrate significant performance improvements with a real-time inference speed of 33.3 FPS, reaffirming the soundness of our revision model.
What problem does this paper attempt to address?