Class-Guide Deformable Alignment Fusion for Real-Time Semantic Segmentation

Liangcheng Qin,Zhichao Sha,Shilin Zhou,Wei Du,Yulan Guo,Yaoyuan Zeng
DOI: https://doi.org/10.1145/3604078.3604105
2023-01-01
Abstract:Feature aggregation has an important impact on semantic segmentation, especially the segmentation of object boundaries. With recent advances of various feature aggregation techniques, great progress has been achieved in semantic segmentation. However, it is still a challenge to effectively incorporate high-level semantic features and low-level spatial features. In this work, we propose the Class-Guide Deformable Alignment Fusion network (CGDAFNet) for real-time semantic segmentation. Instead of simply concatenating or adding cross-level features directly, we propose a class-guide deformable alignment fusion module (CGDAFM) to handle the misalignment problem between cross-level features, so as to achieve effective cross-level feature incorporation. A grouped double normalization residual module (GDNRM) is proposed to improve segmentation performance with a small computation cost. Experiment results on the CamVid and Cityscapes datasets verify the effectiveness of our method. Our CGDAFNet can achieve a mIoU score of 78.8% on Cityscapes dataset by taking input with the resolution of 1024×2048. In particular, the speed of our method is 39 FPS with single RTX 3080.
What problem does this paper attempt to address?