CCAFFMNet: Dual-spectral semantic segmentation network with channel-coordinate attention feature fusion module

Shi Yi,Junjie Li,Xi Liu,Xuesong Yuan
DOI: https://doi.org/10.1016/j.neucom.2021.11.056
IF: 6
2022-01-01
Neurocomputing
Abstract:Dual-spectral (RGB-thermal) semantic segmentation is a fundamental task for visual perception of autonomous driving in harsh imaging environments (such as darkness, rain, and fog). In recent years, the encoder-decoder dual-spectral semantic segmentation networks have achieved satisfactory results. However, existing networks pay little attention to the feature-fusion strategy of infrared and RGB features at each feature-fusion stage, which limits the performance of semantic segmentation. This study proposes a novel encoder-decoder-based dual-spectral semantic segmentation network. Channel-coordinate attention feature-fusion modules (CCAFFMs) are designed and inserted into each feature-fusion stage to obtain the channel and spatial correlations between infrared and RGB features. Thus, fused feature maps are refined in this way. A down-up-connected decoder with skip connections is designed to restore the resolution of the feature map and ensure that it contains more object details and sharper boundary contours. Furthermore, we manually annotate and augment the RoadScene dataset to construct the RoadScene-seg dataset. In this way, dual-spectral semantic segmentation can be extended to diverse autonomous driving environments. The results of extensive experiments on the MF and RoadScene-seg datasets prove the superiority of the proposed network over state-of-the-art methods. (C) 2021 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?