FAFNet: Fully aligned fusion network for RGBD semantic segmentation based on hierarchical semantic flows

Jiazhou Chen,Yangfan Zhan,Yanghui Xu,Xiang Pan
DOI: https://doi.org/10.1049/ipr2.12614
IF: 2.3
2022-08-30
IET Image Processing
Abstract:Depth maps are acquirable and irreplaceable geometric information that significantly enhances traditional color images. RGB and Depth (RGBD) images have been widely used in various image analysis applications, but they are still very limited due to challenges from different modalities and misalignment between color and depth. In this paper, a Fully Aligned Fusion Network (FAFNet) for RGBD semantic segmentation is presented. To improve cross‐modality fusion, a new RGBD fusion block is proposed, features from color images and depth maps are first fused by an attention cross fusion module and then aligned by a semantic flow. A multi‐layer structure is also designed to hierarchically utilize the RGBD fusion block, which not only eases issues of low resolution and noises for depth maps but also reduces the loss of semantic features in the upsampling process. Quantitative and qualitative evaluations on both the NYU‐Depth V2 and the SUN RGB‐D dataset demonstrate that the FAFNet model outperforms state‐of‐the‐art RGBD semantic segmentation methods.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?