Self-Enhanced Feature Fusion for RGB-D Semantic Segmentation

Pengcheng Xiang,Baochen Yao,Zefeng Jiang,Chengbin Peng
DOI: https://doi.org/10.1109/lsp.2024.3475352
2024-11-09
IEEE Signal Processing Letters
Abstract:Effectively fusing depth and RGB information to fully leverage their complementary strengths is essential for advancing RGB-D semantic segmentation. However, when fusing with RGB information, traditional methods often overlook noises in depth data, presuming that they are of high accuracy. To resolve this issue, we propose a self-enhanced feature fusion network (SEFnet) for RGB-D semantic segmentation in this work. It mainly comprises three steps. Firstly, RGB and depth embeddings from the initial layers of the network are fused together. Secondly, the fused features are enhanced by pure RGB embeddings and are progressively guided by semantic edge labels to suppress irrelevant features. Finally, the enhanced features are combined with high-level RGB features and are fed into a normalizing flow decoder to obtain segmentation results. Experimental results demonstrate that the proposed approach can provide accurate predictions, outperforming state-of-the-art methods on benchmark datasets.
engineering, electrical & electronic
What problem does this paper attempt to address?