Dual Attention Based Multi-scale Feature Fusion Network for Indoor RGBD Semantic Segmentation.

Zhongwei Hua,Lizhe Qi,Daming Du,Wenxuan Jiang,Yunquan Sun
DOI: https://doi.org/10.1109/icpr56361.2022.9956246
2022-01-01
Abstract:RGBD semantic segmentation combined with color image information and depth information can effectively alleviate the problems of low classification accuracy and difficulty in accurately dividing edges between different semantic regions in indoor scenes caused by complex backgrounds, uneven lighting, similar object textures, spatial overlap, and occlusion. To fully fuse the color features with the spatial position and hierarchical information of objects, this paper proposes a multi-scale network model based on the dual attention mechanism (channel attention and spatial attention)(DAMFNet), which effectively integrates color texture features and spatial structure features, and further improves the semantic segmentation performance of indoor objects. We evaluate the proposed network model on the common indoor dataset SUNRGBD and achieve state-of-the-art results. In addition, this paper also demonstrates the excellent segmentation accuracy and effect of the proposed network model on self-built indoor datasets and in real-world application scenarios.
What problem does this paper attempt to address?