Incorporating Luminance, Depth and Color Information by a Fusion-based Network for Semantic Segmentation

Shang-Wei Hung,Shao-Yuan Lo,Hsueh-Ming Hang
DOI: https://doi.org/10.48550/arXiv.1809.09077
2019-05-20
Abstract:Semantic segmentation has made encouraging progress due to the success of deep convolutional networks in recent years. Meanwhile, depth sensors become prevalent nowadays, so depth maps can be acquired more easily. However, there are few studies that focus on the RGB-D semantic segmentation task. Exploiting the depth information effectiveness to improve performance is a challenge. In this paper, we propose a novel solution named LDFNet, which incorporates Luminance, Depth and Color information by a fusion-based network. It includes a sub-network to process depth maps and employs luminance images to assist the depth information in processes. LDFNet outperforms the other state-of-art systems on the Cityscapes dataset, and its inference speed is faster than most of the existing networks. The experimental results show the effectiveness of the proposed multi-modal fusion network and its potential for practical applications.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?