Image Semantic Segmentation Based on Encoder-Decoder Network

Xiaopin Zhao,Weibin Liu,Weiwei Xing
DOI: https://doi.org/10.1145/3398329.3398357
2020-01-01
Abstract:Semantic segmentation is an extremely important task in computer vision. At present, the related methods have achieved high performance. Nevertheless, Semantic segmentation still faces the challenge of localization accuracy due to DCNN invariance and existence of objects at multi-scale. In order to improve the accuracy of segmentation, this paper proposes a U-SEM encoder-decoder network. Firstly, in the encoding stage, it down-samples through the ResNet. Secondly, in the decoding stage, in order to filter and utilize the useful features, the SE-Mobile Block is proposed and fused to the network. The SE block adopts the idea of attention mechanism to focus on useful features and ignore those redundant features. Mobile blocks use deep separable convolutions to replace traditional convolutions, speeding up operations and reducing parameters. Finally, it adopts the skip structure where the feature information of different scales are merged to produce accurate and detailed segmentation. Experimental results show that the proposed network achieves good performance on multiple datasets which reaches the accuracy of 78.4% mIOU on PASCAL VOC 2012 and 75.7% mIOU on Cityscapes dataset.
What problem does this paper attempt to address?