DSNet:Multi-resolution Dense Encoder and Stack Decoder Network for Aerial Image Segmentation

Yanwen Chong,Congchong Nie,Yulong Tao,Shaoming Pan
DOI: https://doi.org/10.1109/CAC48633.2019.8996431
2019-01-01
Abstract:Semantic segmentation in high resolution aerial image is faced with a challenge caused by ubiquitous fine-structure objects. Traditional encoder-decoder structure losses some detail information during the process of down-sampling, which is harmful to the location of fine-structure objects. In this work, we present a multi-resolution dense encoder and stack decoder network to deal with this problem. On the one hand, the dense encoder embeds shallow detailed feature into deep semantic feature through proposed information-reserved down-sampling method called CE-Pooling. On the other hand, the stack decoder gradually enhances the detailed feature through iterative attention fusion. Extensive experiments on several benchmark datasets have been conducted, which shows that our method is superior than the state-of-the-art approaches.
What problem does this paper attempt to address?