ECNet: an Efficient and Context-Aware Network for Street Scene Parsing.

Bin Jiang,Wenxuan Tu,Chao Yang,Yi Xiao
DOI: https://doi.org/10.1109/paap.2018.00042
2018-01-01
Abstract:Semantic segmentation for scene parsing in traffic not only needs to be precise, but also needs to be efficient for further applications in self-driving system. Most existing approaches employ a heavyweight structure as base network and mutil-scale module to enhance the context information, which often suffers from problems of modeling inefficiency and contextual information missing, respectively. We propose an Efficient and Context-Aware Network (ECNet) based on DenseNet121 and a novel mutil-scale mechanism. Firstly, we stack dense blocks with hybird dilation rates to extract feature maps, which is much more efficient in parameter computation and model storage. Secondly, we present an Enhanced Atrous Spatial Pyramid Pooling (EASPP) module which allows it to enlarge receptive field and obtain more discriminative features compared with Atrous Spatial Pyramid Pooling (ASPP). Additionally, our Decomposed Residual Block (DRB) module adopts decomposed convolution layer without increasing too much computational burden in order to boost the spatial information, which beneficially narrows the gap in semantic levels and spatial resolution. Experimental results on CityScapes and Camvid datasets show that our method is not only efficient but also performs more accurate for street scene understanding.
What problem does this paper attempt to address?