Hybrid Dilated Convolution Network Using Attentive Kernels for Real-Time Semantic Segmentation

Jiankai He,Bin Jiang,Chao Yang,Wenxuan Tu
DOI: https://doi.org/10.1007/978-3-030-60633-6_11
2020-01-01
Abstract:Though current semantic segmentation methods achieve high accuracy, most of them suffer from low speed, massive memory usage, and high computation complexity. To avoid these problems, we propose a light-weight network called Hybrid Dilated Convolution Network (HDCNet). HDCNet mainly consists of the Hybrid Scale-Aligned Block (HSAB) and the Attentive Depthwise Separable Block (ADSB). The HSAB adopts multiple small kernel convolutions with small-scale dilation rates to extract local information and applies several large kernel convolutions with large-scale dilation rates to encode global information, respectively. We further explore the best option to match the kernel size to the dilation scale. The ADSB is designed to decrease redundant parameters and enhance the critical information by depthwise separable convolution and mixed convolution kernels. In this way, ADSB and HSAB jointly encode multi-scale context information to improve model performance. Thereafter, we combine integrated local information with global information to generate final prediction results. Extensive experiments on Cityscape dataset have demonstrated that the proposed method reaches a better trade-off between accuracy and efficiency compared with other start-of-the-art methods. In particular, HDCNet obtains 72.82% MIoU with only 2.02M and 16.8 GFLOPs.
What problem does this paper attempt to address?