Cross Aggregation Network for Semantic Segmentation

Minghua Zhao,Yuxing Zhi,Shuangli Du,Xinhong Hei,Jing Hu,Cheng Shi,Peng Li
DOI: https://doi.org/10.2139/ssrn.4125814
2022-01-01
Abstract:To achieve more accurate prediction, advanced semantic segmentation methods are explored in the way of context modeling. Images in real scenes usually contain multi-scale objects and contents. The feature propagation form in convolution networks is very important to capture multi-scale context and obtain accurate segmentation. This paper proposes a novel pattern of information flow aggregation to rich features expression, called Cross Aggregation Module (CRA), In CRA, the flows of different scales are transmitted to the context aggregation module (CAM) through a parallel-cross connection to generate a feature map containing more information to aggregate multi-scale features and capture remote context information. We further add the low-level features in the later stage in the encoder to enhance the features interaction. Based on these developments, we build a Context Cross Aggregation Network (CRANet) which employs an asymmetric decoder to restore the scale of the predicted feature map. The proposed CRANet is evaluated on two challenging datasets, i.e., Cityscapes and Camvid. The experiments show that the proposed network achieves competitive performance.
What problem does this paper attempt to address?