Multi-branch Input Structure for Pyramid Scene Parsing Network

Chuanya Wang,Jin Chen
DOI: https://doi.org/10.1007/978-981-32-9686-2_80
2019-01-01
Abstract:Deep convolutional neural networks have been widely researched and have made outstanding achievements in the field of target recognition and image segmentation in recent years. In this paper, we propose a multi-branch input method for aggregating feature information at the early convolutional layer. At present, the popular neural network model uses single branch down-sampling to obtain feature maps, while we built four-branches structure with different dimension channels to segment the original image. Then, in the fusion unit, four feature maps are fused point by point and transmitted to the next convolution layer. Experiments show that the multi-branch input structure mentioned above can improves the system performance and saves the training time. In the experiment, we use the ADE20K dataset and the Cityscapes dataset, both of which are considered as high quality semantically segmented datasets.
What problem does this paper attempt to address?