Scene Classification Via Learning a Multi-Branch Convolutional Network.

Xiaoyong Bian,Chunfang Chen,Yang Chen,Xiongjun Fei,Jingshan Tang
DOI: https://doi.org/10.1109/smc.2019.8914533
2019-01-01
Abstract:Efficient and accurate classification of high-resolution scene remains a challenge of within-class diversity and between-class similarity due to rich image variations in viewpoint, object pose, spatial resolution and background. To address these issues, we propose a multi-branch convolutional neural network (MB-CNN), which focuses on tackling the problem of learning the appropriate representation of a high-resolution scene that is rich enough to discriminate between different semantic classes. First, pyramid scene parsing network (PSPNet) with minor modification is introduced to gather global object information. Then, an attention net is proposed to highlight transformation invariance and key regions for attention feature extraction. Finally, above two branches are fused with original input branch to learn consistently semantic class information and generate powerful predictions. Our approach achieves better performance favorably against state-of-the-arts on two publicly available scene datasets.
What problem does this paper attempt to address?