Adaptive local recalibration network for scene recognition

Jiale Wang,Lian Zou,Cien Fan,Hao Jiang,Liqiong Chen,Mofan Cheng,Hu Yu,Yifeng Liu
DOI: https://doi.org/10.1007/s10489-023-04963-0
IF: 5.3
2023-01-01
Applied Intelligence
Abstract:Scene recognition is a computer vision task that categorizes scenes from photographs. In this paper, we introuduce the Adaptive Local Recalibration Network (ALR-Net), a novel scene recognition method based on convolutional neural networks (CNNs). In comparison to the object classification task, the scene classification images have a more dispersed distribution of information. To solve this issue, we suggest an attention mechanism for locating the discriminative regions for scene recognition. Along with normal data augmentation, we use the regions to guide two additional data augmentation approaches, namely adaptive cropping and adaptive hiding, in order to capture local information more efficiently and specifically. Attention maps are also used to adaptively recalibrate scene feature maps so that discriminative regions receive more attention than others. In addition, we bring in a scene distribution label for each image, which is used to assist the training of attention maps. Extensive studies on two scene recognition benchmarks verified the proposed model’s effectiveness: MIT67 (88.37%) and SUN397 (74.24%).
What problem does this paper attempt to address?