Attention Pyramid Module for Scene Recognition

Zhinan Qiao,Xiaohui Yuan,Chengyuan Zhuang,Abolfazl Meyarian
DOI: https://doi.org/10.1109/icpr48806.2021.9412235
2021-01-10
Abstract:The unrestricted open vocabulary and diverse substances of scenery images bring significant challenges to scene recognition. However, most deep learning architectures and attention methods are developed on general-purpose datasets and omit the characteristics of scene data. In this paper, we exploit the Attention Pyramid Module (APM) to tackle the predicament of scene recognition. Our method streamlines the multi-scale scene recognition pipeline, learns comprehensive scene features at various scales and locations, addresses the interdependency among scales, and further assists feature re-calibration as well as the aggregation process. APM is extremely light-weighted and can be plugged into existing network architectures in a parameter-efficient manner. By integrating APM into ResNet-50, we obtain a boost of top-1 accuracy by 3.54% on the benchmark dataset. Our comprehensive experiments demonstrate that APM achieves much improved performance comparing with the state-of-the-art attention methods using significantly less computation budget. Source code is provided on https://github.com/ZN-Qiao/APM.
What problem does this paper attempt to address?