A Simple and Light-Weight Attention Module for Convolutional Neural Networks

Jongchan Park,Sanghyun Woo,Joon-Young Lee,In So Kweon
DOI: https://doi.org/10.1007/s11263-019-01283-0
IF: 13.369
2020-01-28
International Journal of Computer Vision
Abstract:Many aspects of deep neural networks, such as depth, width, or cardinality, have been studied to strengthen the representational power. In this work, we study the effect of <i>attention</i> in convolutional neural networks and present our idea in a simple self-contained module, called <i>Bottleneck Attention Module</i> (BAM). Given an intermediate feature map, BAM efficiently produces the attention map along two factorized axes, <i>channel</i> and <i>spatial</i>, with negligible overheads. BAM is placed at <i>bottlenecks</i> of various models where the downsampling of feature maps occurs, and is jointly trained in an end-to-end manner. Ablation studies and extensive experiments are conducted in CIFAR-100/ImageNet classification, VOC2007/MS-COCO detection, super resolution and scene parsing with various architectures including mobile-oriented networks. BAM shows consistent improvements over all experiments, demonstrating the wide applicability of BAM. The code and models are available at <a href="https://github.com/Jongchan/attentionmodule">https://github.com/Jongchan/attentionmodule</a>.
computer science, artificial intelligence
What problem does this paper attempt to address?