Spatial Decomposition and Aggregation for Attention in Convolutional Neural Networks

Meng Zhu,Weidong Min,Hongyue Xiang,Cheng Zha,Zheng Huang,Longfei Li,Qiyan Fu
DOI: https://doi.org/10.1142/s0218001423520195
IF: 1.261
2024-01-01
International Journal of Pattern Recognition and Artificial Intelligence
Abstract:Channel attention has been shown to improve the performance of deep convolutional neural networks efficiently. Channel attention adaptively recalibrates the importance of each channel, determining what to attend to. However, channel attention only encodes inter-channel information but neglects the importance of positional information. Positional information is crucial in determining where to attend to. To address this issue, we propose a novel channel-spatial attention method named Spatial-Decomposition-Aggregation Attention (SDAA) method. First, a high-axis spatial direction is decomposed into multiple low-axis spatial directions. Then, a shared transformation sub-unit establishes attention in each low-axis space direction. Next, all the low-axis attention masks are aggregated into a high-axis attention mask. Finally, the generated high-axis attention mask is fused into the input features, thus enhancing the input features. Essentially, our method is a divide-and-conquer process. Experimental results demonstrate that our SDAA method outperforms the existing channel-spatial attention methods.
What problem does this paper attempt to address?