BA-Net: Bridge Attention in Deep Neural Networks

Ronghui Zhang,Runzong Zou,Yue Zhao,Zirui Zhang,Junzhou Chen,Yue Cao,Chuan Hu,Houbing Song
2024-10-11
Abstract:Attention mechanisms, particularly channel attention, have become highly influential in numerous computer vision tasks. Despite their effectiveness, many existing methods primarily focus on optimizing performance through complex attention modules applied at individual convolutional layers, often overlooking the synergistic interactions that can occur across multiple layers. In response to this gap, we introduce bridge attention, a novel approach designed to facilitate more effective integration and information flow between different convolutional layers. Our work extends the original bridge attention model (BAv1) by introducing an adaptive selection operator, which reduces information redundancy and optimizes the overall information exchange. This enhancement results in the development of BAv2, which achieves substantial performance improvements in the ImageNet classification task, obtaining Top-1 accuracies of 80.49% and 81.75% when using ResNet50 and ResNet101 as backbone networks, respectively. These results surpass the retrained baselines by 1.61% and 0.77%, respectively. Furthermore, BAv2 outperforms other existing channel attention techniques, such as the classical SENet101, exceeding its retrained performance by 0.52% Additionally, integrating BAv2 into advanced convolutional networks and vision transformers has led to significant gains in performance across a wide range of computer vision tasks, underscoring its broad applicability.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the limitations of existing attention mechanisms in convolutional neural networks (ConvNets). In particular, the channel attention mechanism mainly focuses on optimizing performance in a single convolutional layer, while ignoring the synergy across multiple convolutional layers. Specifically: 1. **Limitations of Existing Methods**: - Most existing channel attention mechanisms mainly focus on optimizing the performance of a single convolutional layer through complex attention modules, while ignoring the information flow and synergy between different convolutional layers. - Although this method can improve performance, it often increases the complexity and computational cost of the model, making the model more complex and computationally intensive. 2. **Core of the Problem**: - How to more effectively integrate and transfer information between different convolutional layers to improve the performance of the channel attention mechanism while maintaining the lightweight and high - efficiency of the model. - Existing methods usually only extract features from adjacent convolutional layers, lacking effective utilization of the features of the previous layers, resulting in low correlation between the generated attention weights and the previous layers and insufficient adaptability. 3. **Proposed Solution**: - To solve the above problems, the paper introduces a new bridging attention mechanism (Bridge Attention), especially the BAv2 module. By bridging operations, the features of the previous layers are combined with the features of the current layer, thereby enhancing the information flow and synergy. - The BAv2 module dynamically selects and fuses features from different convolutional layers by introducing an adaptive selection operator, reducing information redundancy and optimizing the overall information exchange. 4. **Specific Improvements**: - **Bridging Operation**: Through the bridging operation, the features of the previous layers are fused with the features of the current layer to generate more representative channel weights. - **Adaptive Selection Operator**: Through the adaptive selection operator, the feature weights from different layers are dynamically adjusted, further improving the effect of attention generation. 5. **Experimental Verification**: - The paper verifies the superior performance of the BAv2 module in the ImageNet classification task and other computer vision tasks through a large number of experiments, proving its wide applicability and effectiveness in various deep neural network architectures. In summary, this paper aims to solve the limitations of existing channel attention mechanisms in multi - layer convolutional neural networks by introducing the bridging attention mechanism, especially in the BAv2 module, so as to achieve a more efficient and lightweight attention mechanism design.