Spatial Group and Cross-Channel Attention: Make Smaller Models More Effective, Focus on High-Level Semantic Features
Ze-chen Zheng,Chao Fan,Miao Wang,Cong-qian Wang,Xue-lei He,Xiao-wei He
DOI: https://doi.org/10.1007/978-981-97-5615-5_6
2024-01-01
Abstract:In the realm of lightweight network models, Due to the limited representation of the model, it is often unable to capture enough feature information for image classification tasks, resulting in performance degradation. Attention mechanisms can effectively improve the expressiveness of models, but most attention modules in recent studies are designed to be complex to achieve better performance. We expect to learn high-level semantic features with little model complexity, the paper introduces the Spatial Group & Cross-channel Attention (SGCA) module. The SGCA module strategically increases only the number of parameters that can be safely overlooked, concurrently delivering significant performance improvements to lightweight network models. Our experiments validate the efficacy of the SGCA module through interpretability assessments, image classification evaluations, and ablation experiments. Despite being trained under category supervision, the SGCA module adeptly captures active regions encompassing various high-order semantic features. These features include focusing features and contour features of the object. When integrated into existing lightweight backbone networks, the SGCA module demonstrates a remarkable enhancement in task performance. Specifically, when applied to ResNet18, MobileNetv2, MobileNetv3, EfficientNetv1, and EfficientNetv2, the SGCA module yields improvements in task performance ranging from 0.3% to 8.93%. https://github.com/Zhengzech enzzc/Spatial-Group-Cross-channel-Attention-SGCA-