RGAM: A refined global attention mechanism for medical image segmentation

Gangjun Ning,Pingping Liu,Chuangye Dai,Mingsi Sun,Qiuzhan Zhou,Qingliang Li
DOI: https://doi.org/10.1049/cvi2.12323
IF: 1.484
2024-12-01
IET Computer Vision
Abstract:In this paper, the authors propose a refined global attention module (RGAM) to address known shortcomings of existing attention mechanisms. To validate the effectiveness and generality of the proposed method, the authors conducted extensive experiments on four distinct medical image segmentation datasets. The results demonstrate the effectiveness of RGAM in achieving state‐of‐the‐art performance compared to existing methods. Attention mechanisms are popular techniques in computer vision that mimic the ability of the human visual system to analyse complex scenes, enhancing the performance of convolutional neural networks (CNN). In this paper, the authors propose a refined global attention module (RGAM) to address known shortcomings of existing attention mechanisms: (1) Traditional channel attention mechanisms are not refined enough when concentrating features, which may lead to overlooking important information. (2) The 1‐dimensional attention map generated by traditional spatial attention mechanisms make it difficult to accurately summarise the weights of all channels in the original feature map at the same position. The RGAM is composed of two parts: refined channel attention and refined spatial attention. In the channel attention part, the authors used multiple weight‐shared dilated convolutions with varying dilation rates to perceive features with different receptive fields at the feature compression stage. The authors also combined dilated convolutions with depth‐wise convolution to reduce the number of parameters. In the spatial attention part, the authors grouped the feature maps and calculated the attention for each group independently, allowing for a more accurate assessment of each spatial position's importance. Specifically, the authors calculated the attention weights separately for the width and height directions, similar to SENet, to obtain more refined attention weights. To validate the effectiveness and generality of the proposed method, the authors conducted extensive experiments on four distinct medical image segmentation datasets. The results demonstrate the effectiveness of RGAM in achieving state‐of‐the‐art performance compared to existing methods.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?