Abstract:BackgroundU-Net and its variations have achieved remarkable performances in medical image segmentation. However, they have two limitations. First, the shallow layer feature of the encoder always contains background noise. Second, semantic gaps exist between the features of the encoder and the decoder. Skip-connections directly connect the encoder to the decoder, which will lead to the fusion of semantically dissimilar feature maps. PurposeTo overcome these two limitations, this paper proposes a novel medical image segmentation algorithm, called feature-guided attention network, which consists of U-Net, the cross-level attention filtering module (CAFM), and the attention-guided upsampling module (AUM). MethodsIn the proposed method, the AUM and the CAFM were introduced into the U-Net, where the AUM learns to filter the background noise in the low-level feature map of the encoder and the CAFM tries to eliminate the semantic gap between the encoder and the decoder. Specifically, the AUM adopts a top-down pathway to use the high-level feature map so as to filter the background noise in the low-level feature map of the encoder. The AUM uses the encoder features to guide the upsampling of the corresponding decoder features, thus eliminating the semantic gap between them. Four medical image segmentation tasks, including coronary atherosclerotic plaque segmentation (Dataset A), retinal vessel segmentation (Dataset B), skin lesion segmentation (Dataset C), and multiclass retinal edema lesions segmentation (Dataset D), were used to validate the proposed method. ResultsFor Dataset A, the proposed method achieved higher Intersection over Union (IoU) (67.91 +/- 3.82%$67.91\pm 3.82\%$), dice (79.39 +/- 3.37%$79.39\pm 3.37\%$), accuracy (98.39 +/- 0.34%$98.39\pm 0.34\%$), and sensitivity (85.10 +/- 3.74%$85.10\pm 3.74\%$) than the previous best method: CA-Net. For Dataset B, the proposed method achieved higher sensitivity (83.50%) and accuracy (97.55%) than the previous best method: SCS-Net. For Dataset C, the proposed method had highest IoU (83.47 +/- 0.41%$83.47\pm 0.41\%$) and dice (90.81 +/- 0.34%$90.81\pm 0.34\%$) than those of all compared previous methods. For Dataset D, the proposed method had highest dice (average: 81.53%; retina edema area [REA]: 83.78%; pigment epithelial detachment [PED] 77.13%), sensitivity (REA: 89.01%; SRF: 85.50%), specificity (REA: 99.35%; PED: 100.00), and accuracy (98.73%) among all compared previous networks. In addition, the number of parameters of the proposed method was 2.43 M, which is less than CA-Net (3.21 M) and CPF-Net (3.07 M). ConclusionsThe proposed method demonstrated state-of-the-art performance, outperforming other top-notch medical image segmentation algorithms. The CAFM filtered the background noise in the low-level feature map of the encoder, while the AUM eliminated the semantic gap between the encoder and the decoder. Furthermore, the proposed method was of high computational efficiency.

Attention Based Multi-Scale Nested Network for Biomedical Image Segmentation

DmADs-Net: Dense multiscale attention and depth-supervised network for medical image segmentation

A Multi-Scale Context Aware Attention Model for Medical Image Segmentation

DCA: Densely Cross-scale Attention Network for Anatomically-plausible Medical Image Segmentation.

MSAANet: Multi-scale Axial Attention Network for Medical Image Segmentation.

CMM-Net: Contextual multi-scale multi-level network for efficient biomedical image segmentation

Attention-modulated Multi-Branch Convolutional Neural Networks for Neonatal Brain Tissue Segmentation

A novel medical image segmentation approach by using multi-branch segmentation network based on local and global information synchronous learning

Multi-level Feature Attention Network for Medical Image Segmentation

ConvMedSegNet:A multi-receptive field depthwise convolutional neural network for medical image segmentation

HAD-Net: an Attention U-based Network with Hyper-Scale Shifted Aggregating and Max-Diagonal Sampling for Medical Image Segmentation

BEA-Net: Body and Edge Aware Network with Multi-Scale Short-Term Concatenation for Medical Image Segmentation.

AMCNet: Attention-Based Multiscale Convolutional Network for DCM MRI Segmentation

Attention Mechanism Trained with Small Datasets for Biomedical Image Segmentation

Feature-guided Attention Network for Medical Image Segmentation.

MADR-Net: multi-level attention dilated residual neural network for segmentation of medical images

SA-Net: A scale-attention network for medical image segmentation

MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention

w-Net: Dual Supervised Medical Image Segmentation Model with Multi-Dimensional Attention and Cascade Multi-Scale Convolution

NMNet: Learning Multi-level Semantic Information from Scale Extension Domain for Improved Medical Image Segmentation

W-Net: Dual Supervised Medical Image Segmentation Model with Multi-Dimensional Attention and Cascade Multi-Scale Convolution