Abstract:BackgroundU-Net and its variations have achieved remarkable performances in medical image segmentation. However, they have two limitations. First, the shallow layer feature of the encoder always contains background noise. Second, semantic gaps exist between the features of the encoder and the decoder. Skip-connections directly connect the encoder to the decoder, which will lead to the fusion of semantically dissimilar feature maps. PurposeTo overcome these two limitations, this paper proposes a novel medical image segmentation algorithm, called feature-guided attention network, which consists of U-Net, the cross-level attention filtering module (CAFM), and the attention-guided upsampling module (AUM). MethodsIn the proposed method, the AUM and the CAFM were introduced into the U-Net, where the AUM learns to filter the background noise in the low-level feature map of the encoder and the CAFM tries to eliminate the semantic gap between the encoder and the decoder. Specifically, the AUM adopts a top-down pathway to use the high-level feature map so as to filter the background noise in the low-level feature map of the encoder. The AUM uses the encoder features to guide the upsampling of the corresponding decoder features, thus eliminating the semantic gap between them. Four medical image segmentation tasks, including coronary atherosclerotic plaque segmentation (Dataset A), retinal vessel segmentation (Dataset B), skin lesion segmentation (Dataset C), and multiclass retinal edema lesions segmentation (Dataset D), were used to validate the proposed method. ResultsFor Dataset A, the proposed method achieved higher Intersection over Union (IoU) (67.91 +/- 3.82%$67.91\pm 3.82\%$), dice (79.39 +/- 3.37%$79.39\pm 3.37\%$), accuracy (98.39 +/- 0.34%$98.39\pm 0.34\%$), and sensitivity (85.10 +/- 3.74%$85.10\pm 3.74\%$) than the previous best method: CA-Net. For Dataset B, the proposed method achieved higher sensitivity (83.50%) and accuracy (97.55%) than the previous best method: SCS-Net. For Dataset C, the proposed method had highest IoU (83.47 +/- 0.41%$83.47\pm 0.41\%$) and dice (90.81 +/- 0.34%$90.81\pm 0.34\%$) than those of all compared previous methods. For Dataset D, the proposed method had highest dice (average: 81.53%; retina edema area [REA]: 83.78%; pigment epithelial detachment [PED] 77.13%), sensitivity (REA: 89.01%; SRF: 85.50%), specificity (REA: 99.35%; PED: 100.00), and accuracy (98.73%) among all compared previous networks. In addition, the number of parameters of the proposed method was 2.43 M, which is less than CA-Net (3.21 M) and CPF-Net (3.07 M). ConclusionsThe proposed method demonstrated state-of-the-art performance, outperforming other top-notch medical image segmentation algorithms. The CAFM filtered the background noise in the low-level feature map of the encoder, while the AUM eliminated the semantic gap between the encoder and the decoder. Furthermore, the proposed method was of high computational efficiency.

A hybrid approach for improving U-Net variants in medical image segmentation

Hybrid dilation and attention residual U-Net for medical image segmentation

Performance Analysis of UNet and Variants for Medical Image Segmentation

A multi-attention and depthwise separable convolution network for medical image segmentation

A Medical Image Segmentation Algorithm Based on Spatial and Channel Attention Mechanisms and Data Augmentation

U-Net and its variants for Medical Image Segmentation : A short review

U-Net in Medical Image Segmentation: A Review of Its Applications Across Modalities

Residual-Attention UNet++: A Nested Residual-Attention U-Net for Medical Image Segmentation

Edge-Boosted U-Net For 2d Medical Image Segmentation

DoubleU-NetPlus: A Novel Attention and Context Guided Dual U-Net with Multi-Scale Residual Feature Fusion Network for Semantic Segmentation of Medical Images

U-Net-Based Medical Image Segmentation

Feature-guided Attention Network for Medical Image Segmentation.

LHU-Net: A Light Hybrid U-Net for Cost-Efficient, High-Performance Volumetric Medical Image Segmentation

U-Net and its variants for medical image segmentation: theory and applications

Context-Based Deep Residual Learning for Medical Image Segmentation

DCSAU-Net: A deeper and more compact split-attention U-Net for medical image segmentation

A Multi-Scale Context Aware Attention Model for Medical Image Segmentation

Attention U-Net Based on Bi-ConvLSTM and Its Optimization for Smart Healthcare

Multimodal Biomedical Image Segmentation using Multi-Dimensional U-Convolutional Neural Network

U-Net and Its Variants for Medical Image Segmentation: A Review of Theory and Applications

UNet based on dynamic convolution decomposition and triplet attention