U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image Segmentation

Yaopeng Peng,Milan Sonka,Danny Z. Chen
2024-03-31
Abstract:In this paper, we introduce U-Net v2, a new robust and efficient U-Net variant for medical image segmentation. It aims to augment the infusion of semantic information into low-level features while simultaneously refining high-level features with finer details. For an input image, we begin by extracting multi-level features with a deep neural network encoder. Next, we enhance the feature map of each level by infusing semantic information from higher-level features and integrating finer details from lower-level features through Hadamard product. Our novel skip connections empower features of all the levels with enriched semantic characteristics and intricate details. The improved features are subsequently transmitted to the decoder for further processing and segmentation. Our method can be seamlessly integrated into any Encoder-Decoder network. We evaluate our method on several public medical image segmentation datasets for skin lesion segmentation and polyp segmentation, and the experimental results demonstrate the segmentation accuracy of our new method over state-of-the-art methods, while preserving memory and computational efficiency. Code is available at: <a class="link-external link-https" href="https://github.com/yaoppeng/U-Net_v2" rel="external noopener nofollow">this https URL</a>
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address issues in medical image segmentation, particularly in tasks of skin lesion segmentation and polyp segmentation. Specifically, the paper proposes U-Net v2, an improved version of the U-Net architecture, which enhances the semantic information of low-level features and refines the details of high-level features by rethinking and improving the skip connection mechanism. Although the traditional U-Net performs well in medical image segmentation, it has shortcomings in fusing low-level and high-level features. U-Net v2 introduces a new skip connection module (SDI module) that explicitly injects the semantic information of high-level features and the detail information of low-level features into each level of feature maps using Hadamard product, thereby improving segmentation performance. Experimental results show that on multiple public datasets, U-Net v2 significantly improves metrics such as Dice Similarity Coefficient (DSC) and Intersection over Union (IoU) compared to existing methods, while maintaining low floating-point operations (FLOPs) and GPU memory consumption, demonstrating high computational efficiency. Additionally, the paper conducts ablation studies to verify the effectiveness of the SDI module.