Abstract:Medical image segmentation is critical for efficient diagnosis of diseases and treatment planning. In recent years, convolutional neural networks (CNN)-based methods, particularly U-Net and its variants, have achieved remarkable results on medical image segmentation tasks. However, they do not always work consistently on images with complex structures and large variations in regions of interest (ROI). This could be due to the fixed geometric structure of the receptive fields used for feature extraction and repetitive down-sampling operations that lead to information loss. To overcome these problems, the standard U-Net architecture is modified in this work by replacing the convolution block with a dilated convolution block to extract multi-scale context features with varying sizes of receptive fields, and adding a dilated inception block between the encoder and decoder paths to alleviate the problem of information recession and the semantic gap between features. Furthermore, the input of each dilated convolution block is added to the output through a squeeze and excitation unit, which alleviates the vanishing gradient problem and improves overall feature representation by re-weighting the channel-wise feature responses. The original inception block is modified by reducing the size of the spatial filter and introducing dilated convolution to obtain a larger receptive field. The proposed network was validated on three challenging medical image segmentation tasks with varying size ROIs: lung segmentation on chest X-ray (CXR) images, skin lesion segmentation on dermoscopy images and nucleus segmentation on microscopy cell images. Improved performance compared to state-of-the-art techniques demonstrates the effectiveness and generalisability of the proposed Dilated Convolution and Inception blocks-based U-Net (DCI-UNet).

w-Net: Dual Supervised Medical Image Segmentation Model with Multi-Dimensional Attention and Cascade Multi-Scale Convolution

W-Net: Dual Supervised Medical Image Segmentation Model with Multi-Dimensional Attention and Cascade Multi-Scale Convolution

[Multi-scale medical image segmentation based on pixel encoding and spatial attention mechanism]

ω-net: Dual supervised medical image segmentation with multi-dimensional self-attention and diversely-connected multi-scale convolution

A multi-attention and depthwise separable convolution network for medical image segmentation

DMAGNet: Dual‐path Multi‐scale Attention Guided Network for Medical Image Segmentation

SA-Net: A scale-attention network for medical image segmentation

CM-SegNet: A Deep Learning-Based Automatic Segmentation Approach for Medical Images by Combining Convolution and Multilayer Perceptron

2-D general network based on channel-space attention for medical image segmentation

DmADs-Net: Dense multiscale attention and depth-supervised network for medical image segmentation

Attention Based Multi-Scale Nested Network for Biomedical Image Segmentation

BEA-Net: Body and Edge Aware Network with Multi-Scale Short-Term Concatenation for Medical Image Segmentation.

Few-Shot Medical Image Segmentation with Large Kernel Attention

Multi-level Feature Attention Network for Medical Image Segmentation

A Multi-Scale Context Aware Attention Model for Medical Image Segmentation

LM-Net: A light-weight and multi-scale network for medical image segmentation

MSDEnet: Multi-scale detail enhanced network based on human visual system for medical image segmentation

EFCNet: Every Feature Counts for Small Medical Object Segmentation

An attention mechanism and multi-feature fusion network for medical image segmentation

A Multi-Scaled Receptive Field Learning Approach for Medical Image Segmentation

kCBAC-Net: Deeply Supervised Complete Bipartite Networks with Asymmetric Convolutions for Medical Image Segmentation