Abstract:Segmentation is an important task in a wide range of computer vision applications, including medical image analysis. Recent years have seen an increase in the complexity of medical image segmentation approaches based on sophisticated convolutional neural network architectures. This progress has led to incremental enhancements in performance on widely recognised benchmark datasets. However, most of the existing approaches are computationally demanding, which limits their practical applicability. This paper presents an expand-squeeze dual multiscale residual network (ESDMR-Net), which is a full y convolutional network that is particularly well-suited for resource-constrained computing hardware such as mobile devices. ESDMR-Net focusses on extracting multiscale features, enabling the learning of contextual dependencies among semantically distinct features. The ESDMR-Net architecture allows dual-stream information flow within encoder–decoder pairs. The expansion operation (depthwise separable convolution) makes all of the rich features with multiscale information available to the squeeze operation (bottleneck layer), which then extracts the necessary information for the segmentation task. The Expand-Squeeze (ES) block helps the network pay more attention to under-represented classes, which contributes to improved segmentation accuracy. To enhance the flow of information across multiple resolutions or scales, we integrated dual multiscale residual (DMR) blocks into the skip connection. This integration enables the decoder to access features from various levels of abstraction, ultimately resulting in more comprehensive feature representations. We present experiments on seven datasets from five distinct examples of applications: segmentation of retinal vessels ( 2× ), skin lesions ( 2× ), digestive tract polyps, lung regions, and cells. Our model demonstrates strong performance, with an F1 score of 0.8287%, 0.8211%, 0.9034%, 0.9451%, 0.9543%, 0.9840%, and 0.8424% on the DRIVE, CHASE, ISIC2017, ISIC2016, CVC-ClinicDB, MC and MoNuSeg datasets, respectively. Remarkably, our model achieves these results despite having significantly fewer trainable parameters, with a reduction of two or even three orders of magnitude.

Scale-wise Discriminative Region Learning for Medical Image Segmentation

MsVRL: Self-Supervised Multiscale Visual Representation Learning Via Cross-Level Consistency for Medical Image Segmentation

ScaleFormer: Revisiting the Transformer-based Backbones from a Scale-wise Perspective for Medical Image Segmentation.

SLViT: Scale-Wise Language-Guided Vision Transformer for Referring Image Segmentation.

MMViT-Seg: A Lightweight Transformer and CNN Fusion Network for COVID-19 Segmentation.

ScaleNet: Rethinking Feature Interaction from a Scale-Wise Perspective for Medical Image Segmentation.

SA-Net: A scale-attention network for medical image segmentation

MS-TCNet: An effective Transformer–CNN combined network using multi-scale feature learning for 3D medical image segmentation

LM-Net: A light-weight and multi-scale network for medical image segmentation

RSKD: Enhanced medical image segmentation via multi-layer, rank-sensitive knowledge distillation in Vision Transformer models

CascadeMedSeg: integrating pyramid vision transformer with multi-scale fusion for precise medical image segmentation

A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer

MSR-UNet: enhancing multi-scale and long-range dependencies in medical image segmentation

A More Design-Flexible Medical Transformer for Volumetric Image Segmentation.

[Multi-scale medical image segmentation based on pixel encoding and spatial attention mechanism]

GCFormer: Multi-scale Feature Plays a Crucial Role in Medical Images Segmentation

MSDEnet: Multi-scale detail enhanced network based on human visual system for medical image segmentation

ShiftTransUNet: An Efficient Deep Learning Model for Medical Image Segmentation Using ShiftViT Framework

W-Net: Dual Supervised Medical Image Segmentation Model with Multi-Dimensional Attention and Cascade Multi-Scale Convolution

An Effective Dual-Scale Hybrid Encoder Network for Medical Image Segmentation

ESDMR-Net: A lightweight network with expand-squeeze and dual multiscale residual connections for medical image segmentation