Abstract:Objectives: Accurate extraction of regions of interest (ROI) with variable shapes and scales is one of the primary challenges in medical image segmentation. Current U-based networks mostly aggregate multi-stage encoding outputs as an improved multi-scale skip connection. Although this design has been proven to provide scale diversity and contextual integrity, there remain several intuitive limits: (i) the encoding outputs are resampled to the same size simply, which destruct the fine-grained information. The advantages of utilization of multiple scales are insufficient. (ii) Certain redundant information proportional to the feature dimension size is introduced and causes multi-stage interference. And (iii) the precision of information delivery relies on the up-sampling and down-sampling layers, but guidance on maintaining consistency in feature locations and trends between them is lacking. Methods: To improve these situations, this paper proposed a U-based CNN network named HAD-Net, by assembling a new hyper-scale shifted aggregating module (HSAM) paradigm and progressive reusing attention (PRA) for skip connections, as well as employing a novel pair of dual-branch parameter-free sampling layers, i.e. max-diagonal pooling (MDP) and max-diagonal un-pooling (MDUP). That is, the aggregating scheme additionally combines five subregions with certain offsets in the shallower stage. Since the lower scale-down ratios of subregions enrich scales and fine-grain context. Then, the attention scheme contains a partial-to-global channel attention (PGCA) and a multi-scale reusing spatial attention (MRSA), it builds reusing connections internally and adjusts the focus on more useful dimensions. Finally, MDP and MDUP are explored in pairs to improve texture delivery and feature consistency, enhancing information retention and avoiding positional confusion. Results: Compared to state-of-the-art networks, HAD-Net has achieved comparable and even better performances with Dice of 90.13%, 81.51%, and 75.43% for each class on BraTS20, 89.59% Dice and 98.56% AUC on Kvasir-SEG, as well as 82.17% Dice and 98.05% AUC on DRIVE. Conclusions: The scheme of HSAM+PRA+MDP+MDUP has been proven to be a remarkable improvement and leaves room for further research.

HEA-Net: Attention and MLP Hybrid Encoder Architecture for Medical Image Segmentation

Mixed Transformer U-Net for Medical Image Segmentation

Transformer and group parallel axial attention co-encoder for medical image segmentation

Dual-attention transformer-based hybrid network for multi-modal medical image segmentation

MHA-Net: A Multibranch Hybrid Attention Network for Medical Image Segmentation

MAFUNet: Multi-Attention Fusion Network for Medical Image Segmentation

UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation

MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation

MEA-Net: multilayer edge attention network for medical image segmentation

TSCA-Net: Transformer based spatial-channel attention segmentation network for medical images

HD 2 A-Net: A novel dual gated attention network using comprehensive hybrid dilated convolutions for medical image segmentation

MDA-Unet: A Multi-Scale Dilated Attention U-Net for Medical Image Segmentation

HMT-Net: Transformer and MLP Hybrid Encoder for Skin Disease Segmentation

U-MLP: MLP-based ultralight refinement network for medical image segmentation

DECTNet: Dual Encoder Network combined convolution and Transformer architecture for medical image segmentation

IEA-Net: Internal and External Dual-Attention Medical Segmentation Network with High-Performance Convolutional Blocks

HAD-Net: an Attention U-based Network with Hyper-Scale Shifted Aggregating and Max-Diagonal Sampling for Medical Image Segmentation

TransDAE: Dual Attention Mechanism in a Hierarchical Transformer for Efficient Medical Image Segmentation

EPT-Net: Edge Perception Transformer for 3D Medical Image Segmentation

DAU-Net: A medical image segmentation network combining the Hadamard product and dual scale attention gate

Sfe-Transunet: A Transformer-Based U-Net With Skipped Features Enhancer For Medical Image Segmentation