Abstract:Objective. In recent years, methods based on U-shaped structure and skip connection have achieved remarkable results in many medical semantic segmentation tasks. However, the information integration capability of this structure is still limited due to the incompatibility of feature maps of encoding and decoding stages at corresponding levels and lack of extraction of valid information in the final stage of encoding. This structural defect is particularly obvious in segmentation tasks with non-obvious, small and blurred-edge targets. Our objective is to design a novel segmentation network to solve the above problems. Approach. The segmentation network named Global Context-Aware Network (GCANet) is mainly designed by inserting a Multi-feature Collaboration Adaptation (MCA) module, a Scale-Aware Mining (SAM) module and an Edge-enhanced Pixel Intensity Mapping (Edge-PIM) into the U-shaped structure. Firstly, the MCA module can integrate information from all encoding stages and then effectively acts on the decoding stages, solving the problem of information loss during downsampling and pooling. Secondly, the SAM module can further mine information from the encoded high-level features to enrich the information passed to the decoding stage. Thirdly, Edge-PIM can further refine the segmentation results by edge enhancement. Main results. We newly collect Magnetic Resonance Imaging of Colorectal Cancer Liver Metastases (MRI-CRLM) dataset in different imaging sequences with non-obvious, small and blurred-edge liver metastases. Our method performs well on the MRI-CRLM dataset and the publicly available ISIC-2018 dataset, outperforming state-of-the-art methods such as CPFNet on multiple metrics after boxplot analysis, indicating that it can perform well on a wide range of medical image segmentation tasks. Significance. The proposed method solves the problem mentioned above and improved segmentation accuracy for non-obvious, small and blurred-edge targets. Meanwhile, the proposed visualization method Edge-PIM can make the edge more prominent, which can assist medical radiologists in their research work well.

Unified semantic model for medical image segmentation

Mixed Transformer U-Net for Medical Image Segmentation

[Multi-scale medical image segmentation based on pixel encoding and spatial attention mechanism]

SeUNet-Trans: A Simple yet Effective UNet-Transformer Model for Medical Image Segmentation

UniSeg: A Prompt-driven Universal Segmentation Model as well as A Strong Representation Learner

VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation

Swin-TransUper: Swin Transformer-based UperNet for medical image segmentation

MSR-UNet: enhancing multi-scale and long-range dependencies in medical image segmentation

DA-TransUNet: Integrating Spatial and Channel Dual Attention with Transformer U-Net for Medical Image Segmentation

DS-TransUNet:Dual Swin Transformer U-Net for Medical Image Segmentation

MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven Universal Model

SAM-UNet:Enhancing Zero-Shot Segmentation of SAM for Universal Medical Images

Collaborative multi-feature extraction and scale-aware semantic information mining for medical image segmentation

MedSeq: Semantic Segmentation for Medical Image Sequences

Linear semantic transformation for semi-supervised medical image segmentation

Sfe-Transunet: A Transformer-Based U-Net With Skipped Features Enhancer For Medical Image Segmentation

DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation

UniverSeg: Universal Medical Image Segmentation

From CNN to Transformer: A Review of Medical Image Segmentation Models

EG-TransUNet: a transformer-based U-Net with enhanced and guided models for biomedical image segmentation

A Hybrid Enhanced Attention Transformer Network for Medical Ultrasound Image Segmentation