SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

Zhaohu Xing,Tian Ye,Yijun Yang,Guang Liu,Lei Zhu

2024-09-15

Abstract:The Transformer architecture has shown a remarkable ability in modeling global relationships. However, it poses a significant computational challenge when processing high-dimensional medical images. This hinders its development and widespread adoption in this task. Mamba, as a State Space Model (SSM), recently emerged as a notable manner for long-range dependencies in sequential modeling, excelling in natural language processing filed with its remarkable memory efficiency and computational speed. Inspired by its success, we introduce SegMamba, a novel 3D medical image \textbf{Seg}mentation \textbf{Mamba} model, designed to effectively capture long-range dependencies within whole volume features at every scale. Our SegMamba, in contrast to Transformer-based methods, excels in whole volume feature modeling from a state space model standpoint, maintaining superior processing speed, even with volume features at a resolution of {$64\times 64\times 64$}. Comprehensive experiments on the BraTS2023 dataset demonstrate the effectiveness and efficiency of our SegMamba. The code for SegMamba is available at: <a class="link-external link-https" href="https://github.com/ge-xing/SegMamba" rel="external noopener nofollow">this https URL</a>

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The paper attempts to address the issues in 3D medical image segmentation, where existing methods face heavy computational burdens and difficulties in effectively modeling long-range dependencies when handling high-dimensional medical images. Specifically: 1. **Computational Burden**: Although existing Transformer-based methods can extract global information, the quadratic complexity of their self-attention mechanism leads to significant computational overhead, especially when processing high-resolution 3D medical images. 2. **Modeling Long-Range Dependencies**: Traditional CNN methods struggle to effectively model global relationships due to the locality of convolutional layers. While existing Transformer methods can model global information, they are inefficient when dealing with long sequences. To address these issues, the authors introduce SegMamba, a novel 3D medical image segmentation framework that combines Mamba (a state-space model). SegMamba effectively captures long-range dependencies in volumetric data while maintaining efficiency through the design of the Three-way Mamba module (ToM), Gated Spatial Convolution module (GSC), and Feature-level Uncertainty Estimation module (FUE). Additionally, the authors have constructed a new large-scale 3D colorectal cancer segmentation dataset (CRC-500) to support related research and benchmarking. Experimental results demonstrate that SegMamba performs excellently across multiple datasets, exhibiting high segmentation accuracy and inference efficiency.

SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

MambaClinix: Hierarchical Gated Convolution and Mamba-Based U-Net for Enhanced 3D Medical Image Segmentation

HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image Segmentation

Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation

MedSegMamba: 3D CNN-Mamba Hybrid Architecture for Brain Segmentation

Taming Mambas for Voxel Level 3D Medical Image Segmentation

VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation

HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation

SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors

EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation

LPAM: A lightweight medical segmentation network based on Mamba improved by prompt attention

nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model

LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation

SegFormer3D: an Efficient Transformer for 3D Medical Image Segmentation

A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond

VM-UNet: Vision Mamba UNet for Medical Image Segmentation

MedMamba: Vision Mamba for Medical Image Classification

Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images