SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

Zhaohu Xing,Tian Ye,Yijun Yang,Guang Liu,Lei Zhu
2024-09-15
Abstract:The Transformer architecture has shown a remarkable ability in modeling global relationships. However, it poses a significant computational challenge when processing high-dimensional medical images. This hinders its development and widespread adoption in this task. Mamba, as a State Space Model (SSM), recently emerged as a notable manner for long-range dependencies in sequential modeling, excelling in natural language processing filed with its remarkable memory efficiency and computational speed. Inspired by its success, we introduce SegMamba, a novel 3D medical image \textbf{Seg}mentation \textbf{Mamba} model, designed to effectively capture long-range dependencies within whole volume features at every scale. Our SegMamba, in contrast to Transformer-based methods, excels in whole volume feature modeling from a state space model standpoint, maintaining superior processing speed, even with volume features at a resolution of {$64\times 64\times 64$}. Comprehensive experiments on the BraTS2023 dataset demonstrate the effectiveness and efficiency of our SegMamba. The code for SegMamba is available at: <a class="link-external link-https" href="https://github.com/ge-xing/SegMamba" rel="external noopener nofollow">this https URL</a>
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the issues in 3D medical image segmentation, where existing methods face heavy computational burdens and difficulties in effectively modeling long-range dependencies when handling high-dimensional medical images. Specifically: 1. **Computational Burden**: Although existing Transformer-based methods can extract global information, the quadratic complexity of their self-attention mechanism leads to significant computational overhead, especially when processing high-resolution 3D medical images. 2. **Modeling Long-Range Dependencies**: Traditional CNN methods struggle to effectively model global relationships due to the locality of convolutional layers. While existing Transformer methods can model global information, they are inefficient when dealing with long sequences. To address these issues, the authors introduce SegMamba, a novel 3D medical image segmentation framework that combines Mamba (a state-space model). SegMamba effectively captures long-range dependencies in volumetric data while maintaining efficiency through the design of the Three-way Mamba module (ToM), Gated Spatial Convolution module (GSC), and Feature-level Uncertainty Estimation module (FUE). Additionally, the authors have constructed a new large-scale 3D colorectal cancer segmentation dataset (CRC-500) to support related research and benchmarking. Experimental results demonstrate that SegMamba performs excellently across multiple datasets, exhibiting high segmentation accuracy and inference efficiency.