Abstract:Since the era of deep learning, convolutional neural networks (CNNs) and vision transformers (ViTs) have been extensively studied and widely used in medical image classification tasks. Unfortunately, CNN's limitations in modeling long-range dependencies result in poor classification performances. In contrast, ViTs are hampered by the quadratic computational complexity of their self-attention mechanism, making them difficult to deploy in real-world settings with limited computational resources. Recent studies have shown that state space models (SSMs) represented by Mamba can effectively model long-range dependencies while maintaining linear computational complexity. Inspired by it, we proposed MedMamba, the first Vision Mamba for generalized medical image classification. Concretely, we introduced a novel hybrid basic block named SS-Conv-SSM, which purely integrates the convolutional layers for extracting local features with the abilities of SSM to capture long-range dependencies, aiming to model medical images from different image modalities efficiently. By employing the grouped convolution strategy and channel-shuffle operation, MedMamba successfully provides fewer model parameters and a lower computational burden for efficient applications without sacrificing accuracy. We thoroughly evaluated MedMamba using 16 datasets containing ten imaging modalities and 411,007 images. Experimental results show that MedMamba demonstrates competitive performance on most tasks compared with the state-of-the-art methods. This work aims to explore the potential of Vision Mamba and establish a new baseline for medical image classification, thereby providing valuable insights for developing more powerful Mamba-based artificial intelligence algorithms and applications in medicine. The source codes and all pre-trained weights of MedMamba are available at <a class="link-external link-https" href="https://github.com/YubiaoYue/MedMamba" rel="external noopener nofollow">this https URL</a>.

Vision Mamba for Classification of Breast Ultrasound Images

MedMamba: Vision Mamba for Medical Image Classification

MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Mamba in Vision: A Comprehensive Survey of Techniques and Applications

VMC‐UNet: A Vision Mamba‐CNN U‐Net for Tumor Segmentation in Breast Ultrasound Image

Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation

A Survey on Vision Mamba: Models, Applications and Challenges

Medical Image Classification with a Hybrid SSM Model Based on CNN and Transformer

VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation

BUViTNet: Breast Ultrasound Detection via Vision Transformers

Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining

A Survey on Visual Mamba

Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans

A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond

VMamba: Visual State Space Model

Vision transformer-convolution for breast cancer classification using mammography images: A comparative study

Visual Mamba: A Survey and New Outlooks

HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image Segmentation