From CNN to Transformer: A Review of Medical Image Segmentation Models

Wenjian Yao,Jiajun Bai,Wei Liao,Yuheng Chen,Mengjuan Liu,Yao Xie

2023-08-10

Abstract:Medical image segmentation is an important step in medical image analysis, especially as a crucial prerequisite for efficient disease diagnosis and treatment. The use of deep learning for image segmentation has become a prevalent trend. The widely adopted approach currently is U-Net and its variants. Additionally, with the remarkable success of pre-trained models in natural language processing tasks, transformer-based models like TransUNet have achieved desirable performance on multiple medical image segmentation datasets. In this paper, we conduct a survey of the most representative four medical image segmentation models in recent years. We theoretically analyze the characteristics of these models and quantitatively evaluate their performance on two benchmark datasets (i.e., Tuberculosis Chest X-rays and ovarian tumors). Finally, we discuss the main challenges and future trends in medical image segmentation. Our work can assist researchers in the related field to quickly establish medical segmentation models tailored to specific regions.

Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning

What problem does this paper attempt to address?

The paper mainly addresses issues in the field of medical image segmentation, particularly focusing on improving the accuracy and efficiency of image segmentation using deep learning techniques. Specifically, the paper focuses on the following aspects: 1. **Importance of Medical Image Segmentation**: Medical image segmentation is a crucial step in medical image analysis, playing a vital role in the efficient diagnosis and treatment of diseases. 2. **Limitations of Existing Methods**: Traditional image segmentation methods rely on manual feature extraction or handcrafted algorithms based on image processing techniques. These methods have limitations in terms of efficiency and accuracy when dealing with large volumes of complex medical images. 3. **Application of Deep Learning in Image Segmentation**: In recent years, deep learning-based image segmentation methods have become the mainstream trend. Convolutional Neural Networks (CNN) and their variants, such as U-Net, are widely adopted. However, CNNs have limited capability in handling long-range dependencies. 4. **Application of Transformer Models**: With the success of pre-trained models in natural language processing tasks, Transformer-based models have also been introduced into medical image segmentation tasks, showing excellent performance. 5. **Main Contributions of the Paper**: This paper reviews four of the most representative medical image segmentation models in recent years, including U-Net, UNet++, TransUNet, and Swin-Unet. The paper not only theoretically analyzes the characteristics of these models but also quantitatively evaluates their performance on two benchmark datasets (chest X-ray images of tuberculosis and ovarian tumor images). Additionally, the paper discusses the main challenges and future development trends in the field of medical image segmentation and provides all experimental codes and detailed model configuration parameters to help researchers in related fields quickly understand and apply these models. In summary, this paper aims to promote the advancement and development of medical image segmentation technology through a comprehensive evaluation of current representative medical image segmentation models, particularly in improving segmentation accuracy and model generalization capabilities using Transformer structures.

From CNN to Transformer: A Review of Medical Image Segmentation Models

TF-Unet:An Automatic Cardiac MRI Image Segmentation Method

Mixed Transformer U-Net for Medical Image Segmentation

Transformers in medical image segmentation: A review

TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

Transformers in medical image segmentation: a narrative review

A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer

Big Model and Small Model : Remote Modeling and Local Information Extraction Module for Medical Image Segmentation.

EG-TransUNet: a transformer-based U-Net with enhanced and guided models for biomedical image segmentation

Fully Convolutional Network for the Semantic Segmentation of Medical Images: A Survey

SW-UNet: a U-Net fusing sliding window transformer block with CNN for segmentation of lung nodules

Medical Image Segmentation Based on TransUnet and A2B

MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation

SeUNet-Trans: A Simple yet Effective UNet-Transformer Model for Medical Image Segmentation

Next-Gen Medical Imaging: U-Net Evolution and the Rise of Transformers

3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers

Medical Image Segmentation Based on U-Net: A Review

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Sfe-Transunet: A Transformer-Based U-Net With Skipped Features Enhancer For Medical Image Segmentation

HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image Segmentation

TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers