Abstract:Medical images are information carriers that visually reflect and record the anatomical structure of the human body, and play an important role in clinical diagnosis, teaching and research, etc. Modern medicine has become increasingly inseparable from the intelligent processing of medical images. In recent years, there have been more and more attempts to apply deep learning theory to medical image segmentation tasks, and it is imperative to explore a simple and efficient deep learning algorithm for medical image segmentation. In this paper, we investigate the segmentation of lung nodule images. We address the above-mentioned problems of medical image segmentation algorithms and conduct research on medical image fusion algorithms based on a hybrid channel-space attention mechanism and medical image segmentation algorithms with a hybrid architecture of Convolutional Neural Networks (CNN) and Visual Transformer. To the problem that medical image segmentation algorithms are difficult to capture long-range feature dependencies, this paper proposes a medical image segmentation model SW-UNet based on a hybrid CNN and Vision Transformer (ViT) framework. Self-attention mechanism and sliding window design of Visual Transformer are used to capture global feature associations and break the perceptual field limitation of convolutional operations due to inductive bias. At the same time, a widened self-attentive vector is used to streamline the number of modules and compress the model size so as to fit the characteristics of a small amount of medical data, which makes the model easy to be overfitted. Experiments on the LUNA16 lung nodule image dataset validate the algorithm and show that the proposed network can achieve efficient medical image segmentation on a lightweight scale. In addition, to validate the migratability of the model, we performed additional validation on other tumor datasets with desirable results. Our research addresses the crucial need for improved medical image segmentation algorithms. By introducing the SW-UNet model, which combines CNN and ViT, we successfully capture long-range feature dependencies and break the perceptual field limitations of traditional convolutional operations. This approach not only enhances the efficiency of medical image segmentation but also maintains model scalability and adaptability to small medical datasets. The positive outcomes on various tumor datasets emphasize the potential migratability and broad applicability of our proposed model in the field of medical image analysis.

VSNet: classification of pulmonary nodules in 3D using vision transformer and sequence spatial attention mechanism

Detection of Pulmonary Nodules in CT Images Based on 3D Multi-Scale Retina Vnet

S3TU-Net: Structured Convolution and Superpixel Transformer for Lung Nodule Segmentation

Multimodality Attention-Guided 3-D Detection of Nonsmall Cell Lung Cancer in 18F-FDG PET/CT Images

3D Multi-View Squeeze-and-excitation Convolutional Neural Network for Lung Nodule Classification

An improved V-Net lung nodule segmentation model based on pixel threshold separation and attention mechanism

Multiscale lung nodule segmentation based on 3D coordinate attention and edge enhancement

Attention-guided deep neural network with a multichannel architecture for lung nodule classification

An attention-based deep learning network for lung nodule malignancy discrimination

SSANet—Novel Residual Network for Computer‐Aided Diagnosis of Pulmonary Nodules in Chest Computed Tomography

Unsupervised Contrastive Learning based Transformer for Lung Nodule Detection

TiCNet: Transformer in Convolutional Neural Network for Pulmonary Nodule Detection on CT Images

3D multi-view convolutional neural networks for lung nodule classification

An improved 3-D attention CNN with hybrid loss and feature fusion for pulmonary nodule classification

SW-UNet: a U-Net fusing sliding window transformer block with CNN for segmentation of lung nodules

LSSANet: A Long Short Slice-Aware Network for Pulmonary Nodule Detection

AttentNet: Fully Convolutional 3D Attention for Lung Nodule Detection

Improving lung nodule segmentation in thoracic CT scans through the ensemble of 3D U-Net models

AResNet-ViT: A Hybrid CNN-Transformer Network for Benign and Malignant Breast Nodule Classification in Ultrasound Images

Classification of Benign and Malignant Lung Nodules Based on Residuals and 3D VNet Network

Lung Nodule Texture Detection and Classification Using 3D CNN