Efficient Lung Cancer Image Classification and Segmentation Algorithm Based on an Improved Swin Transformer

Ruina Sun,Yuexin Pang,Wenfa Li
DOI: https://doi.org/10.3390/electronics12041024
IF: 2.9
2023-02-19
Electronics
Abstract:With the advancement of computer technology, transformer models have been applied to the field of computer vision (CV) after their success in natural language processing (NLP). In today's rapidly evolving medical field, radiologists continue to face multiple challenges, such as increased workload and increased diagnostic demands. The accuracy of traditional lung cancer detection methods still needs to be improved, especially in realistic diagnostic scenarios. In this study, we evaluated the performance of the Swin Transformer model in the classification and segmentation of lung cancer. The results showed that the pre-trained Swin-B model achieved a top-1 accuracy of 82.26% in the classification mission, outperforming ViT by 2.529%. In the segmentation mission, the Swin-S model demonstrated improvement over other methods in terms of mean Intersection over Union (mIoU). These results suggest that pre-training can be an effective approach for improving the accuracy of the Swin Transformer model in these tasks.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are to improve the accuracy and efficiency of lung cancer detection, especially in real - life diagnosis scenarios. Specifically, the researchers aim to enhance the performance of lung cancer image classification and segmentation tasks through an improved Swin Transformer model. ### Specific Problem Description 1. **Limitations of Traditional Lung Cancer Detection Methods**: - The accuracy of traditional lung cancer detection methods in practical applications needs to be improved. - The increased workload of doctors leads to a rise in the misdiagnosis rate. - Lung cancer has no obvious symptoms in the early stage, and it is difficult to achieve early diagnosis relying solely on doctors' visual examinations. 2. **Challenges in the Application of Computer Vision Technologies**: - Although convolutional neural networks (CNN) perform well in image detection, their training process is time - consuming and expensive, and problems such as reduced resolution and limited input size affect their performance. - Although the self - attention mechanism can solve some problems, it cannot effectively learn feature representations in deep networks, limiting the performance improvement of the model. ### Solutions To solve the above problems, the author proposes an efficient lung cancer image classification and segmentation algorithm based on the improved Swin Transformer. Specifically, it includes: - **Classification Tasks**: - Use the pre - trained Swin - B model for lung cancer image classification, achieving a top - 1 accuracy of 82.26%, which is 2.529% higher than ViT. - Train the Swin Transformer network through two settings, namely regular training and pre - training, to improve the generalization ability of the model. - **Segmentation Tasks**: - In the segmentation task, the Swin - S model is superior to other methods in terms of the mean intersection - over - union (mIoU). - Improve the accuracy of the segmentation task by slicing images and labels in three directions (x, y, z) and programming to automatically filter labels with nodules. ### Innovation Points - **Combination of Classification and Segmentation**: Creatively propose a segmentation method based on an efficient transformer to complete the lung cancer classification task. - **Application of Pre - training**: Introduce pre - training in the segmentation task to help the model improve the accuracy of the experiment. - **Data Augmentation**: Augment data through operations such as rotation, translation, and flipping to solve the problem of imbalance between positive and negative samples. Through these improvements, the researchers hope to achieve higher accuracy and efficiency in lung cancer detection, thereby improving the speed and accuracy of clinical diagnosis.