An improved V-Net lung nodule segmentation model based on pixel threshold separation and attention mechanism

Xiaopu Ma,Handing Song,Xiao Jia,Zhan Wang
DOI: https://doi.org/10.1038/s41598-024-55178-3
IF: 4.6
2024-02-28
Scientific Reports
Abstract:Accurate labeling of lung nodules in computed tomography (CT) images is crucial in early lung cancer diagnosis and before nodule resection surgery. However, the irregular shape of lung nodules in CT images and the complex lung environment make it much more challenging to segment lung nodules accurately. On this basis, we propose an improved V-Net segmentation method based on pixel threshold separation and attention mechanism for lung nodules. This method first offers a data augment strategy to solve the problem of insufficient samples in 3D medical datasets. In addition, we integrate the feature extraction module based on pixel threshold separation into the model to enhance the feature extraction ability under different thresholds on the one hand. On the other hand, the model introduces channel and spatial attention modules to make the model pay more attention to important semantic information and improve its generalization ability and accuracy. Experiments show that the Dice similarity coefficients of the improved model on the public datasets LUNA16 and LNDb are 94.9% and 81.1% respectively, and the sensitivities reach 92.7% and 76.9% respectively. which is superior to most existing UNet architecture models and comparable to the manual level segmentation results by medical technologists.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper primarily addresses the precise segmentation of pulmonary nodules in computed tomography (CT) images. Due to the irregular shape of pulmonary nodules and the complex lung environment, accurately segmenting pulmonary nodules is highly challenging. To solve this problem, the authors propose an improved V-Net segmentation method (named Dig-CS-VNet), which is based on pixel threshold separation and attention mechanisms. Specifically, the main contributions of this method include: 1. **Data Augmentation Strategy**: A data augmentation method suitable for 3D medical datasets is proposed. By using slice reverse order and matching plane geometric transformations, the dataset size is increased, addressing the issue of insufficient samples in 3D medical image datasets. 2. **Model Construction**: - A feature extraction module based on pixel threshold separation (Dig_Sep) is proposed, enhancing the model's ability to extract hierarchical features at different thresholds. - Channel and spatial attention mechanisms are introduced into the model, enabling the model to focus more on important semantic information, thereby improving its generalization ability and accuracy. Experimental results show that the improved model achieved Dice similarity coefficients of 94.9% and 81.1% on the LUNA16 and LNDb public datasets, respectively, with sensitivities of 92.7% and 76.9%, respectively. This outperforms most existing U-Net architecture models and is comparable to the manual segmentation level of medical technicians. In summary, this study aims to improve the accuracy and efficiency of pulmonary nodule segmentation in CT images by proposing an improved V-Net model that combines pixel threshold separation and attention mechanisms.