FusionLungNet: Multi-scale Fusion Convolution with Refinement Network for Lung CT Image Segmentation

Sadjad Rezvani,Mansoor Fateh,Yeganeh Jalali,Amirreza Fateh
2024-10-21
Abstract:Early detection of lung cancer is crucial as it increases the chances of successful treatment. Automatic lung image segmentation assists doctors in identifying diseases such as lung cancer, COVID-19, and respiratory disorders. However, lung segmentation is challenging due to overlapping features like vascular and bronchial structures, along with pixel-level fusion of brightness, color, and texture. New lung segmentation methods face difficulties in identifying long-range relationships between image components, reliance on convolution operations that may not capture all critical features, and the complex structures of the lungs. Furthermore, semantic gaps between feature maps can hinder the integration of relevant information, reducing model accuracy. Skip connections can also limit the decoder's access to complete information, resulting in partial information loss during encoding. To overcome these challenges, we propose a hybrid approach using the FusionLungNet network, which has a multi-level structure with key components, including the ResNet-50 encoder, Channel-wise Aggregation Attention (CAA) module, Multi-scale Feature Fusion (MFF) block, self refinement (SR) module, and multiple decoders. The refinement sub-network uses convolutional neural networks for image post-processing to improve quality. Our method employs a combination of loss functions, including SSIM, IOU, and focal loss, to optimize image reconstruction quality. We created and publicly released a new dataset for lung segmentation called LungSegDB, including 1800 CT images from the LIDC-IDRI dataset (dataset version 1) and 700 images from the Chest CT Cancer Images from Kaggle dataset (dataset version 2). Our method achieved an IOU score of 98.04, outperforming existing methods and demonstrating significant improvements in segmentation accuracy. <a class="link-external link-https" href="https://github.com/sadjadrz/FusionLungNet" rel="external noopener nofollow">this https URL</a>
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to address multiple challenges in lung CT image segmentation to improve the accuracy and reliability of early - stage lung cancer detection. Specifically, the paper attempts to solve the following key problems: 1. **Feature overlap and complex structures**: - The feature overlap between lung tissues and other structures (such as blood vessels and bronchi) makes it difficult to accurately divide tissue boundaries. - This complexity reduces the accuracy and reliability of lung segmentation. 2. **Capturing long - distance dependencies**: - Existing lung segmentation methods have difficulty in capturing long - distance dependencies between image components, which limits the model's ability to capture key features. 3. **Limitations of convolutional operations**: - Methods relying on convolutional operations may not be able to capture all important features, especially when dealing with smaller structures. 4. **Semantic gap**: - The semantic gap of feature maps between the encoder and the decoder will hinder the integration of relevant information, thereby reducing the accuracy of the model. 5. **Information loss in skip connections**: - Although skip connections can preserve information to a certain extent, they may still cause partial information loss during the encoding process, affecting the decoder's ability to obtain complete information. 6. **Singularity of loss functions**: - A single loss function may not be able to effectively capture the multifaceted nature of the lung segmentation task, resulting in sub - optimal performance. ### Proposed solutions To solve the above problems, the paper proposes a new network architecture named FusionLungNet, which has the following features: - **ResNet - 50 encoder**: Used to extract important information from the input image. - **Channel - Aggregation - Attention (CAA) module**: Optimizes feature representation, emphasizes key channels in the feature map, and more effectively captures important details in lung CT images. - **Multi - Scale - Feature - Fusion (MFF) module**: Effectively combines features from different sources to improve the quality of the reconstructed image. - **Self - Refinement (SR) module**: Solves the related challenges of the multi - scale - feature - fusion module by optimally passing features to the decoder, and corrects small defects in the original image, enhancing the clarity and contrast of the image. - **Multiple decoders**: Ensure the comprehensive recovery of feature information. - **Hybrid loss function**: Combines the Structural Similarity Index (SSIM), Intersection over Union (IOU), and Focal Loss to optimize the image reconstruction quality, ensure structural integrity, spatial accuracy, and classification balance. In addition, the paper also creates and publicly releases a new lung segmentation dataset, LungSegDB, which contains 1,800 CT images from the LIDC - IDRI dataset and 700 images from the Kaggle Chest CT Cancer Image Dataset. The introduction of these datasets provides rich and diverse data support for model training. ### Summary FusionLungNet successfully addresses multiple challenges in lung CT image segmentation by introducing multi - scale - fusion - convolution and refinement networks, significantly improving the accuracy and robustness of segmentation, and providing strong support for early - stage lung cancer detection.