DeepChestGNN: A Comprehensive Framework for Enhanced Lung Disease Identification through Advanced Graphical Deep Features

Shakil Rana,Md Jabed Hosen,Tasnim Jahan Tonni,Md. Awlad Hossen Rony,Kaniz Fatema,Md. Zahid Hasan,Md. Tanvir Rahman,Risala Tasin Khan,Tony Jan,Md Whaiduzzaman
DOI: https://doi.org/10.3390/s24092830
IF: 3.9
2024-04-30
Sensors
Abstract:Lung diseases are the third-leading cause of mortality in the world. Due to compromised lung function, respiratory difficulties, and physiological complications, lung disease brought on by toxic substances, pollution, infections, or smoking results in millions of deaths every year. Chest X-ray images pose a challenge for classification due to their visual similarity, leading to confusion among radiologists. To imitate those issues, we created an automated system with a large data hub that contains 17 datasets of chest X-ray images for a total of 71,096, and we aim to classify ten different disease classes. For combining various resources, our large datasets contain noise and annotations, class imbalances, data redundancy, etc. We conducted several image pre-processing techniques to eliminate noise and artifacts from images, such as resizing, de-annotation, CLAHE, and filtering. The elastic deformation augmentation technique also generates a balanced dataset. Then, we developed DeepChestGNN, a novel medical image classification model utilizing a deep convolutional neural network (DCNN) to extract 100 significant deep features indicative of various lung diseases. This model, incorporating Batch Normalization, MaxPooling, and Dropout layers, achieved a remarkable 99.74% accuracy in extensive trials. By combining graph neural networks (GNNs) with feedforward layers, the architecture is very flexible when it comes to working with graph data for accurate lung disease classification. This study highlights the significant impact of combining advanced research with clinical application potential in diagnosing lung diseases, providing an optimal framework for precise and efficient disease identification and classification.
engineering, electrical & electronic,instruments & instrumentation,chemistry, analytical
What problem does this paper attempt to address?
The paper aims to address several key issues in lung disease identification. Specifically: 1. **Dataset Integration and Diversity**: Researchers collected 17 chest X-ray datasets from multiple sources and integrated them into a large comprehensive dataset containing 71,096 images, covering 10 different lung disease categories (normal, pleural effusion, pulmonary fibrosis, lung opacity, mass, nodule, COVID-19, pneumonia, pneumothorax, and tuberculosis). 2. **Image Preprocessing**: To improve classification accuracy, researchers employed a series of image preprocessing techniques, including: - Image Scaling: All images were uniformly resized to 224x224 pixels. - Denoising: Total variation denoising (TV denoising) was used to reduce image noise. - Annotation Removal: Text annotations in the images were removed using annotation removal methods. - Contrast Enhancement: Contrast Limited Adaptive Histogram Equalization (CLAHE) was used to enhance image contrast. - Filter Application: The "green fire blue" filter was applied to further enhance image features. 3. **Data Augmentation**: Elastic Deformation technology was used to generate a balanced dataset, capturing the elastic deformation of biological tissues under compression, thereby better simulating actual clinical conditions. 4. **Model Design**: A new deep convolutional neural network (DCNN) architecture was proposed, combining Batch Normalization, MaxPooling, and Dropout layers to extract 100 important deep features. Subsequently, a graph neural network (GNN) model named DeepChestGNN was proposed based on these features, utilizing graph structures to further optimize model performance, achieving efficient and high-accuracy lung disease classification. Through the above methods, researchers significantly improved the diagnostic accuracy of lung diseases, achieving a classification accuracy of 99.74%, providing strong support for clinical diagnosis.