An Efficient Combination of Convolutional Neural Network and LightGBM Algorithm for Lung Cancer Histopathology Classification

Esraa A.-R. Hamed,Mohammed A.-M. Salem,Nagwa L. Badr,Mohamed F. Tolba
DOI: https://doi.org/10.3390/diagnostics13152469
IF: 3.6
2023-07-26
Diagnostics
Abstract:The most dangerous disease in recent decades is lung cancer. The most accurate method of cancer diagnosis, according to research, is through the use of histopathological images that are acquired by a biopsy. Deep learning techniques have achieved success in bioinformatics, particularly medical imaging. In this paper, we present an innovative method for rapidly identifying and classifying histopathology images of lung tissues by combining a newly proposed Convolutional Neural Networks (CNN) model with a few total parameters and the enhanced Light Gradient Boosting Model (LightGBM) classifier. After the images have been pre-processed in this study, the proposed CNN technique is provided for feature extraction. Then, the LightGBM model with multiple threads has been used for lung tissue classification. The simulation result, applied to the LC25000 dataset, demonstrated that the novel technique successfully classifies lung tissue with 99.6% accuracy and sensitivity. Furthermore, the proposed CNN model has achieved the lowest training parameters of only one million parameters, and it has also achieved the shortest processing time of just one second throughout the feature extraction process. When this result is compared with the most recent state-of-the-art approaches, the suggested approach has increased effectiveness in the areas of both disease classification accuracy and processing time.
medicine, general & internal
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to quickly identify and classify pathological images of lung tissues by combining Convolutional Neural Network (CNN) and LightGBM algorithm, so as to improve the accuracy and efficiency of lung cancer diagnosis. Specifically, the paper proposes an innovative method aiming at: 1. **Improving the efficiency of feature extraction**: Design a CNN model with fewer parameters to extract key features in pathological images, reducing the requirements for training time and computing resources. 2. **Improving classification accuracy**: Utilize a multi - threaded LightGBM classifier, combined with the features extracted from CNN, to perform efficient and accurate classification of lung tissues. 3. **Reducing the misdiagnosis rate**: Decrease the occurrence of false negatives, that is, reduce the situation of wrongly judging that a patient is not ill, thereby improving the safety and reliability of diagnosis. The main contributions of the paper are: - Proposing a method for pre - processing the LC25000 dataset to enhance image contrast. - Designing an efficient CNN model with the least training parameters for extracting discriminant feature vectors. - Using the LightGBM model with the fastest computing speed for lung tissue classification, which improves the efficiency and accuracy of disease classification. - Compared with existing machine learning and deep learning techniques, the proposed CNN - LightGBM strategy shows higher effectiveness in feature extraction and classification. Through these methods, the paper aims to provide a fast, accurate and efficient solution for the early diagnosis of lung cancer, thereby improving the treatment effect and survival rate of patients.