COVID‐19 detection from chest X-ray images using transfer learning

Enas M. F. El Houby
DOI: https://doi.org/10.1038/s41598-024-61693-0
IF: 4.6
2024-05-23
Scientific Reports
Abstract:COVID-19 is a kind of coronavirus that appeared in China in the Province of Wuhan in December 2019. The most significant influence of this virus is its very highly contagious characteristic which may lead to death. The standard diagnosis of COVID-19 is based on swabs from the throat and nose, their sensitivity is not high enough and so they are prone to errors. Early diagnosis of COVID-19 disease is important to provide the chance of quick isolation of the suspected cases and to decrease the opportunity of infection in healthy people. In this research, a framework for chest X-ray image classification tasks based on deep learning is proposed to help in early diagnosis of COVID-19. The proposed framework contains two phases which are the pre-processing phase and classification phase which uses pre-trained convolution neural network models based on transfer learning. In the pre-processing phase, different image enhancements have been applied to full and segmented X-ray images to improve the classification performance of the CNN models. Two CNN pre-trained models have been used for classification which are VGG19 and EfficientNetB0. From experimental results, the best model achieved a sensitivity of 0.96, specificity of 0.94, precision of 0.9412, F1 score of 0.9505 and accuracy of 0.95 using enhanced full X-ray images for binary classification of chest X-ray images into COVID-19 or normal with VGG19. The proposed framework is promising and achieved a classification accuracy of 0.935 for 4-class classification.
multidisciplinary sciences
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to make an early diagnosis of COVID - 19 through chest X - ray images in order to improve the accuracy and efficiency of detection. Specifically, the author points out that the traditional PCR tests based on pharyngeal swabs and nasal swabs have low sensitivity and are error - prone. Therefore, a deep - learning - based framework is proposed, which uses transfer - learning techniques to classify chest X - ray images, thus assisting doctors in diagnosing COVID - 19 earlier and more accurately. ### Problem Background 1. **High Infectivity**: COVID - 19 is highly infectious and may lead to serious diseases and even death. 2. **Limitations of Traditional Detection Methods**: Although the traditional PCR test is reliable, its sensitivity is limited and it is error - prone, especially in the early stage when symptoms are not obvious. 3. **Importance of Early Diagnosis**: Early diagnosis can help quickly isolate suspected cases and reduce the risk of infection in healthy people. ### Solution To address the above problems, the author proposes a deep - learning - based framework, which consists of two main stages: 1. **Pre - processing Stage**: - Perform multiple enhancement processes on the original X - ray images and the segmented lung area images to improve the image quality. - Use techniques such as histogram equalization (HE), contrast - limited adaptive histogram equalization (CLAHE), and image inpainting to enhance the images. 2. **Classification Stage**: - Use pre - trained convolutional neural network (CNN) models (such as VGG19 and EfficientNetB0) for classification. - Through transfer - learning techniques, utilize the feature extraction capabilities of these pre - trained models on large - scale data sets to further optimize the classification performance. ### Experimental Results Through experimental verification, this framework has achieved remarkable results in the binary classification (COVID - 19 vs. normal) task. The best model uses the enhanced complete X - ray images and achieves the following performance metrics: - **Sensitivity**: 0.96 - **Specificity**: 0.94 - **Precision**: 0.9412 - **F1 Score**: 0.9505 - **Accuracy**: 0.95 In addition, in the four - class classification task, this framework also performs well, reaching a classification accuracy of 0.935. ### Formula Summary The main evaluation metric formulas involved in the paper are as follows: - **Sensitivity (Sensitivity/Recall)**: \[ \text{Sensitivity}=\frac{TP}{TP + FN} \] - **Precision**: \[ \text{Precision}=\frac{TP}{TP + FP} \] - **Specificity**: \[ \text{Specificity}=\frac{TN}{TN + FP} \] - **Accuracy**: \[ \text{Accuracy}=\frac{TP + TN}{TP + FP + TN + FN} \] - **F1 Score**: \[ \text{F1 Score}=\frac{2\times TP}{2\times TP + FP + FN} \] Among them, \(TP\) represents true positives, \(FP\) represents false positives, \(TN\) represents true negatives, and \(FN\) represents false negatives. Through this method, this research provides an efficient and reliable solution for the early diagnosis of COVID - 19.