Self-Trained Convolutional Neural Network (CNN) for Tuberculosis Diagnosis in Medical Imaging

Karan Sarawagi,Ashutosh Pagrotra,Hardik Dhiman,Navjot Singh
DOI: https://doi.org/10.7759/cureus.63356
2024-06-28
Cureus
Abstract:Background Tuberculosis (TB) is a serious infectious disease that primarily affects the lungs. Despite advancements in the medical industry, TB remains a significant global health challenge. Early and accurate detection of TB is crucial for effective treatment and reducing transmission. This article presents a deep learning approach using convolutional neural networks (CNNs) to improve TB detection in chest X-ray images. Methods For the dataset, we collected 7000 images from Kaggle.com, of which 3500 exhibit tuberculosis evidence and the remaining 3500 are normal. Preprocessing techniques such as wavelet transformation, contrast-limited adaptive histogram equalisation (CLAHE), and gamma correction were applied to enhance the image quality. Random flipping, random rotation, random resizing, and random rescaling were among the techniques employed to increase dataset variability and model robustness. Convolutional, max-pooling, flatten, and dense layers comprised the CNN model architecture. For binary classification, sigmoid activation was utilised in the output layer and rectified linear unit (ReLU) activation in the input and hidden layers. Results The CNN model achieved an accuracy of ~96.57% in detecting TB from chest X-ray images, demonstrating the effectiveness of deep learning, particularly CNNs, in this application. Self-trained CNNs have optimised the results as compared to the transfer learning of various pre-trained models. Conclusion This study shows how well deep learning-in particular, CNNs-performs in the identification of tuberculosis. Subsequent efforts have to give precedence to optimising the model by obtaining more extensive datasets from the local hospitals and localities, which are vulnerable to TB, and stress the possibility of augmenting diagnostic knowledge in medical imaging via machine learning methodologies.
What problem does this paper attempt to address?