Deep Learning Approach for Enhancing Oral Squamous Cell Carcinoma with LIME Explainable AI Technique

Samiha Islam,Muhammad Zawad Mahmud,Shahran Rahman Alve,Md. Mejbah Ullah Chowdhury
2024-11-21
Abstract:The goal of the present study is to analyze an application of deep learning models in order to augment the diagnostic performance of oral squamous cell carcinoma (OSCC) with a longitudinal cohort study using the Histopathological Imaging Database for oral cancer analysis. The dataset consisted of 5192 images (2435 Normal and 2511 OSCC), which were allocated between training, testing, and validation sets with an estimated ratio repartition of about 52% for the OSCC group, and still, our performance measure was validated on a combination set that contains almost equal number of sample in this use case as entire database have been divided into half using stratified splitting technique based again near binary proportion but total distribution was around even. We selected four deep-learning architectures for evaluation in the present study: ResNet101, DenseNet121, VGG16, and EfficientnetB3. EfficientNetB3 was found to be the best, with an accuracy of 98.33% and F1 score (0.9844), and it took remarkably less computing power in comparison with other models. The subsequent one was DenseNet121, with 90.24% accuracy and an F1 score of 90.45%. Moreover, we employed the Local Interpretable Model-agnostic Explanations (LIME) method to clarify why EfficientNetB3 made certain decisions with its predictions to improve the explainability and trustworthiness of results. This work provides evidence for the possible superior diagnosis in OSCC activated from the EfficientNetB3 model with the explanation of AI techniques such as LIME and paves an important groundwork to build on towards clinical usage.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the diagnostic performance of oral squamous cell carcinoma (OSCC). Specifically, the researchers analyzed and enhanced the diagnostic performance of OSCC by applying four different deep - learning models (ResNet101, DenseNet121, VGG16, and EfficientNetB3), and used the LIME (Local Interpretable Model - agnostic Explanations) interpretable AI technology to improve the interpretability and credibility of model predictions. ### Research Background Oral squamous cell carcinoma (OSCC) is a malignant tumor originating from the oral cavity, oropharynx, and hypopharynx, accounting for more than 90% of all oral cancer cases. Early detection is crucial for improving prognosis and survival rate, but early lesions are usually asymptomatic, leading to late - stage diagnosis, increased treatment difficulty, and poor prognosis. Due to the influence of risk factors such as smoking and betel nut chewing in Asia, the incidence of OSCC is relatively high, so early identification and management are particularly important. ### Research Methods 1. **Dataset**: The study used a dataset containing 5,192 images (2,435 normal images and 2,511 OSCC images), which were divided into training set, test set, and validation set. 2. **Data Pre - processing**: All images were cropped to 224 × 224 pixels and converted to JPG format. In addition, image enhancement such as scaling, right - shifting, and flipping was also carried out. 3. **Model Selection and Training**: - **ResNet101**: A convolutional neural network with 101 layers, which solves the vanishing gradient problem through skip connections. - **VGG16**: A convolutional neural network consisting of 13 convolutional layers and 3 fully - connected layers, suitable for image classification. - **DenseNet121**: A densely - connected neural network with 121 layers, which improves the convergence performance through dense connections. - **EfficientNetB3**: Optimizes performance by compoundly scaling depth, width, and resolution, and has high computational efficiency. 4. **Evaluation Metrics**: - **Precision**: \[ \text{Precision} = \frac{\text{TP}}{\text{TP} + \text{FP}} \] - **Recall**: \[ \text{Recall} = \frac{\text{TP}}{\text{TP} + \text{FN}} \] - **F1 - Score**: \[ \text{F1 - Score} = \frac{2\times\text{Precision}\times\text{Recall}}{\text{Precision} + \text{Recall}} \] - **Accuracy**: \[ \text{Accuracy} = \frac{\text{TP} + \text{TN}}{\text{TP} + \text{TN} + \text{FP} + \text{FN}} \] ### Main Contributions 1. **Model Tuning**: By fine - tuning the pre - trained EfficientNetB3 model, a high accuracy (98.33%) and F1 - score (0.9844) for OSCC classification were achieved. 2. **Interpretability AI**: The LIME technology was applied to explain the prediction results of the EfficientNetB3 model, improving the interpretability and credibility of the model. ### Conclusion The research results show that deep - learning methods have significant potential in histopathological image analysis, especially the EfficientNetB3 model performs particularly well in the diagnosis of OSCC. The application of the LIME technology further enhances the interpretability of the model, which helps to improve the reliability and trustworthiness of medical diagnosis. Future research directions include further improving model performance, applying to other cancer types, and combining more interpretable AI technologies.