Melanoma Skin Cancer Identification with Explainability Utilizing Mask Guided Technique

Lahiru Gamage,Uditha Isuranga,Dulani Meedeniya,Senuri De Silva,Pratheepan Yogarajah
DOI: https://doi.org/10.3390/electronics13040680
IF: 2.9
2024-02-07
Electronics
Abstract:Melanoma is a highly prevalent and lethal form of skin cancer, which has a significant impact globally. The chances of recovery for melanoma patients substantially improve with early detection. Currently, deep learning (DL) methods are gaining popularity in assisting with the identification of diseases using medical imaging. The paper introduces a computational model for classifying melanoma skin cancer images using convolutional neural networks (CNNs) and vision transformers (ViT) with the HAM10000 dataset. Both approaches utilize mask-guided techniques, employing a specialized U2-Net segmentation module to generate masks. The CNN-based approach utilizes ResNet50, VGG16, and Xception with transfer learning. The training process is enhanced using a Bayesian hyperparameter tuner. Moreover, this study applies gradient-weighted class activation mapping (Grad-CAM) and Grad-CAM++ to generate heatmaps to explain the classification models. These visual heatmaps elucidate the contribution of each input region to the classification outcome. The CNN-based model approach achieved the highest accuracy at 98.37% in the Xception model with a sensitivity and specificity of 95.92% and 99.01%, respectively. The ViT-based model approach achieved high values for accuracy, sensitivity, and specificity, such as 92.79%, 91.09%, and 93.54%, respectively. Furthermore, the performance of the model was assessed through intersection over union (IOU) and other qualitative evaluations. Finally, we developed the proposed model as a web application that can be used as a support tool for medical practitioners in real-time. The system usability study score of 86.87% is reported, which shows the usefulness of the proposed solution.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve The paper aims to address the issue of early detection of melanoma and enhance its credibility by improving the interpretability of the model. Specifically, the paper proposes a computational model based on Convolutional Neural Networks (CNN) and Vision Transformer (ViT) for classifying melanoma images. The main objectives include: 1. **Improving Skin Image Classification Performance**: Using various CNN architectures (such as Xception, ResNet50, VGG16, etc.) for classification and optimizing model performance through hyperparameter tuning. 2. **Introducing Interpretability**: Utilizing Gradient-weighted Class Activation Mapping (Grad-CAM) and Grad-CAM++ to generate heatmaps that explain the classification results. 3. **Application of Segmentation Techniques**: Generating segmentation masks using the U2-Net model to further guide the training of CNN and ViT models, thereby increasing the model's focus on key areas. 4. **Developing Practical Tools**: Developing the proposed model into a web application to serve as an auxiliary tool for medical practitioners in real clinical environments. Through these methods, the paper aims to improve the accuracy and reliability of melanoma detection and enhance the trust of medical professionals by increasing the transparency of the model.