A novel Deeplabv3+ and vision-based transformer model for segmentation and classification of skin lesions
Iqra Ahmad,Javaria Amin,Muhammad IkramUllah Lali,Farhat Abbas,Muhammad Imran Sharif
DOI: https://doi.org/10.1016/j.bspc.2024.106084
IF: 5.1
2024-02-16
Biomedical Signal Processing and Control
Abstract:Skin cancer (SC) is a common disease caused due to ultraviolet radiation. Accurate SC detection is degraded due to some artifacts such as lesion variations in shape, size, color, texture, hairs, poor contrast, brightness, and irregular lesion boundaries. To solve these limitations, a deep learning-based technique is proposed that consists of segmentation and classification of SC. The DeepLabv3+ segmentation model is designed that consist of 9 convolutional neural network blocks. Each block comprises 19 convolution, 18 rectified linear units, and 18 batch normalization layers. The model is evaluated on ISIC-16, 17, 18, and PH2 datasets that provide accuracy of 98.90 %, 98.38 %, 99.45 %, and 100 %, respectively. Another Vision Transformer (ViT) model is developed for the classification of skin lesions (SL). The ViT model performs better than CNN because ViT works as a token while CNN works pixel to pixel. The ViT model consists of eight blocks, each with 17 normalization, 8 multi-head attention, 19 dense, and 19 dropout layers with a 7x7 patch size. The model is evaluated on PH2, ISIC-19, ISIC-20, and HAM10000 datasets that provided an accuracy of 100 %, 96.97 %, 97.73 %, and 100 % respectively. The results are better than existing methods.
engineering, biomedical