ETLoViT: an Acne Diagnose Approach Using Vision-Transformers and Model Ensembling

KRISHNA VENI PALURI,Ashish Gupta
DOI: https://doi.org/10.1088/2631-8695/ad7ad9
IF: 1.7
2024-09-14
Engineering Research Express
Abstract:Acne, a widespread skin condition predominantly affecting teenagers, presents intricate challenges in its diagnosis. Recent Advances in deep learning, machine learning, and image processing methods have made it possible to diagnose acne automatically and effectively. However, achieving higher acne classification accuracy is still one of the concerns with these methods Therefore, this paper introduces a group of trained models based on transfer learning that are applied to Vision Transformer extracted features (ETLoViT). These models are trained using two innovative deep learning methods: the Vision Transformer (ViT) and model ensembling for acne image classification. The ViT approach harnesses the power of the attention module to extract acne features efficiently. These extracted features are subsequently run through different transfer learning models, such as MobileNetV2, VGG16, and EfficientNetB7. The predicted results subsequently combined for classification. The proposed approach is compared with existing deep learning methods, and the results demonstrate that proposed EVLoViT approach consistently performs better, achieving an astounding 96% classification accuracy.
What problem does this paper attempt to address?