Multi-objective optimization of ViT architecture for efficient brain tumor classification

Emrullah Şahin,Durmuş Özdemir,Hasan Temurtaş
DOI: https://doi.org/10.1016/j.bspc.2023.105938
IF: 5.1
2024-01-14
Biomedical Signal Processing and Control
Abstract:This study presents an advanced approach to optimizing the Vision Transformer (ViT) network for brain tumor classification in 2D MRI images, utilizing Bayesian Multi-Objective (BMO) optimization techniques. Rather than merely addressing the limitations of the standard ViT model, our objective was to enhance its overall efficiency and effectiveness. The application of BMO enabled us to fine-tune the architectural parameters of the ViT network, resulting in a model that was not only twice as fast but also four times smaller in size compared to the original. In terms of performance, the optimized ViT model achieved notable improvements, with a 1.48 % increase in validation accuracy, a 3.23 % rise in the F1-score, and a 3.36 % improvement in precision. These substantial enhancements highlight the potential of integrating BMO with visual transformer-based models, suggesting a promising direction for future research in achieving high efficiency and accuracy in complex classification tasks.
engineering, biomedical
What problem does this paper attempt to address?