ViT-CB: Integrating hybrid Vision Transformer and CatBoost to enhanced brain tumor detection with SHAP

Radius Tanone,Li-Hua Li,Shoffan Saifullah
DOI: https://doi.org/10.1016/j.bspc.2024.107027
IF: 5.1
2024-10-26
Biomedical Signal Processing and Control
Abstract:Brain tumor classification is a crucial aspect of diagnosing and treating neurological disorders. Conventional methods often rely on manual feature extraction, which may limit their effectiveness in dealing with complex patterns. In recent years, deep learning techniques have shown promising results in medical image analysis due to their ability to learn discriminative features automatically. However, developing deep learning models for brain tumor classification has posed challenges for medical decision-makers in terms of interpreting these models. This study uses a novel architecture called ViT-CB, which leverages the Vision Transformer deep learning model for feature extraction. The image's dimensionality is then reduced using PCA to enhance the model's performance. The CatBoost algorithm is further employed to improve the model's final brain tumor classification performance. Furthermore, we utilize SHAP-based XAI to interpret the model's result, identifying the features that most influence the final classification. Using two open datasets, the ViT-CB model achieved a sensitivity of 0.9961, a specificity of 0.9891, and an accuracy of 0.99316 on dataset 1. On dataset 2, the model demonstrated a sensitivity of 0.9903, a specificity of 0.9975, and 0.90004 accuracy, highlighting its effectiveness in handling complex medical imaging data. These results make an essential contribution to the classification of brain tumors.
engineering, biomedical
What problem does this paper attempt to address?