Improving brain tumor classification with combined convolutional neural networks and transfer learning

Ramazan İncir,Ferhat Bozkurt
DOI: https://doi.org/10.1016/j.knosys.2024.111981
IF: 8.139
2024-05-26
Knowledge-Based Systems
Abstract:Brain tumors pose a serious threat, causing the deaths of thousands of people worldwide, and can lead to life-threatening consequences when not accurately diagnosed. The classification of brain tumors and the determination of correct treatment strategies are crucial, yet this process encounters various challenges. These tumors originate from different cell types and exhibit diversity in growth rates, histological features, and genetic structures. Some may show similarities at the microscopic level, complicating classification and making diagnosis and treatment challenging. The examination of brain MR images is a widely used method in diagnosing brain tumors. However, occasional misdiagnosis of tumors can lead to ineffective responses to treatments and reduced chances of survival for patients. Traditional machine learning classifiers require manually determined features, which is quite time-consuming. On the other hand, deep learning is highly effective in feature extraction and has recently been widely preferred in classification. In this context, the effectiveness of transfer learning architectures in brain tumor diagnosis was evaluated. Six different transfer learning architectures, including ResNet-50, MobileNet, VGG16, Inception-V3, DenseNet-121, and EfficientNetV2-M, were used in this study. A public MRI dataset was used for model validation and comparison with similar studies. To address the imbalance in the number of images among classes in the dataset, data augmentation techniques such as random rotation were applied during data preprocessing. Experiments revealed that the EfficientNetV2-M model outperformed other architectures with a 98.01% accuracy rate. Additionally, the study aimed to create a new model with more comprehensive feature extraction and generalization ability by combining the advantages of multiple models in different combinations. In this context, combinations of EfficientNetV2-M architecture with Inception-V3 and DenseNet-121 architectures were formed. Through these combinations, a concatenation-based EfficientNetV2-M + Inception-V3 model achieved a 98.41% accuracy value. It was observed that the proposed concatenation-based model outperformed advanced methods in improving medical imaging techniques and patient outcomes.
computer science, artificial intelligence
What problem does this paper attempt to address?