BGF-YOLO: Enhanced YOLOv8 with Multiscale Attentional Feature Fusion for Brain Tumor Detection

Ming Kang,Chee-Ming Ting,Fung Fung Ting,Raphaël C.-W. Phan
DOI: https://doi.org/10.1007/978-3-031-72111-3_4
2024-10-13
Abstract:You Only Look Once (YOLO)-based object detectors have shown remarkable accuracy for automated brain tumor detection. In this paper, we develop a novel BGF-YOLO architecture by incorporating Bi-level routing attention, Generalized feature pyramid networks, and Fourth detecting head into YOLOv8. BGF-YOLO contains an attention mechanism to focus more on important features, and feature pyramid networks to enrich feature representation by merging high-level semantic features with spatial details. Furthermore, we investigate the effect of different attention mechanisms and feature fusions, detection head architectures on brain tumor detection accuracy. Experimental results show that BGF-YOLO gives a 4.7% absolute increase of mAP$_{50}$ compared to YOLOv8x, and achieves state-of-the-art on the brain tumor detection dataset Br35H. The code is available at <a class="link-external link-https" href="https://github.com/mkang315/BGF-YOLO" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition,Signal Processing,Applications
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? The main goal of this paper is to improve the accuracy of brain tumor detection by enhancing the YOLOv8 architecture. Specifically, the authors propose a new model called BGF-YOLO, which integrates the following techniques into the original YOLOv8: 1. **Bi-level Routing Attention (BRA)**: Used for dynamic sparse attention mechanisms, focusing on more important features and reducing feature redundancy. 2. **Generalized Feature Pyramid Networks (GFPN)**: Used for multi-scale feature fusion, enhancing the fusion effect of different levels of features. 3. **Addition of a fourth detection head**: Enriches the scale of anchor boxes and optimizes regression loss to better detect targets of different sizes. With these improvements, BGF-YOLO achieves better performance in brain tumor detection tasks, particularly in terms of Precision, mean Average Precision (mAP 50), and mean Average Precision (mAP 50:95), compared to YOLOv8 and other variants. Experimental results show that on the Br35H dataset, BGF-YOLO has an absolute improvement of 4.7% in mAP 50 over YOLOv8x, making it the current state-of-the-art model on this dataset.