YOLO-HAMFF: A UAV detection model based on the hybrid attention and multi-level feature fusion for the YOLOv8 model.

Yue Zhou,Yutong Jiang,Zhonglin Yang,Yang Yang,Ying Wang
DOI: https://doi.org/10.1145/3655755.3655766
2024-01-01
Abstract:Unmanned Aerial Vehicles (UAVs) have become increasingly prevalent in various applications, ranging from surveillance and monitoring to infrastructure inspection. The accurate and efficient detection of UAVs is a critical task for ensuring security and safety in both civilian and military domains. However, detecting UAVs is a challenging task because of the small target size, rapid speed, and complex image backgrounds. To solve these problems, this paper presents a novel detection method to improve UAV detection performance based on the You Only Look Once (YOLO) framework. Firstly, we propose a hybrid attention mechanism, which uses hybrid dilated convolution to recover the respective field and the spatial and channel attention mechanisms to improve feature capture ability under complex image backgrounds. Additionally, we introduce a multi-level fusion model into the YOLO detection framework, augmenting the detection neck to express multi-scale features for precise recovery of image details, particularly for small-size UAVs. Experimental results show that compared with the state-of-the-art YOLOv8 model, the proposed detection method is effective in improving UAV detection performances.
What problem does this paper attempt to address?