Efficient Detection of UAV Image Based on Self-Attention and Global Feature Fusion

Jing Bai,Haiyang Hu,Chaoxi Su,Yunxia Zhang
DOI: https://doi.org/10.1109/cisat62382.2024.10695271
2024-01-01
Abstract:Unmanned aerial vehicle (UAV) image object detection has gamered significant attention in fields such as intelligent transportation, urban management, and agricultural monitoring. However, it faces key challenges, including deficiencies in multi-scale feature extraction and inaccuracies when processing complex scenes and small-sized targets. To address these issues, we propose a novel UAV image object detection network, named SGF-Net, which is based on self-attention guidance and global feature fusion. First, to optimize feature extraction from a global perspective and enhance target localization precision, we introduce the global feature extraction module (GFEM). This module utilizes the self-attention mechanism to capture and integrate long-range dependencies within images. Second, we develop a normal distribution-based prior assigner (NDPA) that measures the resemblance between ground truth and priors, thereby improving the precision of target position matching and addressing the problem of inaccurate localization of small targets. Furthermore, we design an attention-guided ROI pooling module (ARPM) using a deep fusion strategy of multilevel features to optimize the integration of multi-scale features and improve the quality of feature representation. Finally, experimental results demonstrate the effectiveness of the proposed SGF-Net approach.
What problem does this paper attempt to address?