Abstract:As Unmanned Aerial Vehicle (UAV) remote sensing technology progresses, the utilization of deep learning in UAV imagery object detection has become more prevalent. However, detecting small targets in complex backgrounds and distinguishing dense targets remains a major challenge. To address these issues and improve object detection efficiency, this study proposes an UAV imagery object detection method called YOLO-UAV by optimizing YOLOv5. YOLO-UAV first reconstructs the backbone and feature fusion networks by simplifying the network structure and reducing computational burden. The employment of a Dense_CSPDarknet53 backbone network, fashioned via the incorporation of dense connections, facilitates the extraction of latent image information through the recurrent utilization of features. In the Neck structure, an efficient feature fusion block with structural re-parameterization and ELAN strategies is integrated to effectively reduce interference from complex background noise while extracting more accurate and rich features. In addition, by proposing GS-Decoupled Head, this approach diminishes the parameter count of the decoupled head without compromising accuracy. It also separates classification tasks from regression tasks to lessen the influence of task disparities on prediction bias. To tackle the discrepancy between positive and negative samples in bounding box regression tasks, this study introduces a new loss function, Focal-ECIoU, capable of expediting network convergence and improve model positioning ability. Experimental findings from the public VisDrone2019 dataset indicate that YOLO-UAV outperforms other advanced object detection methods in comprehensive performance. Compared with the baseline model YOLOv5s, YOLO-UAV increased mAP0.5 from 35.1% to 46.7%, while mAP0.5:0.95 increased from 19.1% to 27.4%. For small-scale targets, AP $_{small}$ increased from 10.2% to 17.3%. The experiment proves that YOLO-UAV performs well in improving object detection accuracy and has strong generalization ability, satisfying the practical requirements of UAV imagery object detection tasks.

Residual Spatial Reduced Transformer Based on YOLOv5 for UAV Images Object Detection

YOLO-UAV: Object Detection Method of Unmanned Aerial Vehicle Imagery Based on Efficient Multi-Scale Feature Fusion

Small object detection based on YOLOv8 in UAV perspective

URS-YOLOv5s: Object Detection Algorithm for UAV Remote Sensing Images

A novel small object detection algorithm for UAVs based on YOLOv5

Target Detection Method of UAV Aerial Imagery Based on Improved YOLOv5

Small object detection in UAV image based on improved YOLOv5

An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion

A Modified YOLOv5 for Object Detection in UAV-captured Scenarios

Small Target Detection Algorithm for UAV Based on Improved YOLOv5

Small Object Detection in UAV Images Based on YOLOv8n

Research on Object Detection and Recognition Method for UAV Aerial Images Based on Improved YOLOv5

YOLO-Drone: An Optimized YOLOv8 Network for Tiny UAV Object Detection

ARF-YOLOv8: a novel real-time object detection model for UAV-captured images detection

An Improved YOLOv5 Method for Small Object Detection in UAV Capture Scenes

YOLO-ERF: Lightweight Object Detector for UAV Aerial Images

Lightweight and Efficient Tiny-Object Detection Based on Improved YOLOv8n for UAV Aerial Images

DB-YOLOv5: A UAV Object Detection Model Based on Dual Backbone Network for Security Surveillance

UAV-YOLO: Small Object Detection on Unmanned Aerial Vehicle Perspective

SF-YOLO: RGB-T Fusion Object Detection in UAV Scenes