Abstract:In contemporary society, the pervasive integration of Unmanned Aerial Vehicles (UAVs) in everyday activities is notable. Object detection emerges as a pivotal task within the UAV operational context. However, challenges such as the presence of expansive backgrounds in UAV images, insufficient target pixel resolution, and the prevalence of image interferences contribute to the diminished accuracy observed in existing object detection models tailored for UAV aerial imagery. Conventional strategies employed to enhance accuracy often incur exorbitant computational costs, failing to strike a harmonious balance between precision improvement and computational resource utilization. To address these challenges, this paper introduces an optimized variant of YOLOv8, denoted as OB-YOLO, specifically tailored for UAV aerial photography scenarios. The proposed model exhibits enhanced accuracy while concurrently mitigating parameter and floating-point operation costs. Particularly, the BiFPN concept is incorporated to fortify the feature fusion process, enabling comprehensive consideration and reuse of multi-scale feature fusion within the model. Additionally, the study integrates the full-dimensional dynamic convolution (ODConv) structure to replace the ordinary convolution within residual networks in the C2f module of the backbone network. This augmentation not only enhances the model’s feature extraction capabilities but also significantly reduces both the number of model parameters and computational workload through the parallel implementation of ODConv, coupled with the simultaneous introduction of the multidimensional attention mechanism. Furthermore, InnerIoU is employed for computing Intersection over Union (IoU) loss using auxiliary edges, and MPDIoU is integrated to expedite convergence speed and enhance accuracy. The confluence of these methodologies, enriched by the incorporation of the minimum point distance in MPDIoU, collectively contributes to superior detection performance. The proposed algorithm is systematically compared and evaluated on the extensively utilized VisDrone2019 dataset. The results demonstrate that OB-YOLO surpasses the YOLOv8 baseline model by 4.8% on the VisDrone2019-DET dataset, showcasing improved performance while concurrently reducing both network parameter and floating-point calculations.

OB-YOLO: A UAV Image Detection Model for Reducing Computational Resource Consumption

YOLOv7-P: a lighter and more effective UAV aerial photography object detection algorithm

Drone-YOLO: An Efficient Neural Network Method for Target Detection in Drone Images

Small Object Detection in UAV Images Based on YOLOv8n

Lightweight Object Detection Algorithm for UAV Aerial Imagery

Lightweight and Efficient Tiny-Object Detection Based on Improved YOLOv8n for UAV Aerial Images

Lightweight unmanned aerial vehicle object detection algorithm based on improved YOLOv8

UAV-YOLOv8: A Small-Object-Detection Model Based on Improved YOLOv8 for UAV Aerial Photography Scenarios

Efficient YOLOv7-Drone: An Enhanced Object Detection Approach for Drone Aerial Imagery

YOLO-Drone: An Optimized YOLOv8 Network for Tiny UAV Object Detection

YOLOv7-UAV: An Unmanned Aerial Vehicle Image Object Detection Algorithm Based on Improved YOLOv7

SOD-YOLO: Small-Object-Detection Algorithm Based on Improved YOLOv8 for UAV Images

YOLOD: A Target Detection Method for UAV Aerial Imagery

YOLO-ERF: Lightweight Object Detector for UAV Aerial Images

LES-YOLO: Efficient Object Detection Algorithm Used on UAV for Traffic Monitoring

Target Detection Method of UAV Aerial Imagery Based on Improved YOLOv5

YOLO-Q: Drone Aerial Target Detection

Lightweight Low-Altitude UAV Object Detection Based on Improved YOLOv5s

A Modified YOLOv8 Detection Network for UAV Aerial Image Recognition