Abstract:Object detection in unmanned aerial vehicle (UAV) imagery is a meaningful foundation in various research domains. However, UAV imagery poses unique challenges, including large image sizes, small sizes detection objects, dense distribution, overlapping instances, and insufficient lighting impacting the effectiveness of object detection. In this article, we propose Drone-YOLO, a series of multi-scale UAV image object detection algorithms based on the YOLOv8 model, designed to overcome the specific challenges associated with UAV image object detection. To address the issues of large scene sizes and small detection objects, we introduce improvements to the neck component of the YOLOv8 model. Specifically, we employ a three-layer PAFPN structure and incorporate a detection head tailored for small-sized objects using large-scale feature maps, significantly enhancing the algorithm’s capability to detect small-sized targets. Furthermore, we integrate the sandwich-fusion module into each layer of the neck’s up–down branch. This fusion mechanism combines network features with low-level features, providing rich spatial information about the objects at different layer detection heads. We achieve this fusion using depthwise separable evolution, which balances parameter costs and a large receptive field. In the network backbone, we employ RepVGG modules as downsampling layers, enhancing the network’s ability to learn multi-scale features and outperforming traditional convolutional layers. The proposed Drone-YOLO methods have been evaluated in ablation experiments and compared with other state-of-the-art approaches on the VisDrone2019 dataset. The results demonstrate that our Drone-YOLO (large) outperforms other baseline methods in the accuracy of object detection. Compared to YOLOv8, our method achieves a significant improvement in mAP0.5 metrics, with a 13.4% increase on the VisDrone2019-test and a 17.40% increase on the VisDrone2019-val. Additionally, the parameter-efficient Drone-YOLO (tiny) with only 5.25 M parameters performs equivalently or better than the baseline method with 9.66M parameters on the dataset. These experiments validate the effectiveness of the Drone-YOLO methods in the task of object detection in drone imagery.

FE-YOLOv5: Improved YOLOv5 Network for Multi-scale Drone-Captured Scene Detection

An Improved YOLOv5 Method for Small Object Detection in UAV Capture Scenes

FE-YOLOv5: Feature enhancement network based on YOLOv5 for small object detection

SMFF-YOLO: A Scale-Adaptive YOLO Algorithm with Multi-Level Feature Fusion for Object Detection in UAV Scenes

Drone-YOLO: An Efficient Neural Network Method for Target Detection in Drone Images

Small Target Detection Algorithm for UAV Based on Improved YOLOv5

Aerial images object detection method based on cross-scale multi-feature fusion

Target Detection Method of UAV Aerial Imagery Based on Improved YOLOv5

AFE-YOLOv8: A Novel Object Detection Model for Unmanned Aerial Vehicle Scenes with Adaptive Feature Enhancement

UN-YOLOv5s: A UAV-Based Aerial Photography Detection Algorithm

DetailCaptureYOLO: Accurately Detecting Small Targets in UAV Aerial Images

Small Target-YOLOv5: Enhancing the Algorithm for Small Object Detection in Drone Aerial Imagery Based on YOLOv5

A novel small object detection algorithm for UAVs based on YOLOv5

An Aerial Image Detection Algorithm Based on Improved YOLOv5

SF-YOLOv5: Improved YOLOv5 with swin transformer and fusion-concat method for multi-UAV detection

EA‐YOLO: An Efficient and Accurate UAV Image Object Detection Algorithm

OD-YOLO: Robust Small Object Detection Model in Remote Sensing Image with a Novel Multi-Scale Feature Fusion

SL-YOLO: A Stronger and Lighter Drone Target Detection Model

Lightweight unmanned aerial vehicle object detection algorithm based on improved YOLOv8

YOLOv5_mamba: unmanned aerial vehicle object detection based on bidirectional dense feedback network and adaptive gate feature fusion