Abstract:Object detection in unmanned aerial vehicle (UAV) imagery is a meaningful foundation in various research domains. However, UAV imagery poses unique challenges, including large image sizes, small sizes detection objects, dense distribution, overlapping instances, and insufficient lighting impacting the effectiveness of object detection. In this article, we propose Drone-YOLO, a series of multi-scale UAV image object detection algorithms based on the YOLOv8 model, designed to overcome the specific challenges associated with UAV image object detection. To address the issues of large scene sizes and small detection objects, we introduce improvements to the neck component of the YOLOv8 model. Specifically, we employ a three-layer PAFPN structure and incorporate a detection head tailored for small-sized objects using large-scale feature maps, significantly enhancing the algorithm’s capability to detect small-sized targets. Furthermore, we integrate the sandwich-fusion module into each layer of the neck’s up–down branch. This fusion mechanism combines network features with low-level features, providing rich spatial information about the objects at different layer detection heads. We achieve this fusion using depthwise separable evolution, which balances parameter costs and a large receptive field. In the network backbone, we employ RepVGG modules as downsampling layers, enhancing the network’s ability to learn multi-scale features and outperforming traditional convolutional layers. The proposed Drone-YOLO methods have been evaluated in ablation experiments and compared with other state-of-the-art approaches on the VisDrone2019 dataset. The results demonstrate that our Drone-YOLO (large) outperforms other baseline methods in the accuracy of object detection. Compared to YOLOv8, our method achieves a significant improvement in mAP0.5 metrics, with a 13.4% increase on the VisDrone2019-test and a 17.40% increase on the VisDrone2019-val. Additionally, the parameter-efficient Drone-YOLO (tiny) with only 5.25 M parameters performs equivalently or better than the baseline method with 9.66M parameters on the dataset. These experiments validate the effectiveness of the Drone-YOLO methods in the task of object detection in drone imagery.

Hierarchical Active Learning for Low-Altitude Drone-View Object Detection

DroneNet: Rescue Drone-View Object Detection

Hierarchical alignment network for domain adaptive object detection in aerial images

Active Learning for Single-Stage Object Detection in UAV Images

Efficient YOLOv7-Drone: An Enhanced Object Detection Approach for Drone Aerial Imagery

A Novel Tensor Decomposition-Based Efficient Detector for Low-Altitude Aerial Objects With Knowledge Distillation Scheme

Drone-YOLO: An Efficient Neural Network Method for Target Detection in Drone Images

A New Algorithm for Small Target Detection From the Perspective of Unmanned Aerial Vehicles

MUS-CDB: Mixed Uncertainty Sampling with Class Distribution Balancing for Active Annotation in Aerial Object Detection

Hybrid deep learning for object detection in drone imagery: a new metaheuristic based model

Towards Resolving the Challenge of Long-tail Distribution in UAV Images for Object Detection

A Small Object Detection Method for Drone-Captured Images Based on Improved YOLOv7

HAL3D: Hierarchical Active Learning for Fine-Grained 3D Part Labeling

Hybrid Convolutional-Transformer framework for drone-based few-shot weakly supervised object detection

Finding Nonrigid Tiny Person With Densely Cropped and Local Attention Object Detector Networks in Low-Altitude Aerial Images

Hybrid receptive field network for small object detection on drone view

AIOD-YOLO: an algorithm for object detection in low-altitude aerial images

YOLOD: A Target Detection Method for UAV Aerial Imagery

DTSSNet: Dynamic Training Sample Selection Network for UAV Object Detection

Learnable Cross-Scale Sparse Attention Guided Feature Fusion for UAV Object Detection