HeMoDU: High-Efficiency Multi-Object Detection Algorithm for Unmanned Aerial Vehicles on Urban Roads

Hanyi Shi,Ningzhi Wang,Xinyao Xu,Yue Qian,Lingbin Zeng,Yi Zhu
DOI: https://doi.org/10.3390/s24134045
IF: 3.9
2024-06-21
Sensors
Abstract:Unmanned aerial vehicle (UAV)-based object detection methods are widely used in traffic detection due to their high flexibility and extensive coverage. In recent years, with the increasing complexity of the urban road environment, UAV object detection algorithms based on deep learning have gradually become a research hotspot. However, how to further improve algorithmic efficiency in response to the numerous and rapidly changing road elements, and thus achieve high-speed and accurate road object detection, remains a challenging issue. Given this context, this paper proposes the high-efficiency multi-object detection algorithm for UAVs (HeMoDU). HeMoDU reconstructs a state-of-the-art, deep-learning-based object detection model and optimizes several aspects to improve computational efficiency and detection accuracy. To validate the performance of HeMoDU in urban road environments, this paper uses the public urban road datasets VisDrone2019 and UA-DETRAC for evaluation. The experimental results show that the HeMoDU model effectively improves the speed and accuracy of UAV object detection.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
The paper primarily addresses the issue of multi-object detection for Unmanned Aerial Vehicles (UAVs) in urban road environments and proposes an efficient multi-object detection algorithm (HeMoDU). Specifically, the paper tackles the following key issues: 1. **Improving Detection Efficiency and Accuracy**: With the increasing complexity of urban environments, how to further enhance the efficiency of UAV detection algorithms when faced with numerous and rapidly changing road elements, and achieve high-speed, accurate object detection, remains a challenge. 2. **Improving Existing Models**: The paper is based on the current state-of-the-art deep learning-based object detection model YOLOv8, and improves it to meet the needs of multi-object detection in urban road environments. 3. **Introducing Innovative Technologies**: To improve detection speed and accuracy, the paper proposes several innovative technologies, including: - Introducing the concept of visual state space from the VMamba model, utilizing a 2D selective scanning mechanism to obtain the global receptive field of the image, thereby extracting deeper image features. - Using the VoV-GSCSP module from the Slim-Neck model to optimize the backbone network, and employing mixed convolution (GSConv) to fully utilize inter-channel connections, handling combined high-level features at a lower computational cost, effectively enhancing small object detection accuracy. - Utilizing the Programmable Gradient Information (PGI) framework to further enhance the model's inference efficiency. Through the above methods, the HeMoDU model can significantly improve object detection performance while ensuring computational efficiency. The experimental section evaluates the HeMoDU model using publicly available urban road datasets VisDrone2019 and UA-DETRAC, verifying its effectiveness and superiority in multi-object detection in urban road environments.