A Novel Network Framework on Simultaneous Road Segmentation and Vehicle Detection for UAV Aerial Traffic Images

Min Xiao,Wei Min,Congmao Yang,Yongchao Song
DOI: https://doi.org/10.3390/s24113606
IF: 3.9
2024-06-04
Sensors
Abstract:Unmanned Aerial Vehicle (UAV) aerial sensors are an important means of collecting ground image data. Through the road segmentation and vehicle detection of drivable areas in UAV aerial images, they can be applied to monitoring roads, traffic flow detection, traffic management, etc. As well, they can be integrated with intelligent transportation systems to support the related work of transportation departments. Existing algorithms only realize a single task, while intelligent transportation requires the simultaneous processing of multiple tasks, which cannot meet complex practical needs. However, UAV aerial images have the characteristics of variable road scenes, a large number of small targets, and dense vehicles, which make it difficult to complete the tasks. In response to these issues, we propose to implement road segmentation and on-road vehicle detection tasks in the same framework for UAV aerial images, and we conduct experiments on a self-constructed dataset based on the DroneVehicle dataset. For road segmentation, we propose a new algorithm C-DeepLabV3+. The new algorithm introduces the coordinate attention (CA) module, which can obtain more accurate segmentation target location information and make the segmentation target edges more continuous. Also, the improved algorithm introduces the cascade feature fusion module to prevent the loss of detail information in road segmentation and to obtain better segmentation performance. For vehicle detection, we propose an improved algorithm S-YOLOv5 by adding a parameter-free lightweight attention module SimAM. Finally, the proposed road segmentation–vehicle detection framework is utilized to unite the C-DeepLabV3+ and S-YOLOv5 algorithms for the implementation of the serial tasks. The experimental results show that on the constructed ViDroneVehicle dataset, the C-DeepLabV3+ algorithm has an mPA value of 98.75% and an mIoU value of 97.53%, which can better segment the road area and solve the problem of occlusion. The mAP value of the S-YOLOv5 algorithm has an mAP value of 97.40%, which is more than YOLOv5's 96.95%, which effectively reduces the vehicle omission and false detection rates. By comparison, the results of both algorithms are superior to multiple state-of-the-art methods. The overall framework proposed in this paper has superior performance and is capable of realizing high-quality and high-precision road segmentation and vehicle detection from UAV aerial images.
engineering, electrical & electronic,instruments & instrumentation,chemistry, analytical
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper primarily proposes a new network framework for road segmentation and vehicle detection tasks in UAV (Unmanned Aerial Vehicle) aerial images. Specifically: 1. **Road Segmentation Problem**: - Existing road segmentation algorithms face issues such as unclear boundaries, inaccurate results, and insensitivity to occlusion when dealing with complex scenes. - To address this, the authors propose a new road segmentation algorithm, C-DeepLabV3+, which introduces a Coordinate Attention Module to obtain more precise target location information and make the edges of the segmented targets more continuous. Additionally, the improved algorithm incorporates a cascade feature fusion module to prevent the loss of detailed information during road segmentation, thereby achieving better segmentation performance. 2. **Vehicle Detection Problem**: - In UAV aerial images, the large number of small targets and high target density make it challenging for the existing YOLOv5 algorithm to cope with these challenges. - To address this, the authors propose an improved vehicle detection algorithm, S-YOLOv5, which adds a parameter-free lightweight attention module, SimAM, to YOLOv5 to enhance vehicle detection accuracy in complex environments. By integrating these two improved algorithms, the paper constructs a road segmentation-vehicle detection framework capable of achieving high-quality and high-precision road segmentation and vehicle detection tasks. Experimental results show that this framework outperforms various existing methods on the self-built dataset ViDroneVehicle.