Abstract:Unmanned Aerial Vehicle (UAV) aerial sensors are an important means of collecting ground image data. Through the road segmentation and vehicle detection of drivable areas in UAV aerial images, they can be applied to monitoring roads, traffic flow detection, traffic management, etc. As well, they can be integrated with intelligent transportation systems to support the related work of transportation departments. Existing algorithms only realize a single task, while intelligent transportation requires the simultaneous processing of multiple tasks, which cannot meet complex practical needs. However, UAV aerial images have the characteristics of variable road scenes, a large number of small targets, and dense vehicles, which make it difficult to complete the tasks. In response to these issues, we propose to implement road segmentation and on-road vehicle detection tasks in the same framework for UAV aerial images, and we conduct experiments on a self-constructed dataset based on the DroneVehicle dataset. For road segmentation, we propose a new algorithm C-DeepLabV3+. The new algorithm introduces the coordinate attention (CA) module, which can obtain more accurate segmentation target location information and make the segmentation target edges more continuous. Also, the improved algorithm introduces the cascade feature fusion module to prevent the loss of detail information in road segmentation and to obtain better segmentation performance. For vehicle detection, we propose an improved algorithm S-YOLOv5 by adding a parameter-free lightweight attention module SimAM. Finally, the proposed road segmentation–vehicle detection framework is utilized to unite the C-DeepLabV3+ and S-YOLOv5 algorithms for the implementation of the serial tasks. The experimental results show that on the constructed ViDroneVehicle dataset, the C-DeepLabV3+ algorithm has an mPA value of 98.75% and an mIoU value of 97.53%, which can better segment the road area and solve the problem of occlusion. The mAP value of the S-YOLOv5 algorithm has an mAP value of 97.40%, which is more than YOLOv5's 96.95%, which effectively reduces the vehicle omission and false detection rates. By comparison, the results of both algorithms are superior to multiple state-of-the-art methods. The overall framework proposed in this paper has superior performance and is capable of realizing high-quality and high-precision road segmentation and vehicle detection from UAV aerial images.

What problem does this paper attempt to address?

### Problems Addressed by the Paper This paper primarily proposes a new network framework for road segmentation and vehicle detection tasks in UAV (Unmanned Aerial Vehicle) aerial images. Specifically: 1. **Road Segmentation Problem**: - Existing road segmentation algorithms face issues such as unclear boundaries, inaccurate results, and insensitivity to occlusion when dealing with complex scenes. - To address this, the authors propose a new road segmentation algorithm, C-DeepLabV3+, which introduces a Coordinate Attention Module to obtain more precise target location information and make the edges of the segmented targets more continuous. Additionally, the improved algorithm incorporates a cascade feature fusion module to prevent the loss of detailed information during road segmentation, thereby achieving better segmentation performance. 2. **Vehicle Detection Problem**: - In UAV aerial images, the large number of small targets and high target density make it challenging for the existing YOLOv5 algorithm to cope with these challenges. - To address this, the authors propose an improved vehicle detection algorithm, S-YOLOv5, which adds a parameter-free lightweight attention module, SimAM, to YOLOv5 to enhance vehicle detection accuracy in complex environments. By integrating these two improved algorithms, the paper constructs a road segmentation-vehicle detection framework capable of achieving high-quality and high-precision road segmentation and vehicle detection tasks. Experimental results show that this framework outperforms various existing methods on the self-built dataset ViDroneVehicle.

A Novel Network Framework on Simultaneous Road Segmentation and Vehicle Detection for UAV Aerial Traffic Images

Traffic Collisions Early Warning Aided by Small Unmanned Aerial Vehicle Companion

YOLO-U: multi-task model for vehicle detection and road segmentation in UAV aerial imagery

Automatic extraction and 3D modeling of real road scenes using UAV imagery and deep learning semantic segmentation

Advanced Framework for Microscopic and Lane-Level Macroscopic Traffic Parameters Estimation from UAV Video

Research on multitask model of object detection and road segmentation in unstructured road scenes

Accurate Detection and Tracking of Small-Scale Vehicles in High-Altitude Unmanned Aerial Vehicle Bird-View Imagery

Vehicle Instance Segmentation from Aerial Image and Video Using a Multi-Task Learning Residual Fully Convolutional Network

Neighborhood physical disorder in New York City

Aero-YOLO: An Efficient Vehicle and Pedestrian Detection Algorithm Based on Unmanned Aerial Imagery

DAGN: A Real-Time UAV Remote Sensing Image Vehicle Detection Framework

Drone-TOOD: A Lightweight Task-Aligned Object Detection Algorithm for Vehicle Detection in UAV Images

A Novel Multi-Data-Augmentation and Multi-Deep-Learning Framework for Counting Small Vehicles and Crowds

A Comprehensive Framework for Transportation Infrastructure Digitalization: TJYRoad-Net for Enhanced Point Cloud Segmentation

Vehicle Instance Segmentation From Aerial Image and Video Using a Multitask Learning Residual Fully Convolutional Network

3D Instance Segmentation and Object Detection Framework Based on the Fusion of Lidar Remote Sensing and Optical Image Sensing

An improved UAV target detection algorithm based on ASFF-YOLOv5s.

Explicit Facial Emotion Processing in Patients With Dissociative Seizures

Developing a More Reliable Framework for Extracting Traffic Data from a UAV Video

Adaptive Feature Fusion and Improved Attention Mechanism-Based Small Object Detection for UAV Target Tracking