Abstract:Efficiently obtaining the up-to-date information in the disaster-stricken area is the key to successful disaster response. Unmanned aerial vehicles (UAVs), workers and cars can collaborate to accomplish sensing tasks, such as data collection, in disaster-stricken areas. In this paper, we explicitly address the route planning for a group of agents, including UAVs, workers, and cars, with the goal of maximizing the task completion rate. We propose MANF-RL-RP, a heterogeneous multi-agent route planning algorithm that incorporates several efficient designs, including global-local dual information processing and a tailored model structure for heterogeneous multi-agent systems. Global-local dual information processing encompasses the extraction and dissemination of spatial features from global information, as well as the partitioning and filtering of local information from individual agents. Regarding the construction of the model structure for heterogeneous multi-agent, we perform the following work. We design the same data structure to represent the states of different agents, prove the Markovian property of the decision-making process of agents to simplify the model structure, and also design a reasonable reward function to train the model. Finally, we conducted detailed experiments based on the rich simulation data. In comparison to the baseline algorithms, namely Greedy-SC-RP and MANF-DNN-RP, MANF-RL-RP has exhibited a significant improvement in terms of task completion rate.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to efficiently obtain the latest information in the disaster - stricken areas during disaster response. Specifically, it studies the path - planning problem of unmanned aerial vehicles (UAVs), workers, and cars in the collaborative completion of sensing tasks (such as data collection) to maximize the task completion rate. ### Problem Background After a disaster occurs, obtaining the latest information in the disaster - stricken areas in a timely manner is crucial for successful rescue operations. However, traditional mobile crowdsourcing sensing (MCS) has limitations in complex and dangerous disaster - stricken environments, mainly in the following aspects: 1. **Environmental Complexity**: The environment in the disaster - stricken area is complex, which restricts the mobility of participants. 2. **Sensing Capacity Requirements**: High - precision and specific sensing capabilities are required, while traditional MCS relies on participants and their mobile devices as basic sensing units, making it difficult to meet these requirements. 3. **Limitations of UAV Autonomous Operations**: - **Safety Hazards in Low - Altitude Environments**: UAVs face many safety issues when performing tasks in low - altitude environments and require precise manipulation. - **Availability of Charging Stations**: Existing research assumes that UAVs can autonomously navigate to charging stations for charging, but in fact, the availability of charging stations is limited, especially in urban environments after disasters. ### Solution To solve the above problems, this paper proposes a multi - agent reinforcement learning algorithm named MANF - RL - RP for the collaborative path - planning of UAVs, workers, and cars. The specific objectives are as follows: 1. **Maximize the Task Completion Rate**: By optimizing path - planning, ensure that as many sensing tasks as possible are completed. 2. **Overcome the Limitations of UAV Autonomous Operations**: - Workers are responsible for precisely operating UAVs at sensing task locations to make up for the deficiencies of UAVs' autonomous operations in low - altitude environments. - Cars are responsible for replacing UAV batteries at designated durable locations to ensure that UAVs have sufficient power to continue performing tasks. 3. **Improve Overall Efficiency**: Through global - local dual - information processing and reasonable model structure design, simplify the model and improve training efficiency. ### Main Contributions 1. **First Research**: This is the first research on the collaborative path - planning problem of UAVs, workers, and cars in disaster response, and it has been proven that this problem is NP - Hard. 2. **Propose a New Algorithm**: Propose a heterogeneous multi - agent path - planning algorithm MANF - RL - RP based on multi - agent reinforcement learning. 3. **Experimental Verification**: Detailed experiments have been carried out with rich simulation data, and the results show that MANF - RL - RP has a significant improvement in the task completion rate compared with baseline algorithms (such as Greedy - SC - RP and MANF - DNN - RP). ### Mathematical Formula Representation To ensure the correctness and readability of the formulas, the following are the key formulas involved in the paper: - **Relationship between UAV Remaining Power and Urgency**: \[ urget_i=\frac{1}{e^{\lfloor uPowt_i / uCsp_i\rfloor}} \] where \(uPowt_i\) represents the remaining power of UAV \(i\) at time \(t\), \(uCsp_i\) represents the energy consumption of UAV \(i\) at each time step, and \(\lfloor x\rfloor\) represents the floor function. - **UAV Power Change**: \[ uPowt_{i + 1}= \begin{cases} 1, & \text{if } uLoct_{i+1}=cLoct_{k + 1}\\ uPowt_i - uCsp_i, & \text{if } uLoct_{i+1}\neq cLoct_{k + 1}\text{ and } uPowt_i\geq uCsp_i\\ uPowt_i, & \text{if } uLoct_{i+1}\neq cLoct_{k + 1}\text{ and } uPowt_i < uCsp_i \end{cases} \] Through these formulas, the paper describes in detail the UAVs, workers...

Collaborative Route Planning of UAVs, Workers and Cars for Crowdsensing in Disaster Response

Collaborative Route Planning of UAVs, Workers, and Cars for Crowdsensing in Disaster Response

Deep Reinforcement Learning Enabled Multi-UAV Scheduling for Disaster Data Collection With Time-Varying Value

Multi-UAVs Collaborative Path Planning in the Cramped Environment

3M-RL: Multi-Resolution, Multi-Agent, Mean-Field Reinforcement Learning for Autonomous UAV Routing

Multi-UAV Coverage Planning with Limited Endurance in Disaster Environment

Multiservice UAVs for Emergency Tasks in Post-disaster Scenarios

Adaptive Resource Allocation for Emergency Communications with Unmanned Aerial Vehicle-Assisted Free Space Optical/Radio Frequency Relay System

Multi-UAV Disaster Environment Coverage Planning with Limited-Endurance

Coordinating Disaster Emergency Response with Heuristic Reinforcement Learning

Scheduling of emergency tasks for multiservice UAVs in post-disaster scenarios

TRACT: Towards Large-Scale Crowdsensing with High-Efficiency Swarm Path Planning.

Unmanned aerial vehicle–human collaboration route planning for intelligent infrastructure inspection

A Path Planning Method for Collaborative Coverage Monitoring in Urban Scenarios

Multi-robot task allocation in disaster response: Addressing dynamic tasks with deadlines and robots with range and payload constraints

Management of a post-disaster emergency scenario through unmanned aerial vehicles: Multi-Depot Multi-Trip Vehicle Routing with Total Completion Time Minimization

Dense Multi-Agent Reinforcement Learning Aided Multi-UAV Information Coverage for Vehicular Networks

Multiobjective Emergency Resource Allocation under the Natural Disaster Chain with Path Planning

Multiple UAVs collaborative traffic monitoring with intention-based communication

Multi-UAVs cooperative task assignment and path planning scheme

Multi-UAV Optimal Mission Assignment and Path Planning for Disaster Rescue Using Adaptive Genetic Algorithm and Improved Artificial Bee Colony Method