Abstract:Harvesting data from distributed Internet of Things (IoT) devices with multiple autonomous unmanned aerial vehicles (UAVs) is a challenging problem requiring flexible path planning methods. We propose a multi-agent reinforcement learning (MARL) approach that, in contrast to previous work, can adapt to profound changes in the scenario parameters defining the data harvesting mission, such as the number of deployed UAVs, number, position and data amount of IoT devices, or the maximum flying time, without the need to perform expensive recomputations or relearn control policies. We formulate the path planning problem for a cooperative, non-communicating, and homogeneous team of UAVs tasked with maximizing collected data from distributed IoT sensor nodes subject to flying time and collision avoidance constraints. The path planning problem is translated into a decentralized partially observable Markov decision process (Dec-POMDP), which we solve through a deep reinforcement learning (DRL) approach, approximating the optimal UAV control policy without prior knowledge of the challenging wireless channel characteristics in dense urban environments. By exploiting a combination of centered global and local map representations of the environment that are fed into convolutional layers of the agents, we show that our proposed network architecture enables the agents to cooperate effectively by carefully dividing the data collection task among themselves, adapt to large complex environments and state spaces, and make movement decisions that balance data collection goals, flight-time efficiency, and navigation constraints. Finally, learning a control policy that generalizes over the scenario parameter space enables us to analyze the influence of individual parameters on collection performance and provide some intuition about system-level benefits.

Multi - Agent Reinforcement Learning for Backscattering Data Collection in Multi-UAV IoT

Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning

Hierarchical Deep Reinforcement Learning for Backscattering Data Collection With Multiple UAVs

High-Sample-Efficient Multiagent Reinforcement Learning for Navigation and Collision Avoidance of UAV Swarms in Multitask Environments

Distributed UAV Swarm for Device-Free Integrated Sensing and Communication Relying on Multi-Agent Reinforcement Learning

Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep Reinforcement Learning Approach

Cooperative Data Collection for UAV-Assisted Maritime IoT Based on Deep Reinforcement Learning

Mean-Field Multi-Agent Reinforcement Learning for UAV Assisted Secure Data Dissemination.

Maximizing UAV Coverage in Maritime Wireless Networks: A Multiagent Reinforcement Learning Approach

Semantics-Aware Multi-UAV Cooperation for Age-Optimal Data Collection: an Adaptive Communication Based MARL Approach.

Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning

Matching combined multi-agent reinforcement learning for uav secure data dissemination

Multi-UAV Cooperative Search in Multi-Layered Aerial Computing Networks: A Multi-Agent Deep Reinforcement Learning Approach

Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

Model-aided Federated Reinforcement Learning for Multi-UAV Trajectory Planning in IoT Networks

A Method of Multi-UAV Cooperative Task Assignment Based on Reinforcement Learning

A Deep Reinforcement Learning Approach for Multi-UAV-Assisted Data Collection in Wireless Powered IoT Networks

Reinforcement Learning Assisted Multi-UAV Task Allocation and Path Planning for IIoT

Deep Reinforcement Learning-Driven UAV Data Collection Path Planning: A Study on Minimizing AoI

Multi-UAV Autonomous Path Planning in Reconnaissance Missions Considering Incomplete Information: A Reinforcement Learning Method

Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks