Abstract:The emerging backscatter communication technology is recognized as a promising solution to the battery problem of Internet of Things (IoT) devices. For example, the wireless sensor network with backscatter communication technology can monitor the environment in remote areas without battery maintenance or replacement. Unfortunately, the transmission range of backscatter communication is limited. To tackle this challenge, we propose a multi-UAV-aided data collection scenario where the unmanned aerial vehicle (UAV) can fly close to the backscatter sensor node (BSN) to activate it and then collects the data. We aim to minimize the total flight time of the rechargeable UAVs when the collection mission is finished. During the data collection process, the UAVs can return to the charging station to recharge itself when the energy of UAV is not sufficient to complete the mission. To reduce the complexity of the task, we first use the Gaussian mixture model clustering method to divide the BSNs into multiple clusters. Then we consider the deterministic boundary and ambiguous boundary for the UAV flying regions, respectively. For the deterministic boundary scenario, we propose a single-agent deep option learning (SADOL) algorithm, where each UAV cannot fly beyond the deterministic boundary. For the ambiguous boundary scenario, we propose a multiagent deep option learning (MADOL) algorithm to enable the UAVs to cooperatively learn the ambiguous BSNs assignment. In the simulation, we compare the proposed algorithms with multiagent deep deterministic policy gradient (MADDPG), deep deterministic policy gradient (DDPG), and deep Q-network (DQN) algorithms, which proves the proposed algorithms can achieve better performance.

PASCAL: PopulAtion-Specific Curriculum-based MADRL for collision-free flocking with large-scale fixed-wing UAV swarms

Collision-Avoiding Flocking With Multiple Fixed-Wing UAVs in Obstacle-Cluttered Environments: A Task-Specific Curriculum- Based MADRL Approach

Learning-Based Multi-UAV Flocking Control With Limited Visual Field and Instinctive Repulsion

Oracle-Guided Deep Reinforcement Learning for Large-Scale Multi-UAVs Flocking and Navigation.

Deep Reinforcement Learning for Flocking Motion of Multi-UAV systems: Learn from a Digital Twin

Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance

MW-MADDPG: a meta-learning based decision-making method for collaborative UAV swarm

Programming and Deployment of Autonomous Swarms using Multi-Agent Reinforcement Learning

Flocking of Under-Actuated Unmanned Surface Vehicles via Deep Reinforcement Learning and Model Predictive Path Integral Control

UAV Swarm Confrontation Using Hierarchical Multiagent Reinforcement Learning

UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning

A Reinforcement Learning-based Decentralized Method of Avoiding Multi-UAV Collision in 3-D Airspace

Coordinated Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Swarms in Autonomous Mobile Access Applications

Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking Control

Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning

Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning

Multi-Target Pursuit by a Decentralized Heterogeneous UAV Swarm using Deep Multi-Agent Reinforcement Learning

Game of Drones: Multi-UAV Pursuit-Evasion Game With Online Motion Planning by Deep Reinforcement Learning

PPO-Exp: Keeping Fixed-Wing UAV Formation with Deep Reinforcement Learning

Hierarchical Deep Reinforcement Learning for Backscattering Data Collection With Multiple UAVs

UAV-Enabled Secure Communications by Multi-Agent Deep Reinforcement Learning