Abstract:This paper addresses multi-UAV pursuit-evasion, where a group of drones cooperates to capture a fast evader in a confined environment with obstacles. Existing heuristic algorithms, which simplify the pursuit-evasion problem, often lack expressive coordination strategies and struggle to capture the evader in extreme scenarios, such as when the evader moves at high speeds. In contrast, reinforcement learning (RL) has been applied to this problem and has the potential to obtain highly cooperative capture strategies. However, RL-based methods face challenges in training for complex 3-dimensional scenarios with diverse task settings due to the vast exploration space. The dynamics constraints of drones further restrict the ability of reinforcement learning to acquire high-performance capture strategies. In this work, we introduce a dual curriculum learning framework, named DualCL, which addresses multi-UAV pursuit-evasion in diverse environments and demonstrates zero-shot transfer ability to unseen scenarios. DualCL comprises two main components: the Intrinsic Parameter Curriculum Proposer, which progressively suggests intrinsic parameters from easy to hard to improve the capture capability of drones, and the External Environment Generator, tasked with exploring unresolved scenarios and generating appropriate training distributions of external environment parameters. The simulation experimental results show that DualCL significantly outperforms baseline methods, achieving over 90% capture rate and reducing the capture timestep by at least 27.5% in the training scenarios. Additionally, it exhibits the best zero-shot generalization ability in unseen environments. Moreover, we demonstrate the transferability of our pursuit strategy from simulation to real-world environments. Further details can be found on the project website at <a class="link-external link-https" href="https://sites.google.com/view/dualcl" rel="external noopener nofollow">this https URL</a>.

Collaborative Encirclement of Multiple UAVs Based on Deep Reinforcement Learning

Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

Cooperative Encirclement Strategy for Multiple Drones Based on ATT-MADDPG

Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning

Deep Reinforcement Learning-Driven Collaborative Rounding-Up for Multiple Unmanned Aerial Vehicles in Obstacle Environments

Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning

Group-Based Deep Reinforcement Learning in Multi-UAV Confrontation

UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning

Multi-Target Pursuit by a Decentralized Heterogeneous UAV Swarm using Deep Multi-Agent Reinforcement Learning

Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning

Cooperative multi-agent target searching: a deep reinforcement learning approach based on parallel hindsight experience replay

A Dual Curriculum Learning Framework for Multi-UAV Pursuit-Evasion in Diverse Environments

A Reinforcement Learning-based Decentralized Method of Avoiding Multi-UAV Collision in 3-D Airspace

Collaborative Coverage Path Planning of UAV Cluster based on Deep Reinforcement Learning

Multi-UAV Collaborative Search and Strike based on Reinforcement Learning

Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning

Collision-Avoiding Flocking With Multiple Fixed-Wing UAVs in Obstacle-Cluttered Environments: A Task-Specific Curriculum- Based MADRL Approach

Deep Reinforcement Learning-based Collaborative Multi-UAV Coverage Path Planning

Cooperative Pursuit with Multiple Pursuers based on Deep Minimax Q-learning

Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments

UAV-Enabled Secure Communications by Multi-Agent Deep Reinforcement Learning