Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature Selection

Jiafan Zhuang,Gaofei Han,Zihao Xia,Boxi Wang,Wenji Li,Dongliang Wang,Zhifeng Hao,Ruichu Cai,Zhun Fan

2024-07-15

Abstract:In unseen and complex outdoor environments, collision avoidance navigation for unmanned aerial vehicle (UAV) swarms presents a challenging problem. It requires UAVs to navigate through various obstacles and complex backgrounds. Existing collision avoidance navigation methods based on deep reinforcement learning show promising performance but suffer from poor generalization abilities, resulting in performance degradation in unseen environments. To address this issue, we investigate the cause of weak generalization ability in DRL and propose a novel causal feature selection module. This module can be integrated into the policy network and effectively filters out non-causal factors in representations, thereby reducing the influence of spurious correlations between non-causal factors and action predictions. Experimental results demonstrate that our proposed method can achieve robust navigation performance and effective collision avoidance especially in scenarios with unseen backgrounds and obstacles, which significantly outperforms existing state-of-the-art algorithms.

Robotics

What problem does this paper attempt to address?

The paper primarily addresses the obstacle avoidance navigation problem for multiple Unmanned Aerial Vehicles (UAVs) in unknown and complex outdoor environments. Specifically, the paper aims to solve the following core issues: 1. **Weak Generalization Ability**: Existing obstacle avoidance navigation methods based on Deep Reinforcement Learning (DRL) exhibit good performance, but their performance significantly degrades in unseen environments, indicating a problem of weak generalization ability. 2. **Influence of Non-causal Factors**: Analysis reveals that current methods may incorrectly establish a relationship between the shape of obstacles and the strategy during the learning process, leading to ineffective obstacle avoidance strategies when encountering unseen obstacles (e.g., cubic obstacles). To address the above issues, the paper proposes the following contributions: - **Causal Feature Selection Module**: A novel Causal Feature Selection (CFS) module is designed, which can be integrated into the policy network to effectively filter out non-causal factors in the representation, reducing their impact on action prediction. - **Experimental Validation**: Experiments conducted in test scenarios with unseen backgrounds and obstacles validate that the proposed method can significantly improve the navigation success rate and obstacle avoidance performance of UAVs in unknown environments, especially showing significant advantages when facing unseen obstacles. In short, the goal of the paper is to enhance the obstacle avoidance capability and generalization ability of DRL-based multi-UAV systems in unknown environments by introducing a causal feature selection mechanism to address the limitations of existing methods.

Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature Selection

Collision Avoidance for Multiple UAVs in Unknown Scenarios with Causal Representation Disentanglement

Research on collision avoidance algorithm of unmanned surface vehicle based on deep reinforcement learning

Integrating human experience in deep reinforcement learning for multi-UAV collision detection and avoidance

A Reinforcement Learning-based Decentralized Method of Avoiding Multi-UAV Collision in 3-D Airspace

Deep-Reinforcement-Learning-Based Collision Avoidance in UAV Environment

Autonomous obstacle avoidance of UAV based on deep reinforcement learning

Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments

3M-RL: Multi-Resolution, Multi-Agent, Mean-Field Reinforcement Learning for Autonomous UAV Routing

Efficient Multi-agent Navigation with Lightweight DRL Policy

Adaptive Collision Avoidance Decisions in Autonomous Ship Encounter Scenarios Through Rule-Guided Vision Supervised Learning

Attention-Based Policy Distillation for UAV Simultaneous Target Tracking and Obstacle Avoidance

UAV Target Following in Complex Occluded Environments with Adaptive Multi-Modal Fusion

EPO-S: A Constrained RL Method to Enhance UAV Safety with Spatial Representation

Adaptive Environment Modeling Based Reinforcement Learning for Collision Avoidance in Complex Scenes

Learning-Based Multi-UAV Flocking Control With Limited Visual Field and Instinctive Repulsion

Collision Avoidance and Navigation for a Quadrotor Swarm Using End-to-end Deep Reinforcement Learning

Multi-UAV autonomous collision avoidance based on PPO-GIC algorithm with CNN-LSTM fusion network

Multi-UAV Navigation for Partially Observable Communication Coverage by Graph Reinforcement Learning

Multi-UAV Autonomous Obstacle Avoidance Based on Reinforcement Learning

NavRL: Learning Safe Flight in Dynamic Environments