Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature Selection

Jiafan Zhuang,Gaofei Han,Zihao Xia,Boxi Wang,Wenji Li,Dongliang Wang,Zhifeng Hao,Ruichu Cai,Zhun Fan
2024-07-15
Abstract:In unseen and complex outdoor environments, collision avoidance navigation for unmanned aerial vehicle (UAV) swarms presents a challenging problem. It requires UAVs to navigate through various obstacles and complex backgrounds. Existing collision avoidance navigation methods based on deep reinforcement learning show promising performance but suffer from poor generalization abilities, resulting in performance degradation in unseen environments. To address this issue, we investigate the cause of weak generalization ability in DRL and propose a novel causal feature selection module. This module can be integrated into the policy network and effectively filters out non-causal factors in representations, thereby reducing the influence of spurious correlations between non-causal factors and action predictions. Experimental results demonstrate that our proposed method can achieve robust navigation performance and effective collision avoidance especially in scenarios with unseen backgrounds and obstacles, which significantly outperforms existing state-of-the-art algorithms.
Robotics
What problem does this paper attempt to address?
The paper primarily addresses the obstacle avoidance navigation problem for multiple Unmanned Aerial Vehicles (UAVs) in unknown and complex outdoor environments. Specifically, the paper aims to solve the following core issues: 1. **Weak Generalization Ability**: Existing obstacle avoidance navigation methods based on Deep Reinforcement Learning (DRL) exhibit good performance, but their performance significantly degrades in unseen environments, indicating a problem of weak generalization ability. 2. **Influence of Non-causal Factors**: Analysis reveals that current methods may incorrectly establish a relationship between the shape of obstacles and the strategy during the learning process, leading to ineffective obstacle avoidance strategies when encountering unseen obstacles (e.g., cubic obstacles). To address the above issues, the paper proposes the following contributions: - **Causal Feature Selection Module**: A novel Causal Feature Selection (CFS) module is designed, which can be integrated into the policy network to effectively filter out non-causal factors in the representation, reducing their impact on action prediction. - **Experimental Validation**: Experiments conducted in test scenarios with unseen backgrounds and obstacles validate that the proposed method can significantly improve the navigation success rate and obstacle avoidance performance of UAVs in unknown environments, especially showing significant advantages when facing unseen obstacles. In short, the goal of the paper is to enhance the obstacle avoidance capability and generalization ability of DRL-based multi-UAV systems in unknown environments by introducing a causal feature selection mechanism to address the limitations of existing methods.