Collision Avoidance for Multiple UAVs in Unknown Scenarios with Causal Representation Disentanglement

Jiafan Zhuang,Zihao Xia,Gaofei Han,Boxi Wang,Wenji Li,Dongliang Wang,Zhifeng Hao,Ruichu Cai,Zhun Fan
2024-07-15
Abstract:Deep reinforcement learning (DRL) has achieved remarkable progress in online path planning tasks for multi-UAV systems. However, existing DRL-based methods often suffer from performance degradation when tackling unseen scenarios, since the non-causal factors in visual representations adversely affect policy learning. To address this issue, we propose a novel representation learning approach, \ie, causal representation disentanglement, which can identify the causal and non-causal factors in representations. After that, we only pass causal factors for subsequent policy learning and thus explicitly eliminate the influence of non-causal factors, which effectively improves the generalization ability of DRL models. Experimental results show that our proposed method can achieve robust navigation performance and effective collision avoidance especially in unseen scenarios, which significantly outperforms existing SOTA algorithms.
Robotics
What problem does this paper attempt to address?
The paper mainly addresses the collision avoidance problem for multi-Unmanned Aerial Vehicle (multi-UAV) systems in unknown scenarios. Specifically, the research focuses on how to improve the generalization ability of Deep Reinforcement Learning (DRL)-based methods in unseen environments. The paper points out that in online path planning tasks, existing DRL methods often experience performance degradation when faced with unseen scenarios. This is because visual representations contain non-causal factors that can adversely affect policy learning. To tackle this challenge, the authors propose a new representation learning method called Causal Representation Disentanglement. This method aims to identify causal and non-causal factors in the representation and only pass the causal factors for subsequent policy learning, thereby explicitly eliminating the influence of non-causal factors. Experimental results show that the proposed method can achieve robust navigation performance and effective collision avoidance, especially in unseen scenarios, with significant performance improvement compared to existing state-of-the-art (SOTA) algorithms. In short, the problem the paper attempts to solve can be summarized as: how to improve DRL-based collision avoidance algorithms for multi-UAV systems to enhance their generalization ability in unknown scenarios.