Multi-Agent Partial Observable Safe Reinforcement Learning for Counter Uncrewed Aerial Systems

Jean-Elie Pierre,Xiang Sun,Rafael Fierro
DOI: https://doi.org/10.1109/access.2023.3298601
IF: 3.9
2023-08-05
IEEE Access
Abstract:The proliferation of small uncrewed aerial systems (UAS) poses many threats to airspace systems and critical infrastructures. In recent years, there has been a growing interest in using multi-agent reinforcement learning (MARL) to counter unwanted UAS systems. However, MARL is unable to generate safety actions to meet predefined constraints, thus hindering its use in real-life applications. In this paper, we formulate the Counter-UAS problem as a multi-agent partially observable Markov decision process (MAPOMDP), and we propose Multi-AGent partial observable deep reiNforcement lEarning for pursuer conTrol optimization (MAGNET) to train a group of UAS in terms of pursuers or agents, to pursue and intercept a faster UAS or evader, which tries to escape from capture while navigating through crowded airspace with several moving non-cooperating interacting entities (NCIEs). In MAGNET, we integrate the Control Barrier Function (CBF) based safety layer into proximal policy optimization (PPO) to provide safety guarantees during the training and testing process. In addition, we incorporate the DeepSet network into MAGNET to handle the time-varying dimension of an agent's observations. We conduct extensive simulations and the results show that MAGNET is able to maintain a collision-free environment at the sacrifice of slight evader capture rate reduction as compared to the baseline implementations.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?