Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning

Huan Wang,Jintao Wang
DOI: https://doi.org/10.1038/s41598-024-54938-5
IF: 4.6
2024-02-24
Scientific Reports
Abstract:In the realm of air combat, autonomous decision-making in regard to Unmanned Aerial Vehicle (UAV) has emerged as a critical force. However, prevailing autonomous decision-making algorithms in this domain predominantly rely on rule-based methods, proving challenging to design and implement optimal solutions in complex multi-UAV combat environments. This paper proposes a novel approach to multi-UAV air combat decision-making utilizing hierarchical reinforcement learning. First, a hierarchical decision-making network is designed based on tactical action types to streamline the complexity of the maneuver decision-making space. Second, the high-quality combat experience gained from training is decomposed, with the aim of augmenting the quantity of valuable experiences and alleviating the intricacies of strategy learning. Finally, the performance of the algorithm is validated using the advanced UAV simulation platform JSBSim. Through comparisons with various baseline algorithms, our experiments demonstrate the superior performance of the proposed method in both even and disadvantaged air combat environments.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper mainly addresses the complexity issue in multi-UAV (Unmanned Aerial Vehicle) air combat decision-making. Current autonomous decision-making algorithms mostly rely on rule-based methods, which are challenging to design and implement optimal solutions in complex multi-UAV combat environments. This research proposes a novel multi-UAV air combat decision-making approach utilizing hierarchical reinforcement learning. The specific contributions include: 1. Designing a hierarchical decision-making network based on tactical action types to reduce the complexity of maneuver decision space. 2. Introducing an experience decomposition mechanism aimed at increasing valuable experience and alleviating the complexity of policy learning. 3. Demonstrating the superior performance of the proposed algorithm in different environments through comparisons with various baseline algorithms and validation on the JSBSim advanced UAV simulation platform. The paper reviews existing UAV air combat decision-making techniques and hierarchical reinforcement learning techniques, pointing out the limitations of existing methods such as the inadequate adaptability of rule-based methods in complex environments. This research adopts hierarchical reinforcement learning to improve training efficiency by decomposing complex decision tasks into smaller subtasks, thus enhancing decision-making efficiency in the multi-UAV air combat environment. Experimental results show that the proposed method outperforms other benchmark algorithms in adversarial environments.