Abstract:Reinforcement learning has been applied to air combat problems in recent years, and the idea of curriculum learning is often used for reinforcement learning, but traditional curriculum learning suffers from the problem of plasticity loss in neural networks. Plasticity loss is the difficulty of learning new knowledge after the network has converged. To this end, we propose a motivational curriculum learning distributed proximal policy optimization (MCLDPPO) algorithm, through which trained agents can significantly outperform the predictive game tree and mainstream reinforcement learning methods. The motivational curriculum learning is designed to help the agent gradually improve its combat ability by observing the agent's unsatisfactory performance and providing appropriate rewards as a guide. Furthermore, a complete tactical maneuver is encapsulated based on the existing air combat knowledge, and through the flexible use of these maneuvers, some tactics beyond human knowledge can be realized. In addition, we designed an interruption mechanism for the agent to increase the frequency of decision-making when the agent faces an emergency. When the number of threats received by the agent changes, the current action is interrupted in order to reacquire observations and make decisions again. Using the interruption mechanism can significantly improve the performance of the agent. To simulate actual air combat better, we use digital twin technology to simulate real air battles and propose a parallel battlefield mechanism that can run multiple simulation environments simultaneously, effectively improving data throughput. The experimental results demonstrate that the agent can fully utilize the situational information to make reasonable decisions and provide tactical adaptation in the air combat, verifying the effectiveness of the algorithmic framework proposed in this paper.

Intelligent Game Strategies in Target-Missile-Defender Engagement Using Curriculum-Based Deep Reinforcement Learning

Maneuvering penetration strategies of ballistic missiles based on deep reinforcement learning

Adversarial Decision-Making for Moving Target Defense: A Multi-Agent Markov Game and Reinforcement Learning Approach

Realizing Midcourse Penetration With Deep Reinforcement Learning

Intelligent Pursuit–Evasion Game Based on Deep Reinforcement Learning for Hypersonic Vehicles

High-dynamic Intelligent Maneuvering Guidance Strategy Via Deep Reinforcement Learning

3D optimal defensive guidance strategy with safe distance

Intelligent maneuver strategy for hypersonic vehicles in three-player pursuit-evasion games via deep reinforcement learning

Resilient Pursuit Evasion Guidance with Feedback Game Strategy

Research on Action Strategies and Simulations of DRL and MCTS-based Intelligent Round Game

Application of Deep Reinforcement Learning to Defense and Intrusion Strategies Using Unmanned Aerial Vehicles in a Versus Game

Optimal strategies design of active target defense differential game

Improving Maneuver Strategy in Air Combat by Alternate Freeze Games with a Deep Reinforcement Learning Algorithm

Real‐time game‐theoretic model predictive control for differential game of target defense

Model Predictive Guidance for Active Aircraft Protection from a Homing Missile

Real‐time receding horizon pursuit and evasion games of missile guidance based on neural network

Mastering air combat game with deep reinforcement learning

Deep Reinforcement Learning‐Based Air Defense Decision‐Making Using Potential Games

Deep Reinforcement Learning for Target Searching in Cognitive Electronic Warfare

Intelligent Maneuver Strategy for a Hypersonic Pursuit-Evasion Game Based on Deep Reinforcement Learning

A method of network attack-defense game and collaborative defense decision-making based on hierarchical multi-agent reinforcement learning