Strategy Generation Based on DDPG with Prioritized Experience Replay for UCAV.

Junsen Lu,Yun-Bo Zhao,Yu Kang,Yuhui Wang,Yimin Deng
DOI: https://doi.org/10.1109/icarm54641.2022.9959220
2022-01-01
Abstract:Unmanned combat aerial vehicles are playing an increasingly important role in the future military field, while the optimal control strategy remains a great challenge due to the high dynamics of the aerial vehicles themselves as well as the environmental uncertainties in air-combat. Based on a deep deterministic policy gradient algorithm framework, an air combat decision-making strategy is designed and implemented, and further a prioritized experience replay method is proposed for the proposed algorithm to further improve the efficiency in the training process. Simulation experiments show that, at much reduced training cost, the proposed approach achieves superior air combat performance with fast convergence.
What problem does this paper attempt to address?