Multiagent Reinforcement Learning for Antijamming Game of Frequency-Agile Radar
Jie Geng,Bo Jiu,Kang Li,Yu Zhao,Chao Wang,Hongwei Liu
DOI: https://doi.org/10.1109/lgrs.2024.3382041
IF: 5.343
2024-04-09
IEEE Geoscience and Remote Sensing Letters
Abstract:With the development of jamming systems, the jammer is becoming much smarter and can change its strategy with radar, which forms a competitive game between the radar and the jammer and poses a significant threat to the radar. In this letter, the antijamming game of frequency-agile (FA) radar is investigated, in which the jammer is capable of transmitting jamming signals with multiple frequency channels and different power allocation manners. To characterize the sequential interaction and partial observation of the antijamming game, an extensive-form game (EFG) is introduced to model the confrontation between the radar and the jammer. Subsequently, a novel multiagent reinforcement learning (MARL) algorithm, termed neural fictitious self-play (NFSP)-deep deterministic policy gradient (DDPG), is devised by combining NFSP with DDPG and modifying the supervised learning process of NFSP to solve Nash equilibrium (NE) strategies, which can handle the antijamming games featuring both discrete and continuous action spaces. Finally, simulation results show that the learning strategies acquired through the proposed method approximate NE and outperform the rule-based strategies. The signal-to-interference-pulse-noise ratio (SINR) improvements of radar taking NE strategy are 9.13, 9.12, and 9.15 dB in the worst case in the antijamming game with five frequencies compared to three rule-based strategies.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics