Abstract:Addressing the formidable challenges posed by multiple jammers jamming multiple radars, which arise from spatial discretization, many degrees of freedom, numerous model input parameters, and the complexity of constraints, along with a multi-peaked objective function, this paper proposes a cooperative jamming resource allocation method, based on evolutionary reinforcement learning, that uses joint multi-domain information. Firstly, an adversarial scenario model is established, characterizing the interaction between multiple jammers and radars based on a multi-beam jammer model and a radar detection model. Subsequently, considering real-world scenarios, this paper analyzes the constraints and objective function involved in cooperative jamming resource allocation by multiple jammers. Finally, accounting for the impact of spatial, frequency, and energy domain information on jamming resource allocation, matrices representing spatial condition constraints, jamming beam allocation, and jamming power allocation are formulated to characterize the cooperative jamming resource allocation problem. Based on this foundation, the joint allocation of the jamming beam and jamming power is optimized under the constraints of jamming resources. Through simulation experiments, it was determined that, compared to the dung beetle optimizer (DBO) algorithm and the particle swarm optimization (PSO) algorithm, the proposed evolutionary reinforcement learning algorithm based on DBO and Q-Learning (DBO-QL) offers 3.03% and 6.25% improvements in terms of jamming benefit and 26.33% and 50.26% improvements in terms of optimization success rate, respectively. In terms of algorithm response time, the proposed hybrid DBO-QL algorithm has a response time of 0.11 s, which is 97.35% and 96.57% lower than the response times of the DBO and PSO algorithms, respectively. The results show that the method proposed in this paper has good convergence, stability, and timeliness.

An Environmentally Sensitive Jamming Bandits Using Improved UCB Method

An Electronic Jamming Method Based on a Distributed Information Sharing Mechanism

A Fast Learning Method for Optimal Jamming to Radar in Real-Time Environment

A Radar Anti-Jamming Strategy Based on Game Theory With Temporal Constraints

Performance Analysis of Deep Reinforcement Learning-Based Intelligent Cooperative Jamming Method Confronting Multi-functional Networked Radar

A Dynamic Game Strategy for Radar Screening Pulsewidth Allocation Against Jamming Using Reinforcement Learning

Reinforcement Learning-Based Anti-Jamming in Networked UAV Radar Systems

Jamming Games in Underwater Sensor Networks with Reinforcement Learning

Improving anti-jamming decision-making strategies for cognitive radar via multi-agent deep reinforcement learning

Multiagent Reinforcement Learning for Antijamming Game of Frequency-Agile Radar

Multi-Agent Reinforcement Learning for Anti-jamming Game of Frequency-Agile Radar

Radar and Jammer Intelligent Game under Jamming Power Dynamic Allocation

Frequency Diversity Array Radar and Jammer Intelligent Frequency Domain Power Countermeasures Based on Multi-Agent Reinforcement Learning

Jamming Policy Generation via Heuristic Programming Reinforcement Learning

Enhanced Radar Anti-Jamming With Multi-Agent Reinforcement Learning

Joint Optimization of Jamming Type Selection and Power Control for Countering Multi-function Radar Based on Deep Reinforcement Learning

Avoiding Jammers: A Reinforcement Learning Approach

Joint Optimization of Jamming Type Selection and Power Control for Countering Multifunction Radar Based on Deep Reinforcement Learning

Cooperative Jamming Resource Allocation with Joint Multi-Domain Information Using Evolutionary Reinforcement Learning

An Intelligent Anti-jamming Decision-making Method Based on Deep Reinforcement Learning for Cognitive Radar

Online Emission Policy Selection for Radar Anti-Jamming using Bandit-Optimized Policy Search