Abstract:Due to the need for the real-time participation of a large number of professionals, existing decision-making methods for radar interference suppression are characterized by a slow decision-making speed, unstable decision-making effects, and insufficient decision-making intelligence. In this paper, a deep reinforcement learning (DRL)-based decision-making method for radar interference suppression is proposed. This method has a fast decision speed and a stable and accurate decision effect, and can complete decision-making tasks by itself with high intelligence. To enhance the ability of the agent to acquire high-value experiences, the variable greedy algorithm (VGA) is proposed. The VGA adjusts the fixed greedy value in the original action strategy into a declining greedy curve that mimics the human learning process via a combination of the ideas in the win or learn fast–policy hill-climbing (WoLF-PHC) algorithm and the Ebbinghaus forgetting curve. To improve the efficiency of the agent in utilizing high-value experiences, the double-depth prioritized experience replay (DDPER) algorithm is proposed. The DDPER algorithm changes the uniform random experience replay into prioritized experience replay (PER), and performs sorting and extraction learning based on experience values in the form of additive trees to achieve better learning results. Further, the accuracy and speed of decision-making are improved via the double-depth experience replay system. The findings of a simulation experiment show that the agent can efficiently learn the most optimal radar interference suppression method by knowing that the interference suppression algorithm library contains algorithms that can deal with current environmental interference signals. Furthermore, compared to the PER–double deep Q-Network (PER-DDQN) presented by Zhang, the average accuracy, speed, and stability of decision-making are respectively increased by 6.4%, 2.51%, and 102.12%.

Domain Knowledge-Assisted Deep Reinforcement Learning Power Allocation for MIMO Radar Detection

Data-Driven Radar Selection and Power Allocation Method for Target Tracking in Multiple Radar System

Data-Driven Simultaneous Multibeam Power Allocation: when Multiple Targets Tracking Meets Deep Reinforcement Learning.

A Cognitive Multi-Carrier Radar for Communication Interference Avoidance Via Deep Reinforcement Learning

Deep Reinforcement Learning Based Decision Making for Radar Jamming Suppression

Deep Reinforcement Learning Control for Radar Detection and Tracking in Congested Spectral Environments

Joint Optimization of Jamming Type Selection and Power Control for Countering Multi-function Radar Based on Deep Reinforcement Learning

Deep Reinforcement Learning Based Radar Parameter Adaptation for Multiple Target Tracking

Weak Target Detection in Massive MIMO Radar via an Improved Reinforcement Learning Approach

A Reinforcement Learning based approach for Multi-target Detection in Massive MIMO radar

Joint Optimization of Jamming Type Selection and Power Control for Countering Multifunction Radar Based on Deep Reinforcement Learning

Improving anti-jamming decision-making strategies for cognitive radar via multi-agent deep reinforcement learning

A Deep Reinforcement Learning-Based Whittle Index Policy for Multibeam Allocation

Experimental Analysis of Reinforcement Learning Techniques for Spectrum Sharing Radar

Joint Task Offloading and Resource Allocation for Intelligent Reflecting Surface-Aided Integrated Sensing and Communication Systems Using Deep Reinforcement Learning Algorithm

Robust Power Allocation for Resource-Aware Multi-Target Tracking With Colocated MIMO Radars

Antenna Placement Optimization for Distributed MIMO Radar Based on a Reinforcement Learning Algorithm

Receive-Beam Resource Allocation for Multiple Target Tracking with Distributed MIMO Radars

Reinforcement learning-based waveform optimization for MIMO multi-target detection

A Resource Scheduling Algorithm for Multi-Target 3D Imaging in Radar Network Based on Deep Reinforcement Learning