Dynamic Spectrum Anti-Jamming Access With Fast Convergence: A Labeled Deep Reinforcement Learning Approach

Yangyang Li,Yuhua Xu,Guoxin Li,Yuping Gong,Xin Liu,Hao Wang,Wen Li
DOI: https://doi.org/10.1109/tifs.2023.3307950
IF: 7.231
2023-09-09
IEEE Transactions on Information Forensics and Security
Abstract:The primary objective of anti-jamming techniques is to ensure that the transmitted data arrives at the intended receiver without being disturbed or jammed with by any jamming signal or other hostile activities to ensuring the security of the communication system. Deep reinforcement learning (DRL) has been extensively utilized in solving the dynamic spectrum anti-jamming problem. However, most of existing DRL-based algorithms require lots of training time, which fails to adapt the fast-channging jamming environment. Our objective is to find a practical and fast-convergence anti-jamming learning solution. To achieve this, we redesign the DRL algorithm in the following two ways. First, we split the cycle of reinforcement learning into two parts: applying process and training process. Second, we use soft labels instead of rewards which bring more information. We further theoretically show that the information gain can help our proposed algorithm converge faster. Moreover, we also show that our labeled DRL algorithm is better than the idealized DRL-based scheme which can obtain the same information as the soft labels. Simulation results demonstrate that compared with existing DRL-based algorithms, our proposed algorithm reduces the number of iterations by up to 90%.
computer science, theory & methods,engineering, electrical & electronic
What problem does this paper attempt to address?