Lightweight Reinforcement Learning with State Abstraction for Dynamic Spectrum Anti-Jamming Communications

Xin Liu,Ximing Wang,Yuhua Xu,Zhiyong Du,Yifan Xu,Hao Han
DOI: https://doi.org/10.1109/wcnc57260.2024.10571324
2024-01-01
Abstract:This paper studies the anti-jamming channel selection problem in the unmanned aerial vehicle (UAV) communication scenario using machine learning. Recently, deep reinforcement learning (DRL) based anti-jamming approaches have drawn much attention, but most of them require lots of computing resources and power supply for training, which is impractical for the hardware-limited UAVs. What's more, the high complexity of DRL-based algorithms weakens their online learning ability, failing to rapidly adapt to the changing jamming environment. To be applicable to the hardware-limited UAVs, we propose a lightweight reinforcement learning algorithm based on the idea of spectrum state abstraction. We first assign similar spectrum states to clusters using the DRL and clustering algorithms. A state clustering network is deployed in the UAV to convert the large and redundant state space into a small number of state clusters. Based on the clustered states, the UAV uses a simple tabular Q-learning algorithm to online find the optimal anti-jamming policy. The simulation results show that, compared with the conventional DRL approach, the proposed algorithm can efficiently find the optimal anti-jamming policy and fast adapt to the change of jamming pattern in the complicated and dynamic jamming environment.
What problem does this paper attempt to address?