Abstract:The swarm relay and power allocation policy determines the bit error rate and the energy consumption of unmanned aerial vehicles (UAVs) and can be optimized based on the network and jamming model, which is rarely known by UAVs. In this paper, we propose a multi-agent reinforcement learning (RL)-based UAV swarm communication scheme to optimize the relay selection and power allocation against jamming. Based on the network topology, channel states, previous performance and observations shared by the neighboring UAVs, this scheme formulates the policy distribution to improve the policy exploration and applies a policy learning mechanism to stabilize the learning process. Based on transfer learning, the shared swarm experiences are exploited to accelerate the initial learning and improve policy optimization. A deep RL-based scheme is proposed to mitigate the state quantization error for the rapidly changing channel states under high swarm moving speed and thus further improve the anti-jamming performance. This scheme designs a policy network with four fully connected layers to approximate the policy distribution and uses another two neural networks to estimate the average policy distribution and the expected long-term utility, respectively, to update the policy network for stabilized deep learning. We investigate the computational complexity and derive the performance bound regarding the bit error rate, the energy consumption and the utility. Simulation and experimental results verify the performance gain of our proposed schemes over related works.

Lightweight Reinforcement Learning with State Abstraction for Dynamic Spectrum Anti-Jamming Communications

Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming

Penalized Reinforcement Learning-Based Energy-Efficient UAV-RIS Assisted Maritime Uplink Communications Against Jamming

UAV Networks Against Multiple Maneuvering Smart Jamming with Knowledge-Based Reinforcement Learning.

Flexible Channel Access Against Unknown Dynamic Jamming Attack: A Reinforcement Learning Approach.

UAV Anti-Jamming Video Transmissions with QoE Guarantee: A Reinforcement Learning-Based Approach.

Intelligent Dynamic Spectrum Anti-Jamming Communications: A Deep Reinforcement Learning Perspective

UAV Communication Against Intelligent Jamming: A Stackelberg Game Approach With Federated Reinforcement Learning

A Learning Approach Towards Secure Cognitive Networks with UAV Relaying and Active Jamming

Deep Learning-Assisted Secure UAV-Relaying Networks with Channel Uncertainties

RIS-Assisted Robust Beamforming for UAV Anti-Jamming and Eavesdropping Communications: A Deep Reinforcement Learning Approach

Matching combined multi-agent reinforcement learning for uav secure data dissemination

Cooperative Multi-UAV Dynamic Anti-Jamming Scheme with Deep Reinforcement Learning.

Anti-jamming Transmission in Softwarization UAV Network: a Federated Deep Reinforcement Learning Approach

Dynamic Spectrum Anti-Jamming with Distributed Learning and Transfer Learning

Deep Reinforcement Learning-Driven Jamming-Enhanced Secure Unmanned Aerial Vehicle Communications

Meta-Reinforcement Learning in Time-Varying UAV Communications: Adaptive Anti-Jamming Channel Selection

DRL-Based Dynamic Channel Access and SCLAR Maximization for Networks Under Jamming

Anti-jamming Communications Using Spectrum Waterfall: A Deep Reinforcement Learning Approach

Dynamic Channel Allocation for Multi-UAVs: A Deep Reinforcement Learning Approach

Distributed reinforcement learning based framework for energy-efficient UAV relay against jamming