Multi-Agent Low-Bias Reinforcement Learning for Resource Allocation in UAV-Assisted Networks

Shiyang Zhou,Yufan Cheng,Xia Lei
DOI: https://doi.org/10.1109/ICCWorkshops53468.2022.9814506
2022-01-01
Abstract:This paper considers an unmanned aerial vehicle (UAV)-assisted downlink transmission. To maximize the average achievable channel capacity among the ground users, a multi-agent low-bias deep reinforcement learning (MA-LB-DRL) scheme is proposed to solve the joint optimization problem of trajectory design, channel selection and power control. Firstly, considering the NP-hard, non-convex, and mixed-integer features, the joint optimization problem is decoupled into two subproblems, which deal with the integer-action and continuousaction problem respectively. Secondly, an MA-LB-DRL scheme, which consists of a multi-agent double deep Q-network (MADDQN) for channel selection and a multi-agent twin delayed deep deterministic policy gradient (MATD3PG) for trajectory design and power control, is proposed to overcome the overestimation bias via reducing value function approximation error at the expense of tiny complexity. Finally, the simulation results demonstrate that the proposed scheme achieves high channel capacity than the benchmark schemes.
What problem does this paper attempt to address?