Deep Reinforcement Learning Based 3D UAV Trajectory Design and Frequency Band Allocation.

Ruijin Ding,Feifei Gao,Xuemin Sherman Shen
DOI: https://doi.org/10.1109/GLOBECOM42002.2020.9322532
2020-01-01
Abstract:Unmanned Aerial Vehicle (UAV)-assisted communication is a promising technique for future communication. In this paper, the UAV serves as base station (BS) to provide energy-efficient and fair communication service for ground users (GUs). We first derive the energy consumption model of a quad-rotor UAV as a function of UAV's 3D movement. Then we formulate the problem where UAV aims to maximize the defined fair throughput within limited on-board energy through 3D trajectory and frequency band allocation. The formulated problem is hard to deal with for the GUs' movement and complicated nonconvex objective function. Then we propose a deep reinforcement learning (DRL) based method to transform the original problem into maximizing accumulative reward. Simulation results demonstrate that the proposed method outperforms two baselines in terms of fairness and total throughput.
What problem does this paper attempt to address?