Online Trajectory Optimization for the UAV-Mounted Base Stations

Zhang Guangchi,Yan Yulin,Cui Miao,Chen Wei,Zhang Jing
DOI: https://doi.org/10.11999/JEIT200525
2021-01-01
Abstract:Considering dealing with the problem of random and dynamic communication requests of ground users in a UAV(Unmanned Aerial Vehicle) mounted base station communication system, which can not be tackled by an offline trajectory design scheme, an online trajectory optimization algorithm is proposed for the UAV-mounted base station. In the considered system, a single UAV is utilized as an aerial base station to provide wireless communication service to two ground users. The problem of minimizing the average communication delay of the ground users via optimizing the UAV's trajectory is considered. First, it is shown that the problem can be tasted as a Markov Decision Process (MDP), and then the delay of one single communication is introduced into the action value function. Finally, the Monte Carlo and Q-Learning algorithms from the reinforcement learning technology are respectively adopted to realize the online trajectory optimization. Simulation results show that the proposed algorithm outperforms the "fixed position" and "greedy algorithm" schemes.
What problem does this paper attempt to address?