Path Planning of UAV Base Station Based on Deep Reinforcement Learning

Siming Yang,Zheng Shan,Jiang Cao,Yuan Gao,Yang Guo,Ping Wang,Xiaonan Wang,Jing Wang,Tingting Zhang,Jiayu Guo
DOI: https://doi.org/10.1016/j.procs.2022.04.013
2022-01-01
Procedia Computer Science
Abstract:UAV base station platform has become the current research hotspot of assisting ground base station for wireless coverage.At present, the most important issue is how to make path planning to provide the stable communication guarantee for multiple mobile users. In this article, we model the air-to-ground channel to describe the path loss between the UAV platform and the user and build a simulation environment for training based on the OpenAI-GYM architecture. In addition, this paper proposes a reinforcement learning algorithm based on intrinsic rewards, which uses the mean square error of the state prediction results to quantify the novelty of the state. Algorithms enable agents to efficiently carry out strategy iterations. Experiments results showed that our algorithm has a higher score and takes less time.
What problem does this paper attempt to address?