Path Design for Cellular-Connected UAV with Reinforcement Learning

Yong Zeng,Xiaoli Xu
DOI: https://doi.org/10.1109/globecom38437.2019.9014041
2019-01-01
Abstract:This paper studies the path design problem for cellular-connected unmanned aerial vehicle (UAV), which aims to minimize its mission completion time while maintaining good connectivity with the cellular network. We first argue that the conventional path design approach via formulating and solving optimization problems faces several practical challenges, and then propose a new reinforcement learning-based UAV path design algorithm by applying temporal-difference method to directly learn the state-value function of the corresponding Markov Decision Process. The proposed algorithm is further extended by using linear function approximation with tile coding to deal with large state space. The proposed algorithms only require the raw measured or simulation-generated signal samples as the input and are suitable for both online and offline implementations. Numerical results show that the proposed path designs can successfully avoid the coverage holes of cellular networks even in the complex urban environment.
What problem does this paper attempt to address?