End-to-end UAV Intelligent Training via Deep Reinforcement Learning

Xin Liu,Caizheng Wang,Jun Yang,Qiuquan Guo
DOI: https://doi.org/10.1109/RICAI60863.2023.10489264
2023-12-01
Abstract:In this paper, we propose an end-to-end training method based on deep reinforcement learning for motion control problems in Unmanned-Aerial-Vehicle (UAV) active target tracking and autonomous obstacle avoidance. Proximal policy optimization (PPO) is employed as a deep reinforcement learning method to train artificial neural networks in an end-to-end manner using a continuous reward function. The proposed approach combines visual perception and deep reinforcement learning policy into an end-to-end decision control model. It takes RGB visual images observed by the UAV as input states, and outputs discrete control actions for UAV flight. Environment augmentation techniques and custom reward functions are utilized to enhance training efficiency and generalization capability. The training and validation are conducted in a simulation environment built using the Unreal Engine and Microsoft AirSim. The results of the simulation experiments demonstrate that the proposed approach can achieve autonomous tracking control of maneuvering targets for UAVs with good robustness and generalization. Moreover, for UAV autonomous obstacle avoidance tasks, the trained UAV based on this method can achieve fast and stable visual obstacle avoidance in unknown environments.
Engineering,Computer Science
What problem does this paper attempt to address?