Safe Navigation for UAV-Enabled Data Dissemination by Deep Reinforcement Learning in Unknown Environments

Fei Huang,Guangxia Li,Shiwei Tian,Jin Chen,Guangteng Fan,Jinghui Chang
DOI: https://doi.org/10.23919/jcc.2022.01.015
2022-01-01
China Communications
Abstract:Unmanned aerial vehicles (UAVs) are increasingly considered in safe autonomous navigation systems to explore unknown environments where UAVs are equipped with multiple sensors to perceive the surroundings. However, how to achieve UAV-enabled data dissemination and also ensure safe navigation synchronously is a new challenge. In this paper, our goal is minimizing the whole weighted sum of the UAV's task completion time while satisfying the data transmission task requirement and the UAV's feasible flight region constraints. However, it is unable to be solved via standard optimization methods mainly on account of lacking a tractable and accurate system model in practice. To overcome this tough issue, we propose a new solution approach by utilizing the most advanced dueling double deep Q network (dueling DDQN) with multi-step learning. Specifically, to improve the algorithm, the extra labels are added to the primitive states. Simulation results indicate the validity and performance superiority of the proposed algorithm under different data thresholds compared with two other benchmarks.
What problem does this paper attempt to address?