Option-Based Hierarchical Reinforcement Learning for UAV Multi-Objective Path Planning

Sikuan Zhu,Qingling Wang
DOI: https://doi.org/10.1109/CAC59555.2023.10450655
2023-11-17
Abstract:This paper proposes a deep reinforcement learning approach to solve the multi-objective path planning problem for UAV material delivery in a dynamic multi-obstacle environment. The approach uses option-based hierarchical reinforcement learning to decompose the problem into single-objective subtasks, and a novel network structure is designed to extract and fuse state and obstacle information to facilitate decision-making. A twin delayed deep deterministic policy gradient algorithm is used for centralized training and distributed execution. The approach also introduces a new reward function to speed up convergence. Experimental results demonstrate that the proposed method is effective in completing the task with faster convergence and generalization compared to traditional TD3 algorithm,
Engineering,Computer Science
What problem does this paper attempt to address?