Research on Dynamic Path Planning of Wheeled Robot Based on Deep Reinforcement Learning on the Slope Ground

Peng Wang,Xiaoqiang Li,Chunxiao Song,Shipeng Zhai
DOI: https://doi.org/10.1155/2020/7167243
2020-02-01
Journal of Robotics
Abstract:The existing dynamic path planning algorithm cannot properly solve the problem of the path planning of wheeled robot on the slope ground with dynamic moving obstacles. To solve the problem of slow convergence rate in the training phase of DDQN, the dynamic path planning algorithm based on Tree-Double Deep Q Network (TDDQN) is proposed. The algorithm discards detected incomplete and over-detected paths by optimizing the tree structure, and combines the DDQN method with the tree structure method. Firstly, DDQN algorithm is used to select the best action in the current state after performing fewer actions, so as to obtain the candidate path that meets the conditions. And then, according to the obtained state, the above process is repeatedly executed to form multiple paths of the tree structure. Finally, the non-maximum suppression method is used to select the best path from the plurality of eligible candidate paths. ROS simulation and experiment verify that the wheeled robot can reach the target effectively on the slope ground with moving obstacles. The results show that compared with DDQN algorithm, TDDQN has the advantages of fast convergence and low loss function.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the path planning problem of wheeled robots in the presence of dynamically moving obstacles on sloping ground. Traditional dynamic path planning algorithms cannot effectively solve this problem, especially the problem of slow convergence speed during the training phase. For this reason, the paper proposes a dynamic path planning algorithm based on the Tree - Double Deep Q Network (TDDQN). By optimizing the tree structure to discard the detected incomplete or over - detected paths, and combining the DDQN method with the tree - structure method, it aims to quickly find the optimal path and improve the efficiency and accuracy of path planning. Specifically, this algorithm first uses the DDQN algorithm to select the best action in the current state to obtain eligible candidate paths; then, according to the obtained state, repeat the above process to form multiple tree - structure paths; finally, use the non - maximum suppression method to select the best path from multiple eligible candidate paths. Experimental results show that, compared with the DDQN algorithm, the TDDQN algorithm has a faster convergence speed and a lower loss function value.