Improved Path Planning for Indoor Patrol Robot Based on Deep Reinforcement Learning

Jianfeng Zheng,Shuren Mao,Zhenyu Wu,Pengcheng Kong,Hao Qiang
DOI: https://doi.org/10.3390/sym14010132
2022-01-11
Symmetry
Abstract:To solve the problems of poor exploration ability and convergence speed of traditional deep reinforcement learning in the navigation task of the patrol robot under indoor specified routes, an improved deep reinforcement learning algorithm based on Pan/Tilt/Zoom(PTZ) image information was proposed in this paper. The obtained symmetric image information and target position information are taken as the input of the network, the speed of the robot is taken as the output of the next action, and the circular route with boundary is taken as the test. The improved reward and punishment function is designed to improve the convergence speed of the algorithm and optimize the path so that the robot can plan a safer path while avoiding obstacles first. Compared with Deep Q Network(DQN) algorithm, the convergence speed after improvement is shortened by about 40%, and the loss function is more stable.
What problem does this paper attempt to address?