An Intelligent Robot Motion Planning Method and Application Via LPPO in Unknown Environment

Lingli Yu,Jiawei Luo,Kaijun Zhou
DOI: https://doi.org/10.1109/cyber55403.2022.9907624
2022-01-01
Abstract:Aiming at the problem of motion planning in complex dynamic environment, traditional algorithm which relies on prior map information is unavailable due to the change of environment and suffer from high computational complexity. Proximal policy optimization algorithm suffers from the problem of training failure due to a bad strategy learned in a certain round. In this paper, an improved end-to-end motion planning algorithm with proximal policy optimization is proposed to address this problem by using logarithmic decay strategy dynamically selecting the clipping range and applying the algorithm to robot motion planning. It ensures a larger exploration space for the agent at the beginning of training, and limits the sudden change of the agent's strategy by attenuating the clipping range at the later stage, which improves the stability of the algorithm. We compare our proposed algorithm with several other improved algorithms in different Mujoco environments and our built environment. The results show that our proposed improved algorithm obtains higher reward values and the robot has smoother motion control while navigating obstacle avoidance.
What problem does this paper attempt to address?