Robot Intelligent Trajectory Planning Based on PCM Guided Reinforcement Learning

Xiang Teng,Jian Fu,Cong Li,ZhaoJie Ju
DOI: https://doi.org/10.1007/978-3-030-27529-7_30
2019-01-01
Abstract:Reinforcement Learning (RL) was successfully applied in multi-degree-of-freedoms robot to acquire motor skills, however, it hardly ever consider each joints’ relationship, or just think about the linear relationship between them. In order to find the nonlinear relationship between each degrees of freedom (DOFs), we propose a Pseudo Covariance Matrix (PCM) to guide reinforcement learning for motor skill acquisition. Specifically it combined Path Integral Policy Improvement ($$\mathrm{PI}^2$$) with Kernel Canonical Correlation Analysis (KCCA), where KCCA is used to obtain the PCM in high dimensional space and record it as the heuristic information to search an optimal/sub-optimal strategy. The experiments based on robots (SCARA and UR5) demonstrate the new method is feasible and effective.
What problem does this paper attempt to address?