Training Musculoskeletal Arm Play Taichi with Deep Reinforcement Learning

Haoran Xu,Xiang Ma,Leiyang Xu,Qiang Wang
DOI: https://doi.org/10.1109/i2mtc43012.2020.9129264
2020-01-01
Abstract:Musculoskeletal robot arm driven by pneumatic artificial muscle actuators is secure, lightweight and compliant, which makes it an ideal solution for prosthetics and rehabilitation equipment. However, there exists conflicts between anatomical accuracy and engineering feasibility, and the nonlinearity of muscle actuators brings difficulty in accurate mathematical modeling. To overcome these problems, in this paper, we propose an optimized musculoskeletal arm design of ten muscles and four degrees-offreedom, employ Deep Deterministic Policy Gradient (DDPG) to train a data driven controller, and combine the off-policy reinforcement learning algorithm with Hindsight Experience Replay (HER) to deal with sparse rewards situation. TaiChi motion sequence is acquired using Perception Neuron and the trajectory tracking experiments are carried out in simulation platform. Experiment results demonstrate that, the controlled musculoskeletal arm is able to track the TaiChi trajectory and learn well in sparse rewards situation.
What problem does this paper attempt to address?