Abstract:Because of their unique adaptability, flexibility, and robustness, musculoskeletal robotic systems are regarded potentially as next-generation robots. However, motion learning and generation of such a robotic system are still challenging. This paper presents a neuromuscular control method, namely, TMS-PPO, based on time-varying muscle synergy (TMS) and proximal policy optimization (PPO). The electromyogram (EMG) activation signals of actual human motions are decomposed to obtain TMSs based on the temporal properties of the TMS. The weights of networks are trained to generate the scale and phase coefficients through the PPO. The coefficients modulate the TMSs to generate appropriate activation patterns to optimize motion learning of the musculoskeletal system. To verify the effectiveness of the proposed method, the TMSs are extracted from human upper limb muscle activation signals, and we compare TMS-PPO with PPO in the motion learning and generation process of an upper limb musculoskeletal system. The results show that TMS-PPO can complete the control tasks because the average errors of the joints are less than 0.05 rad. In the meantime, TMSs are used as motion primitives of the musculoskeletal system to simulate the process of the human CNS controlling muscles. It shows that TMS-PPO reduces the energy consumption and improves the learning rate significantly compared with the PPO. The learning episodes reduce from (10^4) to (10^3) , which indicates that TMS-PPO has a stronger learning ability and better physiological explanation. Note to Practitioners—Due to the superiorities of the musculoskeletal system, humanoid robots that imitate human driven mechanisms are vigorously carried out worldwide. Taking advantages of human-like characteristics, the musculoskeletal robot provides new opportunities to understand and validate the human mechanisms of muscle control and motion learning, to compare the performance of the robot to that of humans as well as work in real world, e.g., human interactive robots, amusement robots and medical training robots in the future. However, strong redundancy, coupling, and nonlinearity of the system also raises many challenges for the investigation of the control problem. Inspired by how the human CNS controls a musculoskeletal system and realize motion generalization, a novel muscle-synergies-based neuromuscular control that combines time-varying muscle synergy (TMS) and Proximal Policy Optimization (PPO), namely, TMS-PPO is proposed in this paper. The learning efficiency of PPO and the physiological interpretation of the control process are improved during the motion learning and generation processes of the musculoskeletal system. Preliminary simulation experiments suggest that this method is feasible in terms of control accuracy and efficiency. Moreover, the performance of the TMS-PPO is comparable to the PPO without significant improvement. To solve this problem, in future work, we will introduce the cerebellar model into the control method which plays the role of adjusting and correcting the motions of the limbs to achieve accurate and stable control in the actions process of humans.

Hierarchical Motion Learning for Goal-Oriented Movements With Speed-Accuracy Tradeoff of a Musculoskeletal System

Learning Gait-conditioned Bipedal Locomotion with Motor Adaptation

Learning Control Policies for Imitating Human Gaits

Optimum trajectory learning in musculoskeletal systems with model predictive control and deep reinforcement learning

Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid Locomotion

Scalable muscle-actuated human simulation and control

A memory and attention-based reinforcement learning for musculoskeletal robots with prior knowledge of muscle synergies

From Rough to Precise: Human-Inspired Phased Target Learning Framework for Redundant Musculoskeletal Systems

Autonomously Achieving Bipedal Locomotion Skill Via Hierarchical Motion Modelling.

Reinforcement Learning Control of a Biomechanical Model of the Upper Extremity

Muscle Excitation Estimation in Biomechanical Simulation Using NAF Reinforcement Learning

Bioinspired Gain-Modulated Recurrent Neural Network for Controlling Musculoskeletal Robot

Proximal Policy Optimization With Time-Varying Muscle Synergy for the Control of an Upper Limb Musculoskeletal System

A language‐directed virtual human motion generation approach based on musculoskeletal models

Hierarchical Learning Framework for Whole-Body Model Predictive Control of a Real Humanoid Robot

Hitting the Gym: Reinforcement Learning Control of Exercise-Strengthened Biohybrid Robots in Simulation

Semiparametric Musculoskeletal Model for Reinforcement Learning-Based Trajectory Tracking

Hierarchical Optimization for Personalized Hand and Wrist Musculoskeletal Modeling and Motion Estimation

A Cerebellum-Inspired Prediction and Correction Model for Motion Control of a Musculoskeletal Robot

μSim: A goal-driven framework for elucidating the neural control of movement through musculoskeletal modeling

Reinforcement Learning of Musculoskeletal Control from Functional Simulations