Hierarchical Motion Learning for Goal-Oriented Movements With Speed-Accuracy Tradeoff of a Musculoskeletal System

Junjie Zhou,Shanlin Zhong,Wei Wu
DOI: https://doi.org/10.1109/tcyb.2021.3109021
IF: 11.8
2021-01-01
IEEE Transactions on Cybernetics
Abstract:Generating various goal-oriented movements via the flexible muscle model of the musculoskeletal system as fast and accurately as possible is a pressing problem, which is also the basis of most human adaptive behaviors, such as reaching, catching, interception, and pointing. This article focuses on the adaptive motion generation of fast goal-oriented motion on the musculoskeletal system by implementing the speed-accuracy tradeoff (SAT) in a hierarchical motion learning framework. First, we introduce Fitts' Law into the modified basal ganglia circuit-inspired iterative decision-making model for achieving dynamic and adaptive decision making. Then, as a time constraint, the decision is decomposed into a series of supervised terms by the proposed striatal FSI-SPN interneuron circuit-inspired velocity modulator to implement the tradeoff smoothly on the musculoskeletal system. Finally, an improved policy gradient algorithm is suggested to generate the muscle excitations of the modulated motion via the proposed muscle co-contraction policy, which promotes general cooperation between flexor and extensor muscles. In experiments, a redundant musculoskeletal arm model is trained to perform the adaptive quick pointing movements. By combining the muscle co-contraction policy with SAT, our algorithm shows the most efficient training and the best performance in the adaptive motion generation among the other three popular reinforcement learning algorithms on the musculoskeletal model.
automation & control systems,computer science, cybernetics, artificial intelligence
What problem does this paper attempt to address?