Value Iteration-based Zero-sum Neuro-optimal Control of Modular and Reconfigurable Robots Via Adaptive Dynamic Programming

Zhian Feng,Yuanchun Li,Tianjiao An,Bo Dong,Guangjun Liu
DOI: https://doi.org/10.1109/cac53003.2021.9727718
2021-01-01
Abstract:An adaptive dynamic programming (ADP) zerosum neuro-optimal control method based on value iteration (VI) algorithm is proposed for the optimal position and velocity tracking control issues of modular and reconfigurable robots (MRRs). An adaptive fuzzy control method is used to identify Coriolis and centripetal force term as well as gravity term of MRRs. The proposed VI algorithm allows any positive semidefinite function to be initialized. In order to ensure that the iterated value function converges to the optimal solution, the convergence analysis is presented. Based on VI and ADP, the Hamilton-Jacobi-Issacs (HJI) equation is solved by using neural network (NN), then the approximated optimal control is achieved. The asymptotic stability of MRR system is proved by Lyapunov theory. Finally, simulation reaults are presented to show the reliability of proposed method.
What problem does this paper attempt to address?