Data-based neural controls for an unknown continuous-time multi-input system with integral reinforcement

Yongfeng Lv,Jun Zhao,Wan Zhang,Huimin Chang
DOI: https://doi.org/10.1007/s11768-024-00238-2
2024-11-30
Control Theory and Technology
Abstract:Integral reinforcement learning (IRL) is an effective tool for solving optimal control problems of nonlinear systems, and it has been widely utilized in optimal controller design for solving discrete-time nonlinearity. However, solving the Hamilton–Jacobi–Bellman (HJB) equations for nonlinear systems requires precise and complicated dynamics. Moreover, the research and application of IRL in continuous-time (CT) systems must be further improved. To develop the IRL of a CT nonlinear system, a data-based adaptive neural dynamic programming (ANDP) method is proposed to investigate the optimal control problem of uncertain CT multi-input systems such that the knowledge of the dynamics in the HJB equation is unnecessary. First, the multi-input model is approximated using a neural network (NN), which can be utilized to design an integral reinforcement signal. Subsequently, two criterion networks and one action network are constructed based on the integral reinforcement signal. A nonzero-sum Nash equilibrium can be reached by learning the optimal strategies of the multi-input model. In this scheme, the NN weights are constantly updated using an adaptive algorithm. The weight convergence and the system stability are analyzed in detail. The optimal control problem of a multi-input nonlinear CT system is effectively solved using the ANDP scheme, and the results are verified by a simulation study.
automation & control systems
What problem does this paper attempt to address?