Approximately Optimal Control of Discrete-Time Nonlinear Switched Systems Using Globalized Dual Heuristic Programming

Chaoxu Mu,Kaiju Liao,Ling Ren,Zhongke Gao
DOI: https://doi.org/10.1007/s11063-020-10278-9
IF: 2.565
2020-07-30
Neural Processing Letters
Abstract:Based on the idea of data-driven control, a novel iterative adaptive dynamic programming (ADP) algorithm based on the globalized dual heuristic programming (GDHP) technique is used to solve the optimal control problem of discrete-time nonlinear switched systems. In order to solve the Hamilton–Jacobi–Bellman (HJB) equation of switched systems, the iterative ADP method is proposed and the strict convergence analysis is also provided. Three neural networks are constructed to implement the iterative ADP algorithm, where a novel model network is designed to identify the system dynamics, a critic network is used to approximate the cost function and its partial derivatives, and an action network is provided to obtain the approximate optimal control law. Two simulation examples are described to illustrate the effectiveness of the proposed method by comparing with the heuristic dynamic programming (HDP) and dual heuristic programming (DHP) methods.
computer science, artificial intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the optimal control problem of discrete-time nonlinear switched systems. Specifically, the researchers propose an iterative Adaptive Dynamic Programming (ADP) algorithm based on Globalized Dual Heuristic Programming (GDHP) technology to solve the Hamilton-Jacobi-Bellman (HJB) equation for such systems. By constructing three neural networks (model network, evaluation network, and action network), this method can achieve the identification of system dynamics, approximation of the cost function and its partial derivatives, and ultimately obtain an approximate optimal control strategy. ### Main Contributions: 1. **For the first time, the ADP algorithm based on GDHP technology is applied to the optimal control problem of discrete-time nonlinear switched systems**. 2. **A new model network is designed to estimate state variables, significantly improving learning performance**. 3. **Through simulation results, the iterative HDP and DHP methods are compared with the proposed GDHP method, demonstrating the superiority of the GDHP method**. ### Research Background: - **Switched Systems**: Hybrid control systems composed of two or more subsystems, widely used in power systems, robotic systems, traffic control systems, and process control systems. - **Optimal Control Problem**: Not only requires studying the optimal control law but also considering the optimal switching sequence. - **Dynamic Programming (DP) Algorithm**: A commonly used method to solve optimal control problems, but it encounters the curse of dimensionality during the iterative process. - **Adaptive Dynamic Programming (ADP) Algorithm**: Solves the optimal control problem of high-dimensional systems through iterative methods, avoiding the curse of dimensionality. ### Method Overview: - **Model Network**: Used to identify system dynamics. - **Evaluation Network**: Used to approximate the cost function and its partial derivatives. - **Action Network**: Used to obtain the approximate optimal control law. ### Experimental Verification: - **Simulation Examples**: The effectiveness and superiority of the proposed method are verified through simulations of two discrete-time nonlinear switched systems. The simulation results show that the GDHP method outperforms traditional HDP and DHP methods in control performance. ### Conclusion: - The proposed iterative ADP algorithm based on GDHP can effectively solve the optimal control problem of discrete-time nonlinear switched systems, with high control performance and convergence.