Abstract:Based on the idea of data-driven control, a novel iterative adaptive dynamic programming (ADP) algorithm based on the globalized dual heuristic programming (GDHP) technique is used to solve the optimal control problem of discrete-time nonlinear switched systems. In order to solve the Hamilton–Jacobi–Bellman (HJB) equation of switched systems, the iterative ADP method is proposed and the strict convergence analysis is also provided. Three neural networks are constructed to implement the iterative ADP algorithm, where a novel model network is designed to identify the system dynamics, a critic network is used to approximate the cost function and its partial derivatives, and an action network is provided to obtain the approximate optimal control law. Two simulation examples are described to illustrate the effectiveness of the proposed method by comparing with the heuristic dynamic programming (HDP) and dual heuristic programming (DHP) methods.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the optimal control problem of discrete-time nonlinear switched systems. Specifically, the researchers propose an iterative Adaptive Dynamic Programming (ADP) algorithm based on Globalized Dual Heuristic Programming (GDHP) technology to solve the Hamilton-Jacobi-Bellman (HJB) equation for such systems. By constructing three neural networks (model network, evaluation network, and action network), this method can achieve the identification of system dynamics, approximation of the cost function and its partial derivatives, and ultimately obtain an approximate optimal control strategy. ### Main Contributions: 1. **For the first time, the ADP algorithm based on GDHP technology is applied to the optimal control problem of discrete-time nonlinear switched systems**. 2. **A new model network is designed to estimate state variables, significantly improving learning performance**. 3. **Through simulation results, the iterative HDP and DHP methods are compared with the proposed GDHP method, demonstrating the superiority of the GDHP method**. ### Research Background: - **Switched Systems**: Hybrid control systems composed of two or more subsystems, widely used in power systems, robotic systems, traffic control systems, and process control systems. - **Optimal Control Problem**: Not only requires studying the optimal control law but also considering the optimal switching sequence. - **Dynamic Programming (DP) Algorithm**: A commonly used method to solve optimal control problems, but it encounters the curse of dimensionality during the iterative process. - **Adaptive Dynamic Programming (ADP) Algorithm**: Solves the optimal control problem of high-dimensional systems through iterative methods, avoiding the curse of dimensionality. ### Method Overview: - **Model Network**: Used to identify system dynamics. - **Evaluation Network**: Used to approximate the cost function and its partial derivatives. - **Action Network**: Used to obtain the approximate optimal control law. ### Experimental Verification: - **Simulation Examples**: The effectiveness and superiority of the proposed method are verified through simulations of two discrete-time nonlinear switched systems. The simulation results show that the GDHP method outperforms traditional HDP and DHP methods in control performance. ### Conclusion: - The proposed iterative ADP algorithm based on GDHP can effectively solve the optimal control problem of discrete-time nonlinear switched systems, with high control performance and convergence.

Approximately Optimal Control of Discrete-Time Nonlinear Switched Systems Using Globalized Dual Heuristic Programming

Optimal Control for Hybrid Systems Based on Mixed Dynamic Programming

Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System

Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems

Data-Driven Event-Triggered Adaptive Dynamic Programming Control for Nonlinear Systems with Input Saturation.

Event-triggered design for discrete-time nonlinear systems with control constraints

Intelligent Optimal Control of Constrained Nonlinear Systems Via Receding-Horizon Heuristic Dynamic Programming

A hybrid model-based optimal control method for nonlinear systems using simultaneous dynamic optimization strategies

Event-Triggered Adaptive Dynamic Programming for Hierarchical Sliding-Mode Surface-Based Optimal Control of Switched Nonlinear Systems

Adaptive neural event‐triggered near‐optimal control for affined uncertain nonlinear discrete‐time system

Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control

Novel iterative neural dynamic programming for data-based approximate optimal control design

Adaptive dynamic programming for optimal control of discrete‐time nonlinear system with state constraints based on control barrier function

Optimal control of nonlinear system based on deterministic policy gradient with eligibility traces

Event-Triggered Single-Network ADP for Zero-Sum Game of Unknown Nonlinear Systems with Constrained Input

Approximate dynamic programming for continuous state and control problems

Discrete-Time Self-Learning Parallel Control

ADP-Based Decentralized Controller Design for Nonlinear Time-Delay Interconnected Systems

Optimal Controls for Dual-Driven Load System with Synchronously Approximate Dynamic Programming Method

Discrete‐Time Optimal Control of State‐Constrained Nonlinear Systems Using Approximate Dynamic Programming

Decentralized Adaptive Neural Inverse Optimal Control of Nonlinear Interconnected Systems