Abstract:This article solves the robust tracking problem for a type of partially unknown nonlinear systems with asymmetric constrained‐input by utilizing an improved adaptive dynamic programming method based on experience replay technique and critic‐only neural network (NN). Besides, an identifier neural network (INN) is used to identify the unknown part of the system dynamics. This article solves the robust tracking problem (RTP) for a type of partially unknown nonlinear systems with asymmetric constrained‐input by utilizing an improved adaptive dynamic programming (ADP) method based on experience replay (ER) technique and critic‐only neural network (NN). Initially, an identifier neural network (INN) is used to identify the unknown part of the system dynamics. Subsequently, the tracking error and the desired trajectory are used to construct an augmented system, so that the robust tracking problem (RTP) is transformed into a constrained optimal control problem (OCP). It is proved that the designed control policy of OCP can make the tracking error to be uniformly ultimately bounded (UUB). Then, using the framework of ADP and critic‐only NN to solve the derived Hamilton–Jacobi–Bellman equation (HJBE). The NN weight regulation law is partially derived by using gradient descent algorithm (GDA) and then is improved by using the ER technique and the Lyapunov stability theory, which no longer need the conditions of persistence of excitation (PE) and the initial admissible control. Besides, the total system states and NN weights are proved to be closed‐loop stable by utilizing the Lyapunov technique. Finally, through two simulation examples, it is demonstrated that the proposed control scheme is effective.

Data-Efficient Off-Policy Learning for Distributed Optimal Tracking Control of HMAS with Unidentified Exosystem Dynamics.

Reinforcement Learning-Based Unknown Reference Tracking Control of HMASs with Nonidentical Communication Delays

Optimal Tracking Control of Heterogeneous MASs Using Event-Driven Adaptive Observer and Reinforcement Learning

Cooperative Path Following Control in Autonomous Vehicles Graphical Games: A Data-Based Off-Policy Learning Approach

Human-in-the-loop Distributed Cooperative Tracking Control with Applications to Autonomous Ground Vehicles: A Data-Driven Mixed Iteration Approach

Optimal couple-group tracking control for the heterogeneous multi-agent systems with cooperative-competitive interactions via reinforcement learning method

Data-Based Collaborative Learning for Multiagent Systems under Distributed Denial-of-Service Attacks

Optimal Tracking Control of Nonlinear Multiagent Systems Using Internal Reinforce Q-Learning

Distributed output formation tracking control of heterogeneous multi-agent systems using reinforcement learning

Resilient Output Formation-Containment Tracking of Heterogeneous Multi-Agent Systems: A Learning-Based Framework using Dynamic Data

Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning

Data-Driven H∞ Output Consensus for Heterogeneous Multiagent Systems Under Switching Topology via Reinforcement Learning

Data-driven output consensus for a class of discrete-time multiagent systems by reinforcement learning techniques

Adaptive Dynamic Programming and Data-Driven Cooperative Optimal Output Regulation with Adaptive Observers

Data-driven cooperative optimal output regulation for linear discrete-time multi-agent systems by online distributed adaptive internal model approach

Behavior Learning Based Distributed Tracking Control for Human-in-the-Loop Multi-Agent Systems

Distributed Bipartite Output Formation Control for Heterogeneous Discrete-Time Linear Multi-Agent Systems

Distributed Average Tracking for Linear Heterogeneous Multi-Agent Systems With External Disturbances

Novel two-dimensional off-policy Q -learning method for output feedback optimal tracking control of batch process with unknown dynamics

Optimal consensus control for unknown second-order multi-agent systems: Using model-free reinforcement learning method

Experience replay based online adaptive robust tracking control for partially unknown nonlinear systems with asymmetric constrained‐input