Abstract:Real-world control applications in complex and uncertain environments require adaptability to handle model uncertainties and robustness against disturbances. This paper presents an online, output-feedback, critic-only, model-based reinforcement learning architecture that simultaneously learns and implements an optimal controller while maintaining stability during the learning phase. Using multiplier matrices, a convenient way to search for observer gains is designed along with a controller that learns from simulated experience to ensure stability and convergence of trajectories of the closed-loop system to a neighborhood of the origin. Local uniform ultimate boundedness of the trajectories is established using a Lyapunov-based analysis and demonstrated through simulation results, under mild excitation conditions.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to achieve real - time control of nonlinear systems in complex and uncertain environments, especially for those systems with high model uncertainty and the need for robustness against external disturbances. Specifically, the paper focuses on how to design an online, output - feedback - based, critic - only model - based reinforcement learning architecture that can simultaneously learn and implement the optimal controller during the learning process while maintaining system stability. The main challenges mentioned in the paper include: 1. **Dealing with model uncertainty**: In practical applications, system models are often uncertain, which makes traditional control methods difficult to work effectively. The method proposed in the paper aims to overcome these uncertainties through adaptive learning. 2. **Ensuring stability during the learning process**: During the learning process, ensuring system stability and performance is a key issue. The paper achieves this by designing an observer to estimate the system state and combining it with the model - based predictive control (MBRL) framework, ensuring that system stability can be maintained even during the learning stage. 3. **Solving the control problem of nonlinear systems with partially constrained inputs**: The paper targets nonlinear systems with partially constrained inputs, which are very common in practical applications, such as in robot control, aerospace, etc. The method proposed in the paper can effectively handle the control problems of such systems while ensuring system performance and stability. 4. **Optimizing control performance**: In addition to maintaining system stability, the paper also aims to improve system performance by optimizing the control strategy. By minimizing a given cost function, the method proposed in the paper can find a near - optimal control strategy. In summary, the core objective of this paper is to develop an online adaptive optimal control method that can work effectively in complex and uncertain environments, especially suitable for nonlinear systems with partially constrained inputs, while ensuring stability during the learning process and the final control performance.

Output Feedback Adaptive Optimal Control of Affine Nonlinear systems with a Linear Measurement Model

Adaptive Output-Feedback Control of Nonlinear Systems with Unknown Nonlinearities.

Near Optimal Neural Network-based Output Feedback Control of Affine Nonlinear Discrete-Time Systems

State and Input Constrained Output-Feedback Adaptive Optimal Control of Affine Nonlinear Systems

State and Parameter Estimation for Affine Nonlinear Systems

Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems Using Online Approximators

Online Reinforcement Learning-based Neural Network Controller Design for Affine Nonlinear Discrete-time Systems.

Observer-Based Adaptive Output Feedback Control for Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement Learning.

ℒ1adaptive Controller for a Class of Non-Affine Multi-Input Multi-Output Nonlinear Systems

Robust Adaptive Output Feedback Control of Nonlinearly Parameterized Systems

Online Output-Feedback Optimal Control of Linear Systems Based on Data-Driven Adaptive Learning

Output-feedback Robust Tracking Control of Uncertain Systems via Adaptive Learning

Adaptive Output-Feedback Optimal Control for Continuous-Time Linear Systems Based on Adaptive Dynamic Programming Approach

Robust adaptive output feedback control for uncertain nonlinear systems

Safe adaptive output‐feedback optimal control of a class of linear systems

Online Adaptive Optimal Control for Continuous-Time Nonlinear Systems with Completely Unknown Dynamics.

Observer-Based Neuro-Adaptive Optimized Control of Strict-Feedback Nonlinear Systems with State Constraints

Self-learning Robust Optimal Control for Continuous-Time Nonlinear Systems with Mismatched Disturbances

Output-Feedback Robust Control of Uncertain Systems Via Online Data-Driven Learning

Online accelerated data‐driven learning for optimal feedback control of discrete‐time partially uncertain systems

Neural-Network-Based Online Optimal Control for Uncertain Non-Linear Continuous-Time Systems with Control Constraints