Abstract:The problem of safely learning and controlling a dynamical system - i.e., of stabilizing an originally (partially) unknown system while ensuring that it does not leave a prescribed 'safe set' - has recently received tremendous attention in the controls community. Further complexities arise, however, when the structure of the safe set itself depends on the unknown part of the system's dynamics. In particular, a popular approach based on control Lyapunov functions (CLF), control barrier functions (CBF) and Gaussian processes (to build confidence set around the unknown term), which has proved successful in the known-safe set setting, becomes inefficient as-is, due to the introduction of higher-order terms to be estimated and bounded with high probability using only system state measurements. In this paper, we build on the recent literature on GPs and reproducing kernels to perform this latter task, and show how to correspondingly modify the CLF-CBF-based approach to obtain safety guarantees. Namely, we derive exponential CLF and second relative order exponential CBF constraints whose satisfaction guarantees stability and forward in-variance of the partially unknown safe set with high probability. To overcome the intractability of verification of these conditions on the continuous domain, we apply discretization of the state space and use Lipschitz continuity properties of dynamics to derive equivalent CLF and CBF certificates in discrete state space. Finally, we present an algorithm for the control design aim using the derived certificates.

Safe Q-learning for continuous-time linear systems

Optimal Control for Constrained Discrete-Time Nonlinear Systems Based on Safe Reinforcement Learning.

A Learning-Based Optimal Tracking Controller for Continuous Linear Systems with Unknown Dynamics: Theory and Case Study

Lagrangian-based online safe reinforcement learning for state-constrained systems

Robust Safe Reinforcement Learning Control of Unknown Continuous-Time Nonlinear Systems with State Constraints and Disturbances

Safety-Aware Learning-Based Control of Systems with Uncertainty Dependent Constraints (extended version)

Q-Learning for Linear Quadratic Optimal Control with Terminal State Constraint

Safe Controller for Output Feedback Linear Systems using Model-Based Reinforcement Learning

Safe adaptive output‐feedback optimal control of a class of linear systems

Optimized Control Invariance Conditions for Uncertain Input-Constrained Nonlinear Control Systems

Safe Model-Based Reinforcement Learning for Systems with Parametric Uncertainties

Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions

Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems

Reinforcement Learning of Structured Control for Linear Systems with Unknown State Matrix

Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Robust Reinforcement Learning for Risk-Sensitive Linear Quadratic Gaussian Control

Learning to Control under Uncertainty with Data-Based Iterative Linear Quadratic Regulator

Model-Based Safe Reinforcement Learning With Time-Varying Constraints: Applications to Intelligent Vehicles

Learning the Linear Quadratic Regulator from Nonlinear Observations