Abstract:We present a three-step method to perform system identification and optimal control of non-linear systems. Our approach is mainly data driven and does not require active excitation of the system to perform system identification. In particular, it is designed for systems for which only historical data under closed-loop control are available and where historical control commands exhibit low variability. In a first step, simple simulation models of the system are built and run under various conditions. In a second step, a neural network architecture is extensively trained on the simulation outputs to learn the system physics, and retrained with historical data from the real system with stopping rules. These constraints avoid overfitting that arise by fitting closed-loop controlled systems. By doing so, we obtain one (or many) system model(s), represented by this architecture, and whose behaviour can be chosen to match more or less the real system. Finally, state-of-the-art reinforcement learning with a variant of domain randomization and distributed learning is used for optimal control of the system. We first illustrate the model identification strategy with a simple example, the pendulum with external torque. We then apply our method to model and optimize the control of a large building facility located in Switzerland. Simulation results demonstrate that this approach generates stable functional controllers which outperform on comfort and energy benchmark rule-based controllers.

Hybrid Q-learning for Data-Based Optimal Control of Non-Linear Switching System

Optimal Control for Hybrid Systems Based on Mixed Dynamic Programming

A nonlinear predictive control algorithm based on fuzzy online modeling and discrete optimization

Study on optimal control strategy for a class of piecewise linear hybrid systems

On Modeling and Optimal Control of a Class of Hybrid Systems

A hybrid model-based optimal control method for nonlinear systems using simultaneous dynamic optimization strategies

Optimal Control for Constrained Discrete-Time Nonlinear Systems Based on Safe Reinforcement Learning.

Fuzzy Optimal Control for a Class of Discrete-Time Switched Nonlinear Systems

Adaptive autonomous soaring of multiple UAVs using Simultaneous Perturbation Stochastic Approximation

Near Optimal Control for a Class of Stochastic Hybrid Systems.

Approximately Optimal Control of Discrete-Time Nonlinear Switched Systems Using Globalized Dual Heuristic Programming

A Combined Policy Gradient and Q-learning Method for Data-driven Optimal Control Problems

Data-Driven Near-Optimal Control of Nonlinear Systems Over Finite Horizon

H∞$$ {h}_{\infty } $$ Optimal Output Tracking Control for Markov Jump Systems: A Reinforcement Learning‐based Approach

Fuzzy $H_{\infty }$ Control of Discrete-Time Nonlinear Markov Jump Systems via a Novel Hybrid Reinforcement $Q$-Learning Method

Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System

Study on optimal control of switching systems and characteristic disparity

A hybrid learning method for system identification and optimal control

Optimal Control of Switched Hybrid Systems

Interactions of salts and denaturing agents with a polyacrylamide gel.

Indirect Adaptive Fuzzy-Regulated Optimal Control for Unknown Continuous-Time Nonlinear Systems.