Abstract:This research paper introduces a model-free optimal controller for discrete-time Markovian jump linear systems (MJLSs), employing principles from the methodology of reinforcement learning (RL). While Q-learning methods have demonstrated efficacy in determining optimal controller gains for deterministic systems, their application to systems with Markovian switching remains unexplored. To address this research gap, we propose a Q-function involving the Markovian mode. Subsequently, a Q-learning algorithm is proposed to learn the unknown kernel matrix using raw input-state information from the system. Notably, the study proves the convergence of the proposed Q-learning optimal controller gains to the model-based optimal controller gains after proving the convergence of a value iteration algorithm as the first step. Addition of excitation noise to input which is required to ensure the leaning performance does not lead to any bias. Unlike the conventional optimal controller, the proposed method does not require any knowledge on system dynamics and eliminates the need for solving coupled algebraic Riccati equations arising in optimal control of MJLSs. Finally, the efficiency of the proposed method is demonstrated through a simulation study.

Stochastic LQ optimal control for Markov jumping systems with multiplicative noise using reinforcement learning

Reinforcement Learning-Based $\mathcal{h}_{\infty }$ Control of 2-D Markov Jump Roesser Systems with Optimal Disturbance Attenuation

Adaptive synchronization of delayed Markovian switching neural networks with Lévy noise

Optimal Vibration Control of a Class of Nonlinear Stochastic Systems with Markovian Jump

H∞$$ {h}_{\infty } $$ Optimal Output Tracking Control for Markov Jump Systems: A Reinforcement Learning‐based Approach

Finite-time L2−l∞ Tracking Control for Markov Jump Repeated Scalar Nonlinear Systems with Partly Usable Model Information

Asynchronous Control for Discrete-Time Markovian Jump Systems with Multiplicative Noise

Model-free optimal controller for discrete-time Markovian jump linear systems: A Q-learning approach

Asynchronous Static Output Feedback Control of Discrete-time Markov Jump Systems

H∞ optimal output tracking control for Markov jump systems: A reinforcement learning‐based approach

Average Cost Optimal Control of Stochastic Systems Using Reinforcement Learning

&Lt;inline-Formula> &Lt;tex-Math Notation="latex">$\mathcal H_{\infty }$&Lt;/tex-Math> &Lt;/inline-Formula> Control for 2-D Markov Jump Systems in Roesser Model

$\mathcal H_{\infty }$ Control for 2-D Markov Jump Systems in Roesser Model

Reinforcement learning‐based composite suboptimal control for Markov jump singularly perturbed systems with unknown dynamics

Robust Adaptive H∞ Control for Networked Uncertain Semi-Markov Jump Nonlinear Systems with Input Quantization

Finite-Time Control of Markov Jump Lur'e Systems with Singular Perturbations

Asynchronous Event-Triggered Output-Feedback Control of Singular Markov Jump Systems

Quantized Control of Markov Jump Nonlinear Systems Based on Fuzzy Hidden Markov Model.

A Fuzzy-Model-Based Approach to Optimal Control for Nonlinear Markov Jump Singularly Perturbed Systems: A Novel Integral Reinforcement Learning Scheme

Robust Adaptive Switching Control for Markovian Jump Nonlinear Systems Via Backstepping Technique

Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems