Successive over Relaxation for Model-Free LQR Control of Discrete-Time Markov Jump Systems

Wenwu Fan,Junlin Xiong
DOI: https://doi.org/10.1016/j.automatica.2024.111919
IF: 6.4
2025-01-01
Automatica
Abstract:This paper aims to solve the model-free linear quadratic regulator problem for discrete-time Markov jump linear systems without requiring an initial stabilizing control policy. We propose both modelbased and model-free successive over relaxation algorithms to learn the optimal control policy of discrete-time Markov jump linear systems. The model-free value iteration algorithm is a special case of our model-free algorithm when the relaxation factor equals one. A sufficient condition on the relaxation factor is provided to guarantee the convergence of our algorithms. Moreover, it is proved that our model-free algorithm can obtain an approximate optimal solution when the transition probability matrix is unknown. Finally, a numerical example is used to illustrate our results. (c) 2024 Published by Elsevier Ltd.
What problem does this paper attempt to address?