Stochastic LQ optimal control for Markov jumping systems with multiplicative noise using reinforcement learning

Linwei Ye,Zhonggai Zhao,Fei Liu
DOI: https://doi.org/10.1016/j.sysconle.2024.105765
IF: 2.742
2024-03-07
Systems & Control Letters
Abstract:The stochastic linear quadratic (LQ) problem of discrete-time linear Markov jumping systems with multiplicative noise is investigated in this paper. Two reinforcement learning algorithms, one model-based and one model-free, are designed. The algorithms can be readily adapted to handle nonlinear cases by employing an appropriate function approximator. In the linear case, the network structure is designed to enable the problem to be solved without succumbing to modeling errors. For the model-free algorithm, an approach based on moving averages is proposed to reduce estimation variance. A proof of convergence is also provided. Finally, the effectiveness of the proposed algorithms is demonstrated through a numerical example.
automation & control systems,operations research & management science
What problem does this paper attempt to address?