Reinforcement Learning-Based Direct Adaptive Optimal Control of JLQ Model

XU Yan-kai,CHEN Xi
DOI: https://doi.org/10.3321/j.issn:1001-0920.2008.12.008
2008-01-01
Abstract:The discrete-time direct adaptive optimal control problem of jump linear quadratic(JLQ) model is investigated.Reinforcement learning theory and approaches are applied to JLQ model and Q function-based policy iteration algorithm is designed to optimize system performance.When the system parameters and jump probabilities of modes are unknown,the parameter matrix with respcet to Q function is online estimated by observing system behavior under a given control law with recursive least square algorithm.Moreover,based on this matrix,a new policy which can improve system performanc is constructed.The algorithm can converge to the optimal policy.
What problem does this paper attempt to address?