Learning Algorithm for LQG Model with Constrained Control

Yankai Xu,Xi Chen
DOI: https://doi.org/10.3182/20080706-5-kr-1001.02616
2008-01-01
IFAC Proceedings Volumes
Abstract:The paper considers a discrete-time linear quadratic Gaussian model with constrained control. It is formulated with Markov systems. With the derivative equation, a performance gradient with respect to control parameters is estimated from a sample path. Then a learning algorithm is proposed to obtain a suboptimal feedback policy in affine linear form. The learning algorithm can be implemented on-line. Its improving feature makes the algorithm attain better performance than existing approaches, and the idea can be applied to more general cases.
What problem does this paper attempt to address?