$$H_\infty $$ Control Using Reinforcement Learning

Jinna Li,Frank L. Lewis,Jialu Fan
2023-01-01
Abstract:In this chapter, we first present a model-free off-policy game Q-learningOff-policy game Q-learning algorithm to solve the $$H_\infty $$ control problem for linear discrete-time multi-player systems with a single source of external disturbances. The primary contribution lies in that the Q-learning strategy employed in the proposed algorithm is implemented in an off-policy policy iteration approach other than the on-policy learning. Then, we present a data-driven adaptive dynamic programmingDynamic programming for solving the $$H_\infty $$ output feedback control problem with multiple players subject to multi-source disturbances. Considering the advantages of off-policy RL over on-policy RL, a novel off-policy game Q-learningOff-policy game Q-learning algorithm dealing with mixed competition and cooperation among players is developed, such that the $$H_\infty $$ control problem can be finally solved for linear multi-player systems without the knowledge of system dynamics. Besides, rigorous proofs of algorithm convergence and unbiasedness of solutions are presented. Simulation results demonstrate the effectiveness of the proposed method.
What problem does this paper attempt to address?