Policy Iteration Adaptive Dynamic Programming for Optimal Control of Multi-Player Stackelberg-Nash Games

Mingduo Lin,Bo Zhao,Derong Liu,Yongwei Zhang
DOI: https://doi.org/10.23919/ccc55666.2022.9901882
2022-01-01
Abstract:This paper investigates multi-player Stackelberg-Nash (SN) game problems of nonlinear continuous-time systems via policy iteration adaptive dynamic programming (ADP). To represent different hierarchical roles, the appropriate cost functions of the leader and each follower are designed. By introducing the ADP technique, the policy iteration algorithm is developed to obtain approximate solutions of the coupled HJ equation of each player. Then, the multi-player SN equilibrium is derived to guarantee the stability of the closed-loop system. Furthermore, the developed method is realized by employing the critic neural networks through the gradient-based weight updating algorithm. Finally, simulation example validates the effectiveness of the present method.
What problem does this paper attempt to address?