Abstract:This paper is concerned with the closed-loop Stackelberg strategy for linear-quadratic leader-follower game. Completely different from the open-loop and feedback Stackelberg strategy, the solvability of the closed-loop solution even the linear case remains challenging. The main contribution of the paper is to derive the explicitly linear closed-loop Stackelberg strategy with one-step memory in terms of Riccati equations. The key technique is to apply the constrained maximum principle to the leader-follower game and explicitly solve the corresponding forward and backward difference equations. Numerical examples verify the effectiveness of the results, which achieves better performance than feedback strategy.
What problem does this paper attempt to address?
This paper is primarily dedicated to addressing the closed-loop Stackelberg strategy problem in Linear-Quadratic (LQ) leader-follower games. Specifically, the paper focuses on finding an effective closed-loop Stackelberg strategy in a particular type of non-zero-sum game, namely the leader-follower game (Stackelberg game).
### Main Contributions
1. **Closed-Loop Stackelberg Strategy**: The main contribution of the paper is the derivation of a clear, one-step memory-based linear closed-loop Stackelberg strategy based on the Riccati equation. This strategy considers all state information from the initial moment to the current moment, which is more complex but advantageous compared to open-loop strategies that only consider initial values or feedback strategies that only consider the current state.
2. **Key Techniques**: The constrained maximum principle is used to handle the leader-follower game, and the corresponding forward and backward difference equations are explicitly solved.
3. **Performance Verification**: Numerical examples verify that the proposed closed-loop strategy performs better than traditional feedback strategies.
### Research Background and Challenges
- **Leader-Follower Game**: This has important applications in fields such as economics, engineering, and biology, where there are at least two participants, one called the leader and the other the follower. The leader can enforce its strategy on the follower, leading to the asymmetry and complexity of the Stackelberg solution.
- **Existing Research**: Open-loop and feedback strategies have been extensively studied, but for closed-loop strategies, especially in linear-quadratic games, even the most basic techniques like dynamic programming cannot be directly applied because the follower's response can only be implicitly expressed, resulting in a non-standard optimization problem with implicit constraints.
- **Solution Method**: To overcome these difficulties, the paper proposes a closed-loop one-step memory strategy, where the player's information set includes all states from the initial time to the current time. This method utilizes the constrained maximum principle and reduces the problem to solving forward and backward difference equations.
### Summary
In summary, this paper addresses an important issue in linear-quadratic leader-follower games by introducing a new closed-loop Stackelberg strategy and demonstrates that this method can achieve better results than traditional feedback strategies in practical applications.