From First-Order to Second-Order Rationality: Advancing Game Convergence with Dynamic Weighted Fictitious Play

Qi Ju,Falin Hei,Yuxuan Liu,Zhemei Fang,Yunfeng Luo
2024-09-05
Abstract:Constructing effective algorithms to converge to Nash Equilibrium (NE) is an important problem in algorithmic game theory. Prior research generally posits that the upper bound on the convergence rate for games is $O\left(T^{-1/2}\right)$. This paper introduces a novel perspective, positing that the key to accelerating convergence in game theory is rationality. Based on this concept, we propose a Dynamic Weighted Fictitious Play (DW-FP) algorithm. We demonstrate that this algorithm can converge to a NE and exhibits a convergence rate of $O(T^{-1})$ in experimental evaluations.
Computer Science and Game Theory
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to converge to Nash Equilibrium (NE) more effectively in algorithmic game theory. Traditional methods usually assume that the upper bound of the convergence speed in the game is \(O(T^{-1/2})\). However, this paper presents a new perspective, believing that the key to accelerating convergence in game theory lies in "rationality". Based on this concept, the author proposes the Dynamic Weighted Fictitious Play (DW - FP) algorithm and proves that this algorithm can not only converge to Nash Equilibrium but also shows a convergence speed of \(O(T^{-1})\) in experimental evaluations, which is significantly faster than the traditional \(O(T^{-1/2})\) convergence speed. Specifically, the main contributions of the paper include: 1. **Introducing second - order rationality**: Different from first - order rationality which makes decisions only based on one's own situation, second - order rationality takes into account one's own and opponents' strategies, thus improving training efficiency. 2. **Proposing the DW - FP algorithm**: By disclosing the strategies of each agent during the training process and adjusting the weights according to these strategies, the DW - FP algorithm can converge to Nash Equilibrium more quickly. 3. **Experimental verification**: By comparing with existing algorithms (such as FP, RM, RM+ and Greedy RM) in multiple standard test games, it shows the advantage of the DW - FP algorithm in convergence speed. In conclusion, this paper aims to improve the efficiency and performance of algorithms in solving Nash Equilibrium problems by introducing new rationality concepts and algorithm designs.