Modeling Other Players with Bayesian Beliefs for Games with Incomplete Information

Zuyuan Zhang,Mahdi Imani,Tian Lan
2024-05-23
Abstract:Bayesian games model interactive decision-making where players have incomplete information -- e.g., regarding payoffs and private data on players' strategies and preferences -- and must actively reason and update their belief models (with regard to such information) using observation and interaction history. Existing work on counterfactual regret minimization have shown great success for games with complete or imperfect information, but not for Bayesian games. To this end, we introduced a new CFR algorithm: Bayesian-CFR and analyze its regret bound with respect to Bayesian Nash Equilibria in Bayesian games. First, we present a method for updating the posterior distribution of beliefs about the game and other players' types. The method uses a kernel-density estimate and is shown to converge to the true distribution. Second, we define Bayesian regret and present a Bayesian-CFR minimization algorithm for computing the Bayesian Nash equilibrium. Finally, we extend this new approach to other existing algorithms, such as Bayesian-CFR+ and Deep Bayesian CFR. Experimental results show that our proposed solutions significantly outperform existing methods in classical Texas Hold'em games.
Computer Science and Game Theory
What problem does this paper attempt to address?
### The Problem the Paper Attempts to Solve This paper aims to address the strategy computation problem in Bayesian Games, particularly under conditions of incomplete information. Specifically, the paper proposes a new Counterfactual Regret Minimization (CFR) algorithm, called Bayesian-CFR, for computing Bayesian Nash Equilibrium (BNE). Existing CFR algorithms perform well in games with complete or imperfect information but have not yet been applied to Bayesian Games. Therefore, the main objectives of the paper are: 1. **Updating Belief Models**: Propose a method to update players' posterior distributions of the game and other players' types. This method uses kernel density estimation and proves its ability to converge to the true distribution. 2. **Defining Bayesian Regret**: Introduce the concept of Bayesian regret and propose a Bayesian-CFR minimization algorithm to compute Bayesian Nash Equilibrium. 3. **Extending Existing Algorithms**: Extend the new Bayesian-CFR method to other existing algorithms, such as Bayesian-CFR+ and Deep Bayesian-CFR. Through these methods, the paper hopes to significantly outperform existing methods in the classic game of Texas Hold'em. Experimental results show that the proposed solution performs well under different types of player behaviors (such as aggressive, neutral, conservative, etc.).