Modeling Other Players with Bayesian Beliefs for Games with Incomplete Information

Zuyuan Zhang,Mahdi Imani,Tian Lan

2024-05-23

Abstract:Bayesian games model interactive decision-making where players have incomplete information -- e.g., regarding payoffs and private data on players' strategies and preferences -- and must actively reason and update their belief models (with regard to such information) using observation and interaction history. Existing work on counterfactual regret minimization have shown great success for games with complete or imperfect information, but not for Bayesian games. To this end, we introduced a new CFR algorithm: Bayesian-CFR and analyze its regret bound with respect to Bayesian Nash Equilibria in Bayesian games. First, we present a method for updating the posterior distribution of beliefs about the game and other players' types. The method uses a kernel-density estimate and is shown to converge to the true distribution. Second, we define Bayesian regret and present a Bayesian-CFR minimization algorithm for computing the Bayesian Nash equilibrium. Finally, we extend this new approach to other existing algorithms, such as Bayesian-CFR+ and Deep Bayesian CFR. Experimental results show that our proposed solutions significantly outperform existing methods in classical Texas Hold'em games.

Computer Science and Game Theory

What problem does this paper attempt to address?

### The Problem the Paper Attempts to Solve This paper aims to address the strategy computation problem in Bayesian Games, particularly under conditions of incomplete information. Specifically, the paper proposes a new Counterfactual Regret Minimization (CFR) algorithm, called Bayesian-CFR, for computing Bayesian Nash Equilibrium (BNE). Existing CFR algorithms perform well in games with complete or imperfect information but have not yet been applied to Bayesian Games. Therefore, the main objectives of the paper are: 1. **Updating Belief Models**: Propose a method to update players' posterior distributions of the game and other players' types. This method uses kernel density estimation and proves its ability to converge to the true distribution. 2. **Defining Bayesian Regret**: Introduce the concept of Bayesian regret and propose a Bayesian-CFR minimization algorithm to compute Bayesian Nash Equilibrium. 3. **Extending Existing Algorithms**: Extend the new Bayesian-CFR method to other existing algorithms, such as Bayesian-CFR+ and Deep Bayesian-CFR. Through these methods, the paper hopes to significantly outperform existing methods in the classic game of Texas Hold'em. Experimental results show that the proposed solution performs well under different types of player behaviors (such as aggressive, neutral, conservative, etc.).

Modeling Other Players with Bayesian Beliefs for Games with Incomplete Information

Deep Counterfactual Regret Minimization

D2CFR: Minimize Counterfactual Regret With Deep Dueling Neural Network

Combining Counterfactual Regret Minimization with Information Gain to Solve Extensive Games with Unknown Environments

Kdb-D2CFR: Solving Multiplayer imperfect-information games with knowledge distillation-based DeepCFR

No-Regret Learning in Extensive-Form Games with Imperfect Recall

Imization for extensive games with imperfect information

Double Neural Counterfactual Regret Minimization.

Efficient CFR for Imperfect Information Games with Instant Updates

Model-Free Neural Counterfactual Regret Minimization with Bootstrap Learning

CFR-p: Counterfactual Regret Minimization with Hierarchical Policy Abstraction, and its Application to Two-player Mahjong

Lazy-CFR: a Fast Regret Minimization Algorithm for Extensive Games with Imperfect Information.

Regret Minimization in Non-Zero-Sum Games with Applications to Building Champion Multiplayer Computer Poker Agents

A Survey of Nash Equilibrium Strategy Solving Based on CFR

RM-FSP: Regret Minimization Optimizes Neural Fictitious Self-Play

Monte Carlo Neural Fictitious Self-Play: Achieve Approximate Nash equilibrium of Imperfect-Information Games.

No-Regret Learning in Bayesian Games

A Unified Perspective on Deep Equilibrium Finding

Scalable sub-game solving for imperfect-information games

RLCFR: Minimize counterfactual regret by deep reinforcement learning

RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning