Abstract:As a typical example of imperfect information game,Texas Hold'em poker has been an ongoing challenge in artificial intelligence for a long time.Especially in the case of multiplayer no-limit Texas Hold'em,a variety of factors should be considered,combined with its large solution space,making multiplayer no-limit Texas Hold'em AI more challenging.Generally,there are two approaches to handle this problem.The first is game-theoretic method,trying to search the game tree and find out the Nash Equilibrium points with a range of approaches.The second is termed knowledge-based method,learning human player's actions and betting patterns to provide AI with references and make corresponding decisions.In this paper,we propose a gaming policy in multiplayer no-limit Texas Hold'em based on hands prediction,which belongs to the latter category.The basic idea behind our approach is to imitate the process of "hands reading" taken by human beings.Hands reading is an essential skill in playing Texas Hold'em,which refers to predicting the rough probability distribution,or even the precise hands,of the opponents' cards.Hands reading is based on the actions taken by our opponents and the real-time information we perceived.As the development of gaming,players will take several action in each betting round (each player will normally take several actions in a single hand in total).The actions taken by players reveal the information of their hands,making hands reading possible.In this paper,we proposes a complete gaming framework from the perspective of imperfect information game.The applicable scope of this game framework is also discussed in the same section.Then we apply the proposed framework to Texas Hold'em.When designing our poker AI,we focus more on the prediction of the unknown information set.At each time point,our poker AI conducts Monte Carlo simulation to obtain the winning rate based on the current collected information and the predicted information.And winning rate is the most important factor our AI considers to make a decision.This paper presents a detailed description of the AI designation and corresponding implementation.We give the readers valuable references to implement their own multiplayer no-limit Texas Hold'em AI.To our best knowledge,this is the first paper that holistically designs and implements multiplayer (more than 3 players) no-limit Texas Hold'em AI based on hands prediction.Experiments are conducted to demonstrate the performance of our policy and the produced AI.In terms of predicting unknown information set,our approach increases the hands prediction accuracy by 6.65％ than Markov model.To intuitively illustrate the strength of our AI,we select 7 programs from 2015 Huawei Software Elite Challenge for comparison.Experiments are conducted under various conditions,including heads-up (2 players) and multiplayer no-limit Texas Hold'em.We select tournament as the competition rule,with one extra buy-in and a constant blind.Our program gets a winning rate of 89％ in heads-up poker,and places 1.74 in average in eight-player poker.Additionally,it gets the best performance in terms of lasting rounds,winning chips,etc.

Imperfect Information Game in Multiplayer No-limit Texas Hold’em Based on Mean Approximation and Deep CFVnet

Game in Multiplayer No-Limit Texas Hold'Em Based on Hands Prediction

Monte Carlo Neural Fictitious Self-Play: Achieve Approximate Nash equilibrium of Imperfect-Information Games.

DecisionHoldem: Safe Depth-Limited Solving With Diverse Opponents for Imperfect-Information Games

Deep Reinforcement Learning from Self-Play in No-limit Texas Hold'em Poker

Scalable sub-game solving for imperfect-information games

Solving Imperfect Information Poker Games Using Monte Carlo Search and POMDP Models

Monte Carlo Neural Fictitious Self-Play: Approach to Approximate Nash equilibrium of Imperfect-Information Games

PokerGPT: An End-to-End Lightweight Solver for Multi-Player Texas Hold'em via Large Language Model

Kdb-D2CFR: Solving Multiplayer imperfect-information games with knowledge distillation-based DeepCFR

A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games

Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning

DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker

AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning

Opponent Modeling in Poker Games

Regret Minimization in Non-Zero-Sum Games with Applications to Building Champion Multiplayer Computer Poker Agents

OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research

AI Enabled Bridge Bidding Supporting Interactive Visualization

Opponent Modeling in Multiplayer Imperfect-Information Games

Efficient Opponent Exploitation in No-Limit Texas Hold’em Poker: A Neuroevolutionary Method Combined with Reinforcement Learning

Efficient CFR for Imperfect Information Games with Instant Updates