Efficient CFR for Imperfect Information Games with Instant Updates

Hui Li,Kailiang Hu,Yuan Qi,Le Song
2019-01-01
Abstract:Counterfactual regret minimization (CFR) is a framework of iterative algorithms and is empirically the fastest approach to solving large imperfect information games. However, for large games, the convergence speed of the state-of-theart CFR is still the key limitation, especially in real-time applications. We propose a novel counterfactual regret minimization method with instant updates, which has a provably lower convergence bound and a provably tighter space complexity bound. We apply the proposed instant updates into many CFR variants on one Leduc Hold’em instance and five different subgame instances of Heads-Up No-Limit Texas Hold’em (HUNL) generated by DeepStack. The proposed method empirically achieves faster convergence rates than the state-of-the-art CFR. In subgame instances of HUNL, our method converges three times faster than the hybrid method used in DeepStack.
What problem does this paper attempt to address?