AlphaDou: High-Performance End-to-End Doudizhu AI Integrating Bidding

Chang Lei,Huan Lei
2024-09-13
Abstract:Artificial intelligence for card games has long been a popular topic in AI research. In recent years, complex card games like Mahjong and Texas Hold'em have been solved, with corresponding AI programs reaching the level of human experts. However, the game of Doudizhu presents significant challenges due to its vast state/action space and unique characteristics involving reasoning about competition and cooperation, making the game extremely difficult to solve.The RL model Douzero, trained using the Deep Monte Carlo algorithm framework, has shown excellent performance in Doudizhu. However, there are differences between its simplified game environment and the actual Doudizhu environment, and its performance is still a considerable distance from that of human experts. This paper modifies the Deep Monte Carlo algorithm framework by using reinforcement learning to obtain a neural network that simultaneously estimates win rates and expectations. The action space is pruned using expectations, and strategies are generated based on win rates. The modified algorithm enables the AI to perform the full range of tasks in the Doudizhu game, including bidding and cardplay. The model was trained in a actual Doudizhu environment and achieved state-of-the-art performance among publicly available models. We hope that this new framework will provide valuable insights for AI development in other bidding-based games.
Artificial Intelligence,Computer Science and Game Theory,Multiagent Systems
What problem does this paper attempt to address?
The paper attempts to address the problem of developing a high-performance end-to-end AI system for the game of Dou Dizhu, particularly by integrating the card-playing phase and the bidding phase (i.e., scoring). Dou Dizhu is a highly popular three-player card game in China, characterized by its complex state and action space, as well as unique competitive and cooperative reasoning features, making it highly challenging to develop an AI system that can reach the level of human experts. Existing Dou Dizhu AI models, such as Douzero, although performing well in the card-playing phase, still show deficiencies in the bidding phase, with a significant performance gap compared to human experts. Additionally, the simplified game environment used by models like Douzero differs from the actual Dou Dizhu environment, limiting their practical application effectiveness. Therefore, the paper proposes a new reinforcement learning framework—AlphaDou, aiming to improve the Deep Monte Carlo algorithm framework, utilizing reinforcement learning to train neural networks while estimating win rates and expected values, thereby achieving action space pruning and strategy generation. AlphaDou is capable of handling not only the card-playing phase but also effectively bidding, thus demonstrating excellent performance throughout the entire game. Experimental results show that AlphaDou achieves state-of-the-art performance among publicly available models and provides valuable references for AI development in other bidding-required games.