Alternate inference-decision reinforcement learning with generative adversarial inferring for bridge bidding

Jiao Wang,Shijia Wang,Tao Xu
DOI: https://doi.org/10.1007/s00521-024-09860-2
2024-08-26
Neural Computing and Applications
Abstract:Contract bridge is a competitive-cooperative multiplayer game. In the bidding phase, the decision-making process is complex, given the extensive range of inaccessible information from the partner and opponents. Inferring partner private information seems to be a viable way to promote cooperation. However, extant research has neglected the reliability and effective utilization of inferred information, consequently limiting adaptability in the face of unpredictable policies and hidden hands of opponents. This paper introduces a novel bidding decision algorithm for contract bridge, utilizing generative adversarial inferring (GAI) and alternate inference-decision reinforcement learning (AID-RL). Specifically, GAI advances the reliability and rationality of partner information inference, factoring in stochastic noises and utilizing adversarial training. Concurrently, AID-RL strengthens the effective and explicit exploitation of inference by integrating a reinforcement objective that increases decision-making payoff into the inference model training alongside the original generative objective. Moreover, a novel self-play approach, augmented by the counterfactual difference reward (CDR) mechanism is proposed to expedite policy updates and ensure robust decision-making against inference biases and unknown opponents. Empirical results demonstrate the superior performance of our algorithm, attaining a noteworthy victory of + 0.359 IMPs against Wbridge5, outperforming EPNN and SiB + 0.596 and + 0.062 IMPs.
computer science, artificial intelligence
What problem does this paper attempt to address?