Strategy Complexity of Büchi Objectives in Concurrent Stochastic Games

Stefan Kiefer,Richard Mayr,Mahsa Shirmohammadi,Patrick Totzke
2024-04-24
Abstract:We study 2-player concurrent stochastic Büchi games on countable graphs. Two players, Max and Min, seek respectively to maximize and minimize the probability of visiting a set of target states infinitely often. We show that there always exist $\varepsilon$-optimal Max strategies that use just a step counter plus 1 bit of public memory. This upper bound holds for all countable graphs, but it is a new result even for the special case of finite graphs. The upper bound is tight in the sense that Max strategies that use just a step counter, or just finite memory, are not sufficient even on finite game graphs.
Computer Science and Game Theory,Probability
What problem does this paper attempt to address?
The problem that this paper attempts to solve is about the research on the strategy complexity of Büchi objectives and Transience objectives in concurrent stochastic games. Specifically, the author studies the strategy complexity in 2 - player concurrent stochastic Büchi games on countable graphs for two players (Max and Min), where Max's objective is to maximize the probability of visiting a set of target states infinitely many times, and Min's objective is to minimize this probability. The main contributions of the paper include: 1. **ε - Optimal Strategies for Büchi Objectives**: - It is proved that for Büchi objectives, Max has an ε - optimal strategy using only one step counter plus 1 - bit public memory. This upper bound applies to all countable graphs and is even a new result in the special case of finite graphs. - This upper bound is tight, that is, Max strategies using only step counters or finite memory are insufficient in finite - graph games. 2. **ε - Optimal Strategies for Transient Objectives**: - It is proved that for transient objectives, Max has a memoryless ε - optimal strategy. This is meaningful in infinite - state games. 3. **ε - Optimal Strategies Combining Büchi and Transient Objectives**: - It is proved that for combining Büchi and transient objectives, Max has an ε - optimal strategy using only 1 - bit public memory. This result is stronger than that of the Büchi objective alone. The paper shows the existence and complexity of these strategies and provides specific strategy construction methods through constructive methods and strict mathematical proofs. These results are of great significance for understanding the strategy complexity in concurrent stochastic games, especially in the context of Büchi objectives and transient objectives.