Abstract:Turn-based discounted-sum games are two-player zero-sum games played on finite directed graphs. The vertices of the graph are partitioned between player 1 and player 2. Plays are infinite walks on the graph where the next vertex is decided by a player that owns the current vertex. Each edge is assigned an integer weight and the payoff of a play is the discounted-sum of the weights of the play. The goal of player 1 is to maximize the discounted-sum payoff against the adversarial player 2. These games lie in NP and coNP and are among the rare combinatorial problems that belong to this complexity class and the existence of a polynomial-time algorithm is a major open question. Since breaking the general exponential barrier has been a challenging problem, faster parameterized algorithms have been considered. If the discount factor is expressed in unary, then discounted-sum games can be solved in polynomial time. However, if the discount factor is arbitrary (or expressed in binary), but the weights are in unary, none of the existing approaches yield a sub-exponential bound. Our main result is a new analysis technique for a classical algorithm (namely, the strategy iteration algorithm) that present a new runtime bound which is $n^{O ( W^{1/4} \sqrt{n} )}$, for game graphs with $n$ vertices and maximum absolute weight of at most $W$. In particular, our result yields a deterministic sub-exponential bound for games with weights that are constant or represented in unary.

What problem does this paper attempt to address?

The core problem that this paper attempts to solve is to design a deterministic sub - exponential - time algorithm for discounted - sum games with unary weights. Specifically, the goal of the paper is to improve the running - time analysis of the strategy iteration algorithm, thereby providing a new running - time bound, that is, for a game graph with 𝑛 vertices and a maximum absolute weight of 𝑊, its running - time is 𝑛𝑂(𝑊^(1/4)√𝑛). ### Problem Background Discounted - sum games are a class of two - person zero - sum games where players interact for an infinite number of steps on a finite directed graph, with the goal of maximizing or minimizing the discounted - sum payoff. This type of game belongs to the NP ∩ coNP complexity class, but whether there exists a polynomial - time algorithm remains an important open problem. In particular, when the discount factor is represented in binary, even if the weights are represented in unary, existing methods cannot break through the exponential - time barrier. ### Main Contributions of the Paper The main result of the paper is to improve the analysis of the classical strategy iteration algorithm by introducing a new polynomial root - bound technique, obtaining a new running - time bound. Specifically: - **New Lower Bound**: For the roots of polynomials with integer coefficients, the paper proves a sub - exponential lower bound. - **Constructive Upper Bound**: The paper provides an explicit polynomial construction such that the closeness of its roots to 1 can be described by a quasi - polynomial upper bound. - **Non - Constructive Upper Bound**: The paper also proves that there exist certain polynomials whose closeness of roots to 1 can be described by a sub - exponential non - constructive upper bound. These results together establish the deterministic sub - exponential - time complexity of the strategy iteration algorithm when dealing with discounted - sum games with unary weights. ### Practical Significance In addition to its theoretical importance, this type of game also has practical applications in the analysis of reactive systems. For example, when analyzing system state transitions, small or constant weights can naturally represent the importance of different events. Therefore, improving the algorithms for this type of game has both theoretical and practical application value. ### Formula Summary The key formulas involved in the paper include: - Discounted - sum payoff function: \[ \text{Disc}_\lambda(\omega)=\sum_{i \geq 0} \lambda^i r(v_i, v_{i + 1}) \] where $ \lambda $ is the discount factor and $ r(v_i, v_{i + 1}) $ is the weight of the edge $ (v_i, v_{i + 1}) $. - New running - time bound: \[ n^{O\left(W^{1/4}\sqrt{n}\right)} \] The correctness and readability of these formulas are guaranteed by the Markdown format.

Deterministic Sub-exponential Algorithm for Discounted-sum Games with Unary Weights

Concurrent Stochastic Games with Stateful-discounted and Parity Objectives: Complexity and Algorithms

An Exponential Lower Bound for the Latest Deterministic Strategy Iteration Algorithms

Discounting the Past

Universal Complexity Bounds Based on Value Iteration for Stochastic Mean Payoff Games and Entropy Games

Smoothed analysis of deterministic discounted and mean-payoff games

Playing Against Fair Adversaries in Stochastic Games with Total Rewards

Games played by Exponential Weights Algorithms

Discounted nonzero-sum optimal stopping games under Poisson random intervention times

Playing Stochastically in Weighted Timed Games to Emulate Memory

Faster Algorithms for Optimal Ex-Ante Coordinated Collusive Strategies in Extensive-Form Zero-Sum Games

Team Correlated Equilibria in Zero-Sum Extensive-Form Games via Tree Decompositions

Exact Algorithms for Solving Stochastic Games

From First-Order to Second-Order Rationality: Advancing Game Convergence with Dynamic Weighted Fictitious Play

Optimal Approximation Algorithms for Multi-agent Combinatorial Problems with Discounted Price Functions

Integer programs with bounded subdeterminants and two nonzeros per row

Approximately Counting Knapsack Solutions in Subquadratic Time

On the Complexity of Solving Subtraction Games

Zero-Sum Stochastic Stackelberg Games

Integrating Dynamic Weighted Approach with Fictitious Play and Pure Counterfactual Regret Minimization for Equilibrium Finding

Efficient Deterministic Algorithms for Maximizing Symmetric Submodular Functions