Deterministic Sub-exponential Algorithm for Discounted-sum Games with Unary Weights

Ali Asadi,Krishnendu Chatterjee,Raimundo Saona,Jakub Svoboda
2024-05-20
Abstract:Turn-based discounted-sum games are two-player zero-sum games played on finite directed graphs. The vertices of the graph are partitioned between player 1 and player 2. Plays are infinite walks on the graph where the next vertex is decided by a player that owns the current vertex. Each edge is assigned an integer weight and the payoff of a play is the discounted-sum of the weights of the play. The goal of player 1 is to maximize the discounted-sum payoff against the adversarial player 2. These games lie in NP and coNP and are among the rare combinatorial problems that belong to this complexity class and the existence of a polynomial-time algorithm is a major open question. Since breaking the general exponential barrier has been a challenging problem, faster parameterized algorithms have been considered. If the discount factor is expressed in unary, then discounted-sum games can be solved in polynomial time. However, if the discount factor is arbitrary (or expressed in binary), but the weights are in unary, none of the existing approaches yield a sub-exponential bound. Our main result is a new analysis technique for a classical algorithm (namely, the strategy iteration algorithm) that present a new runtime bound which is $n^{O ( W^{1/4} \sqrt{n} )}$, for game graphs with $n$ vertices and maximum absolute weight of at most $W$. In particular, our result yields a deterministic sub-exponential bound for games with weights that are constant or represented in unary.
Computer Science and Game Theory
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is to design a deterministic sub - exponential - time algorithm for discounted - sum games with unary weights. Specifically, the goal of the paper is to improve the running - time analysis of the strategy iteration algorithm, thereby providing a new running - time bound, that is, for a game graph with 𝑛 vertices and a maximum absolute weight of 𝑊, its running - time is 𝑛𝑂(𝑊^(1/4)√𝑛). ### Problem Background Discounted - sum games are a class of two - person zero - sum games where players interact for an infinite number of steps on a finite directed graph, with the goal of maximizing or minimizing the discounted - sum payoff. This type of game belongs to the NP ∩ coNP complexity class, but whether there exists a polynomial - time algorithm remains an important open problem. In particular, when the discount factor is represented in binary, even if the weights are represented in unary, existing methods cannot break through the exponential - time barrier. ### Main Contributions of the Paper The main result of the paper is to improve the analysis of the classical strategy iteration algorithm by introducing a new polynomial root - bound technique, obtaining a new running - time bound. Specifically: - **New Lower Bound**: For the roots of polynomials with integer coefficients, the paper proves a sub - exponential lower bound. - **Constructive Upper Bound**: The paper provides an explicit polynomial construction such that the closeness of its roots to 1 can be described by a quasi - polynomial upper bound. - **Non - Constructive Upper Bound**: The paper also proves that there exist certain polynomials whose closeness of roots to 1 can be described by a sub - exponential non - constructive upper bound. These results together establish the deterministic sub - exponential - time complexity of the strategy iteration algorithm when dealing with discounted - sum games with unary weights. ### Practical Significance In addition to its theoretical importance, this type of game also has practical applications in the analysis of reactive systems. For example, when analyzing system state transitions, small or constant weights can naturally represent the importance of different events. Therefore, improving the algorithms for this type of game has both theoretical and practical application value. ### Formula Summary The key formulas involved in the paper include: - Discounted - sum payoff function: \[ \text{Disc}_\lambda(\omega)=\sum_{i \geq 0} \lambda^i r(v_i, v_{i + 1}) \] where \( \lambda \) is the discount factor and \( r(v_i, v_{i + 1}) \) is the weight of the edge \( (v_i, v_{i + 1}) \). - New running - time bound: \[ n^{O\left(W^{1/4}\sqrt{n}\right)} \] The correctness and readability of these formulas are guaranteed by the Markdown format.