Abstract:Classical reactive synthesis approaches aim to synthesize a reactive system that always satisfies a given specifications. These approaches often reduce to playing a two-player zero-sum game where the goal is to synthesize a winning strategy. However, in many pragmatic domains, such as robotics, a winning strategy does not always exist, yet it is desirable for the system to make an effort to satisfy its requirements instead of "giving up". To this end, this paper investigates the notion of admissible strategies, which formalize "doing-your-best", in quantitative reachability games. We show that, unlike the qualitative case, quantitative admissible strategies are history-dependent even for finite payoff functions, making synthesis a challenging task. In addition, we prove that admissible strategies always exist but may produce undesirable optimistic behaviors. To mitigate this, we propose admissible winning strategies, which enforce the best possible outcome while being admissible. We show that both strategies always exist but are not memoryless. We provide necessary and sufficient conditions for the existence of both strategies and propose synthesis algorithms. Finally, we illustrate the strategies on gridworld and robot manipulator domains.

What problem does this paper attempt to address?

This paper attempts to solve the problem of how to ensure that the system (Sys) meets its requirements as much as possible when there is no winning strategy in quantitative reachability games. Specifically, the paper explores the concept of "doing one's best" and introduces admissible strategies and admissible winning strategies to deal with the situation where there may be no winning strategy in practical applications such as robots. ### Main problems 1. **Handling when there is no winning strategy**: - In many practical scenarios, such as robot operations, there may be environmental factors (Env) that prevent the system from always being able to complete the task. At this time, the system needs a mechanism to try to meet its goals as much as possible instead of giving up directly. 2. **Properties and synthesis of admissible strategies**: - An admissible strategy is a strategy that does its best even in the worst - case scenario. The paper analyzes the characteristics of these strategies and proves that they always exist, but may produce overly optimistic behavior. 3. **Introduction of admissible winning strategies**: - In order to mitigate the overly optimistic behavior that admissible strategies may produce, the paper proposes admissible winning strategies. Such strategies not only do their best but also ensure the best possible results when possible. ### Specific content - **Definitions and background**: - The paper first defines a 2 - player quantitative game and introduces related basic concepts such as states, actions, transition functions, cost functions, etc. - **Analysis of admissible strategies**: - The paper analyzes the properties of admissible strategies in quantitative reachability games, pointing out that these strategies are usually history - dependent and provides necessary and sufficient conditions for their existence. - **Proposal of admissible winning strategies**: - To deal with the overly optimistic behavior that admissible strategies may produce, the paper proposes the concept of admissible winning strategies and proves that such strategies also always exist and require finite memory. - **Algorithms and examples**: - The paper provides algorithms for synthesizing admissible strategies and admissible winning strategies and demonstrates the effectiveness and flexibility of these strategies through examples such as grid worlds and robotic arm operations. ### Conclusions The main contributions of the paper include: 1. Analyzing the existence and properties of admissible strategies in quantitative reachability games that do not depend on the environmental rationality assumption. 2. Introducing the concept of admissible winning strategies to mitigate the overly optimistic behavior that admissible strategies may produce. 3. Proposing algorithms for synthesizing these two types of strategies. 4. Demonstrating the effectiveness and flexibility of these strategies in practical applications through multiple robot application examples. Through these works, the paper provides a new method for designing more robust reactive systems in complex environments.

Beyond Winning Strategies: Admissible and Admissible Winning Strategies for Quantitative Reachability Games

Admissibility Over Winning: A New Approach to Reactive Synthesis in Robotics

Reactive Synthesis Without Regret

Synthesis for multi-objective stochastic games: an application to autonomous urban driving

Sampling-based Reactive Synthesis for Nondeterministic Hybrid Systems

Stackelberg-Pareto Synthesis with Quantitative Reachability Objectives

A learning-based synthesis approach of reward asynchronous probabilistic games against the linear temporal logic winning condition

Strategy Improvement for Concurrent Reachability and Safety Games

Synthesizing Dominant Strategies for Liveness (Full Version)

Minimizing Expected Cost Under Hard Boolean Constraints, with Applications to Quantitative Synthesis

Multi-weighted Reachability Games and Their Application to Permissiveness

Multi-Agent Systems with Quantitative Satisficing Goals

On the Existence of Reactive Strategies Resilient to Delay

Peer Review #3 of "A Learning-Based Synthesis Approach of Reward Asynchronous Probabilistic Games Against the Linear Temporal Logic Winning Condition (V0.1)"

Synthesis of Opacity-Enforcing Winning Strategies Against Colluded Opponent

Towards the Usage of Window Counting Constraints in the Synthesis of Reactive Systems to Reduce State Space Explosion

Integrated Resource Allocation and Strategy Synthesis in Safety Games on Graphs with Deception

On Equilibria in Quantitative Games with Reachability/Safety Objectives

Contract-Based Distributed Synthesis in Two-Objective Parity Games

Strategy Complexity of Reachability in Countable Stochastic 2-Player Games

Weak Subgame Perfect Equilibria and their Application to Quantitative Reachability