Beyond Winning Strategies: Admissible and Admissible Winning Strategies for Quantitative Reachability Games

Karan Muvvala,Qi Heng Ho,Morteza Lahijanian
2024-08-24
Abstract:Classical reactive synthesis approaches aim to synthesize a reactive system that always satisfies a given specifications. These approaches often reduce to playing a two-player zero-sum game where the goal is to synthesize a winning strategy. However, in many pragmatic domains, such as robotics, a winning strategy does not always exist, yet it is desirable for the system to make an effort to satisfy its requirements instead of "giving up". To this end, this paper investigates the notion of admissible strategies, which formalize "doing-your-best", in quantitative reachability games. We show that, unlike the qualitative case, quantitative admissible strategies are history-dependent even for finite payoff functions, making synthesis a challenging task. In addition, we prove that admissible strategies always exist but may produce undesirable optimistic behaviors. To mitigate this, we propose admissible winning strategies, which enforce the best possible outcome while being admissible. We show that both strategies always exist but are not memoryless. We provide necessary and sufficient conditions for the existence of both strategies and propose synthesis algorithms. Finally, we illustrate the strategies on gridworld and robot manipulator domains.
Computer Science and Game Theory,Formal Languages and Automata Theory,Logic in Computer Science,Robotics
What problem does this paper attempt to address?
This paper attempts to solve the problem of how to ensure that the system (Sys) meets its requirements as much as possible when there is no winning strategy in quantitative reachability games. Specifically, the paper explores the concept of "doing one's best" and introduces admissible strategies and admissible winning strategies to deal with the situation where there may be no winning strategy in practical applications such as robots. ### Main problems 1. **Handling when there is no winning strategy**: - In many practical scenarios, such as robot operations, there may be environmental factors (Env) that prevent the system from always being able to complete the task. At this time, the system needs a mechanism to try to meet its goals as much as possible instead of giving up directly. 2. **Properties and synthesis of admissible strategies**: - An admissible strategy is a strategy that does its best even in the worst - case scenario. The paper analyzes the characteristics of these strategies and proves that they always exist, but may produce overly optimistic behavior. 3. **Introduction of admissible winning strategies**: - In order to mitigate the overly optimistic behavior that admissible strategies may produce, the paper proposes admissible winning strategies. Such strategies not only do their best but also ensure the best possible results when possible. ### Specific content - **Definitions and background**: - The paper first defines a 2 - player quantitative game and introduces related basic concepts such as states, actions, transition functions, cost functions, etc. - **Analysis of admissible strategies**: - The paper analyzes the properties of admissible strategies in quantitative reachability games, pointing out that these strategies are usually history - dependent and provides necessary and sufficient conditions for their existence. - **Proposal of admissible winning strategies**: - To deal with the overly optimistic behavior that admissible strategies may produce, the paper proposes the concept of admissible winning strategies and proves that such strategies also always exist and require finite memory. - **Algorithms and examples**: - The paper provides algorithms for synthesizing admissible strategies and admissible winning strategies and demonstrates the effectiveness and flexibility of these strategies through examples such as grid worlds and robotic arm operations. ### Conclusions The main contributions of the paper include: 1. Analyzing the existence and properties of admissible strategies in quantitative reachability games that do not depend on the environmental rationality assumption. 2. Introducing the concept of admissible winning strategies to mitigate the overly optimistic behavior that admissible strategies may produce. 3. Proposing algorithms for synthesizing these two types of strategies. 4. Demonstrating the effectiveness and flexibility of these strategies in practical applications through multiple robot application examples. Through these works, the paper provides a new method for designing more robust reactive systems in complex environments.