SEA-PARAM: Exploring Schedulers in Parametric MDPs

Sebastian Arming,Ezio Bartocci,Ana Sokolova
DOI: https://doi.org/10.4204/EPTCS.250.3
2017-07-13
Abstract:We study parametric Markov decision processes (PMDPs) and their reachability probabilities "independent" of the parameters. Different to existing work on parameter synthesis (implemented in the tools PARAM and PRISM), our main focus is on describing different types of optimal deterministic memoryless schedulers for the whole parameter range. We implement a simple prototype tool SEA-PARAM that computes these optimal schedulers and show experimental results.
Logic in Computer Science
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper mainly studies **the reachability probability in parameterized Markov decision processes (PMDPs)** and attempts to describe different types of optimal deterministic memoryless schedulers within the parameter range. Specifically, the paper addresses the following problems: 1. **Analysis of reachability probability under parameter uncertainty**: Unlike traditional Markov decision processes (MDPs), the transition probabilities in PMDPs depend on a set of parameters. Therefore, the probability of reaching the target state from the starting state depends not only on the selected scheduler but also on the specific values of the parameters. The paper aims to explore how to set bounds for the reachability probability across the entire parameter range in the case of parameter uncertainty. 2. **Classification and identification of optimal schedulers**: The paper proposes a method to enumerate all simple schedulers and calculate their corresponding rational functions. Based on these rational functions, the paper defines ten types of optimal schedulers, including: - **Dominant Scheduler**: It can provide the maximum reachability probability under any parameter values. - **Optimistic Scheduler**: It can achieve the maximum reachability probability under certain parameter values. - **Pessimistic Scheduler**: It can ensure better performance than other schedulers even in the worst - case scenario. - **Bound Scheduler**: It has the smallest range of variation in reachability probability. - **Expectation Scheduler**: It has the maximum expected value in the parameter space. - **Stable Scheduler**: It has the minimum variance, indicating more stable performance. 3. **Tool implementation**: The paper has developed a prototype tool named SEA - PARAM, which can calculate these optimal schedulers and presents the experimental results. By using the existing efficient tools PARAM and PRISM, SEA - PARAM can handle complex multi - variable rational function calculations. ### Formula representation To express the above concepts more clearly, here are some of the key formulas: - **Rational function of reachability probability**: \[ f_\xi(v)=\Pr_{M^{\xi,v}}(s, t) \] where \( f_\xi \) is the rational function corresponding to the scheduler \(\xi\), \( v \) is the parameter valuation, and \( M^{\xi,v} \) is the Markov chain induced by the scheduler \(\xi\) and parameter valuation \( v \). - **Maximum/minimum reachability probability**: \[ \Pr_M^{\max}(s, t)=\max_{\xi}\max_{v}\Pr_{M^{\xi,v}}(s, t) \] \[ \Pr_M^{\min}(s, t)=\min_{\xi}\min_{v}\Pr_{M^{\xi,v}}(s, t) \] - **Expected value**: \[ E(\xi)=\int f_\xi \, dp \] where \( p \) is the probability density function in the parameter space. ### Summary In general, this paper is committed to providing a systematic method to identify and classify optimal schedulers through the study of PMDPs in the case of parameter uncertainty, thereby providing theoretical support and practical tools for the control and reliability analysis of complex systems.