Fast Value Iteration for Goal-Directed Markov Decision Processes

Nevin Lianwen Zhang,Weihong Zhang
DOI: https://doi.org/10.48550/arXiv.1302.1575
2013-02-06
Abstract:Planning problems where effects of actions are non-deterministic can be modeled as Markov decision processes. Planning problems are usually goal-directed. This paper proposes several techniques for exploiting the goal-directedness to accelerate value iteration, a standard algorithm for solving Markov decision processes. Empirical studies have shown that the techniques can bring about significant speedups.
Artificial Intelligence
What problem does this paper attempt to address?