Structured reachability analysis for Markov decision processes

Open AccessProceedings Article

Structured reachability analysis for Markov decision processes

- pp 24-32

TLDR

A family of algorithms for structured reachability analysis of MDPs that are suitable when an initial state (or set of states) is known and can be used to eliminate variables oy variable values from the problem description, reducing the size of the MDP and making it easier to solve.

Abstract:

Recent research in decision theoretic planning has focussed on making the solution of Markov decision processes (MDPs) more feasible. We develop a family of algorithms for structured reachability analysis of MDPs that are suitable when an initial state (or set of states) is known. Usin compact, structured representations of MDPs (e.g., Bayesian networks), our methods, which vary in the tradeoff between complexity and accurac roduce structured descriptions of (estimated) reacpagle states that can be used to eliminate variables oy variable values from the problem description, reducing the size of the MDP and making it easier to solve. One contribution of our work is the extension of ideas from GRAPHPLAN to deal with the distributed nature of action reoresentations typically embodied within Bayes nets and the problem of correlated action effects. We also demonstrate that our algorithm can be made more complete by using k-ary constraints instead of binary constraints. Another contribution is the illustration of how the compact representation of reachability constraints can be exploited by several existing (exact and approximate) abstraction algorithms for MDPs.

Structured reachability analysis for Markov decision processes

Citations

Decision-theoretic planning: structural assumptions and computational leverage

Stochastic dynamic programming with factored representations

Heuristic Search Value Iteration for POMDPs

Heuristic search value iteration for POMDPs

Automated Planning and Acting

References

Dynamic Programming

Neuro-Dynamic Programming.

Neuro-dynamic programming

Dynamic Programming and Markov Processes

Dynamic Programming and Markov Processes.

Related Papers (5)

Learning to act using real-time dynamic programming

Decision-theoretic planning: structural assumptions and computational leverage

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Dynamic Programming

Stochastic dynamic programming with factored representations