Symbolic dynamic programming for first-order MDPs

Open AccessProceedings Article

Symbolic dynamic programming for first-order MDPs

- pp 690-697

TLDR

This technique uses an MDP whose dynamics is represented in a variant of the situation calculus allowing for stochastic actions and produces a logical description of the optimal value function and policy by constructing a set of first-order formulae that minimally partition state space according to distinctions made by the valuefunction and policy.

Abstract:

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the situation calculus allowing for stochastic actions. It produces a logical description of the optimal value function and policy by constructing a set of first-order formulae that minimally partition state space according to distinctions made by the value function and policy. This is achieved through the use of an operation known as decision-theoretic regression. In effect, our algorithm performs value iteration without explicit enumeration of either the state or action spaces of the MDP. This allows problems involving relational fluents and quantification to be solved without requiring explicit state space enumeration or conversion to propositional form.

Citations

PDF

Open Access

More filters

Dynamic bayesian networks: representation, inference and learning

Kevin Murphy, +1 more

TL;DR: This thesis will discuss how to represent many different kinds of models as DBNs, how to perform exact and approximate inference in Dbns, and how to learn DBN models from sequential data.

...read moreread less

Journal ArticleDOI

Markov Decision Processes

Nicole Bäuerle, +1 more

- 08 Sep 2010 -

Jahresbericht Der Deutschen Mathematiker...

TL;DR: The theory of Markov Decision Processes is the theory of controlled Markov chains as mentioned in this paper, which has found applications in various areas like e.g. computer science, engineering, operations research, biology and economics.

...read moreread less

Book

Markov Logic: An Interface Layer for Artificial Intelligence

Pedro Domingos, +1 more

TL;DR: Most subfields of computer science have an interface layer via which applications communicate with the infrastructure, and this is key to their success, but this interface layer has been missing in AI.

...read moreread less

Journal ArticleDOI

Probabilistic reasoning with answer sets

Chitta Baral, +2 more

- 01 Jan 2009 -

Theory and Practice of Logic Programming

TL;DR: P-log as mentioned in this paper is a declarative language that combines logical and probabilistic arguments in its reasoning, where answer set Prolog is used as the logical foundation, while causal Bayes nets serve as a probablistic foundation.

...read moreread less

Journal ArticleDOI

A Concise Introduction to Models and Methods for Automated Planning

Hector Geffner, +1 more

- 01 Jul 2013 -

Synthesis Lectures on Artificial Intelli...

TL;DR: The goal is to provide a modern and coherent view of planning that is precise, concise, and mostly self-contained, without being shallow.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Dynamic Programming

Richard Ernest Bellman

TL;DR: The more the authors study the information processing aspects of the mind, the more perplexed and impressed they become, and it will be a very long time before they understand these processes sufficiently to reproduce them.

...read moreread less

Book

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Martin L. Puterman

TL;DR: Puterman as discussed by the authors provides a uniquely up-to-date, unified, and rigorous treatment of the theoretical, computational, and applied research on Markov decision process models, focusing primarily on infinite horizon discrete time models and models with discrete time spaces while also examining models with arbitrary state spaces, finite horizon models, and continuous time discrete state models.

...read moreread less

Neuro-Dynamic Programming.

Dimitri P. Bertsekas

TL;DR: In this article, the authors present the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.

...read moreread less

Book

Neuro-dynamic programming

Dimitri P. Bertsekas, +1 more

TL;DR: This is the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.

...read moreread less

Journal ArticleDOI

Decision-theoretic planning: structural assumptions and computational leverage

Craig Boutilier, +2 more

- 01 Jul 1999 -

Journal of Artificial Intelligence Resea...

TL;DR: In this article, the authors present an overview and synthesis of MDP-related methods, showing how they provide a unifying framework for modeling many classes of planning problems studied in AI.

...read moreread less

Symbolic dynamic programming for first-order MDPs

Citations

Dynamic bayesian networks: representation, inference and learning

Markov Decision Processes

Markov Logic: An Interface Layer for Artificial Intelligence

Probabilistic reasoning with answer sets

A Concise Introduction to Models and Methods for Automated Planning

References

Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Neuro-Dynamic Programming.

Neuro-dynamic programming

Decision-theoretic planning: structural assumptions and computational leverage

Related Papers (5)

Decision-theoretic planning: structural assumptions and computational leverage

Markov Decision Processes: Discrete Stochastic Dynamic Programming

SPUDD: stochastic planning using decision diagrams

Knowledge in Action: Logical Foundations for Specifying and Implementing Dynamical Systems

Dynamic Programming