Point-based value iteration: an anytime algorithm for POMDPs

Open AccessProceedings Article

Point-based value iteration: an anytime algorithm for POMDPs

- pp 1025-1030

TLDR

This paper introduces the Point-Based Value Iteration (PBVI) algorithm for POMDP planning, and presents results on a robotic laser tag problem as well as three test domains from the literature.

Abstract:

This paper introduces the Point-Based Value Iteration (PBVI) algorithm for POMDP planning. PBVI approximates an exact value iteration solution by selecting a small set of representative belief points and then tracking the value and its derivative for those points only. By using stochastic trajectories to choose belief points, and by maintaining only one value hyper-plane per point, PBVI successfully solves large problems: we present results on a robotic laser tag problem as well as three test domains from the literature.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Partially observable Markov decision processes for spoken dialog systems

Jason D. Williams, +1 more

- 01 Apr 2007 -

Computer Speech & Language

TL;DR: This paper cast a spoken dialog system as a partially observable Markov decision process (POMDP) and shows how this formulation unifies and extends existing techniques to form a single principled framework.

...read moreread less

Journal ArticleDOI

POMDP-Based Statistical Spoken Dialog Systems: A Review

Steve Young, +3 more

TL;DR: This review article provides an overview of the current state of the art in the development of POMDP-based spoken dialog systems.

...read moreread less

Proceedings ArticleDOI

SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces

Hanna Kurniawati, +2 more

TL;DR: This work has developed a new point-based POMDP algorithm that exploits the notion of optimally reachable belief spaces to improve com- putational efficiency and substantially outperformed one of the fastest existing point- based algorithms.

...read moreread less

Journal ArticleDOI

Perseus: randomized point-based value iteration for POMDPs

Matthijs T. J. Spaan, +1 more

- 01 Jul 2005 -

Journal of Artificial Intelligence Resea...

TL;DR: This work presents a randomized point-based value iteration algorithm called PERSEUS, which backs up only a (randomly selected) subset of points in the belief set, sufficient for improving the value of each belief point in the set.

...read moreread less

Journal ArticleDOI

The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management

Steve Young, +6 more

- 01 Apr 2010 -

Computer Speech & Language

TL;DR: This paper explains how Partially Observable Markov Decision Processes (POMDPs) can provide a principled mathematical framework for modelling the inherent uncertainty in spoken dialogue systems and describes a form of approximation called the Hidden Information State model which does scale and which can be used to build practical systems.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Planning and Acting in Partially Observable Stochastic Domains

Leslie Pack Kaelbling, +2 more

- 01 May 1998 -

Artificial Intelligence

TL;DR: A novel algorithm for solving pomdps off line and how, in some cases, a finite-memory controller can be extracted from the solution to a POMDP is outlined.

...read moreread less

Book ChapterDOI

Learning policies for partially observable environments: scaling up

Michael L. Littman, +2 more

TL;DR: This paper discusses several simple solution methods and shows that all are capable of finding near- optimal policies for a selection of extremely small POMDP'S taken from the learning literature, but shows that none are able to solve a slightly larger and noisier problem based on robot navigation.

...read moreread less

Journal ArticleDOI

Value-function approximations for partially observable Markov decision processes

Milos Hauskrecht

- 01 Aug 2000 -

Journal of Artificial Intelligence Resea...

TL;DR: This work surveys various approximation methods, analyzes their properties and relations and provides some new insights into their differences, and presents a number of new approximation methods and novel refinements of existing techniques.

...read moreread less

Proceedings Article

Incremental pruning: a simple, fast, exact method for partially observable Markov decision processes

Anthony R. Cassandra, +2 more

TL;DR: It is found that incremental pruning is presently the most efficient exact method for solving POMDPS.

...read moreread less

The optimal control of partially observable Markov processes

Edward Jay Sondik

TL;DR: In this paper, a beam splitter was placed between an exit slit of an excitation monochromator and a specimen, and part of the excitation radiation was conducted to a first-light quantum meter by said splitter and a reference photomultiplier was provided for receiving fluorescence from said first light quantum meter.

...read moreread less