scispace - formally typeset
Open AccessProceedings Article

Point-based value iteration: an anytime algorithm for POMDPs

TLDR
This paper introduces the Point-Based Value Iteration (PBVI) algorithm for POMDP planning, and presents results on a robotic laser tag problem as well as three test domains from the literature.
Abstract
This paper introduces the Point-Based Value Iteration (PBVI) algorithm for POMDP planning. PBVI approximates an exact value iteration solution by selecting a small set of representative belief points and then tracking the value and its derivative for those points only. By using stochastic trajectories to choose belief points, and by maintaining only one value hyper-plane per point, PBVI successfully solves large problems: we present results on a robotic laser tag problem as well as three test domains from the literature.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Partially observable Markov decision processes for spoken dialog systems

TL;DR: This paper cast a spoken dialog system as a partially observable Markov decision process (POMDP) and shows how this formulation unifies and extends existing techniques to form a single principled framework.
Journal ArticleDOI

POMDP-Based Statistical Spoken Dialog Systems: A Review

TL;DR: This review article provides an overview of the current state of the art in the development of POMDP-based spoken dialog systems.
Proceedings ArticleDOI

SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces

TL;DR: This work has developed a new point-based POMDP algorithm that exploits the notion of optimally reachable belief spaces to improve com- putational efficiency and substantially outperformed one of the fastest existing point- based algorithms.
Journal ArticleDOI

Perseus: randomized point-based value iteration for POMDPs

TL;DR: This work presents a randomized point-based value iteration algorithm called PERSEUS, which backs up only a (randomly selected) subset of points in the belief set, sufficient for improving the value of each belief point in the set.
Journal ArticleDOI

The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management

TL;DR: This paper explains how Partially Observable Markov Decision Processes (POMDPs) can provide a principled mathematical framework for modelling the inherent uncertainty in spoken dialogue systems and describes a form of approximation called the Hidden Information State model which does scale and which can be used to build practical systems.
References
More filters
Journal ArticleDOI

Planning and Acting in Partially Observable Stochastic Domains

TL;DR: A novel algorithm for solving pomdps off line and how, in some cases, a finite-memory controller can be extracted from the solution to a POMDP is outlined.
Book ChapterDOI

Learning policies for partially observable environments: scaling up

TL;DR: This paper discusses several simple solution methods and shows that all are capable of finding near- optimal policies for a selection of extremely small POMDP'S taken from the learning literature, but shows that none are able to solve a slightly larger and noisier problem based on robot navigation.
Journal ArticleDOI

Value-function approximations for partially observable Markov decision processes

TL;DR: This work surveys various approximation methods, analyzes their properties and relations and provides some new insights into their differences, and presents a number of new approximation methods and novel refinements of existing techniques.
Proceedings Article

Incremental pruning: a simple, fast, exact method for partially observable Markov decision processes

TL;DR: It is found that incremental pruning is presently the most efficient exact method for solving POMDPS.

The optimal control of partially observable Markov processes

TL;DR: In this paper, a beam splitter was placed between an exit slit of an excitation monochromator and a specimen, and part of the excitation radiation was conducted to a first-light quantum meter by said splitter and a reference photomultiplier was provided for receiving fluorescence from said first light quantum meter.
Related Papers (5)