Open AccessProceedings Article
Point-based value iteration: an anytime algorithm for POMDPs
Joelle Pineau,Geoff Gordon,Sebastian Thrun +2 more
- pp 1025-1030
TLDR
This paper introduces the Point-Based Value Iteration (PBVI) algorithm for POMDP planning, and presents results on a robotic laser tag problem as well as three test domains from the literature.Abstract:
This paper introduces the Point-Based Value Iteration (PBVI) algorithm for POMDP planning. PBVI approximates an exact value iteration solution by selecting a small set of representative belief points and then tracking the value and its derivative for those points only. By using stochastic trajectories to choose belief points, and by maintaining only one value hyper-plane per point, PBVI successfully solves large problems: we present results on a robotic laser tag problem as well as three test domains from the literature.read more
Citations
More filters
Journal ArticleDOI
Partially observable Markov decision processes for spoken dialog systems
Jason D. Williams,Steve Young +1 more
TL;DR: This paper cast a spoken dialog system as a partially observable Markov decision process (POMDP) and shows how this formulation unifies and extends existing techniques to form a single principled framework.
Journal ArticleDOI
POMDP-Based Statistical Spoken Dialog Systems: A Review
TL;DR: This review article provides an overview of the current state of the art in the development of POMDP-based spoken dialog systems.
Proceedings ArticleDOI
SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces
TL;DR: This work has developed a new point-based POMDP algorithm that exploits the notion of optimally reachable belief spaces to improve com- putational efficiency and substantially outperformed one of the fastest existing point- based algorithms.
Journal ArticleDOI
Perseus: randomized point-based value iteration for POMDPs
TL;DR: This work presents a randomized point-based value iteration algorithm called PERSEUS, which backs up only a (randomly selected) subset of points in the belief set, sufficient for improving the value of each belief point in the set.
Journal ArticleDOI
The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management
Steve Young,Milica Gasic,Simon Keizer,François Mairesse,Jost Schatzmann,Blaise Thomson,Kai Yu +6 more
TL;DR: This paper explains how Partially Observable Markov Decision Processes (POMDPs) can provide a principled mathematical framework for modelling the inherent uncertainty in spoken dialogue systems and describes a form of approximation called the Hidden Information State model which does scale and which can be used to build practical systems.
References
More filters
Journal ArticleDOI
Planning and Acting in Partially Observable Stochastic Domains
TL;DR: A novel algorithm for solving pomdps off line and how, in some cases, a finite-memory controller can be extracted from the solution to a POMDP is outlined.
Book ChapterDOI
Learning policies for partially observable environments: scaling up
TL;DR: This paper discusses several simple solution methods and shows that all are capable of finding near- optimal policies for a selection of extremely small POMDP'S taken from the learning literature, but shows that none are able to solve a slightly larger and noisier problem based on robot navigation.
Journal ArticleDOI
Value-function approximations for partially observable Markov decision processes
TL;DR: This work surveys various approximation methods, analyzes their properties and relations and provides some new insights into their differences, and presents a number of new approximation methods and novel refinements of existing techniques.
Proceedings Article
Incremental pruning: a simple, fast, exact method for partially observable Markov decision processes
TL;DR: It is found that incremental pruning is presently the most efficient exact method for solving POMDPS.
The optimal control of partially observable Markov processes
TL;DR: In this paper, a beam splitter was placed between an exit slit of an excitation monochromator and a specimen, and part of the excitation radiation was conducted to a first-light quantum meter by said splitter and a reference photomultiplier was provided for receiving fluorescence from said first light quantum meter.