Approximating the Stationary Hamilton-Jacobi-Bellman Equation by Hierarchical Tensor Products

Open AccessPosted Content

Approximating the Stationary Hamilton-Jacobi-Bellman Equation by Hierarchical Tensor Products

Mathias Oster, +2 more

- 01 Nov 2019 -

arXiv: Optimization and Control

Chats0

TLDR

This work treats infinite horizon optimal control problems by solving the associated stationary Hamilton-Jacobi-Bellman (HJB) equation numerically to compute the value function and an optimal feedback law, and uses low rank hierarchical tensor product approximation/tree-based tensor formats, in particular tensor trains (TT tensors), and multi-polynomials, together with high dimensional quadrature.

Abstract:

We treat infinite horizon optimal control problems by solving the associated stationary Hamilton-Jacobi-Bellman (HJB) equation numerically, for computing the value function and an optimal feedback area law. The dynamical systems under consideration are spatial discretizations of nonlinear parabolic partial differential equations (PDE), which means that the HJB is suffering from the curse of dimensions. To overcome numerical infeasability we use low-rank hierarchical tensor product approximation, or tree-based tensor formats, in particular tensor trains (TT tensors) and multi-polynomials, since the resulting value function is expected to be smooth. To this end we reformulate the Policy Iteration algorithm as a linearization of HJB equations. The resulting linear hyperbolic PDE remains the computational bottleneck due to high-dimensions. By the methods of characteristics it can be reformulated via the Koopman operator in the spirit of dynamic programming. We use a low rank tensor representation for approximation of the value function. The resulting operator equation is solved using high-dimensional quadrature, e.g. Variational Monte-Carlo methods. From the knowledge of the value function at computable samples $x_i$ we infer the function $ x \mapsto v (x)$. We investigate the convergence of this procedure. By controlling destabilized versions of viscous Burgers and Schloegl equations numerical evidences are given.

Approximating the Stationary Hamilton-Jacobi-Bellman Equation by Hierarchical Tensor Products

Citations

Solving high-dimensional Hamilton–Jacobi–Bellman PDEs using neural networks: perspectives from the theory of controlled diffusions and measures on path space

Solving high-dimensional Hamilton-Jacobi-Bellman PDEs using neural networks: perspectives from the theory of controlled diffusions and measures on path space

Tensor Decomposition Methods for High-dimensional Hamilton-Jacobi-Bellman Equations

Approximative Policy Iteration for Exit Time Feedback Control Problems driven by Stochastic Differential Equations using Tensor Train format

Actor-Critic Method for High Dimensional Static Hamilton-Jacobi-Bellman Partial Differential Equations based on Neural Networks.

References

Deep Learning

Dynamic Programming and Optimal Control

Support Vector Machines

Learning with kernels : Support vector machines, regularization, optimization, and beyond

Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations

Related Papers (5)

Tensor-Train Decomposition

The Alternating Linear Scheme for Tensor Optimization in the Tensor Train Format

Polynomial Approximation of High-Dimensional Hamilton--Jacobi--Bellman Equations and Applications to Feedback Control of Semilinear Parabolic PDEs

Solving high-dimensional partial differential equations using deep learning

Mitigating the curse of dimensionality: sparse grid characteristics method for optimal feedback control and HJB equations