Topic

Bellman equation

About: Bellman equation is a research topic. Over the lifetime, 5884 publications have been published within this topic receiving 135589 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Variational properties of value functions

[...]

Aleksandr Y. Aravkin, James V. Burke¹, Michael P. Friedlander²•Institutions (2)

University of Washington¹, University of British Columbia²

22 Aug 2013-Siam Journal on Optimization

TL;DR: The variational properties of the value functions for a broad class of convex formulations, which are not all covered by standard Lagrange multiplier theory, are characterized and an inverse function theorem is given that links thevalue functions of different regularization formulations (not necessarily convex).

...read moreread less

Abstract: Regularization plays a key role in a variety of optimization formulations of inverse problems. A recurring question in regularization approaches is the selection of regularization pa- rameters, and its eect on the solution and on the optimal value of the optimization problem. The sensitivity of the value function to the regularization parameter can be linked directly to the Lagrange multipliers. In this paper, we fully characterize the variational properties of the value functions for a broad class of convex formulations, which are not all covered by standard Lagrange multiplier theory. We also present an inverse function theorem that links the value functions of dierent regularization formulations (not necessarily convex). These results have implications for the selection of regularization parameters, and the development of specialized algorithms. We give numerical examples that illustrate the theoretical results.

...read moreread less

53 citations

Journal Article•DOI•

Optimal Execution with Multiplicative Price Impact

[...]

Xin Guo, Mihail Zervos

14 Apr 2015-Siam Journal on Financial Mathematics

TL;DR: A price model is developed that presents the stochastic dynamics of a geometric Brownian motion and incorporates a log-linear effect of the investor's transactions and derives an explicit solution to the optimal execution problem if the time horizon is infinite.

...read moreread less

Abstract: We consider the so-called optimal execution problem in algorithmic trading, which is the problem faced by an investor who has a large number of stock shares to sell over a given time horizon and whose actions have an impact on the stock price. In particular, we develop and study a price model that presents the stochastic dynamics of a geometric Brownian motion and incorporates a log-linear effect of the investor's transactions. We then formulate the optimal execution problem as a degenerate singular stochastic control problem. Using both analytic and probabilistic techniques, we establish simple conditions for the market to allow for no arbitrage or price manipulation and develop a detailed characterization of the value function and the optimal strategy. In particular, we derive an explicit solution to the problem if the time horizon is infinite.

...read moreread less

53 citations

Journal Article•DOI•

A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications

[...]

Warren B. Powell¹, Jun Ma¹•Institutions (1)

Princeton University¹

19 Jul 2011-Journal of Control Theory and Applications

TL;DR: Some recent research by the authors on approximate policy iteration algorithms that offer convergence guarantees for both parametric and nonparametric architectures for the value function are described.

...read moreread less

Abstract: We review the literature on approximate dynamic programming, with the goal of better understanding the theory behind practical algorithms for solving dynamic programs with continuous and vector-valued states and actions and complex information processes. We build on the literature that has addressed the well-known problem of multidimensional (and possibly continuous) states, and the extensive literature on model-free dynamic programming, which also assumes that the expectation in Bellman’s equation cannot be computed. However, we point out complications that arise when the actions/controls are vector-valued and possibly continuous. We then describe some recent research by the authors on approximate policy iteration algorithms that offer convergence guarantees (with technical assumptions) for both parametric and nonparametric architectures for the value function.

...read moreread less

52 citations

Proceedings Article•DOI•

An incremental sampling-based algorithm for stochastic optimal control

[...]

Vu Anh Huynh¹, Sertac Karaman¹, Emilio Frazzoli¹•Institutions (1)

Massachusetts Institute of Technology¹

14 May 2012

TL;DR: The proposed incremental Markov Decision Process (iMDP) provides an anytime approach to the computation of optimal control policies of the continuous problem and is demonstrated on motion planning and control problems in cluttered environments in the presence of process noise.

...read moreread less

Abstract: In this paper, we consider a class of continuous-time, continuous-space stochastic optimal control problems Building upon recent advances in Markov chain approximation methods and sampling-based algorithms for deterministic path planning, we propose a novel algorithm called the incremental Markov Decision Process (iMDP) to compute incrementally control policies that approximate arbitrarily well an optimal policy in terms of the expected cost The main idea behind the algorithm is to generate a sequence of finite discretizations of the original problem through random sampling of the state space At each iteration, the discretized problem is a Markov Decision Process that serves as an incrementally refined model of the original problem We show that with probability one, (i) the sequence of the optimal value functions for each of the discretized problems converges uniformly to the optimal value function of the original stochastic optimal control problem, and (ii) the original optimal value function can be computed efficiently in an incremental manner using asynchronous value iterations Thus, the proposed algorithm provides an anytime approach to the computation of optimal control policies of the continuous problem The effectiveness of the proposed approach is demonstrated on motion planning and control problems in cluttered environments in the presence of process noise

...read moreread less

52 citations

Journal Article•DOI•

Conjugate points and shocks in nonlinear optimal control

[...]

N. Caroff¹, Halina Frankowska¹•Institutions (1)

Paris Dauphine University¹

01 Jan 1996-Transactions of the American Mathematical Society

TL;DR: In this article, the authors use the method of characteristics to extend the Jacobi conjugate points theory to the Bolza problem arising in nonlinear optimal control, which yields necessary and sufficient optimality conditions for weak and strong local minima stated in terms of the existence of a solution to a corresponding matrix Riccati differential equation.

...read moreread less

Abstract: In this paper the authors use the method of characteristics to extend the Jacobi conjugate points theory to the Bolza problem arising in nonlinear optimal control. This yields necessary and sufficient optimality conditions for weak and strong local minima stated in terms of the existence of a solution to a corresponding matrix Riccati differential equation. The same approach allows to investigate as well smoothness of the value function.

...read moreread less

52 citations

Collapse

Network Information

Performance

Metrics

6,698

Papers

155,793

Citations

No. of papers in the topic in previous years
Year	Papers
2023	261
2022	537
2021	369
2020	411
2019	348
2018	353

Bellman equation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics