Topic

Bellman equation

About: Bellman equation is a research topic. Over the lifetime, 5884 publications have been published within this topic receiving 135589 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Reflected solutions of backward SDE's, and related obstacle problems for PDE's

[...]

N. El Karoui, C. Kapoudjian, Etienne Pardoux, Shige Peng, M. C. Quenez - Show less +1 more

01 Apr 1997-Annals of Probability

TL;DR: In this article, reflected solutions of one-dimensional backward stochastic differential equations are studied and the authors prove uniqueness and existence both by a fixed point argument and by approximation via penalization.

...read moreread less

Abstract: We study reflected solutions of one-dimensional backward stochastic differential equations. The “reflection” keeps the solution above a given stochastic process. We prove uniqueness and existence both by a fixed point argument and by approximation via penalization. We show that when the coefficient has a special form, then the solution of our problem is the value function of a mixed optimal stopping–optimal stochastic control problem. We finally show that, when put in a Markovian framework, the solution of our reflected BSDE provides a probabilistic formula for the unique viscosity solution of an obstacle problem for a parabolic partial differential equation.

...read moreread less

781 citations

Journal Article•DOI•

Regression methods for pricing complex American-style options

[...]

John N. Tsitsiklis¹, B. Van Roy²•Institutions (2)

Massachusetts Institute of Technology¹, Stanford University²

01 Jul 2001-IEEE Transactions on Neural Networks

TL;DR: A simulation-based approximate dynamic programming method for pricing complex American-style options, with a possibly high-dimensional underlying state space, and a related method which uses a single (parameterized) value function, which is a function of the time-state pair.

...read moreread less

Abstract: We introduce and analyze a simulation-based approximate dynamic programming method for pricing complex American-style options, with a possibly high-dimensional underlying state space. We work within a finitely parameterized family of approximate value functions, and introduce a variant of value iteration, adapted to this parametric setting. We also introduce a related method which uses a single (parameterized) value function, which is a function of the time-state pair, as opposed to using a separate (independently parameterized) value function for each time. Our methods involve the evaluation of value functions at a finite set, consisting of "representative" elements of the state space. We show that with an arbitrary choice of this set, the approximation error can grow exponentially with the time horizon (time to expiration). On the other hand, if representative states are chosen by simulating the state process using the underlying risk-neutral probability distribution, then the approximation error remains bounded.

...read moreread less

695 citations

Journal Article•DOI•

Optimal control of production rate in a failure prone manufacturing system

[...]

Ram Akella¹, P. R. Kumar²•Institutions (2)

Massachusetts Institute of Technology¹, University of Illinois at Urbana–Champaign²

01 Feb 1986-IEEE Transactions on Automatic Control

TL;DR: A manufacturing system can be in one of two states: functional and failed, and it moves back and forth between these two states as a continuous time Markov chain, with mean time between failures = 1/ q1, and mean time to repair 1/q2.

...read moreread less

Abstract: We address the problem of controlling the production rate of a failure prone manufacturing system so as to minimize the discounted inventory, cost, where certain cost rates are specified for both positive and negative inventories, and there is a constant demand rate for the commodity produced. The underlying theoretical problem is the optimal control of a continuous-time system with jump Markov disturbances, with an infinite horizon discounted cost criterion. We use two complementary approaches. First, proceeding informally, and using a combination of stochastic coupling, linear system arguments, stable and unstable eigenspaces, renewal theory, parametric optimization, etc., we arrive at a conjecture for the optimal policy. Then we address the previously ignored mathematical difficulties associated with differential equations with discontinuous right-hand sides, singularity of the optimal control problem, smoothness, and validity of the dynamic programming equation, etc., to give a rigorous proof of optimality of the conjectured policy. It is hoped that both approaches will find uses in other such problems also. We obtain the complete solution and show that the optimal solution is simply characterized by a certain critical number, which we call the optimal inventory level. If the current inventory level exceeds the optimal, one should not produce at all; if less, one should produce at the maximum rate; while if exactly equal, one should produce exactly enough to meet demand. We also give a simple explicit formula for the optimal inventory level.

...read moreread less

643 citations

Journal Article•DOI•

Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation

[...]

Randal W. Beard¹, George N. Saridis², John T. Wen¹•Institutions (2)

Brigham Young University¹, Rensselaer Polytechnic Institute²

01 Dec 1997-Automatica

TL;DR: This paper states sufficient conditions that guarantee that the Galerkin approximation converges to the solution of the GHJB equation and that the resulting approximate control is stabilizing on the same region as the initial control.

...read moreread less

580 citations

Journal Article•DOI•

A quartet of semigroups for model specification, robustness, prices of risk, and model detection

[...]

Evan W. Anderson¹, Lars Peter Hansen², Thomas J. Sargent³•Institutions (3)

University of North Carolina at Chapel Hill¹, University of Chicago², New York University³

01 Mar 2003-Journal of the European Economic Association

TL;DR: In this article, the authors use a statistical theory of detection to quantify how much model misspecification the decision maker should fear, given his historical data record, and establish a tight link between the market price of uncertainty and a bound on the error in statistically discriminating between an approximating and a worst case model.

...read moreread less

Abstract: A representative agent fears that his model, a continuous time Markov process with jump and diffusion components, is misspecified and therefore uses robust control theory to make decisions. Under the decision maker’s approximating model, cautious behavior puts adjustments for model misspecification into market prices for risk factors. We use a statistical theory of detection to quantify how much model misspecification the decision maker should fear, given his historical data record. A semigroup is a collection of objects connected by something like the law of iterated expectations. The law of iterated expectations defines the semigroup for a Markov process, while similar laws define other semigroups. Related semigroups describe (1) an approximating model; (2) a model misspecification adjustment to the continuation value in the decision maker’s Bellman equation; (3) asset prices; and (4) the behavior of the model detection statistics that we use to calibrate how much robustness the decision maker prefers. Semigroups 2, 3, and 4 establish a tight link between the market price of uncertainty and a bound on the error in statistically discriminating between an approximating and a worst case model. (JEL: C00, D51, D81, E1, G12)

...read moreread less

534 citations

Collapse

Network Information

Performance

Metrics

6,698

Papers

155,793

Citations

No. of papers in the topic in previous years
Year	Papers
2023	261
2022	537
2021	369
2020	411
2019	348
2018	353

Bellman equation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics