scispace - formally typeset
Search or ask a question
Topic

Bellman equation

About: Bellman equation is a research topic. Over the lifetime, 5884 publications have been published within this topic receiving 135589 citations.


Papers
More filters
Book ChapterDOI
Yves Achdou1
01 Jan 2013
TL;DR: In this paper, several aspects of a finite difference method used to approximate the previously mentioned system of PDEs are discussed, including: existence and uniqueness properties, a priori bounds on the solutions of the discrete schemes, convergence, and algorithms for solving the resulting nonlinear systems of equations.
Abstract: Mean field type models describing the limiting behavior of stochastic differential game problems as the number of players tends to + ∞, have been recently introduced by J-M. Lasry and P-L. Lions. They may lead to systems of evolutive partial differential equations coupling a forward Bellman equation and a backward Fokker–Planck equation. The forward-backward structure is an important feature of this system, which makes it necessary to design new strategies for mathematical analysis and numerical approximation. In this survey, several aspects of a finite difference method used to approximate the previously mentioned system of PDEs are discussed, including: existence and uniqueness properties, a priori bounds on the solutions of the discrete schemes, convergence, and algorithms for solving the resulting nonlinear systems of equations. Some numerical experiments are presented. Finally, the optimal planning problem is considered, i.e. the problem in which the positions of a very large number of identical rational agents, with a common value function, evolve from a given initial spatial density to a desired target density at the final horizon time.

76 citations

Book ChapterDOI
01 Jan 1988
TL;DR: In this article, the authors investigated the Bellman equation that arises in the optimal control of Markov processes and derived existence and uniqueness results for the optimal optimal control problem with viscosity solutions.
Abstract: We investigate the Bellman equation that arises in the optimal control of Markov processes This is a fully nonlinear integro-differential equation The notion of viscosity solutions is introduced and then existence and uniqueness results are obtained Also, the connection between the optimal control problem and the Bellman equation is developed

76 citations

Journal ArticleDOI
TL;DR: In this paper, a queue-aware power and rate allocation with constraints of average fronthaul consumption for delay-sensitive traffic is formulated as an infinite horizon constrained partially observed Markov decision process, which takes both the urgent queue state information and the imperfect channel state information at transmitters (CSIT) into account.
Abstract: The cloud radio access network (C-RAN) provides high spectral and energy efficiency performances, low expenditures, and intelligent centralized system structures to operators, which have attracted intense interests in both academia and industry. In this paper, a hybrid coordinated multipoint transmission (H-CoMP) scheme is designed for the downlink transmission in C-RANs and fulfills the flexible tradeoff between cooperation gain and fronthaul consumption. The queue-aware power and rate allocation with constraints of average fronthaul consumption for the delay-sensitive traffic are formulated as an infinite horizon constrained partially observed Markov decision process, which takes both the urgent queue state information and the imperfect channel state information at transmitters (CSIT) into account. To deal with the curse of dimensionality involved with the equivalent Bellman equation, the linear approximation of postdecision value functions is utilized. A stochastic gradient algorithm is presented to allocate the queue-aware power and transmission rate with H-CoMP, which is robust against unpredicted traffic arrivals and uncertainties caused by the imperfect CSIT. Furthermore, to substantially reduce the computing complexity, an online learning algorithm is proposed to estimate the per-queue postdecision value functions and update the Lagrange multipliers. The simulation results demonstrate performance gains of the proposed stochastic gradient algorithms and confirm the asymptotical convergence of the proposed online learning algorithm.

76 citations

Proceedings Article
27 Jul 1997
TL;DR: Novel incremental versions of grid-based linear interpolation method and simple lower bound method with Sondik's updates are introduced and a new method for computing an initial upper bound - the fast informed bound method is introduced.
Abstract: Partially observable Markov decision processes (POMDPs) allow one to model complex dynamic decision or control problems that include both action outcome uncertainty and imperfect observability. The control problem is formulated as a dynamic optimization problem with a value function combining costs or rewards from multiple steps. In this paper we propose, analyse and test various incremental methods for computing bounds on the value function for control problems with infinite discounted horizon criteria. The methods described and tested include novel incremental versions of grid-based linear interpolation method and simple lower bound method with Sondik's updates. Both of these can work with arbitrary points of the belief space and can be enhanced by various heuristic point selection strategies. Also introduced is a new method for computing an initial upper bound - the fast informed bound method. This method is able to improve significantly on the standard and commonly used upper bound computed by the MDP-based method. The quality of resulting bounds are tested on a maze navigation problem with 20 states, 6 actions and 8 observations.

76 citations

Journal ArticleDOI
TL;DR: This paper extends the classical consumption and portfolio rules model in continuous time to the framework of decision-makers with time-inconsistent preferences and derives a modified HJB (Hamilton-Jacobi-Bellman) equation to solve the problem for sophisticated agents.

76 citations


Network Information
Related Topics (5)
Optimal control
68K papers, 1.2M citations
87% related
Bounded function
77.2K papers, 1.3M citations
85% related
Markov chain
51.9K papers, 1.3M citations
85% related
Linear system
59.5K papers, 1.4M citations
84% related
Optimization problem
96.4K papers, 2.1M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023261
2022537
2021369
2020411
2019348
2018353