Topic

Bellman equation

About: Bellman equation is a research topic. Over the lifetime, 5884 publications have been published within this topic receiving 135589 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Finite Difference Methods for Mean Field Games

[...]

Yves Achdou¹•Institutions (1)

University of Paris¹

01 Jan 2013

TL;DR: In this paper, several aspects of a finite difference method used to approximate the previously mentioned system of PDEs are discussed, including: existence and uniqueness properties, a priori bounds on the solutions of the discrete schemes, convergence, and algorithms for solving the resulting nonlinear systems of equations.

...read moreread less

Abstract: Mean field type models describing the limiting behavior of stochastic differential game problems as the number of players tends to + ∞, have been recently introduced by J-M. Lasry and P-L. Lions. They may lead to systems of evolutive partial differential equations coupling a forward Bellman equation and a backward Fokker–Planck equation. The forward-backward structure is an important feature of this system, which makes it necessary to design new strategies for mathematical analysis and numerical approximation. In this survey, several aspects of a finite difference method used to approximate the previously mentioned system of PDEs are discussed, including: existence and uniqueness properties, a priori bounds on the solutions of the discrete schemes, convergence, and algorithms for solving the resulting nonlinear systems of equations. Some numerical experiments are presented. Finally, the optimal planning problem is considered, i.e. the problem in which the positions of a very large number of identical rational agents, with a common value function, evolve from a given initial spatial density to a desired target density at the final horizon time.

...read moreread less

76 citations

Book Chapter•DOI•

Optimal Control of Jump-Markov Processes and Viscosity Solutions

[...]

Halil Mete Soner¹•Institutions (1)

Carnegie Mellon University¹

01 Jan 1988

TL;DR: In this article, the authors investigated the Bellman equation that arises in the optimal control of Markov processes and derived existence and uniqueness results for the optimal optimal control problem with viscosity solutions.

...read moreread less

Abstract: We investigate the Bellman equation that arises in the optimal control of Markov processes This is a fully nonlinear integro-differential equation The notion of viscosity solutions is introduced and then existence and uniqueness results are obtained Also, the connection between the optimal control problem and the Bellman equation is developed

...read moreread less

76 citations

Journal Article•DOI•

Resource Allocation Optimization for Delay-Sensitive Traffic in Fronthaul Constrained Cloud Radio Access Networks

[...]

Jian Li¹, Mugen Peng¹, Aolin Cheng¹, Yuling Yu¹, Chonggang Wang² - Show less +1 more•Institutions (2)

Beijing University of Posts and Telecommunications¹, InterDigital, Inc.²

01 Dec 2017-IEEE Systems Journal

TL;DR: In this paper, a queue-aware power and rate allocation with constraints of average fronthaul consumption for delay-sensitive traffic is formulated as an infinite horizon constrained partially observed Markov decision process, which takes both the urgent queue state information and the imperfect channel state information at transmitters (CSIT) into account.

...read moreread less

Abstract: The cloud radio access network (C-RAN) provides high spectral and energy efficiency performances, low expenditures, and intelligent centralized system structures to operators, which have attracted intense interests in both academia and industry. In this paper, a hybrid coordinated multipoint transmission (H-CoMP) scheme is designed for the downlink transmission in C-RANs and fulfills the flexible tradeoff between cooperation gain and fronthaul consumption. The queue-aware power and rate allocation with constraints of average fronthaul consumption for the delay-sensitive traffic are formulated as an infinite horizon constrained partially observed Markov decision process, which takes both the urgent queue state information and the imperfect channel state information at transmitters (CSIT) into account. To deal with the curse of dimensionality involved with the equivalent Bellman equation, the linear approximation of postdecision value functions is utilized. A stochastic gradient algorithm is presented to allocate the queue-aware power and transmission rate with H-CoMP, which is robust against unpredicted traffic arrivals and uncertainties caused by the imperfect CSIT. Furthermore, to substantially reduce the computing complexity, an online learning algorithm is proposed to estimate the per-queue postdecision value functions and update the Lagrange multipliers. The simulation results demonstrate performance gains of the proposed stochastic gradient algorithms and confirm the asymptotical convergence of the proposed online learning algorithm.

...read moreread less

76 citations

Proceedings Article•

Incremental methods for computing bounds in partially observable Markov decision processes

[...]

Milos Hauskrecht¹•Institutions (1)

Massachusetts Institute of Technology¹

27 Jul 1997

TL;DR: Novel incremental versions of grid-based linear interpolation method and simple lower bound method with Sondik's updates are introduced and a new method for computing an initial upper bound - the fast informed bound method is introduced.

...read moreread less

Abstract: Partially observable Markov decision processes (POMDPs) allow one to model complex dynamic decision or control problems that include both action outcome uncertainty and imperfect observability. The control problem is formulated as a dynamic optimization problem with a value function combining costs or rewards from multiple steps. In this paper we propose, analyse and test various incremental methods for computing bounds on the value function for control problems with infinite discounted horizon criteria. The methods described and tested include novel incremental versions of grid-based linear interpolation method and simple lower bound method with Sondik's updates. Both of these can work with arbitrary points of the belief space and can be enhanced by various heuristic point selection strategies. Also introduced is a new method for computing an initial upper bound - the fast informed bound method. This method is able to improve significantly on the standard and commonly used upper bound computed by the MDP-based method. The quality of resulting bounds are tested on a maze navigation problem with 20 states, 6 actions and 8 observations.

...read moreread less

76 citations

Journal Article•DOI•

Consumption and portfolio rules for time-inconsistent investors

[...]

Jesús Marín-Solano¹, Jorge Navas¹•Institutions (1)

University of Barcelona¹

16 Mar 2010-European Journal of Operational Research

TL;DR: This paper extends the classical consumption and portfolio rules model in continuous time to the framework of decision-makers with time-inconsistent preferences and derives a modified HJB (Hamilton-Jacobi-Bellman) equation to solve the problem for sophisticated agents.

...read moreread less

76 citations

Collapse

Network Information

Performance

Metrics

6,698

Papers

155,793

Citations

No. of papers in the topic in previous years
Year	Papers
2023	261
2022	537
2021	369
2020	411
2019	348
2018	353

Bellman equation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics