Home
/
Topics
/
Bellman equation

Topic

Bellman equation

About: Bellman equation is a research topic. Over the lifetime, 5884 publications have been published within this topic receiving 135589 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Optimal city hierarchy: A dynamic programming approach to central place theory

[...]

Wen-Tai Hsu¹, Thomas J. Holmes, Frank Morgan²•Institutions (2)

Singapore Management University¹, Williams College²

01 Nov 2014-Journal of Economic Theory

TL;DR: In this paper, the authors provide a rationale for central place theory via a dynamic programming formulation of the social planner's problem of city hierarchy, and show that there must be one and only one immediate smaller city between two neighboring larger-sized cities in any optimal solution.

...read moreread less

39 citations

Journal Article•DOI•

Hierarchical controls in stochastic manufacturing systems with machines in tandem

[...]

Suresh Sethi¹, Qing Zhang¹, Xun Yu Zhou¹•Institutions (1)

University of Toronto¹

01 Oct 1992-Stochastics and Stochastics Reports

TL;DR: In this article, an asymptotic analysis of hierarchical production planning in a manufacturing system with serial machines that are subject to breakdown and repair, and with convex costs is presented.

...read moreread less

Abstract: This paper presents an asymptotic analysis of hierarchical production planning in a manufacturing system with serial machines that are subject to breakdown and repair, and with convex costs. The machines capacities are modeled as Markov chains. Since the number of parts in the internal buffers between any two machines needs to be non-negative, the problem is inherently a state constrained problem. As the rate of change in machines states approaches infinity, the analysis results in a limiting problem in which the stochastic machines capacity is replaced by the equilibrium mean capacity. A method of “lifting” and “modification” is introduced in order to construct near optimal controls for the original problem by using near optimal controls of the limiting problem. The value function of the original problem is shown to converge to the value function of the limiting problem, and the convergence rate is obtained based on some a priori estimates of the asymptotic behavior of the Markov chains. As a result, an ...

...read moreread less

39 citations

Proceedings Article•

Symbolic dynamic programming for continuous state and action MDPs

[...]

Zahra Zamani¹, Scott Sanner¹, Cheng Fang²•Institutions (2)

NICTA¹, Massachusetts Institute of Technology²

22 Jul 2012

TL;DR: This work shows how the continuous action maximization step in the dynamic programming backup can be evaluated optimally and symbolically and further integrates this technique to work with an efficient and compact data structure for SDP -- the extended algebraic decision diagram (XADD).

...read moreread less

Abstract: Many real-world decision-theoretic planning problems are naturally modeled using both continuous state and action (CSA) spaces, yet little work has provided exact solutions for the case of continuous actions. In this work, we propose a symbolic dynamic programming (SDP) solution to obtain the optimal closed-form value function and policy for CSA-MDPs with multivariate continuous state and actions, discrete noise, piecewise linear dynamics, and piecewise linear (or restricted piecewise quadratic) reward. Our key contribution over previous SDP work is to show how the continuous action maximization step in the dynamic programming backup can be evaluated optimally and symbolically -- a task which amounts to symbolic constrained optimization subject to unknown state parameters; we further integrate this technique to work with an efficient and compact data structure for SDP -- the extended algebraic decision diagram (XADD). We demonstrate empirical results on a didactic nonlinear planning example and two domains from operations research to show the first automated exact solution to these problems.

...read moreread less

39 citations

Journal Article•DOI•

Quadratic optimal control of switched linear stochastic systems

[...]

Wei Zhang¹, Jianghai Hu², Jianming Lian³•Institutions (3)

University of California, Berkeley¹, Purdue University², Florida State University³

01 Nov 2010-Systems & Control Letters

TL;DR: A numerical relaxation framework is developed to efficiently compute a control strategy with a guaranteed performance upper bound and it is proved that by choosing the relaxation parameter sufficiently small, the performance of the resulting control strategy can be made arbitrarily close to the optimal one.

...read moreread less

39 citations

Journal Article•DOI•

Discrete-Time Self-Learning Parallel Control

[...]

Qinglai Wei¹, Lingxiao Wang¹, Jingwei Lu¹, Fei-Yue Wang¹•Institutions (1)

Chinese Academy of Sciences¹

09 Jun 2020-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: A new self-learning parallel control method, which is based on adaptive dynamic programming (ADP) technique, is developed for solving the optimal control problem of discrete- time time-varying nonlinear systems and it aims to obtain an approximate optimal control law sequence.

...read moreread less

Abstract: In this article, a new self-learning parallel control method, which is based on adaptive dynamic programming (ADP) technique, is developed for solving the optimal control problem of discrete- time time-varying nonlinear systems. It aims to obtain an approximate optimal control law sequence and simultaneously guarantees the convergence of the value function. Establishing the time-varying artificial system by neural networks in a certain time-horizon, a control-sequence-improvement ADP algorithm is developed to obtain the control law sequence. For the first time, the criteria of the parallel execution are presented, such that the value function is proven to converge to a finite neighborhood of the optimal performance index function. Finally, numerical results and analysis are presented to demonstrate the effectiveness of the parallel control method.

...read moreread less

39 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
…
162
163
164
165
166
167
168
…
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

6,787

Papers

157,174

Citations

No. of papers in the topic in previous years
Year	Papers
2023	268
2022	556
2021	375
2020	418
2019	353
2018	356

Bellman equation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics