Home
/
Topics
/
Bellman equation

Topic

Bellman equation

About: Bellman equation is a research topic. Over the lifetime, 5884 publications have been published within this topic receiving 135589 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•

Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence

[...]

Travis Dierks¹, Balaje T. Thumati¹, Sarangapani Jagannathan¹•Institutions (1)

Missouri University of Science and Technology¹

01 Jan 2009

TL;DR: The need of the partial knowledge of the nonlinear system dynamics is relaxed in the development of a novel approach to ADP using a two part process: online system identification and offline optimal control training.

...read moreread less

Abstract: The optimal control of linear systems accompanied by quadratic cost functions can be achieved by solving the well-known Riccati equation. However, the optimal control of nonlinear discrete-time systems is a much more challenging task that often requires solving the nonlinear Hamilton―Jacobi―Bellman (HJB) equation. In the recent literature, discrete-time approximate dynamic programming (ADP) techniques have been widely used to determine the optimal or near optimal control policies for affine nonlinear discrete-time systems. However, an inherent assumption of ADP requires the value of the controlled system one step ahead and at least partial knowledge of the system dynamics to be known. In this work, the need of the partial knowledge of the nonlinear system dynamics is relaxed in the development of a novel approach to ADP using a two part process: online system identification and offline optimal control training. First, in the system identification process, a neural network (NN) is tuned online using novel tuning laws to learn the complete plant dynamics so that a local asymptotic stability of the identification error can be shown. Then, using only the learned NN system model, offline ADP is attempted resulting in a novel optimal control law. The proposed scheme does not require explicit knowledge of the system dynamics as only the learned NN model is needed. The proof of convergence is demonstrated. Simulation results verify theoretical conjecture.

...read moreread less

131 citations

Journal Article•DOI•

Some characterizations of optimal trajectories in control theory

[...]

Piermarco Cannarsa, Halina Frankowska

01 Oct 1991-Siam Journal on Control and Optimization

TL;DR: In this article, several characterizations of optimal trajectories for the classical Mayer problem in optimal control are provided, and the problem of optimal design is addressed, obtaining sufficient conditions for optimality.

...read moreread less

Abstract: Several characterizations of optimal trajectories for the classical Mayer problem in optimal control are provided. For this purpose the regularity of directional derivatives of the value function is studied: for instance, it is shown that for smooth control systems the value function V is continuously differentiable along an optimal trajectory $x:[t_0 ,1] \to {\bf R}^n $ provided V is differentiable at the initial point $(t_0 ,x(t_0 ))$.Then the upper semicontinuity of the optimal feedback map is deduced. The problem of optimal design is addressed, obtaining sufficient conditions for optimality. Finally, it is shown that the optimal control problem may be reduced to a viability one.

...read moreread less

130 citations

Journal Article•DOI•

Error Bounds for Monotone Approximation Schemes for Hamilton-Jacobi-Bellman Equations

[...]

Guy Barles, Espen R. Jakobsen

01 Feb 2005-SIAM Journal on Numerical Analysis

TL;DR: The key step in the proof of these new estimates is the introduction of a switching system which allows the construction of approximate, (almost) smooth supersolutions for the Hamilton--Jacobi--Bellman equation.

...read moreread less

Abstract: We obtain error bounds for monotone approximation schemes of Hamilton--Jacobi--Bellman equations. These bounds improve previous results of Krylov and the authors. The key step in the proof of these new estimates is the introduction of a switching system which allows the construction of approximate, (almost) smooth supersolutions for the Hamilton--Jacobi--Bellman equation.

...read moreread less

129 citations

Journal Article•DOI•

Continuous dependence estimates for viscosity solutions of integro-pdes

[...]

Espen R. Jakobsen¹, Kenneth H. Karlsen², Kenneth H. Karlsen³•Institutions (3)

Norwegian University of Science and Technology¹, University of Bergen², University of Oslo³

15 May 2005-Journal of Differential Equations

TL;DR: In this article, the authors present a general framework for deriving continuous depen-dence estimates for, possibly polynomially growing, viscosity solutions of fully nonlinear degenerate parabolic integro-PDEs.

...read moreread less

128 citations

Journal Article•DOI•

Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Convergence Analysis

[...]

Qinglai Wei¹, Frank L. Lewis², Derong Liu³, Ruizhuo Song³, Hanquan Lin¹ - Show less +1 more•Institutions (3)

Chinese Academy of Sciences¹, University of Texas at Arlington², University of Science and Technology Beijing³

01 Jun 2018-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: Monotonicity of the local value iteration ADP algorithm is presented, which shows that under some special conditions of the initial value function and the learning rate function, the iterative value function can monotonically converge to the optimum.

...read moreread less

Abstract: In this paper, convergence properties are established for the newly developed discrete-time local value iteration adaptive dynamic programming (ADP) algorithm. The present local iterative ADP algorithm permits an arbitrary positive semidefinite function to initialize the algorithm. Employing a state-dependent learning rate function, for the first time, the iterative value function and iterative control law can be updated in a subset of the state space instead of the whole state space, which effectively relaxes the computational burden. A new analysis method for the convergence property is developed to prove that the iterative value functions will converge to the optimum under some mild constraints. Monotonicity of the local value iteration ADP algorithm is presented, which shows that under some special conditions of the initial value function and the learning rate function, the iterative value function can monotonically converge to the optimum. Finally, three simulation examples and comparisons are given to illustrate the performance of the developed algorithm.

...read moreread less

128 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
…
33
34
35
36
37
38
39
…
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

6,698

Papers

155,793

Citations

No. of papers in the topic in previous years
Year	Papers
2023	261
2022	537
2021	369
2020	411
2019	348
2018	353

Bellman equation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics