Home
/
Authors
/
Kyriakos G. Vamvoudakis

Author

Kyriakos G. Vamvoudakis

Other affiliations: University of California, University of Texas System, University of California, Santa Barbara ...read more

Bio: Kyriakos G. Vamvoudakis is an academic researcher from Georgia Institute of Technology. The author has contributed to research in topics: Reinforcement learning & Optimal control. The author has an hindex of 27, co-authored 153 publications receiving 5735 citations. Previous affiliations of Kyriakos G. Vamvoudakis include University of California & University of Texas System.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2007

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem

[...]

Kyriakos G. Vamvoudakis¹, Frank L. Lewis¹•Institutions (1)

University of Texas at Arlington¹

01 May 2010-Automatica

TL;DR: An online algorithm based on policy iteration for learning the continuous-time optimal control solution with infinite horizon cost for nonlinear systems with known dynamics, which finds in real-time suitable approximations of both the optimal cost and the optimal control policy, while also guaranteeing closed-loop stability.

...read moreread less

1,012 citations

Journal Article•DOI•

Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers

[...]

Frank L. Lewis¹, Draguna Vrabie, Kyriakos G. Vamvoudakis²•Institutions (2)

University of Texas at Arlington¹, University of California, Santa Barbara²

16 Nov 2012-IEEE Control Systems Magazine

TL;DR: In this article, the authors describe the use of reinforcement learning to design feedback controllers for discrete and continuous-time dynamical systems that combine features of adaptive control and optimal control, which are not usually designed to be optimal in the sense of minimizing user-prescribed performance functions.

...read moreread less

Abstract: This article describes the use of principles of reinforcement learning to design feedback controllers for discrete- and continuous-time dynamical systems that combine features of adaptive control and optimal control. Adaptive control [1], [2] and optimal control [3] represent different philosophies for designing feedback controllers. Optimal controllers are normally designed of ine by solving Hamilton JacobiBellman (HJB) equations, for example, the Riccati equation, using complete knowledge of the system dynamics. Determining optimal control policies for nonlinear systems requires the offline solution of nonlinear HJB equations, which are often difficult or impossible to solve. By contrast, adaptive controllers learn online to control unknown systems using data measured in real time along the system trajectories. Adaptive controllers are not usually designed to be optimal in the sense of minimizing user-prescribed performance functions. Indirect adaptive controllers use system identification techniques to first identify the system parameters and then use the obtained model to solve optimal design equations [1]. Adaptive controllers may satisfy certain inverse optimality conditions [4].

...read moreread less

841 citations

Proceedings Article•DOI•

Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem

[...]

Kyriakos G. Vamvoudakis¹, Frank L. Lewis¹•Institutions (1)

University of Texas at Arlington¹

14 Jun 2009

TL;DR: This paper presents an online adaptive algorithm implemented as an actor/critic structure which involves simultaneous continuous-time adaptation of both actor and critic neural networks, and calls this ‘synchronous’ policy iteration.

...read moreread less

Abstract: In this paper we discuss an online algorithm based on policy iteration for learning the continuous-time (CT) optimal control solution with infinite horizon cost for nonlinear systems with known dynamics. We present an online adaptive algorithm implemented as an actor/critic structure which involves simultaneous continuous-time adaptation of both actor and critic neural networks. We call this ‘synchronous’ policy iteration. A persistence of excitation condition is shown to guarantee convergence of the critic to the actual optimal value function. Novel tuning algorithms are given for both critic and actor networks, with extra terms in the actor tuning law being required to guarantee closed-loop dynamical stability. The convergence to the optimal controller is proven, and stability of the system is also guaranteed. Simulation examples show the effectiveness of the new algorithm.

...read moreread less

648 citations

Journal Article•DOI•

Optimal and Autonomous Control Using Reinforcement Learning: A Survey

[...]

Bahare Kiumarsi¹, Kyriakos G. Vamvoudakis², Hamidreza Modares³, Frank L. Lewis¹•Institutions (3)

University of Texas at Arlington¹, Virginia Tech², Missouri University of Science and Technology³

01 Jun 2018-IEEE Transactions on Neural Networks

TL;DR: Q-learning and the integral RL algorithm as core algorithms for discrete time (DT) and continuous-time (CT) systems, respectively are discussed, and a new direction of off-policy RL for both CT and DT systems is discussed.

...read moreread less

Abstract: This paper reviews the current state of the art on reinforcement learning (RL)-based feedback control solutions to optimal regulation and tracking of single and multiagent systems. Existing RL solutions to both optimal $\mathcal {H}_{2}$ and $\mathcal {H}_\infty $ control problems, as well as graphical games, will be reviewed. RL methods learn the solution to optimal control and game problems online and using measured data along the system trajectories. We discuss Q-learning and the integral RL algorithm as core algorithms for discrete-time (DT) and continuous-time (CT) systems, respectively. Moreover, we discuss a new direction of off-policy RL for both CT and DT systems. Finally, we review several applications.

...read moreread less

536 citations

Journal Article•DOI•

A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems

[...]

Shubhendu Bhasin¹, Rushikesh Kamalapurkar², Marcus Johnson², Kyriakos G. Vamvoudakis³, Frank L. Lewis⁴, Warren E. Dixon² - Show less +2 more•Institutions (4)

Indian Institute of Technology Delhi¹, University of Florida², University of California, Santa Barbara³, University of Texas at Arlington⁴

01 Jan 2013-Automatica

TL;DR: An online adaptive reinforcement learning-based solution is developed for the infinite-horizon optimal control problem for continuous-time uncertain nonlinear systems using a novel actor-critic-identifier (ACI) architecture to approximate the Hamilton-Jacobi-Bellman equation.

...read moreread less

447 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

I and i

[...]

Kevin Barraclough

08 Dec 2001-BMJ

TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.

...read moreread less

Abstract: There is, I think, something ethereal about i —the square root of minus one. I remember first hearing about it at school. It seemed an odd beast at that time—an intruder hovering on the edge of reality. Usually familiarity dulls this sense of the bizarre, but in the case of i it was the reverse: over the years the sense of its surreal nature intensified. It seemed that it was impossible to write mathematics that described the real world in …

...read moreread less

33,785 citations

[신간의 별자리x] 우리/미술, 그리고 ‘슬픔의 박물관’

[...]

이화영

01 Jan 2015

12,972 citations

Book Chapter•DOI•

I. the origin of species by means of natural selection

[...]

AsaHG Gray, A. Hunter Dupree

31 Jan 1963

2,885 citations

Journal Article•

Adaptive Control

[...]

B. Pasik-Duncan

01 Apr 1996-IEEE Control Systems Magazine

TL;DR: In this paper, two major figures in adaptive control provide a wealth of material for researchers, practitioners, and students to enhance their work through the information on many new theoretical developments, and can be used by mathematical control theory specialists to adapt their research to practical needs.

...read moreread less

Abstract: This book, written by two major figures in adaptive control, provides a wealth of material for researchers, practitioners, and students. While some researchers in adaptive control may note the absence of a particular topic, the book‘s scope represents a high-gain instrument. It can be used by designers of control systems to enhance their work through the information on many new theoretical developments, and can be used by mathematical control theory specialists to adapt their research to practical needs. The book is strongly recommended to anyone interested in adaptive control.

...read moreread less

1,814 citations

Book Chapter•DOI•

System Identification I

[...]

Biao Huang¹, Yutong Qi, Akm Monjur Murshed²•Institutions (2)

University of Alberta¹, Shell Canada Limited²

11 Dec 2012

1,704 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse