Home
/
Authors
/
Gabriel A. D. Lopes

Author

Gabriel A. D. Lopes

Other affiliations: Instituto Superior Técnico, University of Michigan

Bio: Gabriel A. D. Lopes is an academic researcher from Delft University of Technology. The author has contributed to research in topics: Reinforcement learning & Visual servoing. The author has an hindex of 15, co-authored 57 publications receiving 1366 citations. Previous affiliations of Gabriel A. D. Lopes include Instituto Superior Técnico & University of Michigan.

Papers published on a yearly basis

2018
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2001
2000
1998

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients

[...]

I. Grondman¹, Lucian Busoniu, Gabriel A. D. Lopes¹, Robert Babuska¹•Institutions (1)

Delft University of Technology¹

01 Nov 2012

TL;DR: The workings of the natural gradient is described, which has made its way into many actor-critic algorithms over the past few years, and a review of several standard and natural actor-Critic algorithms is given.

...read moreread less

Abstract: Policy-gradient-based actor-critic algorithms are amongst the most popular algorithms in the reinforcement learning framework. Their advantage of being able to search for optimal policies using low-variance gradient estimates has made them useful in several real-life applications, such as robotics, power control, and finance. Although general surveys on reinforcement learning techniques already exist, no survey is specifically dedicated to actor-critic algorithms in particular. This paper, therefore, describes the state of the art of actor-critic algorithms, with a focus on methods that can work in an online setting and use function approximation in order to deal with continuous state and action spaces. After starting with a discussion on the concepts of reinforcement learning and the origins of actor-critic algorithms, this paper describes the workings of the natural gradient, which has made its way into many actor-critic algorithms over the past few years. A review of several standard and natural actor-critic algorithms is given, and the paper concludes with an overview of application areas and a discussion on open issues.

...read moreread less

764 citations

Proceedings Article•DOI•

Automated gait adaptation for legged robots

[...]

J.D. Weingarten¹, Gabriel A. D. Lopes¹, Martin Buehler, Richard E. Groff, Daniel E. Koditschek - Show less +1 more•Institutions (1)

University of Michigan¹

06 Jul 2004

TL;DR: This paper presents a system for gait adaptation in the RHex series of hexapedal robots that renders this arduous process nearly autonomous, by recourse to a modified version of Nelder-Mead descent.

...read moreread less

Abstract: Gait parameter adaptation on a physical robot is an error-prone, tedious and time-consuming process. In this paper we present a system for gait adaptation in our RHex series of hexapedal robots that renders this arduous process nearly autonomous. The robot adapts its gait parameters by recourse to a modified version of Nelder-Mead descent, while managing its self-experiments and measuring the outcome by visual servoing within a partially engineered environment The resulting performance gains extend considerably beyond what we have managed with hand tuning. For example, the best hand tuned alternating tripod gaits never exceeded 0.8 m/s nor achieved specific resistance below 2.0. In contrast, Nelder-Mead based tuning has yielded alternating tripod gaits at 2.7 m/s (well over 5 body lengths per second) and reduced specific resistance to 0.6 while requiring little human intervention at low and moderate speeds. Comparable gains have been achieved on the much larger ruggedized version of this machine.

...read moreread less

151 citations

Journal Article•DOI•

Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning

[...]

Hamidreza Modares¹, Subramanya Nageshrao², Gabriel A. D. Lopes², Robert Babuska², Frank L. Lewis³ - Show less +1 more•Institutions (3)

University of Texas at Arlington¹, Delft University of Technology², Northeastern University (China)³

01 Sep 2016-Automatica

TL;DR: This paper considers optimal output synchronization of heterogeneous linear multi-agent systems and shows that this optimal distributed approach implicitly solves the output regulation equations without actually doing so.

...read moreread less

128 citations

Journal Article•DOI•

Port-Hamiltonian Systems in Adaptive and Learning Control: A Survey

[...]

Subramanya Nageshrao¹, Gabriel A. D. Lopes¹, Dimitri Jeltsema¹, Robert Babuska¹•Institutions (1)

Delft University of Technology¹

01 May 2016-IEEE Transactions on Automatic Control

TL;DR: A comprehensive review of the current learning and adaptive control methodologies that have been adapted specifically to PH systems, and highlights the changes from the general setting due to PH model, followed by a detailed presentation of the respective control algorithm.

...read moreread less

Abstract: Port-Hamiltonian (PH) theory is a novel, but well established modeling framework for nonlinear physical systems. Due to the emphasis on the physical structure and modular framework, PH modeling has become a prime focus in system theory. This has led to a considerable research interest in the control of PH systems, resulting in numerous nonlinear control techniques. General nonlinear control methodologies are classified in a spectrum from model-based to model-free, where adaptation and learning typically lie close to the end of the range. Various articles and monographs have provided a detailed overview of model-based control techniques on PH models, but no survey is specifically dedicated to the learning and adaptive control methods that can benefit from the PH structure. To this end, we provide a comprehensive review of the current learning and adaptive control methodologies that have been adapted specifically to PH systems. After establishing the required theoretical background, we elaborate on various general machine learning, iterative learning, and adaptive control techniques and their application to PH systems. For each method we highlight the changes from the general setting due to PH model, followed by a detailed presentation of the respective control algorithm. In general, the advantages of using PH models in learning and adaptive controllers are: i) Prior knowledge in the form of PH model speeds up the learning. ii) In some instances new stability or convergence guarantees are obtained by having a PH model. iii) The resulting control laws can be interpreted in the context of physical systems. We conclude the paper with notes on open research issues.

...read moreread less

60 citations

Journal Article•DOI•

A fast sampling method for estimating the domain of attraction

[...]

Esmaeil Najafi¹, Esmaeil Najafi², Robert Babuska¹, Gabriel A. D. Lopes¹•Institutions (2)

Delft University of Technology¹, University of Tehran²

09 Jul 2016-Nonlinear Dynamics

TL;DR: In this paper, a sampling approach is proposed to estimate the domain of attraction (DoA) of nonlinear systems in real time, which is validated to approximate the DoAs of stable equilibria.

...read moreread less

Abstract: Most stabilizing controllers designed for nonlinear systems are valid only within a specific region of the state space, called the domain of attraction (DoA). Computation of the DoA is usually costly and time-consuming. This paper proposes a computationally effective sampling approach to estimate the DoAs of nonlinear systems in real time. This method is validated to approximate the DoAs of stable equilibria in several nonlinear systems. In addition, it is implemented for the passivity-based learning controller designed for a second-order dynamical system. Simulation and experimental results show that, in all cases studied, the proposed sampling technique quickly estimates the DoAs, corroborating its suitability for real-time applications.

...read moreread less

54 citations

1
2
3
4
…
5
6
7
8
9
10
11
12

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Deep learning in neural networks

[...]

Jürgen Schmidhuber¹•Institutions (1)

University of Lugano¹

01 Jan 2015-Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

14,635 citations

Journal Article•DOI•

A survey of advances in vision-based human motion capture and analysis

[...]

Thomas B. Moeslund¹, Adrian Hilton², Volker Krüger³•Institutions (3)

Aalborg University¹, University of Surrey², Aalborg University – Copenhagen³

01 Nov 2006-Computer Vision and Image Understanding

TL;DR: This survey reviews recent trends in video-based human capture and analysis, as well as discussing open problems for future research to achieve automatic visual analysis of human movement.

...read moreread less

2,738 citations

Book•

Neural Networks and Deep Learning

[...]

Charu C. Aggarwal

01 Jan 2018

2,291 citations

Posted Content•

Deep Reinforcement Learning: An Overview

[...]

Yuxi Li

25 Jan 2017-arXiv: Learning

TL;DR: This work discusses core RL elements, including value function, in particular, Deep Q-Network (DQN), policy, reward, model, planning, and exploration, and important mechanisms for RL, including attention and memory, unsupervised learning, transfer learning, multi-agent RL, hierarchical RL, and learning to learn.

...read moreread less

Abstract: We give an overview of recent exciting achievements of deep reinforcement learning (RL). We discuss six core elements, six important mechanisms, and twelve applications. We start with background of machine learning, deep learning and reinforcement learning. Next we discuss core RL elements, including value function, in particular, Deep Q-Network (DQN), policy, reward, model, planning, and exploration. After that, we discuss important mechanisms for RL, including attention and memory, unsupervised learning, transfer learning, multi-agent RL, hierarchical RL, and learning to learn. Then we discuss various applications of RL, including games, in particular, AlphaGo, robotics, natural language processing, including dialogue systems, machine translation, and text generation, computer vision, neural architecture design, business management, finance, healthcare, Industry 4.0, smart grid, intelligent transportation systems, and computer systems. We mention topics not reviewed yet, and list a collection of RL resources. After presenting a brief summary, we close with discussions. Please see Deep Reinforcement Learning, arXiv:1810.06339, for a significant update.

...read moreread less

935 citations

Journal Article•DOI•

Lecture Notes in Economics and Mathematical Systems

[...]

Robert J. Weiner

23 Jan 1985-Journal of Policy Analysis and Management

789 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse