Home
/
Authors
/
Yu Jiang

Author

Yu Jiang

Other affiliations: Mitsubishi Electric Research Laboratories, South China University of Technology, MathWorks

Bio: Yu Jiang is an academic researcher from New York University. The author has contributed to research in topics: Dynamic programming & Optimal control. The author has an hindex of 18, co-authored 45 publications receiving 2278 citations. Previous affiliations of Yu Jiang include Mitsubishi Electric Research Laboratories & South China University of Technology.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics

[...]

Yu Jiang¹, Zhong-Ping Jiang¹•Institutions (1)

New York University¹

01 Oct 2012-Automatica

TL;DR: This paper presents a novel policy iteration approach for finding online adaptive optimal controllers for continuous-time linear systems with completely unknown system dynamics, using the approximate/adaptive dynamic programming technique to iteratively solve the algebraic Riccati equation using the online information of state and input.

...read moreread less

723 citations

Journal Article•DOI•

Robust Adaptive Dynamic Programming and Feedback Stabilization of Nonlinear Systems

[...]

Yu Jiang¹, Zhong-Ping Jiang¹•Institutions (1)

New York University¹

06 Jan 2014-IEEE Transactions on Neural Networks

TL;DR: The proposed RADP methodology can be viewed as an extension of ADP to uncertain nonlinear systems and has been applied to the controller design problems for a jet engine and a one-machine power system.

...read moreread less

Abstract: This paper studies the robust optimal control design for a class of uncertain nonlinear systems from a perspective of robust adaptive dynamic programming (RADP). The objective is to fill up a gap in the past literature of adaptive dynamic programming (ADP) where dynamic uncertainties or unmodeled dynamics are not addressed. A key strategy is to integrate tools from modern nonlinear control theory, such as the robust redesign and the backstepping techniques as well as the nonlinear small-gain theorem, with the theory of ADP. The proposed RADP methodology can be viewed as an extension of ADP to uncertain nonlinear systems. Practical learning algorithms are developed in this paper, and have been applied to the controller design problems for a jet engine and a one-machine power system.

...read moreread less

328 citations

Journal Article•DOI•

Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems

[...]

Yu Jiang¹, Zhong-Ping Jiang²•Institutions (2)

MathWorks¹, New York University²

19 Mar 2015-IEEE Transactions on Automatic Control

TL;DR: In this article, a novel method of global adaptive dynamic programming (ADP) for the adaptive optimal control of nonlinear polynomial systems is presented, which consists of relaxing the problem of solving the Hamilton-Jacobi-Bellman (HJB) equation to an optimization problem, which is solved via a new policy iteration method.

...read moreread less

Abstract: This paper presents a novel method of global adaptive dynamic programming (ADP) for the adaptive optimal control of nonlinear polynomial systems. The strategy consists of relaxing the problem of solving the Hamilton-Jacobi-Bellman (HJB) equation to an optimization problem, which is solved via a new policy iteration method. The proposed method distinguishes from previously known nonlinear ADP methods in that the neural network approximation is avoided, giving rise to significant computational improvement. Instead of semiglobally or locally stabilizing, the resultant control policy is globally stabilizing for a general class of nonlinear polynomial systems. Furthermore, in the absence of the a priori knowledge of the system dynamics, an online learning method is devised to implement the proposed policy iteration technique by generalizing the current ADP theory. Finally, three numerical examples are provided to validate the effectiveness of the proposed method.

...read moreread less

195 citations

Journal Article•DOI•

Adaptive dynamic programming and optimal control of nonlinear nonaffine systems

[...]

Tao Bian¹, Yu Jiang¹, Zhong-Ping Jiang¹•Institutions (1)

New York University¹

01 Oct 2014-Automatica

TL;DR: A novel optimal control design scheme is proposed for continuous-time nonaffine nonlinear dynamic systems with unknown dynamics by adaptive dynamic programming (ADP), which iteratively updates the control policy online by using the state and input information without identifying the system dynamics.

...read moreread less

184 citations

Journal Article•DOI•

Output-feedback adaptive optimal control of interconnected systems based on robust adaptive dynamic programming

[...]

Weinan Gao¹, Yu Jiang¹, Zhong-Ping Jiang², Tianyou Chai²•Institutions (2)

New York University¹, Northeastern University (China)²

01 Oct 2016-Automatica

TL;DR: The obtained adaptive and optimal output-feedback controllers differ from the existing literature on the ADP in that they are derived from sampled-data systems theory and are guaranteed to be robust to dynamic uncertainties.

...read moreread less

183 citations

1
2
3
4
…
5
6
7
8
9
10

Collapse

Cited by

PDF

Open Access

More filters

Book Chapter•DOI•

System Identification I

[...]

Biao Huang¹, Yutong Qi, Akm Monjur Murshed²•Institutions (2)

University of Alberta¹, Shell Canada Limited²

11 Dec 2012

1,704 citations

Journal Article•DOI•

Approximation theory and methods, by M. J. D. Powell. Pp 339. £25 (hardcover), £8·50 (paperback). 1981. ISBN 0-521-22472-1/29514-9 (Cambridge University Press)

[...]

Alan J. Davies

01 Mar 1984-The Mathematical Gazette

TL;DR: In this article, the authors consider the problem of finding the best approximation operator for a given function, and the uniqueness of best approximations and the existence of best approximation operators.

...read moreread less

Abstract: Preface 1. The approximation problem and existence of best approximations 2. The uniqueness of best approximations 3. Approximation operators and some approximating functions 4. Polynomial interpolation 5. Divided differences 6. The uniform convergence of polynomial approximations 7. The theory of minimax approximation 8. The exchange algorithm 9. The convergence of the exchange algorithm 10. Rational approximation by the exchange algorithm 11. Least squares approximation 12. Properties of orthogonal polynomials 13. Approximation of periodic functions 14. The theory of best L1 approximation 15. An example of L1 approximation and the discrete case 16. The order of convergence of polynomial approximations 17. The uniform boundedness theorem 18. Interpolation by piecewise polynomials 19. B-splines 20. Convergence properties of spline approximations 21. Knot positions and the calculation of spline approximations 22. The Peano kernel theorem 23. Natural and perfect splines 24. Optimal interpolation Appendices Index.

...read moreread less

841 citations

Journal Article•DOI•

Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics

[...]

Yu Jiang¹, Zhong-Ping Jiang¹•Institutions (1)

New York University¹

01 Oct 2012-Automatica

...read moreread less

723 citations

Dissertation•

Formation and control of optimal trajectory in human multijoint arm movement : minimum torque-change model

[...]

洋二宇野

01 Jan 1988

551 citations

Journal Article•DOI•

Optimal and Autonomous Control Using Reinforcement Learning: A Survey

[...]

Bahare Kiumarsi¹, Kyriakos G. Vamvoudakis², Hamidreza Modares³, Frank L. Lewis¹•Institutions (3)

University of Texas at Arlington¹, Virginia Tech², Missouri University of Science and Technology³

01 Jun 2018-IEEE Transactions on Neural Networks

TL;DR: Q-learning and the integral RL algorithm as core algorithms for discrete time (DT) and continuous-time (CT) systems, respectively are discussed, and a new direction of off-policy RL for both CT and DT systems is discussed.

...read moreread less

Abstract: This paper reviews the current state of the art on reinforcement learning (RL)-based feedback control solutions to optimal regulation and tracking of single and multiagent systems. Existing RL solutions to both optimal $\mathcal {H}_{2}$ and $\mathcal {H}_\infty $ control problems, as well as graphical games, will be reviewed. RL methods learn the solution to optimal control and game problems online and using measured data along the system trajectories. We discuss Q-learning and the integral RL algorithm as core algorithms for discrete-time (DT) and continuous-time (CT) systems, respectively. Moreover, we discuss a new direction of off-policy RL for both CT and DT systems. Finally, we review several applications.

...read moreread less

536 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse