Near-Optimal Control for Nonzero-Sum Differential Games of Continuous-Time Nonlinear Systems Using Single-Network ADP

doi:10.1109/TSMCB.2012.2203336

Journal ArticleDOI

Near-Optimal Control for Nonzero-Sum Differential Games of Continuous-Time Nonlinear Systems Using Single-Network ADP

Huaguang Zhang, +2 more

- 01 Feb 2013 -

IEEE Transactions on Systems, Man, and C...

- Vol. 43, Iss: 1, pp 206-216

TLDR

The novel weight tuning laws for critic neural networks are proposed, which not only ensure the Nash equilibrium to be reached but also guarantee the system to be stable and demonstrate the uniform ultimate boundedness of the closed-loop system.

Abstract:

In this paper, a near-optimal control scheme is proposed to solve the nonzero-sum differential games of continuous-time nonlinear systems. The single-network adaptive dynamic programming (ADP) is utilized to obtain the optimal control policies which make the cost functions reach the Nash equilibrium of nonzero-sum differential games, where only one critic network is used for each player instead of the action-critic dual network used in a typical ADP architecture. Furthermore, the novel weight tuning laws for critic neural networks are proposed, which not only ensure the Nash equilibrium to be reached but also guarantee the system to be stable. No initial stabilizing control policy is required for each player. Moreover, Lyapunov theory is utilized to demonstrate the uniform ultimate boundedness of the closed-loop system. Finally, a simulation example is given to verify the effectiveness of the proposed near-optimal control scheme.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Fuzzy Approximation-Based Adaptive Backstepping Optimal Control for a Class of Nonlinear Discrete-Time Systems With Dead-Zone

Yan-Jun Liu, +3 more

- 01 Feb 2016 -

IEEE Transactions on Fuzzy Systems

TL;DR: An adaptive fuzzy optimal control design is addressed for a class of unknown nonlinear discrete-time systems that contain unknown functions and nonsymmetric dead-zone and can be proved based on the difference Lyapunov function method.

...read moreread less

Journal ArticleDOI

Data-Driven Optimal Consensus Control for Discrete-Time Multi-Agent Systems With Unknown Dynamics Using Reinforcement Learning Method

Huaguang Zhang, +3 more

- 01 May 2017 -

IEEE Transactions on Industrial Electron...

TL;DR: A data-based adaptive dynamic programming method is presented using the current and past system data rather than the accurate system models also instead of the traditional identification scheme which would cause the approximation residual errors.

...read moreread less

Journal ArticleDOI

Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints

Derong Liu, +3 more

- 09 Apr 2015 -

IEEE Transactions on Systems, Man, and C...

TL;DR: A novel RL-based robust adaptive control algorithm is developed for a class of continuous-time uncertain nonlinear systems subject to input constraints that is converted to the constrained optimal control problem with appropriately selecting value functions for the nominal system.

...read moreread less

Journal ArticleDOI

Off-Policy Reinforcement Learning for $ H_\infty $ Control Design

Biao Luo, +2 more

- 01 Jan 2015 -

IEEE Transactions on Systems, Man, and C...

TL;DR: An off-policy reinforcement leaning (RL) method is introduced to learn the solution of HJI equation from real system data instead of mathematical system model, and its convergence is proved.

...read moreread less

Journal ArticleDOI

Adaptive Critic Nonlinear Robust Control: A Survey

Ding Wang, +2 more

- 03 Jul 2017 -

IEEE Transactions on Systems, Man, and C...

TL;DR: This survey reviews the recent main results of adaptive-critic-based robust control design of continuous-time nonlinear systems and promotes the development of adaptive critic control methods with robustness guarantee and the construction of higher level intelligent systems.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Dynamic Programming

Richard Ernest Bellman

TL;DR: The more the authors study the information processing aspects of the mind, the more perplexed and impressed they become, and it will be a very long time before they understand these processes sufficiently to reproduce them.

...read moreread less

Journal ArticleDOI

What is dynamic programming

Sean R. Eddy

- 01 Jul 2004 -

Nature Biotechnology

TL;DR: Sequence alignment methods often use something called a 'dynamic programming' algorithm, which can be a good idea or a bad idea, depending on the method used.

...read moreread less

Book

Dynamic Noncooperative Game Theory

Tamer Basar, +1 more

TL;DR: In this paper, the authors present a general formulation of non-cooperative finite games: N-Person nonzero-sum games, Pursuit-Evasion games, and Stackelberg Equilibria of infinite dynamic games.

...read moreread less

Neuro-Dynamic Programming.

Dimitri P. Bertsekas

TL;DR: In this article, the authors present the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.

...read moreread less

Book

Neuro-dynamic programming

Dimitri P. Bertsekas, +1 more

TL;DR: This is the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.

...read moreread less