scispace - formally typeset
Journal ArticleDOI

Near-Optimal Control for Nonzero-Sum Differential Games of Continuous-Time Nonlinear Systems Using Single-Network ADP

TLDR
The novel weight tuning laws for critic neural networks are proposed, which not only ensure the Nash equilibrium to be reached but also guarantee the system to be stable and demonstrate the uniform ultimate boundedness of the closed-loop system.
Abstract
In this paper, a near-optimal control scheme is proposed to solve the nonzero-sum differential games of continuous-time nonlinear systems. The single-network adaptive dynamic programming (ADP) is utilized to obtain the optimal control policies which make the cost functions reach the Nash equilibrium of nonzero-sum differential games, where only one critic network is used for each player instead of the action-critic dual network used in a typical ADP architecture. Furthermore, the novel weight tuning laws for critic neural networks are proposed, which not only ensure the Nash equilibrium to be reached but also guarantee the system to be stable. No initial stabilizing control policy is required for each player. Moreover, Lyapunov theory is utilized to demonstrate the uniform ultimate boundedness of the closed-loop system. Finally, a simulation example is given to verify the effectiveness of the proposed near-optimal control scheme.

read more

Citations
More filters
Journal ArticleDOI

Fuzzy Approximation-Based Adaptive Backstepping Optimal Control for a Class of Nonlinear Discrete-Time Systems With Dead-Zone

TL;DR: An adaptive fuzzy optimal control design is addressed for a class of unknown nonlinear discrete-time systems that contain unknown functions and nonsymmetric dead-zone and can be proved based on the difference Lyapunov function method.
Journal ArticleDOI

Data-Driven Optimal Consensus Control for Discrete-Time Multi-Agent Systems With Unknown Dynamics Using Reinforcement Learning Method

TL;DR: A data-based adaptive dynamic programming method is presented using the current and past system data rather than the accurate system models also instead of the traditional identification scheme which would cause the approximation residual errors.
Journal ArticleDOI

Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints

TL;DR: A novel RL-based robust adaptive control algorithm is developed for a class of continuous-time uncertain nonlinear systems subject to input constraints that is converted to the constrained optimal control problem with appropriately selecting value functions for the nominal system.
Journal ArticleDOI

Off-Policy Reinforcement Learning for $ H_\infty $ Control Design

TL;DR: An off-policy reinforcement leaning (RL) method is introduced to learn the solution of HJI equation from real system data instead of mathematical system model, and its convergence is proved.
Journal ArticleDOI

Adaptive Critic Nonlinear Robust Control: A Survey

TL;DR: This survey reviews the recent main results of adaptive-critic-based robust control design of continuous-time nonlinear systems and promotes the development of adaptive critic control methods with robustness guarantee and the construction of higher level intelligent systems.
References
More filters
Book

Dynamic Programming

TL;DR: The more the authors study the information processing aspects of the mind, the more perplexed and impressed they become, and it will be a very long time before they understand these processes sufficiently to reproduce them.
Journal ArticleDOI

What is dynamic programming

TL;DR: Sequence alignment methods often use something called a 'dynamic programming' algorithm, which can be a good idea or a bad idea, depending on the method used.
Book

Dynamic Noncooperative Game Theory

TL;DR: In this paper, the authors present a general formulation of non-cooperative finite games: N-Person nonzero-sum games, Pursuit-Evasion games, and Stackelberg Equilibria of infinite dynamic games.

Neuro-Dynamic Programming.

TL;DR: In this article, the authors present the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.
Book

Neuro-dynamic programming

TL;DR: This is the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.
Related Papers (5)