Journal ArticleDOI
Near-Optimal Control for Nonzero-Sum Differential Games of Continuous-Time Nonlinear Systems Using Single-Network ADP
TLDR
The novel weight tuning laws for critic neural networks are proposed, which not only ensure the Nash equilibrium to be reached but also guarantee the system to be stable and demonstrate the uniform ultimate boundedness of the closed-loop system.Abstract:
In this paper, a near-optimal control scheme is proposed to solve the nonzero-sum differential games of continuous-time nonlinear systems. The single-network adaptive dynamic programming (ADP) is utilized to obtain the optimal control policies which make the cost functions reach the Nash equilibrium of nonzero-sum differential games, where only one critic network is used for each player instead of the action-critic dual network used in a typical ADP architecture. Furthermore, the novel weight tuning laws for critic neural networks are proposed, which not only ensure the Nash equilibrium to be reached but also guarantee the system to be stable. No initial stabilizing control policy is required for each player. Moreover, Lyapunov theory is utilized to demonstrate the uniform ultimate boundedness of the closed-loop system. Finally, a simulation example is given to verify the effectiveness of the proposed near-optimal control scheme.read more
Citations
More filters
Journal ArticleDOI
Fuzzy Approximation-Based Adaptive Backstepping Optimal Control for a Class of Nonlinear Discrete-Time Systems With Dead-Zone
TL;DR: An adaptive fuzzy optimal control design is addressed for a class of unknown nonlinear discrete-time systems that contain unknown functions and nonsymmetric dead-zone and can be proved based on the difference Lyapunov function method.
Journal ArticleDOI
Data-Driven Optimal Consensus Control for Discrete-Time Multi-Agent Systems With Unknown Dynamics Using Reinforcement Learning Method
TL;DR: A data-based adaptive dynamic programming method is presented using the current and past system data rather than the accurate system models also instead of the traditional identification scheme which would cause the approximation residual errors.
Journal ArticleDOI
Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints
TL;DR: A novel RL-based robust adaptive control algorithm is developed for a class of continuous-time uncertain nonlinear systems subject to input constraints that is converted to the constrained optimal control problem with appropriately selecting value functions for the nominal system.
Journal ArticleDOI
Off-Policy Reinforcement Learning for $ H_\infty $ Control Design
TL;DR: An off-policy reinforcement leaning (RL) method is introduced to learn the solution of HJI equation from real system data instead of mathematical system model, and its convergence is proved.
Journal ArticleDOI
Adaptive Critic Nonlinear Robust Control: A Survey
Ding Wang,Haibo He,Derong Liu +2 more
TL;DR: This survey reviews the recent main results of adaptive-critic-based robust control design of continuous-time nonlinear systems and promotes the development of adaptive critic control methods with robustness guarantee and the construction of higher level intelligent systems.
References
More filters
Book
Dynamic Programming
TL;DR: The more the authors study the information processing aspects of the mind, the more perplexed and impressed they become, and it will be a very long time before they understand these processes sufficiently to reproduce them.
Journal ArticleDOI
What is dynamic programming
TL;DR: Sequence alignment methods often use something called a 'dynamic programming' algorithm, which can be a good idea or a bad idea, depending on the method used.
Book
Dynamic Noncooperative Game Theory
Tamer Basar,Geert Jan Olsder +1 more
TL;DR: In this paper, the authors present a general formulation of non-cooperative finite games: N-Person nonzero-sum games, Pursuit-Evasion games, and Stackelberg Equilibria of infinite dynamic games.
Neuro-Dynamic Programming.
TL;DR: In this article, the authors present the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.
Book
Neuro-dynamic programming
TL;DR: This is the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.
Related Papers (5)
Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
Murad Abu-Khalaf,Frank L. Lewis +1 more