Journal ArticleDOI
An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
Reads0
Chats0
TLDR
A new iterative adaptive dynamic programming (ADP) method is proposed to solve a class of continuous-time nonlinear two-person zero-sum differential games and the convergence property of the performance index function is proved.About:
This article is published in Automatica.The article was published on 2011-01-01. It has received 365 citations till now. The article focuses on the topics: Iterative method & Saddle point.read more
Citations
More filters
Journal ArticleDOI
Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
Yu Jiang,Zhong-Ping Jiang +1 more
TL;DR: This paper presents a novel policy iteration approach for finding online adaptive optimal controllers for continuous-time linear systems with completely unknown system dynamics, using the approximate/adaptive dynamic programming technique to iteratively solve the algebraic Riccati equation using the online information of state and input.
Journal ArticleDOI
Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems
Derong Liu,Qinglai Wei +1 more
TL;DR: It is shown that the iterative performance index function is nonincreasingly convergent to the optimal solution of the Hamilton-Jacobi-Bellman equation and it is proven that any of the iteratives control laws can stabilize the nonlinear systems.
Journal ArticleDOI
Data-Driven Robust Approximate Optimal Tracking Control for Unknown General Nonlinear Systems Using Adaptive Dynamic Programming Method
TL;DR: A novel data-driven robust approximate optimal tracking control scheme is proposed for unknown general nonlinear systems by using the adaptive dynamic programming (ADP) method and a robustifying term is developed to compensate for the NN approximation errors introduced by implementing the ADP method.
Journal ArticleDOI
Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics
Bahare Kiumarsi,Frank L. Lewis,Hamidreza Modares,Ali Karimpour,Mohammad Bagher Naghibi-Sistani +4 more
TL;DR: A novel approach based on the Q -learning algorithm is proposed to solve the infinite-horizon linear quadratic tracker (LQT) for unknown discrete-time systems in a causal manner and the optimal control input is obtained by only solving an augmented ARE.
Journal ArticleDOI
Near-Optimal Control for Nonzero-Sum Differential Games of Continuous-Time Nonlinear Systems Using Single-Network ADP
TL;DR: The novel weight tuning laws for critic neural networks are proposed, which not only ensure the Nash equilibrium to be reached but also guarantee the system to be stable and demonstrate the uniform ultimate boundedness of the closed-loop system.
References
More filters
Book
Dynamic Noncooperative Game Theory
Tamer Basar,Geert Jan Olsder +1 more
TL;DR: In this paper, the authors present a general formulation of non-cooperative finite games: N-Person nonzero-sum games, Pursuit-Evasion games, and Stackelberg Equilibria of infinite dynamic games.
Book
Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations
TL;DR: In this paper, the main ideas on a model problem with continuous viscosity solutions of Hamilton-Jacobi equations are discussed. But the main idea of the main solutions is not discussed.
Stability in dynamical systems
Abstract: Stability in dynamical systems subject to some law of force is considered. This leads to a set of differential equations which govern the motion. (AIP)
Journal ArticleDOI
Adaptive Dynamic Programming: An Introduction
TL;DR: Some recent research trends within the field of adaptive/approximate dynamic programming (ADP), including the variations on the structure of ADP schemes, the development of ADPs algorithms and applications, and many recent papers have provided convergence analysis associated with the algorithms developed.
Related Papers (5)
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
Murad Abu-Khalaf,Frank L. Lewis +1 more