Parameterized value iteration for output reference model tracking of a high order nonlinear aerodynamic system
Citations
261 citations
8 citations
Cites background or methods from "Parameterized value iteration for o..."
...Let the discrete-time known open-loop stable minimum-phase (MP) state-space deterministic strictly causal ORM be [12,46]...
[...]
...A discrete-time nonlinear unknown open-loop stable state-space deterministic strictly causal process is defined as [12,46] P : {xk+1 = f(xk, uk), yk = g(xk)}, (1)...
[...]
...Therefore it fits with the recent data-driven control [35–43] and reinforcement learning [12,44,45] applications....
[...]
...) to be minimized starting with x0 be [6,12,46]...
[...]
...Consider next that the extended state-space model that consists of (1), (2), and the state-space generative model of the reference input signal is, in the most general form [12,46]:...
[...]
Cites background or result from "Parameterized value iteration for o..."
...The authors would like to mention that their paper is an extended version of the IEEE conference paper [1] from the same authors....
[...]
...The extended results contained in this article, with respect to the results in [1], are detailed at the end of the seventh paragraph of the Introduction Section, as follows: The main updates with respect to our paper [12] include the following: detailed IMF-AVI convergence proofs under general function approximators; a case study for a low order linear system in order to generalize the more complex ORM tracking validation on the TITOAS process; comparisons with an offline deep deterministic policy gradient solution; more implementation details and insightful discussions on the obtained results....
[...]
...Additionally, the references [1] and [2] (below) are better acknowledged throughout the revised manuscript as references [12] and [46], respectively....
[...]
References
23,074 citations
"Parameterized value iteration for o..." refers background in this paper
...Although successful stories on RL and ADP applied to large stateaction spaces are reported mainly with AI [6], in control theory, most approaches use low-order processes as representative case studies and mainly in linear quadratic regulator (LQR)-like settings....
[...]
1,163 citations
919 citations
901 citations
"Parameterized value iteration for o..." refers background in this paper
...The VRFT prefilter is chosen as )()( zz ML ....
[...]
...The IO data }~,~{ kk yu is collected with low-amplitude zero-mean inputs 2,1, , kk uu , to maintain the process linearity around the mechanical equilibrium, such that to fit the linear VRFT design framework....
[...]
...Nonlinear (in particular, linear) state-feedback controllers can also be found by VRFT as shown in [23] to serve as initializations for the IMF-AVI....
[...]
...However, such input-output (IO) or input-state feedback controllers were traditionally not to be designed without using a process model, until the advent of data-driven modelfree controller design techniques that have appeared from the field of control theory: Virtual Reference Feedback Tuning (VRFT) [12], Iterative Feedback Tuning [13], data-driven Iterative Learning Control [1], [14], Model Free (Adaptive) Control [15], [16]....
[...]
...The linear VRFT output feedback error diagonal controller is )1/()(),()...
[...]
841 citations