scispace - formally typeset
Search or ask a question

Showing papers by "Robert Babuska published in 2005"


Journal ArticleDOI
TL;DR: State-of-the-art techniques for identifying fuzzy models and designing model-based controllers are reviewed in this article and attention is paid to the role of fuzzy systems in higher levels of the control hierarchy.

397 citations


Proceedings Article
01 Jan 2005
TL;DR: Adaptive state focus Q-learning (ASFQ-learning) as mentioned in this paper is a class of methods derived from Q-Learning that start learning with only the state information that is strictly necessary for a single agent to perform the task, and monitor the convergence of learning.
Abstract: In realistic multiagent systems, learning on the basis of complete state information is not feasible We introduce adaptive state focus Q-learning, a class of methods derived from Q-learning that start learning with only the state information that is strictly necessary for a single agent to perform the task, and that monitor the convergence of learning If lack of convergence is detected, the learner dynamically expands its state space to incorporate more state information (eg, states of other agents) Learning is faster and takes less resources than if the complete state were considered from the start, while being able to handle situations where agents interfere in pursuing their goals We illustrate our approach by instantiating a simple version of such a method, and by showing that it outperforms learning with full state information without being hindered by the deficiencies of learning on the basis of a single agent’s state

24 citations


Journal ArticleDOI
TL;DR: The main contribution of this paper is to present an adaptive fuzzy controller with composite adaptive laws based on both tracking and prediction error that achieves smoother parameter adaptation, better accuracy and improved performance.

23 citations


Book ChapterDOI
01 Jan 2005
TL;DR: In this paper, the authors proposed a fuzzy approximation architecture similar to those previously used for Q-learning, but they combine it with the model-based Q-value iteration algorithm, and prove that the resulting algorithm converges.
Abstract: Reinforcement learning (RL) is a widely used learning paradigm for adaptive agents. There exist several convergent and consistent RL algorithms which have been intensively studied. In their original form, these algorithms require that the environment states and agent actions take values in a relatively small discrete set. Fuzzy representations for approximate, model-free RL have been proposed in the literature for the more difficult case where the state-action space is continuous. In this work, we propose a fuzzy approximation architecture similar to those previously used for Q-learning, but we combine it with the model-based Q-value iteration algorithm. We prove that the resulting algorithm converges. We also give a modified, asynchronous variant of the algorithm that converges at least as fast as the original version. An illustrative simulation example is provided.

19 citations


Journal ArticleDOI
TL;DR: An automated modeling environment for serial manipulators has been implemented in Matlab/Simulink and the modeling environment has been used in the design of a control system for a seven‐degree‐of‐freedom manipulator in a tunnel‐boring machine.
Abstract: – Resulting from the need for fast and insightful modeling combined with the drawbacks of available modeling environments, provides details of work developed on an automated modelling environment., – An automated modeling environment for serial manipulators has been implemented in Matlab/Simulink., – The manipulator configuration is defined by using a graphical user interface and the corresponding mathematical model is automatically generated. The model is exported to Matlab for analysis and control design, as well as to Simulink for simulation and verification purposes. Friction and stiction phenomena are included in the model. The simulation results can be visualized in standard plots and scopes as well as through virtual reality animations., – The modeling environment has been used in the design of a control system for a seven‐degree‐of‐freedom manipulator in a tunnel‐boring machine., – Information on the implementation of an automated modelling environment to facilitate the simultaneous design of the configuration and the corresponding control system of serial manipulators

11 citations


Journal ArticleDOI
TL;DR: An automated modelling and control design environment for serial manipulators has been implemented in Matlab/Simulink and has been used in the design of a control system for a seven-degree-of-freedom manipulator in a tunnel-boring machine.

8 citations


Proceedings ArticleDOI
19 Sep 2005
TL;DR: An improved reinforcement learning algorithm is proposed, based on linear programming method for finding the best-response policy, that has some properties, such as easy computation, simple operation procedure and can guarantee a good learning convergence.
Abstract: An improved reinforcement learning algorithm is proposed in this paper. This algorithm is based on linear programming method for finding the best-response policy. A pursuit example is tested and the results show that this algorithm has some properties, such as easy computation, simple operation procedure and can guarantee a good learning convergence.

8 citations


Book ChapterDOI
01 Jan 2005
TL;DR: This chapter gives an overview of system identification techniques for fuzzy models and some selected techniques for model-based fuzzy control, including gain-scheduling and state-feedback design, model-inverse control and predictive control.
Abstract: This chapter gives an overview of system identification techniques for fuzzy models and some selected techniques for model-based fuzzy control. It starts with a brief discussion of the position of fuzzy modelling within the general nonlinear identification setting. The two most commonly used fuzzy models are the Mamdani model and the Takagi-Sugeno model. An overview of techniques for the data-driven construction of the latter model is given. We discuss both structure selection (input variables, representation of dynamics, number and type of membership functions) and parameter estimation (local and global estimation techniques, weighted least squares, multi-objective optimisation). Further, we discuss control design based on a fuzzy model of the process. As the model is assumed to be obtained through identification from sampled data, we focus on discrete-time methods, including: gain-scheduling and state-feedback design, model-inverse control and predictive control. A real-world application example is given.

4 citations


Journal ArticleDOI
TL;DR: In this paper, an evolutionary algorithm is proposed to select the key variables of complex processes on the basis of a polynomial signal representation and a measured data set, which can cope with strongly correlated variables and non-linear relationships without the need for a priori process knowledge.

3 citations


Journal ArticleDOI
TL;DR: In this article, a new approach in control engineering (Information Processing for Action) is presented, in which control, computers, communication and cognition play equal roles in addressing real-life problems from very small-scale devices to very large-scale industrial processes and non-technical applications.

2 citations


Proceedings ArticleDOI
25 May 2005
TL;DR: This work study and compare direct and indirect adaptive control schemes by means of an experimental benchmark (two coupled DC machines) to gain insight in the benefits and drawbacks of the different variants of adaptive fuzzy controllers and to evaluate their potential for practical applications.
Abstract: Several stable adaptive fuzzy control schemes based on feedback linearization and Lyapunov synthesis were proposed in the literature. However, most of such controllers have been only tested on relatively simple simulations examples, in which the effects of noise, uncertainties, computational times and other fundamental problems related to hardware implementation are neglected. In this paper, we study and compare direct and indirect adaptive control schemes by means of an experimental benchmark (two coupled DC machines). For the indirect schemes, we consider both the standard adaptive laws based on tracking error and composite adaptive laws based on tracking and prediction error. The goal of this work is to gain insight in the benefits and drawbacks of the different variants of adaptive fuzzy controllers and to evaluate their potential for practical applications