scispace - formally typeset
Search or ask a question
Journal ArticleDOI

A comparison of classical and learning controllers

01 Jan 2011-IFAC Proceedings Volumes (Elsevier)-Vol. 44, Iss: 1, pp 1102-1107
Abstract: This paper focuses on evaluating Locally Weighted Projection Regression (LWPR) as an alternative control method to traditional model-based control schemes. LWPR is used to estimate the inverse dynamics function of a 6 degree of freedom (DOF) manipulator. The performance of the resulting controller is compared to that of the resolved acceleration and the adaptive computed torque (ACT) controller. Simulations are carried out in order to evaluate the position and orientation tracking performance of each controller while varying trajectory velocities, end effector loading and errors in the known parameters. Both the adaptive controller and LWPR controller have comparable performance in the presence of parametric uncertainty including friction. The ACT controller outperforms LWPR when the dynamic structure is accurately known and the trajectory is persistently exciting.

Summary (2 min read)

1. INTRODUCTION

  • The use of robotics worldwide is most prevalent in the industrial setting where the environment is highly controlled.
  • Furthermore, uncertainties in the physical parameters of a system may be introduced from discrepancies between the manufacturer data and the actual system (Ayusawa et al., 2008).
  • More recently, statistical regression approaches have been used to infer the optimal structure to describe the observed data, making it possible to encode nonlinearities whose structure may not be well-known.

2. OVERVIEW OF MODEL-BASED CONTROL

  • This term globally linearizes and decouples the system, and thus a linear controller can be applied for the feedback term, uFB, which provides stability and disturbance rejection.
  • Desirable performance of the computed torque approach is based on the assumption that values of the parameters in (1) match the actual parameters of the physical system.
  • Otherwise, imperfect cancelation of the nonlinearities and coupling occurs.

3. LEARNING INVERSE DYNAMICS

  • While the adaptive approach requires accurate knowledge of the structure of the dynamic model of the manipulator, the learning approach obtains a model using measured data (Nguyen-Tuong et al., 2009), allowing unknown or unmodeled nonlinearities such as friction and backlash to be accounted for.
  • In order to be practical for manipulator control, learning algorithms must process continuous streams of training data to update the model and predict outputs fast enough for real-time control.
  • K∑ k=1 wikŷik/ K∑ k=1 wik (8) where K is the number of linear models, ŷik is the prediction of the ikth local linear model given by (6) which is weighed by wik associated with its receptive field.
  • This prediction is repeated i times for each dimension of the output vector y.
  • To reduce computational effort, LWPR assumes that the data can be characterized by local low-dimensional distributions, and attempts to reduce the dimensionality of the input space X using Partial Least Squares regression (PLS).

4. SIMULATIONS

  • In order to evaluate the performance of the LWPR learning controller, two ‘classical’ controllers in the joint space were also implemented: the resolved acceleration (RA) controller (Sciavicco and Scicliano, 2000) and the adaptive computed torque (ACT) controller (Craig et al., 1986; Ortega and Spong, 1988) given by (3).
  • The LWPR controller was trained for 60s on the 0.25Hz trajectory, after which training was stopped and tracking performance was evaluated.
  • Training was stopped when the observed MSE had asymptotically decreased to a low value.
  • The same conditions were repeated on the ACT and RA controller.
  • The third simulation adds varying amounts of error in the inertia parameters of the model while observing the resulting performance of the three controllers when tracking the ‘figure 8’ trajectory.

4.1 Parameter Tuning and Initialization

  • The stability of the ACT controller was found to be highly sensitive to the adaptive gain parameter, γ (3).
  • The initial value for the distance parameter D (7) dictates how large a receptive field is upon initialization.
  • This parameter was generally tuned through a trial-and-error process which involved monitoring the MSE of the predicted values during the training phase.
  • The initial performance of the LWPR controller is also highly dependent upon the data sets that are used to train the LWPR model.
  • Because LWPR is a local learning approach, it must be trained in the region(s) of input space that the manipulator will be operating in.

4.2 Results

  • The LWPR model was trained on the ‘figure 8’ trajectory at 0.25Hz, enabling it to predict the necessary torques for tracking at frequencies near 0.25Hz.
  • Table 3 illustrates that the inaccurate knowledge of the dynamic parameters causes significant degradation in performance for the RA controller, due to the imperfect linearization of the system dynamics.
  • For the LWPR controller, similar findings to that of simulations one and two were observed in that inertia parameter perturbations greater than 10% were sufficient to prevent LWPR from predicting accurate joint torques.
  • The first test involved using the model that was learned in simulation three to attempt to track the PE trajectory.
  • Unlike the adaptive controller which relies on persistence of excitation for tracking performance, the LWPR approach can be trained on an arbitrary trajectory provided that it is given sufficient time to learn.

Did you find this useful? Give us your feedback

Citations
More filters
Journal ArticleDOI
TL;DR: In this article, the authors tried to read modelling and control of robot manipulators as one of the reading material to finish quickly, and they found that reading book can be a great choice when having no friends and activities.
Abstract: Feel lonely? What about reading books? Book is one of the greatest friends to accompany while in your lonely time. When you have no friends and activities somewhere and sometimes, reading book can be a great choice. This is not only for spending the time, it will increase the knowledge. Of course the b=benefits to take will relate to what kind of book that you are reading. And now, we will concern you to try reading modelling and control of robot manipulators as one of the reading material to finish quickly.

517 citations

Proceedings ArticleDOI
24 Dec 2012
TL;DR: The proposed approach for online learning of the inverse dynamics model using Gaussian Process Regression is compared to existing learning and fixed control algorithms and shown to be capable of fast initialization and learning rate.
Abstract: Model-based control strategies for robot manipulators can present numerous performance advantages when an accurate model of the system dynamics is available. In practice, obtaining such a model is a challenging task which involves modeling such physical processes as friction, which may not be well understood and difficult to model. This paper proposes an approach for online learning of the inverse dynamics model using Gaussian Process Regression. The Sparse Online Gaussian Process (SOGP) algorithm is modified to allow for incremental updates of the model and hyperparameters. The influence of initialization on the performance of the learning algorithms, based on any a-priori knowledge available, is also investigated. The proposed approach is compared to existing learning and fixed control algorithms and shown to be capable of fast initialization and learning rate.

21 citations


Cites background from "A comparison of classical and learn..."

  • ...While adaptive control can provide an online estimate of the dynamic parameters, is still reliant upon adequate knowledge of the structure of the dynamic model and is thus particularly susceptible to the effects of unmodeled dynamics [13]....

    [...]

  • ...A comparison between learning approaches such as LWPR and classical control techniques [13] shows that both the adaptive controller and LWPR controller have comparable performance in the presence of parametric uncertainty....

    [...]

Journal ArticleDOI
TL;DR: The proposed approach for online learning of the dynamic model of a robot manipulator is tested on an industrial robot, and shown to outperform independent joint and fixed model-based control.
Abstract: This paper proposes an approach for online learning of the dynamic model of a robot manipulator. The dynamic model is formulated as a weighted sum of locally linear models, and Locally Weighted Projection Regression (LWPR) is used to learn the models based on training data obtained during operation. The LWPR model can be initialized with partial knowledge of rigid body parameters to improve the initial performance. The resulting dynamic model is used to implement a model-based controller. Both feedforward and feedback configurations are investigated. The proposed approach is tested on an industrial robot, and shown to outperform independent joint and fixed model-based control.

12 citations


Cites background or methods from "A comparison of classical and learn..."

  • ...A technique for improving the initial learning performance by making use of full or partial knowledge of the rigid body model is applied (de la Cruz et al. (2011b))....

    [...]

  • ...However, both dynamic parameter estimation and adaptive control methods assume a known dynamic model structure, and can be sensitive to error when the structure is not modeled accurately (de la Cruz et al. (2011a))....

    [...]

  • ...In order to improve the generalization performance of LWPR, an algorithm for incorporating a-priori knowledge from the RBD model (1) into the LWPR algorithm is applied (de la Cruz et al. (2011b))....

    [...]

  • ...The same motor babbling strategy as in (de la Cruz et al. (2011a)) was applied for this experiment, resulting in roughly 50,000 initial training points from motor babbling....

    [...]

  • ...Although LWPR has the ability to learn in an online, incremental manner due to its local learning approach, performance deteriorates quickly as the system moves outside of the region of state space it has been trained in (de la Cruz et al. (2011a))....

    [...]

Proceedings ArticleDOI
21 May 2018
TL;DR: This work proposes a scheme for collecting a sample of correspondences from the robots for training transfer models, and demonstrates the benefit of knowledge transfer in accelerating online learning of the inverse dynamics model between several robots, including between a low-cost Interbotix PhantomX Pincher arm and a more expensive and relatively heavier Kuka youBot arm.
Abstract: Online learning of a robot's inverse dynamics model for trajectory tracking necessitates an interaction between the robot and its environment to collect training data This is challenging for physical robots in the real world, especially for humanoids and manipulators due to their large and high dimensional state and action spaces, as a large amount of data must be collected over time This can put the robot in danger when learning tabula rasa and can also be a time-intensive process especially in a multi-robot setting, where each robot is learning its model from scratch We propose accelerating learning of the inverse dynamics model for trajectory tracking tasks in this multi-robot setting using knowledge transfer, where robots share and re-use data collected by preexisting robots, in order to speed up learning for new robots We propose a scheme for collecting a sample of correspondences from the robots for training transfer models, and demonstrate, in simulations, the benefit of knowledge transfer in accelerating online learning of the inverse dynamics model between several robots, including between a low-cost Interbotix PhantomX Pincher arm, and a more expensive and relatively heavier Kuka youBot arm We show that knowledge transfer can save up to 63% of training time of the youBot arm compared to learning from scratch, and about 58% for the lighter Pincher arm

11 citations


Cites background or methods from "A comparison of classical and learn..."

  • ...In some cases it has been shown that learning can be accelerated by initializing the model with random data generated through a motor babbling process [28], [29]....

    [...]

  • ...Random initialization We also separately initialize learning with random data generated in the motor babbling session as a benchmark, denoted ‘random’, as it has previously been shown to accelerate learning [28], [29]....

    [...]

References
More filters
Book
23 Nov 2005
TL;DR: The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics, and deals with the supervised learning problem for both regression and classification.
Abstract: A comprehensive and self-contained introduction to Gaussian processes, which provide a principled, practical, probabilistic approach to learning in kernel machines. Gaussian processes (GPs) provide a principled, practical, probabilistic approach to learning in kernel machines. GPs have received increased attention in the machine-learning community over the past decade, and this book provides a long-needed systematic and unified treatment of theoretical and practical aspects of GPs in machine learning. The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics. The book deals with the supervised-learning problem for both regression and classification, and includes detailed algorithms. A wide variety of covariance (kernel) functions are presented and their properties discussed. Model selection is discussed both from a Bayesian and a classical perspective. Many connections to other well-known techniques from machine learning and statistics are discussed, including support-vector machines, neural networks, splines, regularization networks, relevance vector machines and others. Theoretical issues including learning curves and the PAC-Bayesian framework are treated, and several approximation methods for learning with large datasets are discussed. The book contains illustrative examples and exercises, and code and datasets are available on the Web. Appendixes provide mathematical background and a discussion of Gaussian Markov processes.

11,357 citations

Journal ArticleDOI
TL;DR: A betterment process for the operation of a mechanical robot in a sense that it betters the nextoperation of a robot by using the previous operation's data is proposed.
Abstract: This article proposes a betterment process for the operation of a mechanical robot in a sense that it betters the next operation of a robot by using the previous operation's data. The process has an iterative learning structure such that the (k + 1)th input to joint actuators consists of the kth input plus an error increment composed of the derivative difference between the kth motion trajectory and the given desired motion trajectory. The convergence of the process to the desired motion trajectory is assured under some reasonable conditions. Numerical results by computer simulation are presented to show the effectiveness of the proposed learning scheme.

3,222 citations


"A comparison of classical and learn..." refers background in this paper

  • ...Rather than learning the underlying model structure of the system, Iterative learning control (ILC) (Arimoto et al., 1984; Bristow et al., 2006) incorporates information from error signals in previous iterations to directly modify the control input for subsequent iterations....

    [...]

  • ...However, ILC is limited primarily to systems which track a specific repeating trajectory and are subject to repeating disturbances (Bristow et al., 2006), whereas the model learning approaches such as LWPR can be incrementally trained to deal with non-repeating trajectories....

    [...]

Journal ArticleDOI
TL;DR: This survey is the first to bring to the attention of the controls community the important contributions from the tribology, lubrication and physics literatures, and provides a set of models and tools for friction compensation which will be of value to both research and application engineers.
Abstract: While considerable progress has been made in friction compensation, this is, apparently, the first survey on the topic. In particular, it is the first to bring to the attention of the controls community the important contributions from the tribology, lubrication and physics literatures. By uniting these results with those of the controls community, a set of models and tools for friction compensation is provided which will be of value to both research and application engineers. The successful design and analysis of friction compensators depends heavily upon the quality of the friction model used, and the suitability of the analysis technique employed. Consequently, this survey first describes models of machine friction, followed by a discussion of relevant analysis techniques and concludes with a survey of friction compensation methods reported in the literature. An overview of techniques used by practising engineers and a bibliography of 280 papers is included.

2,658 citations


"A comparison of classical and learn..." refers background in this paper

  • ...In practice, obtaining such a model is a challenging task which involves modeling physical processes that are not well understood or difficult to model, such as friction (Armstrong-Hélouvry et al., 1994) and backlash....

    [...]

  • ...In practice, obtaining such a model is a challenging task which involves modeling physical processes that are not well understood or difficult to model, such as friction (Armstrong-Hélouvry et al., 1994) and backlash....

    [...]

Journal ArticleDOI
TL;DR: Though beginning its third decade of active research, the field of ILC shows no sign of slowing down and includes many results and learning algorithms beyond the scope of this survey.
Abstract: This article surveyed the major results in iterative learning control (ILC) analysis and design over the past two decades. Problems in stability, performance, learning transient behavior, and robustness were discussed along with four design techniques that have emerged as among the most popular. The content of this survey was selected to provide the reader with a broad perspective of the important ideas, potential, and limitations of ILC. Indeed, the maturing field of ILC includes many results and learning algorithms beyond the scope of this survey. Though beginning its third decade of active research, the field of ILC shows no sign of slowing down.

2,645 citations


"A comparison of classical and learn..." refers background in this paper

  • ...Rather than learning the underlying model structure of the system, Iterative learning control (ILC) (Arimoto et al., 1984; Bristow et al., 2006) incorporates information from error signals in previous iterations to directly modify the control input for subsequent iterations....

    [...]

  • ...However, ILC is limited primarily to systems which track a specific repeating trajectory and are subject to repeating disturbances (Bristow et al., 2006), whereas the model learning approaches such as LWPR can be incrementally trained to deal with non-repeating trajectories....

    [...]

  • ...limited primarily to systems which track a specific repeating trajectory and are subject to repeating disturbances (Bristow et al., 2006), whereas the model learning approaches such as LWPR can be incrementally trained to deal with non-repeating trajectories....

    [...]

Journal ArticleDOI
TL;DR: In this paper, an adaptive robot control algorithm is derived, which consists of a PD feedback part and a full dynamics feed for the compensation part, with the unknown manipulator and payload parameters being estimated online.
Abstract: A new adaptive robot control algorithm is derived, which consists of a PD feedback part and a full dynamics feedfor ward compensation part, with the unknown manipulator and payload parameters being estimated online. The algorithm is computationally simple, because of an effective exploitation of the structure of manipulator dynamics. In particular, it requires neither feedback of joint accelerations nor inversion of the estimated inertia matrix. The algorithm can also be applied directly in Cartesian space.

2,117 citations

Frequently Asked Questions (2)
Q1. What have the authors stated for future works in "A comparison of classical and learning controllers" ?

Future studies will involve experimental validation on physical robots as well as incorporating a-priori knowledge of the dynamic model of the system to improve performance of learning methods outside of the trained regions. 

This paper focuses on evaluating Locally Weighted Projection Regression ( LWPR ) as an alternative control method to traditional model-based control schemes.