Open AccessJournal ArticleDOI

Dynamical movement primitives: Learning attractor models for motor behaviors

- 01 Feb 2013 -

- Vol. 25, Iss: 2, pp 328-373

Chats0

TLDR

Dynamical movement primitives is presented, a line of research for modeling attractor behaviors of autonomous nonlinear dynamical systems with the help of statistical learning techniques, and its properties are evaluated in motor control and robotics.

Abstract:

Nonlinear dynamical systems have been used in many disciplines to model complex behaviors, including biological motor control, robotics, perception, economics, traffic prediction, and neuroscience. While often the unexpected emergent behavior of nonlinear systems is the focus of investigations, it is of equal importance to create goal-directed behavior e.g., stable locomotion from a system of coupled oscillators under perceptual guidance. Modeling goal-directed behavior with nonlinear systems is, however, rather difficult due to the parameter sensitivity of these systems, their complex phase transitions in response to subtle parameter changes, and the difficulty of analyzing and predicting their long-term behavior; intuition and time-consuming parameter tuning play a major role. This letter presents and reviews dynamical movement primitives, a line of research for modeling attractor behaviors of autonomous nonlinear dynamical systems with the help of statistical learning techniques. The essence of our approach is to start with a simple dynamical system, such as a set of linear differential equations, and transform those into a weakly nonlinear system with prescribed attractor dynamics by means of a learnable autonomous forcing term. Both point attractors and limit cycle attractors of almost arbitrary complexity can be generated. We explain the design principle of our approach and evaluate its properties in several example applications in motor control and robotics.

Content maybe subject to copyright Report

Edinburgh Research Explorer

Dynamical Movement Primitives: Learning Attractor Models for

Motor Behaviors

Citation for published version:

Ijspeert, AJ, Nakanishi, J, Hoffmann, H, Pastor, P & Schaal, S 2013, 'Dynamical Movement Primitives:

Learning Attractor Models for Motor Behaviors', Neural Computation, vol. 25, no. 2, pp. 328-373.

https://doi.org/10.1162/NECO_a_00393

Digital Object Identifier (DOI):

10.1162/NECO_a_00393

Link:

Link to publication record in Edinburgh Research Explorer

Document Version:

Publisher's PDF, also known as Version of record

Published In:

Neural Computation

General rights

and / or other copyright owners and it is a condition of accessing these publications that users recognise and

abide by the legal requirements associated with these rights.

Take down policy

The University of Edinburgh has made every reasonable effort to ensure that Edinburgh Research Explorer

content complies with UK legislation. If you believe that the public display of this file breaches copyright please

contact openaccess@ed.ac.uk providing details, and we will remove access to the work immediately and

investigate your claim.

Download date: 10. Aug. 2022

LETTER Communicated by Hirokazu Tanaka

Dynamical Movement Primitives: Learning Attractor

Models for Motor Behaviors

Auke Jan Ijspeert

auke.ijspeert@epﬂ.ch

Ecole Polytechnique F

erale de Lausanne, Lausanne CH-1015, Switzerland

Jun Nakanishi

jun.nakanishi@ed.ac.uk

School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, U.K.

Heiko Hoffmann

heikohof@usc.edu

Peter Pastor

pastorsa@usc.edu

Computer Science, Neuroscience, and Biomedical Engineering, University

of Southern California, Los Angeles, CA 90089, U.S.A.

Stefan Schaal

sschaal@usc.edu

Computer Science, Neuroscience, and Biomedical Engineering, University

of Southern California, Los Angeles, CA 90089, U.S.A.; Max-Planck-Institute

for Intelligent Systems, T

ubingen 72076, Germany; and ATR Computational

Neuroscience Laboratories, Kyoto 619-0288, Japan

Nonlinear dynamical systems have been used in many disciplines to

model complex behaviors, including biological motor control, robotics,

perception, economics, trafﬁc prediction, and neuroscience. While often

the unexpected emergent behavior of nonlinear systems is the focus of

investigations, it is of equal importance to create goal-directed behavior

(e.g., stable locomotion from a system of coupled oscillators under per-

ceptual guidance). Modeling goal-directed behavior with nonlinear sys-

tems is, however, rather difﬁcult due to the parameter sensitivity of these

systems, their complex phase transitions in response to subtle parameter

changes, and the difﬁculty of analyzing and predicting their long-term

behavior; intuition and time-consuming parameter tuning play a major

role. This letter presents and reviews dynamical movement primitives, a

line of research for modeling attractor behaviors of autonomous nonlin-

ear dynamical systems with the help of statistical learning techniques.

The essence of our approach is to start with a simple dynamical system,

such as a set of linear differential equations, and transform those into a

weakly nonlinear system with prescribed attractor dynamics by means

Neural Computation 25, 328–373 (2013)



2013 Massachusetts Institute of Technology

Dynamical Movement Primitives 329

of a learnable autonomous forcing term. Both point attractors and limit

cycle attractors of almost arbitrary complexity can be generated. We ex-

plain the design principle of our approach and evaluate its properties in

several example applications in motor control and robotics.

1 Introduction

In the wake of the development of nonlinear systems theory (Guckenheimer

& Holmes, 1983; Strogatz, 1994; Scott, 2005), it has become common practice

in several branches of science to model natural phenomena with systems of

coupled nonlinear differentialequations. Such approachesare motivated by

the insight that coupling effects of nonlinear systems exhibit rich abilities for

forming complex coordinated patterns without the need to explicitly plan

or supervise the details of such pattern formation. Among the many dif-

ferent forms of nonlinear systems (e.g., high-dimensional, weakly coupled,

strongly coupled, chaotic, Hamiltonian, dissipative), this letter addresses

low-dimensional nonlinear systems, for example, as typically used to model

phenomena of motor coordination or cognitive science (Kelso, 1995; Thelen

& Smith, 1994).

In this domain, there are often two modeling objectives.

First, a model of a baseline behavior is required, as in generating a basic

pattern for bipedal locomotion or reach-and-grasp in arm movement. Such

behaviors are goal oriented; the focus is less on emergent coordination phe-

nomena and more on achieving a task objective. After this baseline model

has been accomplished, the second objective is to use this model to account

for more complex phenomena with the help of the coupling dynamics of

nonlinear systems. For instance, a typical example is the modulation of lo-

comotion due to resonance entrainment of the pattern generator with the

dynamics of a physical body (Nakanishi et al., 2004; Hatsopoulos & War-

ren, 1996). Another example is the coupling between motor control and

perception (Dijkstra, Schoner, Giese, & Gielen, 1994; Kelso, 1995; Swinnen

et al., 2004). In order to allow investigations of such second objectives, a

dynamical systems model has to be found ﬁrst.

Finding an appropriate dynamical systems model for a given behavioral

phenomenon is nontrivial due to the parameter sensitivity of nonlinear

differential equations and their lack of analytical predictability. Thus, mod-

eling is often left to the intuition and the trial-and-error patience of the

researchers. Many impressive studies have been generated in this manner

(Schoner & Kelso, 1988; Sch

oner, 1990; Taga, Yamaguchi, & Shimizu, 1991;

Schaal & Sternad, 1998; Kelso, 1995), but the lack of a generic modeling tool

is unsatisfactory.

In this letter, we propose a generic modeling approach to generate

multidimensional systems of weakly nonlinear differential equations to

With low-dimensional, we refer to systems with less than about 100 degrees of

freedom.

330 Ijspeert et al.

capture an observed behavior in an attractor landscape. The essence of

our methodology is to transform well-understood simple attractor systems

with the help of a learnable forcing function term into a desired attractor

system. Both point attractor and limit cycle attractors of almost arbitrary

complexity can be achieved. Multiple degrees of freedom can be coordi-

nated with arbitrary phase relationships. Stability of the model equations

can be guaranteed. Our approach also provides a metric to compare differ-

ent dynamical systems in a scale-invariant and temporally invariant way.

We evaluate our approach in the domain of motor control for robotics,

where desired kinematic motor behaviors will be coded in attractor land-

scapes and then converted into control commands with inverse dynamics

controllers. Importantly, perceptual variables can be coupled back into the

dynamic equations, such that complex closed-loop motor behaviors are

created out of one relatively simple set of equations. Inspired by the bio-

logical concept of motor primitives (Giszter, Mussa-Ivaldi, & Bizzi, 1993;

Mussa-Ivaldi, 1999), we call our system dynamical movement primitives,as

we see them as building blocks that can used and modulated in real time

for generating complex movements.

The followingsections ﬁrst introduceourmodeling approach (see section

1), then, examine its theoretical properties (see section 2), and ﬁnally explore

our approach in the example domain of motor control in various scenarios

(see section 3). Matlab code is provided as supplemental material to allow

readers to explore properties of the system.

Early versions of the dynamical

system presented in this letter have been published elsewhere in short

format (Ijspeert, Nakanishi, & Schaal, 2002b, 2003) or some review articles

(Schaal, Mohajerian, & Ijspeert, 2007; Schaal, Ijspeert, & Billard, 2003). Here,

we review previous work and present our system in more detail, introduce

examples of spatial and temporal couplings, and discuss issues related to

generalization and coordinate systems. In the end, this letter presents a

comprehensive and mature account of our dynamic modeling approach

with discussions of related work, which will allow readers to apply or

improve research on this topic.

2 A Learnable Nonlinear Attractor Systems

Before developing our model equations, it will be useful to clarify the

speciﬁc goals pursued with this model:

1. Both learnable point attractor and limit cycle attractors need to be

represented. This is useful to encode both discrete (i.e., point to point)

and rhythmic (periodic) trajectories.

The code can be downloaded from http://www-clmc.usc.edu/Resources/Software.

Note that we borrowed the terminology discrete trajectories from the motor control

literature (Schaal, Sternad, Osu, & Kawato, 2004) to denote point-to-point (nonperiodic

Dynamical Movement Primitives 331

2. The model should be an autonomous system, without explicit time

dependence.

3. The model needs to be able to coordinate multidimensional dynam-

ical systems in a stable way.

4. Learning the open parameters of the system should be as simple as

possible, which essentially opts for a representation that is linear in

the open parameters.

5. The system needs to be able to incorporate coupling terms, for exam-

ple, as typically used in synchronization studies or phase resetting

studies and as needed to implement closed-loop perception-action

systems.

6. The system should allow real-time computation as well as arbitrary

modulation of control parameters for online trajectory modulation.

7. Scale and temporal invariance would be desirable; for example,

changing the amplitude or frequency of a periodic system should

not affect a change in geometry of the attractor landscape.

2.1 Model Development. The basic idea of our approach is to use an

analytically well-understood dynamical system with convenient stability

properties and modulate it with nonlinear terms such that it achieves a

desired attractor behavior (Ijspeert et al., 2003). As one of the simplest

possible systems, we chose a damped spring model,

y =α

(β

(g − y) −

y) + f,

which, throughout this letter, we write in ﬁrst-order notation,

z =α

(β

(g − y) −z) + f, (2.1)

y =z,

where τ isatimeconstantandα

and β

are positive constants. If the forcing

term f = 0, these equations represent a globally stable second-order linear

system with (z, y) = (0, g) as a unique point attractor. With appropriate val-

ues of α

and β

, the system can be made critically damped (with β

= α

/4)

in order for y to monotonically converge toward g. Such a system imple-

ments a stable but trivial pattern generator with g as single point attractor.

The choice of a second-order system in equation 2.1 was motivated

or episodic) trajectories—trajectories that are not repeating themselves, as rhythmic tra-

jectories do. This notation should not be confused with discrete dynamical systems,which

denotes difference equations—those that are time discretized.

As will be discussed below, many other choices are possible.

In early work (Ijspeert et al., 2002b, 2003), the forcing term f was applied to the second

y equation (instead of the

z equation), which is analytically less favorable. See section 2.1.8.

HTML Viewer

Figures

Figure 3: Exemplary time evolution of the rhythmic dynamical system (limit cycle behavior). The parameters wi have been adjusted to fit a trajectory ydemo(t) = sin(2πt)+ 0.25cos(4πt + 0.77)+ 0.1sin(6πt + 3.0). The upper plots show the desired position, velocity, and acceleration with dotted lines, but these are mostly covered by the time evolutions of y, ẏ, and ÿ. The bottom plots show the phase variable and its derivative and the basis functions of the forcing term over time (20 basis functions per period).

Figure 8: Illustration of obstacle avoidance with a coupling term. The obstacle is the large (red) sphere in the center of the plot. Various trajectories are shown, starting from different start positions and ending at the sphere labeled “goal.” Also shown is the nominal trajectory (green) that the discrete dynamical system creates when the obstacle is not present: it passes right through the sphere. Trajectories starting at points where the direct line to the goal does not intersect with the obstacle are only minimally curved around the obstacle, while other trajectories show strongly curved paths around the obstacle.

Figure 11: Subjecting the discrete dynamical system from Figure 1 to “holding” perturbation. At time t = 0.35 s, the actual movement system is blocked from its time evolution: its velocity and acceleration are zero, and its position (dashdot line in the top-left figure) remains constant until t = 0.9 s (see the shaded area). Due to the coupling terms, the time evolution of the dynamical system decays to zero and resumes after the actual system is released. For comparison, the unperturbed time evolution of the dynamics is shown in a dashed line. Essentially the perturbation simply delays the time evolution of the dynamical system without any large motor commands leading to possible harm.

Figure 5: Illustration of the significance of the invariance properties, exemplified in a two-dimensional discrete dynamical system to draw a cursive letter a. In all subfigures, the blue (thin) line denotes the letter “a” as taught from a human demonstration using a digitizing tablet. The start point for all figures is the same, while the goal is originally Target0, and, for the purpose of testing generalization, the goal is shifted to Target1. In a and b, the shift of the goal is small, while in c and d, the shift of the goal is much more significant. Subfigures a and c use equations 2.1 to 2.4, the proper formulation of the discrete dynamical system with invariance properties. As can be noted from the red (thick) lines, the generalized letter “a” is always a properly uniformly zoomed version of the original letter “a.” In contrast, in subfigures b and d, the scaling term g− y0 in equation 2.3 was left out, which destroys the invariance properties. While for a small shift of the goal in b the distortion of the letter “a” is insignificant, for a large shift of the goal in d, the distortion creates more a letter “u” than a letter “a.”

Figure 13: Correlation between the parameter vectors of different instantiations of the Graffiti characters (5 instances of each of the 26 alphabet characters). A grayscale value is used, with black corresponding to a correlation of 1.0 and white corresponding to a correlation of 0.0 or below.

Figure 14: Generalization of a 2 DOF discrete dynamical system under different choices of coordinate systems. The 2D movement is a point-to-point movement with a loop on the way to the goal. All movements start at the origin of the coordinate system and terminate at six different goal positions, distributed with 60 degree distance on a circle. The heavy (red) path in the first quadrant of the coordinate system was the originally learned movement. The generalization of this movement to six different targets is drawn with different line styles to make it easier to see the paths of these movements. The two plots on the right of each subfigure show the y1 and y2 trajectories of each original movement. (a) The original movement is in a benign part of the Cartesian coordinate system. (b) Again this is a Cartesian coordinate system, but the y2 coordinate of the original movement has the start and end point of the movement within a small distance. (c) Choosing a coordinate system that has as the first coordinate the line between start and end point, and the second coordinate is perpendicular.

Citations

PDF

Open Access

More filters

PDF

Open Access

More filters

Book

Pattern Recognition and Machine Learning

Christopher M. Bishop

TL;DR: Probability Distributions, linear models for Regression, Linear Models for Classification, Neural Networks, Graphical Models, Mixture Models and EM, Sampling Methods, Continuous Latent Variables, Sequential Data are studied.

...read moreread less

Journal ArticleDOI

Pattern Recognition and Machine Learning

Radford M. Neal

- 01 Aug 2007 -

Technometrics

TL;DR: This book covers a broad range of topics for regular factorial designs and presents all of the material in very mathematical fashion and will surely become an invaluable resource for researchers and graduate students doing research in the design of factorial experiments.

...read moreread less

Book

Applied Nonlinear Control

Jean-Jacques E. Slotine, +1 more

TL;DR: Covers in a progressive fashion a number of analysis tools and design techniques directly applicable to nonlinear control problems in high performance systems (in aerospace, robotics and automotive areas).

...read moreread less

Book

Pattern Recognition and Machine Learning (Information Science and Statistics)

Christopher M. Bishop

TL;DR: Looking for competent reading resources?

...read moreread less

Journal ArticleDOI

Real-time obstacle avoidance for manipulators and mobile robots

Oussama Khatib

- 01 Apr 1986 -

The International Journal of Robotics Re...

TL;DR: This paper reformulated the manipulator con trol problem as direct control of manipulator motion in operational space—the space in which the task is originally described—rather than as control of the task's corresponding joint space motion obtained only after geometric and geometric transformation.

...read moreread less

Collapse

Frequently Asked Questions (13)

Q1. What is the purpose of the synchronization of the canonical system?

An external signal with frequency ωext and phase angle φext is used to synchronize the canonical system with the external oscillation and to ensure that the final phase relationship is phaselocked at φd.

Q2. What have the authors contributed in "Dynamical movement primitives: learning attractor models for motor behaviors" ?

In this paper, the authors propose a generic modeling approach to generate multidimensional systems of weakly nonlinear differential equations.

Q3. What is the useful property of modeling behaviors in a dynamical systems framework?

A useful property of modeling behaviors in a dynamical systems framework comes from the scaling properties and invariance properties that can be designed into dynamical systems.

Q4. How can one influence the temporal evolution of their dynamical systems without affecting the transformation system?

By modulating the canonical system, one can influence the temporal evolution of their dynamical systems without affectingthe spatial pattern generated by the transformation system.

Q5. What is the definition of a nonlinear forcing term?

The nonlinear forcing term can be represented as an autonomous coupling term that can be learned with standard machine learning techniques that are linear in the open parameters.

Q6. What is the coupling term for obstacle avoidance?

the coupling term adds a movement perpendicular to the current movement direction as a function of the distance vector to the obstacle (see Hoffmann et al., 2009, for more details).

Q7. How did the authors use the model to generate movement to six different targets?

Then the authors applied the model to generate movement to six different targets, distributed with 60 degrees difference on a circle around the origin.

Q8. How many DOFs did the robot need to perform?

these tasks required the coordination and phase locking of 30 DOFs, which was easily and naturally accomplished in their approach.

Q9. What is the motivation to present in this letter?

The large variety of follow-up and related approaches to their initial work on dynamical movement primitives is one of the motivations to present in this letter the theory, insights, and a refined approach to learnable dynamical systems that the authors hope will continue to attract even more active research in the future.

Q10. What are the main differences with their approach?

The main differences with their approach is that the underlying dynamics is much more complex than ours (with several hundreds of state variables), that reservoir computing does not offer proof of stability of learned attractors, and that it is less easy to incorporate feedback terms for online trajectory modulation.

Q11. What is the way to model the dynamical systems?

From a practical point of view, one should first carefully investigate what properties a model requires in terms of temporal and spatial invariance and then realize these properties by choosing the most appropriate variant of the dynamical systems model and the most appropriate coordinate system for modeling.

Q12. What is the definition of a nonlinear dynamical system?

The explicit time dependence of this nonlinearity, however, creates a nonautonomous dynamical system or, in the current formulation, more precisely a linear time-variant dynamical system.

Q13. What could be used to exclude such cases?

Such cases could be excluded by more sophisticated classifiers that would employ, for example, confidence levels in decision making.

Dynamical movement primitives: Learning attractor models for motor behaviors

Figures

Citations

An Algorithmic Perspective on Imitation Learning

Imitation Learning: A Survey of Learning Methods

Recent Advances in Robot Learning from Demonstration

A tutorial on task-parameterized movement learning and retrieval

Learning Physical Collaborative Robot Behaviors From Human Demonstrations

References

Pattern Recognition and Machine Learning

Pattern Recognition and Machine Learning

Applied Nonlinear Control

Pattern Recognition and Machine Learning (Information Science and Statistics)

Real-time obstacle avoidance for manipulators and mobile robots

Related Papers (5)

A survey of robot learning from demonstration

Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models

Learning and generalization of motor skills by learning from demonstration

Movement imitation with nonlinear dynamical systems in humanoid robots

Robot Programming by Demonstration

Frequently Asked Questions (13)

Q1. What is the purpose of the synchronization of the canonical system?

Q2. What have the authors contributed in "Dynamical movement primitives: learning attractor models for motor behaviors" ?

Q3. What is the useful property of modeling behaviors in a dynamical systems framework?

Q4. How can one influence the temporal evolution of their dynamical systems without affecting the transformation system?

Q5. What is the definition of a nonlinear forcing term?

Q6. What is the coupling term for obstacle avoidance?

Q7. How did the authors use the model to generate movement to six different targets?

Q8. How many DOFs did the robot need to perform?

Q9. What is the motivation to present in this letter?

Q10. What are the main differences with their approach?

Q11. What is the way to model the dynamical systems?

Q12. What is the definition of a nonlinear dynamical system?

Q13. What could be used to exclude such cases?