What have the authors contributed in "A cerebellar model of timing and prediction in the control of reaching" ?

In this paper, a simplified version of the adjustable pattern generator ( APG ) model was developed to explore its potential for adaptive, predictive control based on delayed feedback information.

What are the future works mentioned in the paper "A cerebellar model of timing and prediction in the control of reaching" ?

In fact, the model makes no explicit predictions of any kind, if this is taken to mean the creation of representations of future events. However, this would be worthwhile to pursue in future research. Experimental data on the effects of CF discharge on PF-to-PC synapses suggest an instructive role for CF signals, as adopted by the model. Recently, however, Chen and Thompson ( 1995 ) demonstrated that delaying CF activation by 250 ms after a PF volley facilitates the appearance of LTD, suggesting that there may be a cellular mechanism that compensates for the time interval.

(Open Access) A cerebellar model of timing and prediction in the control of reaching (1999) | Andrew G. Barto

To Appear in Neural Computation 1

A Cerebellar Model of Timing and Prediction

in the Control of Reaching

Andrew G. Barto

†

, Andrew H. Fagg

†

Nathan Sitkoﬀ

†

, and James C. Houk

∗

†

Department of Computer Science, University of Massachusetts

∗

Department of Physiology, Northwestern University Medical School

May 1998

Abstract

A simpliﬁed model of the cerebellum was developed to explore

its potential for adaptive, predictive control based on delayed feed-

back information. An abstract representation of a single Purkinje cell

with multistable properties was interfaced, via a formalized premotor

network, with a simulated single degree-of-freedom limb. The limb

actuator was a nonlinear spring-mass system based on the nonlin-

ear velocity dependence of the stretch reﬂex. By including realistic

mossy ﬁber signals, as well as realistic conduction delays in aﬀerent

and eﬀerent pathways, the model allowed the investigation of tim-

ing and predictive processes relevant to cerebellar involvement in the

control of movement. The model regulates movement by learning to

react in an anticipatory fashion to sensory feedback. Learning de-

pends on training information generated from corrective movements

and uses a temporally-asymmetric form of plasticity for the parallel

ﬁber synapses on Purkinje cells.

1 Introduction

The neural commands that control rapid limb movements appear to be com-

prised of pulse components followed by smaller step components (Ghez 1979;

Ghez and Martin 1982), analogous to the pulse-step commands that control

rapid eye movements (Robinson 1975). In the case of eye movements, the

pulse component serves to overcome the internal viscosity of the muscles,

thus moving the eye rapidly to the target, whereupon the step component

holds the eye at its ﬁnal position. Limb movements involve more inertia than

eye movements, so the pulse activation of the agonist muscle must end part

way through the movement, and a braking pulse in the antagonist muscle is

needed to decelerate the mass of the limb. Ghez and Martin (1982) showed

that the braking pulse is produced by a stretch reﬂex in the antagonist mus-

cle. The central control problem, therefore, is to terminate the pulse phase

of the command sent to the agonist muscle at an appropriate time during

the movement. The dynamics of the stretch reﬂex should then bring the

movement to a halt at a desired endpoint. Since the pulse must terminate

well in advance of the achievement of the desired endpoint, this is a problem

of timing and prediction in control. In this article we present a model of how

the cerebellum may contribute to the predictive control of limb movements.

The model is a simpliﬁed version of the adjustable pattern generator (APG)

model being developed by Houk and colleagues (Berthier et al. 1993; Houk

et al. 1990; Sinkjær et al. 1990) to test the computational competence of

a conceptual framework for understanding the brain mechanisms of motor

control (Houk 1989; Houk and Barto 1992; Houk et al. 1993; Houk and

Wise 1995; Houk et al. 1996). The model has a modular architecture in

which single modules generate elemental motor commands with adjustable

time courses, and multiple modules cooperatively produce more complex

commands. The APG model is constrained by the modular anatomy of the

cerebellar cortex and its connections with the limb premotor network, by

the physiology of the neurons comprising this network, and by properties

of cerebellar Purkinje cells (PCs). However, it is purposefully abstract to

allow us to explore control and learning issues in a computationally feasible

manner. The model presented here corresponds to a single module of the

APG model consisting of a single unit representing a PC. This unit is modeled

as a collection of nonlinear switching elements, which we call dendritic zones,

representing segments of a PC dendritic tree.

Our previous modeling studies dealt mainly with two issues: (1) demonstra-

tion that a single module can learn to generate appropriate one-dimensional,

variable-duration velocity commands (Houk et al. 1990) and (2) a prelimi-

nary demonstration that an array of 48 modules can learn to function co-

operatively in the control of a simulated non-dynamic, two-joint planar limb

(Berthier et al. 1993). In these previous simulations, the input layer of the

cerebellum, the representation of PCs, and the complexity of the learning

problem were greatly simpliﬁed. In the present article, we employ a more re-

alistic input representation based on what is known about movement-related

mossy ﬁber (MF) signals in the intermediate cerebellum of the monkey (Van

Kan et al. 1993a) and the Marr-Albus architecture of the granular layer

(Tyrrell and Willshaw 1992). In addition, we use a more complex dynamic

spring-mass system (although it is still one-dimensional), and we include re-

alistic conduction delays in the relevant signal pathways. The model also

makes use of a trace mechanism in its learning rule. Preliminary results

appear in Buckingham et al. (1995) and Barto et al. (1996).

We ﬁrst describe the nonlinear spring-mass system and discuss some of its

properties from a control point of view. The following section presents the

details of the model. We then present simulation results demonstrating the

learning and control abilities of a single dendritic zone followed by similar

results for a model with multiple dendritic zones. We conclude with a dis-

cussion of these results.

2 Pulse-Step Control of a Nonlinear Plant

The limb motor plant has prominent nonlinearities that have a strong inﬂu-

ence on movement and its control. The plant model used in this study is

a spring-mass system with a form of nonlinear damping based on studies of

human wrist movement (Gielen and Houk 1984; Wu et al. 1990):

M ¨x + B( ˙x)

+ K(x − x

) = 0, (1)

where x is the position (in meters) of an object of mass M (kg) attached

to the spring, x

is the resting, or equilibrium, position, B is the damping

coeﬃcient, and K is the spring stiﬀness (Fig. 1A). This fractional-power

form of nonlinear damping is derived from a combination of nonlinear muscle

properties and spinal reﬂex mechanisms, the latter driven mainly by feedback

from muscle spindle receptors (Gielen and Houk 1987). Setting M = 1,

B = 3, and K = 30 produces trajectories that are qualitatively similar to

those observed in human wrist movement (Wu et al. 1990).

Nonlinear damping of this kind enables fast movements that terminate with

little oscillation. Fig. 1B is a graph of the damping force as a function of

Damping

Force

Velocity

A B

Xeq (cm)

Vel (cm/s)

0 100 200 300 400 500

Pos (cm)

Time (ms)

0 2 4 6 8 10

−5

Pos (cm)

Vel (cm/s)

C D

Figure 1: Pulse-Step Control of a Simpliﬁed Motor Plant. Panel A: Spring-

Mass System. M , mass; x, position; x

, resting, or equilibrium, position.

Panel B: Nonlinear Damping Force as a Function of Velocity. The plant’s

eﬀective damping coeﬃcient (the graph’s slope) increases rapidly as the ve-

locity magnitude decreases to zero. Panel C: Pulse-Step Control. Control

of a movement from initial position x

= 0 to target endpoint x

= 5 cm.

Top—The pulse-step command. Middle—Velocity as a function of time.

Bottom—Position as a function of time. Panel D: Phase-Plane Trajectory.

The bold line is the phase-plane trajectory of the movement of Panel C. The

dashed line is a plot of the states of the spring-mass system at which the

command should switch from pulse to step so that the mass will stick at the

endpoint x

= 5 cm starting from a variety of diﬀerent initial states.

velocity. As velocity decreases, the eﬀective damping coeﬃcient (the curve’s

slope) increases radically when the velocity gets suﬃciently close to zero.

This causes a decelerating mass generally to “stick” at a non-equilibrium

position, thereafter drifting extremely slowly toward x

. We call the position

at which the mass sticks (deﬁned here as the position at which the absolute

value of its velocity falls and remains below 0.9 cm/sec) the endpoint of a

movement, denoted x

. For all practical purposes, this is where the movement

stops.

The control signal in our model sets the equilibrium value x

, which rep-

resents a central motor command setting the threshold of the stretch reﬂex

(Feldman 1966; Houk and Rymer 1981). Pulse-step control is eﬀective in

producing rapid and well-controlled positioning of the mass in this system.

As shown in Fig. 1C, the control signal switches from a pulse level x

= x

to a smaller step level x

= x

. Also shown are the time courses of the veloc-

ity and position (middle and bottom) for the resulting movement. Inserting

a low-pass ﬁlter in the command pathway, a common feature of muscle mod-

els, would produce velocity proﬁles more closely matching those of actual

movements, but we have not been concerned with this issue.

Fig. 1D shows the phase-plane trajectory (velocity plotted against position)

followed by the state of the spring-mass system during pulse-step control.

When the pulse is being applied, the state follows a trajectory that would

end at the equilibrium position x

= 10 cm if the pulse were to continue.

When the step begins, the state switches to the trajectory that ends at the

equilibrium position x

= 4 cm, but the mass sticks at the target endpoint

= 5 cm before reaching this equilibrium position. Thus, simply setting the

equilibrium position to the target endpoint as suggested by the equilibrium-

point hypothesis (Bizzi et al. 1992; Feldman 1966, 1974) is not a practical

solution to the endpoint positioning task for this system. The dashed line in

Fig. 1D is an approximate plot of the states at which the switch from pulse to

step should occur so that movements starting from a variety of initial states

will stick at x

= 5 cm. This switching curve has to vary as a function of

the target endpoint. If the switch from pulse to step occurs too soon (late),

the mass will undershoot (overshoot) x

In developing a model of pulse-step control of the limb, one can proﬁt from

analogies, where appropriate, with the extensive literature on pulse-step con-

trol of saccadic eye movements. However, an important diﬀerence between

eye and limb control is the absence of a stretch reﬂex for regulating primate

eye muscle activity (Keller and Robinson 1971). As a consequence, models of

the eye motor plant do not contain the nonlinear damping mechanism present

in Eq. 1. As mentioned above, the stretch reﬂex is important in generating a

A cerebellar model of timing and prediction in the control of reaching

Figures

Citations

Neuropsychological mechanisms of interval timing behavior.

Electromyographic correlates of learning an internal model of reaching movements.

The cerebellar microcircuit as an adaptive filter: experimental and computational evidence

Forward models in visuomotor control.

The role of the basal ganglia and cerebellum in language processing

References

Reinforcement Learning: An Introduction

Introduction to Reinforcement Learning

Adaptive filtering prediction and control

A Theory of Cerebellar Cortex

The cerebellum and neural control

Related Papers (5)

A Theory of Cerebellar Cortex

Is the cerebellum a smith predictor

A Theory of Cerebellar Function

Internal models in the cerebellum

Forward models for physiological motor control

Frequently Asked Questions (2)

Q1. What have the authors contributed in "A cerebellar model of timing and prediction in the control of reaching" ?

Q2. What are the future works mentioned in the paper "A cerebellar model of timing and prediction in the control of reaching" ?