What is the purpose of the extended Choreographer platform?

The authors also extended Choreographer to communicate with the Matlab numerical computing platform, which the authors use for numerical integration of ODEs.

What is the strength of the PEPA process algebra?

One strength of the PEPA process algebra as an expressive and practical modelling language is its support for multi-way co-operation; the authors have made use of this expressive power in all of their modelling studies in systems biology.

What is the effect of each possible reaction on the concentration level?

Concentration levels are discretized and the local states of the PEPA component records the impact of each possible reaction on the concentration level.

What is the problem with the large time step used in the computation in [1]?

The large time step used in the computation in [1] is not a problem for the analysis of the long-lived species but gives misleading results for those species which are short-lived.

What is the meaning of a def=p?

Constant: A def= P Constants are components whose meaning is given by a defining equation: A def= P gives the constant A the behaviour of the component P .

What did the authors find out from the analysis?

Using the insights obtained from their analysis procedures the authors were able to return to the differential equation model, diagnose and correct the flaws in the analysis, and show agreement between the results obtained using continuous-space analysis and the results obtained using a discrete-state stochastic analysis.

How long does it take to solve the model?

The time taken to solve the model using the τ -leap method is longer than the time taken to solve the model using Matlab’s stiff ODE solver (ode15s) but shorter than the time required by a standard solver such as ode45.1

(Open Access) Stronger computational modelling of signalling pathways using both continuous and discrete-state methods (2006) | Muffy Calder

Q: What contributions have the authors mentioned in the paper "Stronger computational modelling of signalling pathways using both continuous and discrete-state methods" ?

Starting from a biochemical signalling pathway model expressed in a process algebra enriched with quantitative information the authors automatically derive both continuous-space and discrete-state representations suitable for numerical evaluation.

Q: What is the recent advancement in the field of SSA?

Recent advances in the field include the development of slow-scale SSA which produces a dramatic speed-up relative to SSA by prioritising rare events [12].

Q: What is the main reason for the malfunction of cellular signalling processes?

The malfunction of cellular signalling processes has significant detrimental effects, leading to uncontrolled cell proliferation, as in cancer; or leading to other cells in the body being attacked, as in auto-immune diseases.

Stronger Computational Modelling of Signalling

Pathways Using Both Continuous and

Discrete-State Methods

Muﬀy Calder

, Adam Duguid

, Stephen Gilmore

, and Jane Hillston

Department of Computing Science, University of Glasgow, Glasgow G12 8QQ,

Scotland

Laboratory for Foundations of Computer Science, The University of Edinburgh,

Edinburgh EH9 3JZ, Scotland

Abstract. Starting from a biochemical signalling pathway model ex-

pressed in a process algebra enriched with quantitative information we

automatically derive both continuous-space and discrete-state represen-

tations suitable for numerical evaluation. We compare results obtained

using implicit numerical diﬀerentiation formulae to those obtained using

approximate stochastic simulation thereby exposing a ﬂaw in the use of

the diﬀerentiation procedure producing misleading results.

1 Introduction

The malfunction of cellular signalling processes has signiﬁcant detrimental ef-

fects, leading to uncontrolled cell proliferation, as in cancer; or leading to other

cells in the body being attacked, as in auto-immune diseases. The dynamics of

cell signalling mechanisms are profoundly complex and at present are not fully

understood. Computational modelling of cell signal transduction is an important

intellectual tool in the scientiﬁc study of the biological processes which control

and regulate cellular function.

An example of an inﬂuential computational study of intracellular signal net-

works is [1]. The authors develop an ordinary diﬀerential equation (ODE) model

of epidermal growth factor (EGF) receptor signal pathways in order to give in-

sight into the activation of the MAP kinase cascade through the kinases Raf,

MEK and ERK-1/2. The ODE model is substantial, consisting of 94 state vari-

ables and 95 parameters. It is analysed using the numerical integration proce-

dures of the Matlab numerical computing platform and tested using sensitivity

analysis. The results increase our understanding of EGF receptor signal trans-

duction and suggest avenues for experimental work to test hypotheses generated

from the computational model. Published in 2002 the article is highly regarded

and has subsequently been cited by as many as 150 other research papers.

We have previously proposed a method of investigating cell signalling path-

ways using a process algebra enhanced with quantitative information, PEPA [2],

applied in [3] and [4]. Process algebras are well-known in theoretical computer

science but are still unfamiliar to most computational biologists so we wished to

C. Priami (Ed.): CMSB 2006, LNBI 4210, pp. 63–77, 2006.

 Springer-Verlag Berlin Heidelberg 2006

64 M. Calder et al.

PEPA











ODE

SSA

P(τ, μ)dτ





Fig. 1. A high-level model in the PEPA process algebra can be used to generate either

a system of ODEs or a stochastic simulation

help to establish their relevance by reproducing the results of [1], starting from

the published paper together with its supplementary material and the Matlab

ODE model made available by the authors.

We were able to reproduce the results from [1] starting from our model in

the PEPA process algebra but because we were starting from the vantage point

of modelling in process algebra we could apply other analysis procedures, un-

available to the authors of [1] (Figure 1). To our surprise when modelling in

process algebra we discovered that the computational simulation conducted by

ODEs in [1] contains a systematic ﬂaw in the analysis process which aﬀects many

of the results, some signiﬁcantly. To the best of our knowledge these errors are

presently unknown: at the very least they were unknown to us. Using the insights

obtained from our analysis procedures we were able to return to the diﬀeren-

tial equation model, diagnose and correct the ﬂaws in the analysis, and show

agreement between the results obtained using continuous-space analysis and the

results obtained using a discrete-state stochastic analysis.

Computational methods are well-understood to be complex and delicate so the

relevance of this ﬁnding is not that there is an error in one particularly rich and

valuable numerical study, or that modelling with ODEs is an unsatisfactory pro-

cedure, but rather that modelling in high-level languages (such as process algebras

or Petri nets) may give a methodological advantage which allows an entire class of

hard-to-detect errors and corner cases to be discovered and diagnosed before the

results are published and promulgated to the wider scientiﬁc community.

As original contributions the present paper contains the analysis of the process

used to detect the error in the earlier modelling study [1], a description of the new

software tool used for integrated continuous-space and discrete-state stochastic

analysis of PEPA process algebra models, and an overview of an extensive pro-

cess algebra modelling study comprising 188 process deﬁnitions describing the

dynamics of 95 of the reaction channels in the signalling cascade of the EGF

receptor-induced MAP kinase pathway.

Structure of this paper: In Section 2 we present background material on our

previous work. We follow this in Section 3 with a discussion of related work. In

Section 4 we present an introduction to quantitative process algebras, considering

Stronger Computational Modelling of Signalling Pathways 65

the expressive capabilities of these languages. In Section 5 we explain how these

languages are used in modelling. Section 6 presents a comparison of our analysis

results and the results of other authors. In Section 7 we discuss the software tool

used to perform the analysis. Finally, we present conclusions in Section 8.

2 Background

In an earlier study we made two distinct computational models of the Ras/Raf-

1/MEK/ERK signalling pathway, both expressed in the PEPA process algebra.

Our models were based on the deterministic model presented directly as a system

of coupled ordinary diﬀerential equations in [5].

Our process algebra models adhere to two distinct modelling styles—the

reagent-centric and pathway models from [3]. We interpreted these under the

continuous-time Markov chain semantics for the PEPA language, and thus these

gave rise to stochastic models of the pathway. We used well-known procedures

of numerical linear algebra to conduct a quantitative stochastic evaluation of

the pathway. We used the process algebraic reasoning apparatus of the PEPA

language to establish that these two models were strongly equivalent, meaning

that a timing-aware observer could not distinguish between them. In the exten-

sion of this work in [6] we presented automatic procedures for converting in both

directions between the reagent-centric and pathway views.

We revisited the reagent-centric model in [4], mapping it to a system of ODEs.

The model considered in [4] adds additional species to the model presented

in [5] in order to concentrate on a detail of the pathway not considered in [5].

We applied the mapping procedure from [4] to a reduced version of the model

without these additional species and were able to show that the model gave rise

to exactly the same system of ODEs as studied previously in [5] establishing

a precise formal equivalence between the process algebra model and the ODE

model.

The deterministic and stochastic approaches to computational modelling in

systems biology are often presented as alternatives; one should choose one ap-

proach or the other. Some authors have suggested that stochastic approaches are

technically superior because they can expose small-scale eﬀects which are caused

by some molecular species being present in the reaction volume in very low copy

numbers. We are instead in agreement with the authors of [7], who argue that

the principal challenge is choosing the appropriate framework for the modelling

study at hand. For some problems the inﬂuence of eﬀects such as intra-cellular

noise or circumstances such as low copy numbers is suﬃciently great that a

thorough stochastic treatment is essential. In other modelling problems no such

inﬂuences are manifest and a deterministic treatment based on reaction rate

equations is the correct approach.

The divergence between the stochastic behaviour exposed at low copy num-

bers of reactants and the deterministic approach based on reaction rate equations

is due to the reliance of the ODE-based analysis on the assumption of continuity

and the use of the law of mass action, essentially an empirical law derived from

66 M. Calder et al.

in vitro experimentation. Gillespie’s Stochastic Simulation Algorithm (SSA) [8]

makes no use of such an empirical law, and is instead grounded in the theory of

statistical thermodynamics. In consequence it is an exact procedure for numer-

ically simulating the dynamic evolution of a chemically reacting system, even

at low copy numbers. However, the SSA method converges, as the number of

reactants increases, to the solution computed by the ODEs so that the methods

are in agreement in the limit [9].

Gillespie’s exact algorithm models systems in which there are M possible

reactions represented by the indexed family R

(1 ≤ μ ≤ M ). It builds on a

reaction probability density function P (τ,μ | X) such that P (τ,μ | X)dτ is the

probability that given the state X at time t,thenext reaction in the volume will

occur in the inﬁnitesimal time interval (t + τ, t+ τ +dτ) and be an R

reaction.

Starting from an initial state, SSA randomly picks the time and type of the next

reaction to occur, updates the global state to record the fact that this reaction

has happened, and then repeats.

In practice, Gillespie’s SSA is eﬀective only for non-stiﬀ systems on short

time scales. An approximate acceleration procedure called “τ-leaping” was later

developed by Gillespie and Petzold [10]. The “implicit τ-leaping” method [11]

was developed to attack the orthogonal problem of stiﬀness, common in multi-

scale modelling, where diﬀerent time-scales are appropriate for reactions. Recent

advances in the ﬁeld include the development of slow-scale SSA which produces

a dramatic speed-up relative to SSA by prioritising rare events [12].

A recent survey paper on stochastic simulation is [13]. A comparison paper on

stochastic simulation methods and their relation to diﬀerential-equation based

analysis of reaction kinetics is [9].

3 Related Work

We are not the ﬁrst authors to investigate the model from [1] using stochastic

simulation methods. An earlier comparison using the binomial τ-leap method

appeared in [14]. However, the authors of [14] compare the solutions computed

by their binomial τ-leap method with the solutions computed by Gillespie’s

stochastic simulation algorithm and did not compare with the results from [1].

For this reason the authors of [14] did not ﬁnd the error which we uncovered

by comparing the results computed by stochastic simulation with the results

computed by the authors of [1] using ordinary diﬀerential equations.

In [15] the authors use the PRISM probabilistic model checker [16] to check

logical formulae of Continuous Stochastic Logic (CSL) [17] against models of sig-

nalling pathways expressed as state-machines in the PRISM modelling language,

comparing the result against an ODE model coded in the Matlab numerical

platform.

A recent technical note [18] uses modelling in a stochastic process calculus and

stochastic simulation to investigate the MAPK cascade previously studied in [19]

using ordinary diﬀerential equations. [18] uses synthetic values for rate constants

(all are set to 1.0) so comparison with the results of [19] is not meaningful.

Stronger Computational Modelling of Signalling Pathways 67

4ProcessAlgebras

Process algebras are concise formally-deﬁned modelling languages for the precise

description of concurrent, communicating systems. Our belief is that they are

well-suited to modelling cell signalling pathways and our interest here is exclu-

sively in process algebras which are decorated with quantitative information [20].

The PEPA process algebra [2] which we use beneﬁts from formal semantic de-

scriptions of diﬀerent characters which are appropriate for diﬀerent uses. The

structured operational semantics presented in [2] maps the PEPA language to

a Continuous-Time Markov Chain (CTMC) representation. A continuous-space

semantics maps PEPA models to a system of ordinary diﬀerential equations

(ODEs) [21], admitting diﬀerent solution procedures.

4.1 Expressiveness

Because we are modelling in a high-level language it is possible to apply these

very diﬀerent numerical evaluation procedures to compute diﬀerent kinds of

quantitative information from the same model. This is a freedom which we would

not have if we had coded a Markov chain or a diﬀerential equation-based rep-

resentation of the model directly in a numerical computing platform such as

Matlab. One freedom which the use of a high-level language gives the modeller

is the possibility to use either discrete-state or continuous-space analysis pro-

cedures. Another is the option of applying both types of analysis to the same

model, and that is the approach which we have used here.

One strength of the PEPA process algebra as an expressive and practical

modelling language is its support for multi-way co-operation;wehavemade

use of this expressive power in all of our modelling studies in systems biology.

Genuinely tri-molecular collisions occur only exceptionally rarely in dilute ﬂuids

so these do not normally arise in our modelling for this reason. Rather a collision

between, say, an enzyme and a substrate to produce a compound, is expressed

in PEPA as a three-way co-operation between the input enzyme and substrate

(whose molecular concentrations are reduced) and the output compound (whose

molecular concentration is increased). Similarly a reaction channel with two

input species and two output species is represented as a four-way co-operation

in PEPA. Some reaction channels may have more inputs or more outputs and

so having this expressive power available in our chosen process algebra seems

well-suited to the type of modelling which is undertaken in the area.

4.2 Combinators of the Language

We give only a brief introduction to the PEPA language here. The reader is

referred to [2] for the deﬁnitive description.

PEPA provides a set of combinators which allow expressions to be built which

deﬁne the behaviour of components via the activities that they engage in. These

combinators are presented below.

Stronger computational modelling of signalling pathways using both continuous and discrete-state methods

Figures

Citations

Executable cell biology.

Bio-PEPA: A framework for the modelling and analysis of biological systems

The Continuous π-Calculus: A Process Algebra for Biochemical Modelling

Structural analysis for stochastic process algebra models

A unifying framework for modelling and analysing biochemical pathways using Petri nets

References

Exact Stochastic Simulation of Coupled Chemical Reactions

The MATLAB ODE Suite

The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models.

Ultrasensitivity in the mitogen-activated protein kinase cascade

A Compositional Approach to Performance Modelling

Related Papers (5)

Exact Stochastic Simulation of Coupled Chemical Reactions

Application of a stochastic name-passing calculus to representation and simulation of molecular processes

A Compositional Approach to Performance Modelling

BioAmbients: an abstraction for biological compartments

Representation and simulation of biochemical processes using the pi-calculus process algebra.

Frequently Asked Questions (14)

Q1. What contributions have the authors mentioned in the paper "Stronger computational modelling of signalling pathways using both continuous and discrete-state methods" ?

Q2. What is the recent advancement in the field of SSA?

Q3. What is the purpose of the extended Choreographer platform?

Q4. What is the main reason for the malfunction of cellular signalling processes?

Q5. What is the strength of the PEPA process algebra?

Q6. What is the freedom of using a high-level language?

Q7. What is the effect of each possible reaction on the concentration level?

Q8. What is the problem with the large time step used in the computation in [1]?

Q9. What is the significance of this finding?

Q10. What is the meaning of a def=p?

Q11. What is the main purpose of this paper?

Q12. What did the authors find out from the analysis?

Q13. What is the definition of a collision between an enzyme and a substrate?

Q14. How long does it take to solve the model?