Journal Article•DOI•

Well-Posedness And Accuracy Of The Ensemble Kalman Filter In Discrete And Continuous Time

David T. Kelly¹, Kody J. H. Law², Andrew M. Stuart¹•Institutions (2)

University of Warwick¹, King Abdullah University of Science and Technology²

11 Oct 2013-arXiv: Probability-

TL;DR: In this paper, a systematic analysis of the ensemble Kalman filter (EnKF) is presented, in particular to do so in the small ensemble size limit, where the authors view the method as a state estimator, and not as an algorithm which approximates the true filtering distribution.

read less

Abstract: The ensemble Kalman filter (EnKF) is a method for combining a dynamical model with data in a sequential fashion Despite its widespread use, there has been little analysis of its theoretical properties Many of the algorithmic innovations associated with the filter, which are required to make a useable algorithm in practice, are derived in an ad hoc fashion The aim of this paper is to initiate the development of a systematic analysis of the EnKF, in particular to do so in the small ensemble size limit The perspective is to view the method as a state estimator, and not as an algorithm which approximates the true filtering distribution The perturbed observation version of the algorithm is studied, without and with variance inflation Without variance inflation well-posedness of the filter is established; with variance inflation accuracy of the filter, with resepct to the true signal underlying the data, is established The algorithm is considered in discrete time, and also for a continuous time limit arising when observations are frequent and subject to large noise The underlying dynamical model, and assumptions about it, is sufficiently general to include the Lorenz '63 and '96 models, together with the incompressible Navier-Stokes equation on a two-dimensional torus The analysis is limited to the case of complete observation of the signal with additive white noise Numerical results are presented for the Navier-Stokes equation on a two-dimensional torus for both complete and partial observations of the signal with additive white noise

...read moreread less

Summary (4 min read)

Jump to: [1 Introduction] – [2.1 Filtering Distribution] – [2.2 Assumptions] – [3.1 The Algorithm] – [3.3 Connection to Randomized Maximum Likelihood] – [3.5 Variance Inflation] – [4 Discrete-Time Estimates] – [4.1 Well-Posedness Without Variance Inflation] – [4.3 Accuracy With Variance Inflation] – [5 Derivation Of The Continuous Time Limit] – [6 Continuous-Time Estimates] – [7 Numerical Results] – [7.1 Setup] – [7.2.1 Full observations] – [7.2.2 Partial observations] – [7.3 Continuous Time] – [7.3.1 Full observations] and [8 Conclusions]

1 Introduction

The algorithm is used in oceanography, oil reservoir simulation and weather prediction [BVLE98, EVL00, Kal03, ORL08], for example.
Its behaviour is not well understood.
Section 4 contains theoretical analyses of the perturbed observation EnKF, without and with variance inflation.

2.1 Filtering Distribution

The authors assume that the observed dynamics are governed by an evolution equation du dt = F (u) (2.1) which generates a one-parameter semigroup.
The authors also assume that K ⊂ H is another Hilbert space, which acts as the observation space.
The authors assume that noisy observations are made in K every h time units and write Ψ = Ψh.
That is, given the distribution uj |Yj as well as the observation yj+1, find the distribution of uj+.
The authors refer to the sequence P(uj |Yj) as the filtering distribution.

2.2 Assumptions

To write down the EnKF as the authors do in section 3, and indeed to derive the continuum limit of the EnKF, as they do in section 5, they need make no further assumptions about the underlying dynamics and observation operator other than those made above.
In infinite dimensions the existence of a global attractor in V follows from the techniques in [Tem97] for the Navier-Stokes equation by application of more subtle inequalities relating to the bilinear operator B – see section 2.2 in Chapter III of [Tem97].
Other equations arising in dissipative fluid mechanics can be treated similarly.
Whilst the preceding assumptions on the underlying dynamics apply to a range of interesting models arising in applications, the following assumptions on the observation model are rather restrictive; however the authors have been unable to extend the analysis without making them.
The following consequence of Assumption 2.3 will be useful to us.

3.1 The Algorithm

The analysis step is achieved by performing a randomised version of the Kalman update formula, and using the empirical covariance of the prediction ensemble to compute the Kalman gain.
There are many variants on the basic EnKF idea and the authors will study the perturbed observation form of the method.
The sequence of minimisers vj+1 can be written down explicitly by simply solving the quadratic minimization problem.
This straightforward exercise yields the following result.

3.3 Connection to Randomized Maximum Likelihood

The analysis step of EnKF can be understood in terms of the Randomised Maximum Likelihood (RML) method widely used in oil reservoir history matching applications [ORL08].
The authors will now briefly describe this method.
There are several reasons that this update step only produces approximate samples from the filtering distribution.
The decision to use the empirical distribution instead of say the push-forward of the covariance uj |Yj gives a huge advantage to the EnKF in terms of computational efficiency.

3.5 Variance Inflation

The minimization step of the EnKF computes an update which is a compromise between the model predictions and the data.
This compromise is weighted by the empirical covariance on the model and the fixed noise covariance on the data.
The model typically allows for unstable divergence of trajectories, whilst the data tends to stabilize.
Variance inflation is a technique of adding stability to the algorithm by increasing the size of the model covariance in order to weight the data more heavily.
Furthermore, by adding a positive definite operator, one eliminates the null-space of Ĉj+1 (which will always be present if the number of ensemble members is smaller than the dimension of H) effectively preventing the ensemble from becoming degenerate.

4 Discrete-Time Estimates

The authors will derive long-time estimates for the discrete-time EnKF, under the Assumptions 2.3 and 2.5 on the dynamics and observation models respectively.
The authors study the algorithm without and then with variance inflation.
The technique is to consider evolution of the error between the filter and the true signal underlying the data.
(4.1) Throughout this section the authors use E to denote expectation with respect to the independent i.i.d. noise sequences {ξj} and {ξ(k)j } and initial conditions u0 and v (k) 0 .

4.1 Well-Posedness Without Variance Inflation

Note also that Ĉj+1 has rank K and let Pj+1 denote projection into the finite dimensional subspace orthogonal to the kernel of Ĉj+1.
Here the authors have used the fact that Pj+1 projects onto a space of dimension at most K.
The preceding result shows that the EnKF is well-posed and does not blow-up faster than exponentially.
The authors now show that, with the addition of variance inflation, a stronger result can be proved, implying accuracy of the EnKF.

4.3 Accuracy With Variance Inflation

The authors will focus on the variance inflation technique with A = α2I .
Again assuming H = I and Γ = γ2I , the EnKF ensemble is governed by the following update equations.
The authors will now show that with variance inflation, one obtains much stronger long-time estimates than without it.
The proof is almost identical to the proof of Theorem 4.2.
Choosing α large enough to ensure θ < 1 will result in filter boundedness; furthermore, if the observational noise standard deviation γ is small then choosing α large enough results in filter accuracy.

5 Derivation Of The Continuous Time Limit

In this section the authors formally derive the continuous time scaling limits of the EnKF. (5.1b) The final step is to find an SDE for which the above represents a reasonable numerical scheme.
Of course, this depends crucially on the choice of scaling parameter s.
In fact, it is not hard to show that the one non-trivial limiting SDEs corresponds to the choice s =.
The stabilizing term, which draws the ensemble member back towards the truth, and the noise, both act only orthogonal to the null-space of the empirical covariance of the set of particles.

6 Continuous-Time Estimates

Under Assumptions 2.3 and 2.5.the authors.
As can be seen in [DZ92, Theorem 4.17], these conditions are sufficient to utilise Itô’s formula.
The authors note that, in the case of (6.1), it is not unreasonable to assume the existence of strong solutions.
In finite dimensions this is in fact a consequence of the mean square estimate provided by the theorem; since the authors have been unable to prove this in the rather general infinite dimensional setting, however, they make it an assumption.
The above argument does not work, but nevertheless it is still informative to see why it doesn’t work.

7 Numerical Results

In this section the authors confirm the validity of the theorems derived in the previous sections for variants of the EnKF when applied to the dynamical system (2.2).
Furthermore, the authors extend their numerical explorations beyond the strict range of validity of the theory and, in particular, consider the case of partial observations.
The authors conduct all of their numerical experiments in the case of the incompressible Navier-Stokes equation on a two dimensional torus.
The filter is always inaccurate when used without inflation.
The authors thus turn to study the effect of inflation and note that their results indicate the filter can then always be made accurate, even in the case of partial observations, provided that sufficiently many low Fourier modes are observed.

7.1 Setup

J∇ with J the canonical skew-symmetric matrix.
The true initial condition u0 is randomly drawn from N(0, ν2A−2).
The authors use the notation m(t) to denote the mean of the ensemble.
The method used to approximate the forward model is a modification of a fourth-order Runge-Kutta method, ETD4RK [CM02], in which the Stokes semi-group is computed exactly by working in the incompressible Fourier basis {ψm(x)}m∈Z2\{0}, and Duhamel’s principle (variation of constants formula) is used to incorporate the nonlinear term.

7.2.1 Full observations

Here the authors consider observations made at all numerically resolved, and hence observable, wavenumbers in the system; hence K = 322, not including padding in the spectral domain which avoids aliasing.
Observations of the full-field are made every J = 20 time-steps.
In Figure 2 there is no variance inflation and, whilst typical ensemble members remain bounded on the timescales shown, the error between the ensemble mean and the truth is O(1); indeed comparison with Figure 1 shows that the error in the mean is in fact worse than that of an ensemble evolving without access to data.
Using variance inflation removes this problem and filter accuracy is obtained: see Figure 3.

7.2.2 Partial observations

The observations are again made every J = 20 time-steps, but the authors will now consider observing only projections inside and outside a ring of radius |kλ| = 5 in Fourier space.
Figure 4 shows that when observing all Fourier modes inside a ring of radius |kλ| = 5 the filter is accurate over long time-scales.
In contrast, Figure 5 shows that observing all Fourier modes outside a ring of radius |kλ| = 5 does not provide enough information to induce accurate filtering.

7.3 Continuous Time

And its relation to the underlying truth governed by (2.2), by means of numerical experiments.the authors.
The authors thereby illustrate and extend the results of section 6.
The Navier-Stokes equation 2.2 itself is solved by the method described in section 7.1.
The stochastic process is also diagonalized in the Fourier basis (7.1) and then time-approximated by the EulerMaruyama scheme [KP92].

7.3.1 Full observations

Here the authors consider observations made at all numerically resolved, and hence observable, wavenumbers in the system; hence K = 322, not including padding in the spectral domain which avoids aliasing.
Figure 6 shows that, without inflation, the ensemble remains bounded, but the mean is inaccurate, on the time-scales of interest.
In contrast Figure 7 demonstrates that inflation leads to accurate reconstruction of the truth via the ensemble mean.

8 Conclusions

The authors have developed a method for the analysis of the EnKF.
Instead of viewing it as an algorithm designed to accurately approximate the true filtering distribution, which it cannot do, in general, outside Gaussian scenarios and in the large ensemble limit, the authors study it as an algorithm for signal estimation in the finite (possibly small) ensemble limit.
These positive results about the EnKF are encouraging and serve to underpin its perceived effectiveness in applications.
On the other hand it is important to highlight that their analysis applies only to fully observed dynamics and interesting open questions remain concerning the partially observed case.
These numerical results demonstrate two interesting potential extensions of their theory: (i) to strengthen well-posedness to obtain boundedness of trajectories, at least in mean square; (ii) to extend well-posedness and accuracy results to certain partial observation scenarios.

Did you find this useful? Give us your feedback

Figures (9)

Figure 9: Continuous-time observations, with inflation. Trajectories of various modes of the estimators v(k) and the signal u are depicted above along with the relative error in the L2 norm, |v(1) − u|/|u|, for H = Qλ, with |kλ| = 5.

Figure 4: Discrete-time observations, with inflation. Trajectories of various modes (at observation times) of the estimators v(k) and the signal u are depicted above along with the relative error in the L2 norm, |m − u|/|u|, for H = Pλ, with |kλ| = 5, J = 20, and γ = 10−2.

Figure 5: Discrete-time observations, with inflation. Trajectories of various modes (at observation times) of the estimators v(k) and the signal u are depicted above along with the relative error in the L2 norm, |m − u|/|u|, for H = Qλ, with |kλ| = 5, J = 20, and γ = 10−2.

Figure 6: Continuous-time observations, without inflation. Trajectories of various modes of the estimators v(k) and the signal u are depicted above along with the relative error in the L2 norm, |v(1) − u|/|u|, for H = Pλ, with λ =∞.

Figure 7: Continuous-time observations, with inflation. Trajectories of various modes of the estimators v(k) and the signal u are depicted above along with the relative error in the L2 norm, |v(1) − u|/|u|, for H = Pλ, with λ =∞.

Figure 3: Discrete-time observations, with inflation. Trajectories of various modes (at observation times) of the estimators v(k) and the signal u are depicted above along with the relative error in the L2 norm, |m − u|/|u|, for H = Pλ, with λ =∞.

Figure 8: Continuous-time observations, with inflation. Trajectories of various modes of the estimators v(k) and the signal u are depicted above along with the relative error in the L2 norm, |v(1) − u|/|u|, for H = Pλ, with |kλ| = 5.

Figure 1: Trajectories of various modes (at observation times) of the estimators v(k) and the signal u are depicted above along with the relative error in the L2 norm, |m − u|/|u|, for H = Pλ, with λ = 0 – i.e. nothing is observed.

Figure 2: Discrete-time observations, without inflation. Trajectories of various modes (at observation times) of the estimators v(k) and the signal u are depicted above along with the relative error in the L2 norm, |m−u|/|u|, for H = Pλ, with λ =∞.

Content maybe subject to copyright Report

Well-Posedness And Accuracy Of The Ensemble Kalman Filter In

Discrete And Continuous Time

October 14, 2013

D.T.B. Kelly, K.J.H. Law, A.M. Stuart

The University of Warwick, Email: David.Kelly@Warwick.ac.uk

Abstract

The ensemble Kalman ﬁlter (EnKF) is a method for combining a dynamical model with data in a sequential

fashion. Despite its widespread use, there has been little analysis of its theoretical properties. Many of the

algorithmic innovations associated with the ﬁlter, which are required to make a useable algorithm in practice, are

derived in an ad hoc fashion. The aim of this paper is to initiate the development of a systematic analysis of the

EnKF, in particular to do so in the small ensemble size limit. The perspective is to view the method as a state

estimator, and not as an algorithm which approximates the true ﬁltering distribution. The perturbed observation

version of the algorithm is studied, without and with variance inﬂation. Without variance inﬂation well-posedness

of the ﬁlter is established; with variance inﬂation accuracy of the ﬁlter, with resepct to the true signal underlying the

data, is established. The algorithm is considered in discrete time, and also for a continuous time limit arising when

observations are frequent and subject to large noise. The underlying dynamical model, and assumptions about it,

is sufﬁciently general to include the Lorenz ’63 and ’96 models, together with the incompressible Navier-Stokes

equation on a two-dimensional torus. The analysis is limited to the case of complete observation of the signal with

additive white noise. Numerical results are presented for the Navier-Stokes equation on a two-dimensional torus

for both complete and partial observations of the signal with additive white noise.

1 Introduction

In recent years the ensemble Kalman ﬁlter (EnKF) [Eve06] has become a widely used methodology for combin-

ing dynamical models with data. The algorithm is used in oceanography, oil reservoir simulation and weather

prediction [BVLE98, EVL00, Kal03, ORL08], for example. The basic idea of the method is to propagate an

ensemble of particles to describe the distribution of the signal given data, employing empirical second order

statistics to update the distribution in a Kalman-like fashion when new data is acquired. Despite the widespread

use of the method, its behaviour is not well understood. In contrast with the ordinary Kalman ﬁlter, which ap-

plies to linear Gaussian problems, it is difﬁcult to ﬁnd a mathematical justiﬁcation for EnKF. The most notable

progress in this direction can be found in [LGMT

10, MCB11], where it is proved that, for linear dynamics,

the EnKF approximates the usual Kalman ﬁlter in the large ensemble limit. This analysis is however far from

being useful for practitioners who typically run the method with small ensemble size on nonlinear problems.

Furthermore there is an accumulation of numerical evidence showing that the EnKF, and related schemes such

as the extended Kalman ﬁlter, can “diverge” with the meaning of “diverge” ranging from simply loosing the

true signal through to blow-up [IKJ02, MH08, GM13]. The aim of our work is to try and build mathematical

foundations for the analysis of the EnKF, in particular with regards to well-posedness (lack of blow-up) and

accuracy (tracking the signal over arbitrarily long time-intervals). To make progress on such questions it is

necessary to impose structure on the underlying dynamics and we choose to work with dissipative quadratic

systems with energy-conserving nonlinearity, a class of problems which has wide applicability [MW06] and

which has proved to be useful in the development of ﬁlters [MH12].

In section 3 we derive the perturbed observation form of the EnKF and demonstrate how it links to the

randomized maximum likelihood method (RML) which is widely used in oil reservoir simulation [ORL08].

arXiv:1310.3167v1 [math.PR] 11 Oct 2013

We also introduce the idea of variance inﬂation, widely used in many practical implementations of the EnKF

[And07]. Section 4 contains theoretical analyses of the perturbed observation EnKF, without and with variance

inﬂation. Without variance inﬂation we are able only to prove bounds which grow exponentially in the discrete

time increment underlying the algorithm (Theorem 4.2); with variance inﬂation we are able to prove ﬁlter

accuracy and show that, in mean square with respect to the noise entering the algorithm, the ﬁlter is uniformly

close to the true signal for all large times, provided enough inﬂation is employed (Theorem 4.4). These results,

and in particular the one concerning variance inﬂation, are similar to the results developed in [BLL

12] for the

3DVAR ﬁlter applied to the Navier-Stokes equation and for the 3DVAR ﬁlter applied to the Lorenz ’63 model

in [LSS14], as well as the similar analysis developed in [MLPvL13] for the 3DVAR ﬁlter applied to globally

Lipschitz nonlinear dynamical systems. In section 5 we describe a continuous time limit in which data arrives

very frequently, but is subject to large noise. If these two effects are balanced appropriately a stochastic (partial)

differential equation limit is found and it is instructive to study this limiting continuous time process. This

idea was introduced in [BLSZ] for the 3DVAR ﬁlter and is here employed for the EnKF ﬁlter. The primary

motivation for the continuous time limit is to obtain insight into the mechanisms underlying the EnKF; some of

these mechanisms are more transparent in continuous time. In section 6 we analyze the well-posedness of the

continuous time EnKF (Theorem 6.2). Section 7 contains numerical experiments which illustrate and extend

the theory, and section 8 contains some brief concluding remarks.

Throughout the sequel we use the following notation. Let H be a separable Hilbert space with norm |·| and

inner product h·, ·i. For a linear operator C on H, we will write C ≥ 0 (resp. C > 0) when C is self-adjoint

and positive semi-deﬁnite (resp. positive deﬁnite). Given C > 0, we will denote

|·|

def



−1/2

(·)



2 Set-Up

2.1 Filtering Distribution

We assume that the observed dynamics are governed by an evolution equation

= F (u) (2.1)

which generates a one-parameter semigroup Ψ

: H → H. We also assume that K ⊂ H is another Hilbert

space, which acts as the observation space. We assume that noisy observations are made in K every h time

units and write Ψ = Ψ

. We deﬁne u

= u(jh) for j ∈ N and, assuming that u

is uncertain and modelled as

Gaussian distributed, we obtain

j+1

= Ψ(u

) , with u

∼ N (m

, C

)

for some initial mean m

and covariance C

. We are given the observations

j+1

= Hu

j+1

+ Γ

1/2

j+1

, with ξ

∼ N (0, I) i.i.d. ,

where H ∈ L(H, K) is the observation operator and Γ ∈ L(H, H) with Γ ≥ 0 is the covariance operator of

the observational noise; the i.i.d. noise sequence {ξ

} is assumed independent of u

. The aim of ﬁltering is to

approximate the distribution of u

given Y

= {y

}

`=1

using a sequential update algorithm. That is, given the

distribution u

as well as the observation y

j+1

, ﬁnd the distribution of u

j+1

. We refer to the sequence

P(u

) as the ﬁltering distribution.

2.2 Assumptions

To write down the EnKF as we do in section 3, and indeed to derive the continuum limit of the EnKF, as we do in

section 5, we need make no further assumptions about the underlying dynamics and observation operator other

than those made above. However, in order to analyze the properties of the EnKF, as we do in sections 4 and 6,

we will need to make structural assumptions and we detail these here. It is worth noting that the assumptions we

make on the underlying dynamics are met by several natural models used to test data assimilation algorithms.

In particular, the 2D Navier-Stokes equations on a torus, as well as both Lorenz ’63 and ’96 models, satisfy

Assumptions 2.3 [MW06, MH12, Tem97].

Assumption 2.3. (Dynamics Model) Suppose there is some Banach space V, equipped with norm k·k, that can

be continuously embedded into H. We assume that (2.1) has the form

+ Au + B(u, u) = f , (2.2)

where A : H → H is an unbounded linear operator satisfying

hAu, ui ≥ λ kuk

, (2.3)

for some λ > 0, B is a symmetric bilinear operator B : V × V → H and f : R

→ H. We furthermore assume

that B satisﬁes the identity

hB(u, u), ui = 0 , (2.4)

for all u ∈ H and also

hB(u, v), vi ≤ c kuk kvk |v| , (2.5)

for all u, v, w ∈ H, where c > 0 depends only on the bilinear form. We assume that the equation (2.2) has

a unique weak solution for all u(0) ∈ H, and generates a one-parameter semigroup Ψ

: V → V which may

be extended to act on H. Finally we assume that there exists a global attractor Λ ⊂ V for the dynamics, and

constant R > 0 such that for any initial condition u

∈ Λ, we have that sup

t≥0

ku(t)k ≤ R. 

Remark 2.4. In the ﬁnite dimensional case the ﬁnal assumption on the existence of a global attractor does not

need to be made as it is a consequence of the preceding assumptions made. To see this note that

d|u|

+ λkuk

≤ hf, ui. (2.6)

The continuous embedding of V, together with the Cauchy-Schwarz inequality, implies the existence of a strictly

positive constant  such that

d|u|

+ |u|

≤

2δ

|f|

|u|

(2.7)

for all δ > 0. Choosing δ =  gives the existence of an absorbing set and hence a global attractor by Theorem

1.1 in Chapter I of [Tem97]. In inﬁnite dimensions the existence of a global attractor in V follows from the

techniques in [Tem97] for the Navier-Stokes equation by application of more subtle inequalities relating to the

bilinear operator B – see section 2.2 in Chapter III of [Tem97]. Other equations arising in dissipative ﬂuid

mechanics can be treated similarly. 

Whilst the preceding assumptions on the underlying dynamics apply to a range of interesting models arising

in applications, the following assumptions on the observation model are rather restrictive; however we have been

unable to extend the analysis without making them. We will demonstrate, by means of numerical experiments,

that our results extend beyond the observation scenario employed in the theory

Assumption 2.5. (Observation Model) The system is completely observed so that K = H and H = I.

Furthermore the i.i.d. noise sequence {ξ

} is white so that ξ

∼ N (0, Γ) with Γ = γ

I. 

The following consequence of Assumption 2.3 will be useful to us.

Lemma 2.6. Let Assumptions 2.3 hold. Then there is β ∈ R such that, for any v

∈ Λ, h > 0 and w

∈ H,

|Ψ

) − Ψ

)| ≤ e

βh

− w

| .

Proof. Let v, w denote the solutions of (2.2) with initial conditions v

, w

respectively; deﬁne e = v − w. Then

+ Ae + 2B(v, e) − B(e, e) = 0 , (2.8)

with e(0) = v

− w

. Taking the inner-product with e, using (2.3), (2.4) and (2.5), and choosing δ = λ/(2cR),

gives

|e|

+ λkek

≤ 2ckvkkek|e|

≤ 2cRkek|e|

≤ cR(δkek

+ δ

−1

|e|

)

kek

(cR)

|e|

Thus

|e|

≤

(cR)

|e|

and the desired result follows from an application of the Gronwall inequality.

3 The Ensemble Kalman Filter

3.1 The Algorithm

The idea of the EnKF is to represent the ﬁltering distribution through an ensemble of particles, to propagate

this ensemble under the model to approximate the mapping P(u

) to P(u

j+1

) (refered to as prediction

in the applied literature), and to update the ensemble distribution to include the data point Y

j+1

by using a

Gaussian approximation based on the second order statistics of the ensemble (refered to as analysis in the

applied literature).

The prediction step is achieved by simply ﬂowing forward the ensemble under the model dynamics, that is

(k)

j+1

= Ψ(v

(k)

) , for k = 1 . . . K.

The analysis step is achieved by performing a randomised version of the Kalman update formula, and using the

empirical covariance of the prediction ensemble to compute the Kalman gain. There are many variants on the

basic EnKF idea and we will study the perturbed observation form of the method.

The algorithm proceeds as follows.

1. Set j = 0 and draw an independent set of samples {v

(k)

}

k=1

from N(m

, C

2. (Prediction) Let bv

(k)

j+1

= Ψ(v

(k)

) and deﬁne

j+1

as the empirical covariance of {bv

(k)

j+1

}

k=1

. That is,

j+1

k=1

(bv

(k)

j+1

− ¯v

j+1

) ⊗ (bv

(k)

j+1

− ¯v

j+1

) ,

where ¯v

j+1

k=1

j+1

denotes the ensemble mean.

3. (Observation) Make an observation y

j+1

= Hu

j+1

+ Γ

1/2

j+1

. Then, for each k = 1 . . . K, generate an

artiﬁcial observation

(k)

j+1

= y

j+1

+ Γ

1/2

(k)

j+1

where ξ

(k)

j+1

are N(0, I) distributed and pairwise independent.

4. (Analysis) Let v

(k)

j+1

be the minimiser of the functional

J(v) =

(k)

j+1

− v|

|bv

(k)

j+1

− v|

j+1

5. Set j 7→ j + 1 and return to step 2.

The name “perturbed observation EnKF” follows from the construction of the artiﬁcial observations y

(k)

j+1

which

are found by perturbing the given observation with additional noise. The sequence of minimisers v

j+1

can be

written down explicitly by simply solving the quadratic minimization problem. This straightforward exercise

yields the following result.

Proposition 3.2. The sequence {v

(k)

}

j≥0

is deﬁned by the equation

(I +

j+1

−1

H)v

(k)

j+1

= bv

(k)

j+1

−1

(k)

j+1

for each k = 1, . . . , K.

Hence, collecting the ingredients from the preceding, the deﬁning equations of the EnKF are given by

(I +

j+1

−1

H)v

(k)

j+1

= Ψ(v

(k)

j+1

) +

j+1

−1

(k)

j+1

(3.1a)

(k)

j+1

= y

j+1

+ Γ

1/2

(k)

j+1

(3.1b)

¯v

j+1

k=1

Ψ(v

(k)

j+1

) (3.1c)

j+1

k=1



Ψ(v

(k)

j+1

) − ¯v

j+1



⊗



Ψ(v

(k)

j+1

) − ¯v

j+1



. (3.1d)

There are other representations of the EnKF that are more algorithmically convenient, but the formulae (3.1)

are better suited to our analysis.

3.3 Connection to Randomized Maximum Likelihood

The analysis step of EnKF can be understood in terms of the Randomised Maximum Likelihood (RML) method

widely used in oil reservoir history matching applications [ORL08]. We will now brieﬂy describe this method.

Suppose that we have a random variable u and that u ∼ N( bm,

C). Moreover, let G be some linear operator and

suppose we observe

y = Gu + ξ where ξ ∼ N(0, Γ) .

One can use Bayes’ theorem to write down the conditional density P(u|y). In practice however, it is often

sufﬁcient (or sometimes even better) to simply have a collection of samples {u

(k)

}

k=1

from the conditional

distribution, rather than the density itself. RML is a method of taking samples from the prior N( bm,

C) and

turning them into samples from the posterior. This is achieved as follows, given bu

(k)

∼ N( bm,

C) (samples

from the prior), deﬁne u

(k)

for each k = 1 . . . K by u

(k)

= argmin

(k)

(u) where

(k)

(u) =

|y − Gu + Γ

1/2

(k)

|u − bu

(k)

where ξ

(k)

∼ N(0, I) and independent of ξ. The u

(k)

are then draws from the posterior distribution of u|y

which is a Gaussian with mean m and covariance C. Since one can explicitly write down (m, C), it may be

checked that the u

(k)

deﬁned as above are independent random variables of the form u

(k)

= m + C

1/2

(k)

where ζ

(k)

∼ N (0, I) i.i.d. and are hence draws from the desired posterior, as we know show.

HTML Viewer

Frequently Asked Questions (13)

Q1. What have the authors contributed in "Well-posedness and accuracy of the ensemble kalman filter in discrete and continuous time" ?

The aim of this paper is to initiate the development of a systematic analysis of the EnKF, in particular to do so in the small ensemble size limit. The perturbed observation version of the algorithm is studied, without and with variance inflation.

Q2. What future works have the authors mentioned in the paper "Well-posedness and accuracy of the ensemble kalman filter in discrete and continuous time" ?

These numerical results demonstrate two interesting potential extensions of their theory: ( i ) to strengthen well-posedness to obtain boundedness of trajectories, at least in mean square ; ( ii ) to extend well-posedness and accuracy results to certain partial observation scenarios. Furthermore the authors highlight the fact that their results have assumed exact solution of the underlying differential equation model ; understanding how filtering interacts with numerical approximations, and potentially induces numerical instabilities, is a subject which requires further investigation ; this issue is highlighted in [ GM13 ].

Q3. What is the method used to approximate the forward model?

The method used to approximate the forward model is a modification of a fourth-order Runge-Kutta method, ETD4RK [CM02], in which the Stokes semi-group is computed exactly by working in the incompressible Fourier basis {ψm(x)}m∈Z2\\{0}, and Duhamel’s principle (variation of constants formula) is used to incorporate the nonlinear term.

Q4. What is the purpose of the EnKF?

The idea of the EnKF is to represent the filtering distribution through an ensemble of particles, to propagate this ensemble under the model to approximate the mapping P(uj |Yj) to P(uj+1|Yj) (refered to as prediction in the applied literature), and to update the ensemble distribution to include the data point Yj+1 by using a Gaussian approximation based on the second order statistics of the ensemble (refered to as analysis in the applied literature).

Q5. What is the effect of the noise contribution on the particles?

The perturbed observations noise contribution will act to prevent the particles from synchronizing which, in their absence, could happen.

Q6. What is the reason why the RML method is not an approximation of samples?

1|Yj is certainly not Gaussian in general, unless the dynamics are linear, hence the RML method becomes an approximation of samples.

Q7. What does it mean that the noise is finite dimensional?

However the fact that the noise is effectively finite dimensional, due to the presence of the finite rank covariance operator C, does mean that existence of strong solutions may well be established on a case-by-case basis in some infinite dimensional settings.

Q8. What is the simplest way to achieve the prediction step?

The prediction step is achieved by simply flowing forward the ensemble under the model dynamics, that isv̂ (k) j+1 = Ψ(v (k) j ) , for k = 1 . . .K.The analysis step is achieved by performing a randomised version of the Kalman update formula, and using the empirical covariance of the prediction ensemble to compute the Kalman gain.

Q9. What is the aim of this work?

The aim of their work is to try and build mathematical foundations for the analysis of the EnKF, in particular with regards to well-posedness (lack of blow-up) and accuracy (tracking the signal over arbitrarily long time-intervals).

Q10. What is the effect of inflation on the filter?

The authors thus turn to study the effect of inflation and note that their results indicate the filter can then always be made accurate, even in the case of partial observations, provided that sufficiently many low Fourier modes are observed.

Q11. What is the definition of a global attractor?

In infinite dimensions the existence of a global attractor in V follows from the techniques in [Tem97] for the Navier-Stokes equation by application of more subtle inequalities relating to the bilinear operator B – see section 2.2 in Chapter III of [Tem97].

Q12. What is the effect of variance inflation on the EnKF?

v(k) j+1 = Ψ(v (k) j ) + (α2 γ2 The author+ 1 γ2 Ĉj+1)y (k) j+1 . (4.4)The authors will now show that with variance inflation, one obtains much stronger long-time estimates than without it.

Q13. What is the inverse of the central limit theorem?

from the second equation the authors must have 1 − s/2 ≥ 1/2, for otherwise the stochastic terms would diverge when summed up, in accordance with the central limit theorem.

Well-Posedness And Accuracy Of The Ensemble Kalman Filter In Discrete And Continuous Time

Summary (4 min read)

1 Introduction

2.1 Filtering Distribution

2.2 Assumptions

3.1 The Algorithm

3.3 Connection to Randomized Maximum Likelihood

3.5 Variance Inflation

4 Discrete-Time Estimates

4.1 Well-Posedness Without Variance Inflation

4.3 Accuracy With Variance Inflation

5 Derivation Of The Continuous Time Limit

6 Continuous-Time Estimates

7 Numerical Results

7.1 Setup

7.2.1 Full observations

7.2.2 Partial observations

7.3 Continuous Time

7.3.1 Full observations

8 Conclusions

Figures (9)

Citations

References

Related Papers (5)

Frequently Asked Questions (13)

Q1. What have the authors contributed in "Well-posedness and accuracy of the ensemble kalman filter in discrete and continuous time" ?

Q2. What future works have the authors mentioned in the paper "Well-posedness and accuracy of the ensemble kalman filter in discrete and continuous time" ?

Q3. What is the method used to approximate the forward model?

Q4. What is the purpose of the EnKF?

Q5. What is the effect of the noise contribution on the particles?

Q6. What is the reason why the RML method is not an approximation of samples?

Q7. What does it mean that the noise is finite dimensional?

Q8. What is the simplest way to achieve the prediction step?

Q9. What is the aim of this work?

Q10. What is the effect of inflation on the filter?

Q11. What is the definition of a global attractor?

Q12. What is the effect of variance inflation on the EnKF?

Q13. What is the inverse of the central limit theorem?