What is the key ingredient for the mixed control-stopping problem?

The mixed control-stopping problem is defined by:V̄ (t, x) := sup (ν,τ)∈Ūt×T t[t,T ] J̄(t, x; ν, τ) , (4.2)where Ūt is the subset of elements of Ū that are independent of Ft.

What is the key-tool for the analysis of stochastic control problems?

key-tool for the analysis of such problems is the so-called dynamic programming principle (DPP), which relates the time−t value function V (t, .) to any later time−τ value V (τ, .) for any stopping time τ ∈ [t, T ) a.s. A formal statement of the DPP is:′′V (t, x) = v(t, x) := sup ν∈U E [V (τ,Xντ )|Xνt = x] .′′ (1.1)In particular, this result is routinely used in the case of controlled Markov jump-diffusions in order to derive the corresponding dynamic programming equation in the sense of viscosity solutions, see Lions [6, 7], Fleming and Soner [5], and Touzi [9].

What is the key ingredient for the proof of (4.6)?

The key ingredient for the proof of (4.6) is the following property of the set of stopping times TT :For all θ, τ1 ∈ T tT and τ2 ∈ T t[θ,T ], the authors have τ11{τ1<θ} + τ21{τ1≥θ} ∈ T tT . (4.3)In order to extend the result of Theorem 3.1, the authors shall assume that the following version of A4 holds:Assumption A4’

What is the proof of the Theorem 3.1?

Arguying as in Step 2 of the proof of Theorem 3.1, the authors first observe that, for every ε > 0, the authors can find a countable family Āi := (ti − ri, ti] × Ai ⊂ S, together with a sequence of stopping times τ i,ε in T ti[ti,T ], i ≥ 1, satisfying∪iĀi = S, Āi ∩

What is the proof of the Theorem?

The authors take (Ω,F , F, P) to be the d-dimensional canonical filtered space equipped with the Wiener measure and denote by ω or ω̃ a generic point.

(Open Access) Weak Dynamic Programming Principle for Viscosity Solutions (2011) | Bruno Bouchard

Q: What contributions have the authors mentioned in the paper "Weak dynamic programming principle for viscosity solutions" ?

The authors prove a weak version of the dynamic programming principle for standard stochastic control problems and mixed control-stopping problems, which avoids the technical difficulties related to the measurable selection argument.

Q: What is the proof of the Theorem 3.1?

Arguying as in Step 2 of the proof of Theorem 3.1, the authors first observe that, for every ε > 0, the authors can find a countable family Āi := (ti − ri, ti] × Ai ⊂ S, together with a sequence of stopping times τ i,ε in T ti[ti,T ], i ≥ 1, satisfying∪iĀi = S, Āi ∩

Weak Dynamic Programming Principle

for Viscosity Solutions

∗

Bruno Bouchard

†

and Nizar Touzi

‡

February 2009

Abstract

We prove a weak version of the dynamic programming principle for standard

stochastic control problems and mixed control-stopping problems, which avoids the

technical diﬃculties related to the measurable selection argument. In the Markov

case, our result is tailor-maid for the derivation of the dynamic programming equation

in the sense of viscosity solutions.

Key words: Optimal control, Dynamic programming, discontinuous viscosity solutions.

AMS 1991 subject classiﬁcations: Primary 49L25, 60J60; secondary 49L20, 35K55.

1 Introduction

Consider the standard class of stochastic control problems in the Mayer form

V (t, x) := sup

ν∈U

E [f(X

)|X

= x] ,

where U is the controls set, X

is the controlled process, f is some given function, 0 < T ≤ ∞

is a given time horizon, t ∈ [0, T ) is the time origin, and x ∈ R

is some given initial condition.

This f ramework includes the general class of stochastic control problems under the so-called

Bolza formulation, the corresponding singular versions, and optimal stopping problems.

∗

The authors are grateful to Nicole El Karoui for fruitful comments. This research is part of the Chair

Financial Risks of the Risk Foundation sponsored by Soci´et´e G´en´erale, the Chair Derivatives of the Future

sponsored by the F´ed´eration Bancaire Fran¸caise, the Chair Finance and Sustainable Development sponsored

by EDF and Calyon, and the Chair Les particuliers face au risque sponsored by Groupama.

†

CEREMADE, Universit´e Paris Dauphine and CREST-ENSAE, bouchard@ceremade.dauphine.fr

‡

Ecole Polytechnique Paris, Centre de Math´ematiques Appliqu´ee s, touzi@cmap.polytechnique.fr

A key-tool for the analysis of such problems is the so-called dynamic programming principle

(DPP), which relates the time−t value function V (t, .) to any later time−τ value V (τ, .) for

any stopping time τ ∈ [t, T ) a.s. A formal statement of the DPP is:

V (t, x) = v(t, x) := sup

ν∈U

E [V (τ, X

)|X

= x] .

(1.1)

In particular, this result is routinely used in the case of controlled Markov jump-diﬀusions in

order to derive the corresponding dynamic programming equation in the sense of viscosity

solutions, see Lions [6, 7], Fleming and Soner [5], and Touzi [9].

The statement (1.1) of the DPP is very intuitive and can be easily proved in the deter-

ministic framework, or in discrete-time with ﬁnite probability space. However, its proof is

in general not trivial, and requires on the ﬁrst stage that V be measurable.

The inequality ”V ≤ v” is the easy one but still requires that V be measurable. Our

weak formulation avoids this is sue. Namely, under fairly general conditions on the controls

set and the controlled proc ess, it follows from an easy application of the tower property of

conditional expectations that

V (t, x) ≤ sup

ν∈U

E [V

∗

(τ, X

)|X

= x] ,

where V

∗

is the upper semicontinuous envelope of the function V .

The proof of the converse inequality ”V ≥ v” in a general probability space turns out to

be diﬃcult when the function V is not known a priori to satisfy some continuity condition.

See e.g. Bertsekas and Shreve [1], Borkar [2], and El Karoui [4].

Our weak version of the DPP avoids the non-trivial measurable selection argument needed

to prove the inequality V ≥ v in (1.1). Namely, in the context of a general control problem

presented in Section 2, we show in Section 3 that:

V (t, x) ≥ sup

ν∈U

E [ϕ(τ, X

)|X

= x]

for every upper-semicontinuous minorant ϕ of V.

We also show that an easy consequence of this result is that

V (t, x) ≥ sup

ν∈U



∗

(τ

, X

)|X

= x



where τ

:= τ ∧ inf {s > t : |X

− x| > n}, and V

∗

is the lower semicontinuous envelope of

V .

This result is weaker than the classical DPP (1.1). However, in the controlled Markov jump-

diﬀusions case, it turns out to be tailor-maid for the derivation of the dynamic programming

equation in the sense of viscosity solutions. Section 5 reports this de rivation in the context

of controlled diﬀusions.

Finally, Section 4 provides an extension of our argument in order to obtain a weak dynamic

programming principle for mixed control-stopping problems.

2 The stochastic control problem

Let (Ω, F, P ) be a probability space, T > 0 a ﬁnite time horizon, and F := {F

, 0 ≤ t ≤ T }

a given ﬁltration of F, satisfying the usual assumptions. For every t ≥ 0, we denote by

= (F

)

s≥0

the right-continuous ﬁltration generated by F measurable processes that are

independent of F

, t ≥ 0.

We denote by T the collection of all F−stopping times. For τ

, τ

∈ T with τ

≤ τ

a.s.,

the subset T

[τ

,τ

]

is the collection of all τ ∈ T such that τ ∈ [τ

, τ

] a.s. When τ

= 0, we

simply write T

. We use the notations T

[τ

,τ

]

and T

to denote the corresponding sets of

stopping times that are independent of F

For τ ∈ T and a subset A of a ﬁnite dimensional space, we denote by L

(A) the collection

of all F

−measurable random variables with values in A. H

(A) is the collection of all

F−progressively measurable processes with values in A, and H

rcll

(A) is the subset of all

processes in H

(A) which are right-continuous with ﬁnite left limits.

In the following, we denote by B

(z) (resp. ∂B

(z)) the open ball (resp. its boundary) of

radius r > 0 and center z ∈ R

, ` ∈ N.

Througout this note, we ﬁx an integer d ∈ N, and we introduce the sets:

S := [0, T ] × R

and S



(τ, ξ) : τ ∈ T

and ξ ∈ L

)



We also denote by USC(S) (resp. LSC(S)) the collection of all upper-semicontinuous (resp.

lower-semicontinuous) functions from S to R.

The set of control processes is a given subset U

of H

), for some integer k ≥ 1, so that

the controlled state process deﬁned as the mapping:

(τ, ξ; ν) ∈ S × U

7−→ X

τ,ξ

∈ H

rcll

) for some S with S ⊂ S ⊂ S

is well-deﬁned and satisﬁes:



θ, X

τ,ξ

(θ)



∈ S for all (τ, ξ) ∈ S and θ ∈ T

[τ,T ]

Given a Borel function f : R

−→ R and (t, x) ∈ S, we introduce the reward function

J : S × U −→ R:

J(t, x; ν) := E





t,x

(T )



(2.1)

which is well-deﬁned for controls ν in

U :=

ν ∈ U

: E|f(X

t,x

(T ))| < ∞ ∀ (t, x) ∈ S

. (2.2)

We say that a control ν ∈ U is t-admissible if it is independent of F

, and we denote by U

the collection of such processes . The stochastic control problem is deﬁned by:

V (t, x) := sup

ν∈U

J(t, x; ν) for (t, x) ∈ S. (2.3)

3 Dynamic programming for stochastic control prob-

lems

For the purp ose of our weak dynamic programming principle, the following assumptions are

crucial.

Assumption A For all (t, x) ∈ S and ν ∈ U

, the controlled state process satisﬁes:

A1 (Independence) The process X

t,x

is independent of F

A2 (Causality) For ˜ν ∈ U

, if ν = ˜ν on A ⊂ F, then X

t,x

= X

˜ν

t,x

on A.

A3 (Stability under concatenation) For every ˜ν ∈ U

, and θ ∈ T

[t,T ]

ν1

[0,θ]

+ ˜ν1

(θ,T ]

∈ U

A4 (Consistency with deterministic initial data) For all θ ∈ T

[t,T ]

, we have:

a. For P-a.e ω ∈ Ω, there exists ˜ν

∈ U

θ(ω)

such that





t,x

(T )





(ω) ≤ J(θ(ω), X

t,x

(θ)(ω); ˜ν

)

b. For t ≤ s ≤ T , θ ∈ T

[t,s]

, ˜ν ∈ U

, and ¯ν := ν1

[0,θ]

+ ˜ν1

(θ,T ]

, we have:





¯ν

t,x

(T )





(ω) = J(θ(ω), X

t,x

(θ)(ω); ˜ν) for P − a.e. ω ∈ Ω.

Remark 3.1 Assumption A3 above implies the following property of the controls set which

will be needed later:

A5 (Stability under bifurcation) For ν

, ν

∈ U

, τ ∈ T

[t,T ]

and A ∈ F

, we have:

¯ν := ν

[0,τ]

+ (ν

+ ν

) 1

(τ,T ]

∈ U

To see this, observe that τ

:= T 1

+ τ 1

is a stopping time in T

[t,T ]

, and ¯ν = ν

[0,τ

)

[τ

,T ]

is the concatenation of ν

and ν

at the stopping time τ

Iterating the above property, we see that for 0 ≤ t ≤ s ≤ T and τ ∈ T

[t,T ]

, we have the

following extension: for a ﬁnite sequence (ν

, . . . , ν

) of control in U

with ν

= ν

on [0, τ),

and for a partion (A

)

1≤i≤n

of Ω with A

∈ F

for every i ≤ n:

¯ν := ν

[0,τ)

+ 1

[τ,T ]

i=1

∈ U

Our main result is the following weak version of the dynamic programming principle which

uses the following notation:

∗

(t, x) := lim inf

)→(t,x)

V (t

, x

), V

∗

(t, x) := lim sup

)→(t,x)

V (t

, x

), (t, x) ∈ S.

Theorem 3.1 Let Assumptions A hold true. Then for every (t, x) ∈ S, and for all family

of stopping times {θ

, ν ∈ U

} ⊂ T

[t,T ]

V (t, x) ≤ sup

ν∈U



∗

(θ

, X

t,x

(θ

))



. (3.1)

Assume further that J(.; ν) ∈ LSC(S) for every ν ∈ U

. Then, for any function ϕ : S −→ R:

ϕ ∈ USC(S) and V ≥ ϕ =⇒ V (t, x) ≥ sup

ν∈U



ϕ(θ

, X

t,x

(θ

))



, (3.2)

where U



ν ∈ U

: E



ϕ(θ

, X

t,x

(θ

))



< ∞ or E



ϕ(θ

, X

t,x

(θ

))

−



< ∞



Before proceeding to the proof of this result, we report the following consequence.

Corollary 3.1 Let the conditions of Theorem 3.1 hold. For (t, x) ∈ S, let {θ

, ν ∈ U

} ⊂

[t,T ]

be a family of stopping times such that X

t,x

[t,θ

]

is L

∞

−bounded for all ν ∈ U

. Then,

sup

ν∈U



∗

(θ

, X

t,x

(θ

))



≤ V (t, x) ≤ sup

ν∈U



∗

(θ

, X

t,x

(θ

))



. (3.3)

Proof The right-hand side inequality is already provided in Theorem 3.1. It follows from

standard arguments, see e.g. Lemma 3.5 in [8], that we can ﬁnd a sequence of continuous

functions (ϕ

)

such that ϕ

≤ V

∗

≤ V for all n ≥ 1 and such that ϕ

converges pointwise

to V

∗

on [0, T ] × B

(0). Set φ

:= min

n≥N

for N ≥ 1 and observe that the sequence

(φ

)

is non-decreasing and converges pointwise to V

∗

on [0, T] × B

(0). Applying (3.2) of

Theorem 3.1 and using the monotone convergence Theorem, we then obtain:

V (t, x) ≥ lim

N→∞



(θ

, X

t,x

(θ

))



= E



∗

(θ

, X

t,x

(θ

))



Proof of Theorem 3.1 1. Let ν ∈ U

be arbitrary and set θ := θ

. The ﬁrst assertion

is a direct consequence of Assumption A4-a. I ndeed, it implies that, for P-almost all ω ∈ Ω,

there exists ˜ν

∈ U

θ(ω)

such that





t,x

(T )





(ω) ≤ J(θ(ω), X

t,x

(θ)(ω); ˜ν

) .

Since, by deﬁnition, J(θ(ω), X

t,x

(θ)(ω); ˜ν

) ≤ V

∗

(θ(ω), X

t,x

(θ)(ω)), it follows from the tower

property of conditional expectations that





t,x

(T )



= E





t,x

(T )





≤ E



∗



θ, X

t,x

(θ)



2. Let {(t

, x

), i ≥ 1} := Q

d+1

∩ S, and let ε > 0 be given. Then there is a sequence

(ν

i,ε

)

i≥1

⊂ U

such that:

i,ε

∈ U

and J(t

, x

; ν

i,ε

) ≥ V (t

, x

) − ε, for every i ≥ 1. (3.4)

Weak Dynamic Programming Principle for Viscosity Solutions

Citations

Wellposedness of Second Order Backward SDEs

Wellposedness of Second Order Backward SDEs

Partial differential equation models in macroeconomics.

Optimal transportation under controlled stochastic dynamics

Optimal Control of Trading Algorithms: A General Impulse Control Approach

References

User’s guide to viscosity solutions of second order partial differential equations

Controlled Markov processes and viscosity solutions

Stochastic controls : Hamiltonian systems and HJB equations

Stochastic optimal control : the discrete time case

Applied Stochastic Control of Jump Diffusions

Related Papers (5)

User’s guide to viscosity solutions of second order partial differential equations

Controlled Markov processes and viscosity solutions

Brownian Motion and Stochastic Calculus

Convergence of approximation schemes for fully nonlinear second order equations

Stochastic controls : Hamiltonian systems and HJB equations

Frequently Asked Questions (7)

Q1. What contributions have the authors mentioned in the paper "Weak dynamic programming principle for viscosity solutions" ?

Q2. What is the key ingredient for the mixed control-stopping problem?

Q3. What is the key-tool for the analysis of stochastic control problems?

Q4. What is the standard class of stochastic control problems in the Mayer formV?

Q5. What is the key ingredient for the proof of (4.6)?

Q6. What is the proof of the Theorem 3.1?

Q7. What is the proof of the Theorem?