What is the simplest way to get the limt?

Since the authors have ‖dc(q, t)‖ ≤ b2d and the differential dynamics isδq̇ = (A(x, t)− B1(x, t)B1(x, t)TM(x, t))δq (20) when dc = 0, the authors get limt→∞ ∫ x 0 ‖δq‖ ≤ b2dχ/α by the same proof as for Theorem 3 with (13) replaced by (20), (14) by (18), and Re(t) by Rc(t) = ∫ x 0 ‖ (x, t)δq(t)‖, where M = T .

What is the goal of the motion planning problem?

The goal of the motion planning problem is to find an optimal trajectory that avoids these obstacles and minimize ∫ 50 0 ‖u(t)‖2dt subject to input constraints 0 ≤ ui(t) ≤ 1,∀i,∀t and the dynamics constraints.

How many layers are underfit to the training samples?

The authors can see that the models with more than 2 layers overfit and those with less than 32 hidden units underfit to the training samples.

How do the authors get a contraction metric?

Multiplying (3) by μ−1 from both sides gives ωI W(x, t) ωI as the authors have ω,ω > 0 and (μ−1)2 = W. The authors get (7) by multiplying this inequality by ν = 1/ω.

What is the difference between the two definitenesses?

Instead of directly using sequential data of optimal contraction metrics {M(x(ti), ti)}Ni=0 for neural network training, the positive definiteness of M(x, t) is utilized to reduce the dimension of the target output {yi}Ni=0 defined in Section II.

(Open Access) Neural Contraction Metrics for Robust Estimation and Control: A Convex Optimization Approach (2021) | Hiroyasu Tsukamoto

Q: What have the authors contributed in "Neural contraction metrics for robust estimation and control: a convex optimization approach" ?

This letter presents a new deep learningbased framework for robust nonlinear estimation and control using the concept of a Neural Contraction Metric ( NCM ). The authors demonstrate how to exploit NCMs to design an online optimal estimator and controller for nonlinear systems with bounded disturbances utilizing their duality.

Q: What is the novelty of the NCM approach?

The novelty of the NCM approach lies in that: 1) data points of the optimal contraction metric are sampled offline by solving a convex optimization problem, which minimizes an upper bound of the steady-state Euclidean distance between the perturbed and unperturbed trajectories without assuming any hypothesis function space, and 2) the deep LSTM-RNN is constructed to model the sampled metrics with arbitrary accuracy.

Q: What is the result of Lemma 2?

As a result of Lemma 2, the authors can select (x, t) defined in Theorem 1 as the unique Cholesky decomposition of M(x, t) and train the deep LSTM-RNN using only the non-zero entries of the unique upper triangular matrices { (x(ti), ti)}Ni=0.

Q: What is the simplest formulation for the NCM?

In this section, the authors propose its simpler formulation for nonlinear systems with bounded disturbances in order to be of practical use in engineering applications.

Q: how can i find in the problem (8)?

although α is fixed in Theorem 2, it can be found by a line search as will be demonstrated in Section V.Remark 2: The problem (8) can be solved as a finitedimensional problem by using backward difference approximation, ˙̃W(x(ti), ti) (W̃(x(ti), ti)− W̃(x(ti−1), ti−1))/ ti, where t = ti − ti−1,∀i with t t2 > 0, and by discretizing it along a pre-computed system trajectory {x(ti)}Ni=0.

IEEE CONTROL SYSTEMS LETTERS, VOL. 5, NO. 1, JANUARY 2021 211

Neural Contraction Metrics for Robust Estimation

and Control: A Convex Optimization Approach

Hiroyasu Tsukamoto , Graduate Student Member, IEEE, and Soon-Jo Chung , Senior Member, IEEE

Abstract—This letter presents a new deep learning-

based framework for robust nonlinear estimation and con-

trol using the concept of a Neural Contraction Metric (NCM).

The NCM uses a deep long short-term memory recurrent

neural network for a global approximation of an optimal

contraction metric, the existence of which is a necessary

and sufﬁcient condition for exponential stability of non-

linear systems. The optimality stems from the fact that

the contraction metrics sampled ofﬂine are the solutions

of a convex optimization problem to minimize an upper

bound of the steady-state Euclidean distance between per-

turbed and unperturbed system trajectories. We demon-

strate how to exploit NCMs to design an online optimal esti-

mator and controller for nonlinear systems with bounded

disturbances utilizing their duality. The performance of

our framework is illustrated through Lorenz oscillator

state estimation and spacecraft optimal motion planning

problems.

Index Terms—Machine learning, observers for nonlinear

systems, optimal control.

I. INTRODUCTION

ROVABLY stable and optimal state estimation and con-

trol algorithms for a class of nonlinear dynamical systems

with external disturbances are essential to develop autonomous

robotic explorers operating remotely on land, in water, and in

deep space. In these next generation missions, these robots

are supposed to intelligently perform complex tasks with their

limited computational resources, which are not necessarily

powerful enough to run optimization algorithms in real-time.

Our main contribution is to introduce a Neural Contraction

Metric (NCM), a global representation of optimal con-

traction metrics sampled ofﬂine by using a deep Long

Short-Term Memory Recurrent Neural Network (LSTM-RNN)

(see Fig. 1), and thereby propose a new framework for prov-

ably stable and optimal online estimation and control of

nonlinear systems with bounded disturbances, which only

requires one function evaluation at each time step. A deep

LSTM-RNN [1], [2] is a recurrent neural network with an

Manuscript received March 17, 2020; revised May 15, 2020; accepted

June 4, 2020. Date of publication June 11, 2020; date of current

version June 24, 2020. This work was supported in part by the Jet

Propulsion Laboratory, California Institute of Technology and in part by

the Raytheon Company. Recommended by Senior Editor G. Cherubini.

(Corresponding author: Hiroyasu Tsukamoto.)

The authors are with the Graduate Aerospace Laboratories,

California Institute of Technology, Pasadena, CA 91125 USA (e-mail:

htsukamoto@caltech.edu; sjchung@caltech.edu).

Data is available on-line at https://github.com/astrohiro/ncm

Digital Object Identiﬁer 10.1109/LCSYS.2020.3001646

improved memory structure proposed to circumvent gradient

vanishing [3] and is a universal approximator of contin-

uous curves [4]. Contrary to previous works, the convex

optimization-based sampling methodology in our framework

allows us to obtain a large enough dataset of the optimal

contraction metric without assuming any hypothesis func-

tion space. These sampled metrics, the existence of which is

a necessary and sufﬁcient condition for exponential conver-

gence [5], can be approximated with arbitrary accuracy due to

the high representational power of the deep LSTM-RNN. We

remark that this approach can be used with learned dynam-

ics [6] as a nominal model is assumed to be given. Also, this is

distinct from Lyapunov neural networks designed to estimate a

largest safe region for deterministic systems [7], [8]: the NCM

provides provably stable estimation and control policies, which

have a duality in their differential dynamics and are optimal

in terms of disturbance attenuation. The NCM construction is

summarized as follows.

In the ofﬂine phase, we sample contraction metrics by

solving an optimization problem with exponential stability

constraints, the objective of which is to minimize an upper

bound of the steady-state Euclidean distance between per-

turbed and unperturbed system trajectories. In this letter, we

present a convex optimization problem equivalent to this

problem, thereby exploiting the differential nature of con-

traction analysis that enables Linear Time-Varying (LTV)

systems-type approaches to Lyapunov function construction.

For the sake of practical use, the sampling methodology is

reduced to a much simpler formulation than those of [9]–[11]

derived for Itô stochastic nonlinear systems. These optimal

contraction metrics are sampled using the computationally efﬁ-

cient numerical methods for convex programming [12]–[14]

and then modeled by the deep LSTM-RNN as depicted in

Fig. 1. In the online phase, contraction metrics at each time

instant are computed by the NCM to obtain the optimal feed-

back estimation and control gain or a bounded error tube for

robust motion planning [15], [16].

We illustrate how to design an optimal NCM-based esti-

mator and controller for nonlinear systems with bounded

disturbances, utilizing the estimation and control duality in dif-

ferential dynamics analogous to the one of the Kalman ﬁlter

and Linear Quadratic Regulator (LQR) in LTV systems. Their

performance is demonstrated using Lorenz oscillator state

estimation and spacecraft optimal motion planning problems.

Related Work: Contraction analysis, as well as Lyapunov

theory, is one of the most powerful tools in analyzing the

stability of nonlinear systems [5]. It studies the differential

(virtual) dynamics for the sake of incremental stability by

means of a contraction metric, the existence of which leads

to a necessary and sufﬁcient characterization of exponential

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/

212 IEEE CONTROL SYSTEMS LETTERS, VOL. 5, NO. 1, JANUARY 2021

Fig. 1. Illustration of the NCM: M(x, t) denotes the optimal contrac-

tion metric; x(t) and x

(t) denote perturbed and unperturbed system

trajectories; h

and c

denote the hidden states of the deep LSTM-RNN,

respectively.

stability of nonlinear systems. Finding an optimal contraction

metric for general nonlinear systems is, however, almost as

difﬁcult as ﬁnding an optimal Lyapunov function.

Several numerical methods have been developed for ﬁnding

contraction metrics in a given hypothesis function space. A

natural application of this concept is to represent their candi-

date as a linear combination of some basis functions [17]–[20].

In [21], [22], a tractable framework to construct contraction

metrics for dynamics with polynomial vector ﬁelds is proposed

by relaxing the stability conditions to the sum of squares con-

ditions. Although it is ideal to use a larger number of basis

functions seeking for a more optimal solution, the downside

of this approach is that the problem size grows exponentially

with the number of variables and basis functions [23].

We could thus alternatively rely on numerical schemes

for sampling data points of a Lyapunov function without

assuming any hypothesis function space. This includes the

state-dependent Riccati equation method [24], [25] and it is

proposed in [9]–[11] that this framework can be improved

to obtain an optimal contraction metric, which minimizes an

upper bound of the steady-state mean squared tracking error

for nonlinear stochastic systems. However, solving a nonlin-

ear system of equations or an optimization problem at each

time instant is not suitable for systems with limited online

computational capacity. The NCM addresses this issue by

approximating the sampled solutions by the LSTM-RNN.

II. P

RELIMINARIES

We use the notation x and A for the Euclidean and

induced 2-norm, and A  0, A  0, A ≺ 0, and A  0for

positive deﬁnite, positive semi-deﬁnite, negative deﬁnite, and

negative semi-deﬁnite matrices, respectively. Also, sym(A) =

A + A

and I denotes the identity matrix. We ﬁrst introduce

some preliminaries that will be used to construct an NCM.

A. Contraction Analysis for Incremental Stability

Consider the following perturbed nonlinear system:

˙x(t) = f (x(t), t) + d(t) (1)

where t ∈ R

≥0

, x : R

≥0

→ R

, f : R

× R

≥0

→ R

, and

d : R

≥0

→ R

with d = sup

t≥0

d(t) < +∞.

Theorem 1: Let x

(t) and x

(t) be the solution of (1) with

d(t) = 0 and d(t) = 0, respectively. Suppose there exist

M(x, t) = (x, t)

(x, t)  0, α>0, and 0 <ω, ω<∞ s.t.

M(x, t) + sym



M(x, t)

∂f(x, t)

∂x



−2αM(x, t), ∀x, t (2)

−1

I  M(x, t)  ω

−1

I, ∀x, t. (3)

Then the smallest path integral is exponentially bounded,

thereby yielding a bounded tube of states:



δx−R(0)

√

ωe

−αt

≤



≤

dω

αω

(4)

where R(t) =



(x, t)δx(t), ∀t.

Proof: Differentiating R(t) yields

R(t) + αR(t)

≤(x(t), t)d(t) (see [5]). Since we have (x(t), t)d(t)

≤

√

ω, applying the comparison lemma [26] results in

R(t) ≤ R(0)e

−αt

+ d/(α

√

ω). Rewriting this inequality using

the relations

√

ωR(t) ≥



δx and 1 ≤



ω/ω ≤ ω/ω due

to 0 <ω

≤ ω completes the proof. Input-to-state stability and

ﬁnite-gain L

stability follow from (4) (see [15]).

B. Deep LSTM-RNN

An LSTM-RNN is a neural network designed for process-

ing sequential data with inputs {x

}

i=0

and outputs {y

}

i=0

and deﬁned as y

= W

+ b

where h

= φ(x

, h

i−1

, c

i−1

The activation function φ is given by the following relations:

= o

tanh(c

), o

= σ(W

+ W

i−1

+ W

+ b

), c

i−1

+ι

tanh(W

i−1

), f

= σ(W

i−1

), and ι

= σ(W

i−1

), where

σ is the logistic sigmoid function and W and b terms represent

weight matrices and bias vectors to be optimized, respectively.

The deep LSTM-RNN can be constructed by stacking multiple

of these layers [1], [2].

Since contraction analysis for discrete-time systems leads to

similar results [5], [11], we deﬁne the inputs x

as discretized

states {x

= x(t

)}

t=0

, and the outputs y

as non-zero compo-

nents of the unique Cholesky decomposition of the optimal

contraction metric, as will be discussed in Section III.

III. N

EURAL CONTRACTION METRIC (NCM)

This section presents an algorithm to obtain an NCM

depicted in Fig. 1.

A. Convex Optimization-Based Sampling of Contraction

Metrics (CV-STEM)

We derive one approach to sample contraction metrics by

using the ConVex optimization-based Steady-state Tracking

Error Minimization (CV-STEM) method [10], [11], which

could handle the control design of Itô stochastic nonlinear

systems. In this section, we propose its simpler formulation

for nonlinear systems with bounded disturbances in order to

be of practical use in engineering applications.

By Theorem 1, the problem to minimize an upper bound

of the steady-state Euclidean distance between the trajectory

(t) of the unperturbed system and x

(t) of the perturbed

system (1) (i.e., (4) as t →∞) can be formulated as follows:

∗

= min

ω>0,ω>0,W0

dω

αω

s.t. (2) and (3) (5)

where W(x, t) = M(x, t)

−1

is used as a decision variable

instead of M(x, t). We assume that the contraction rate α and

disturbance bound

d aregivenin(5)(seeRemark1onhow

to select α). We need the following lemma to convexify this

nonlinear optimization problem.

TSUKAMOTO AND CHUNG: NEURAL CONTRACTION METRICS FOR ROBUST ESTIMATION AND CONTROL: CONVEX OPTIMIZATION APPROACH 213

Lemma 1: The inequalities (2) and (3) are equivalent to

W(x, t) − sym



∂f (x, t)

∂x

W(x, t)



 2α

W(x, t), ∀x, t (6)

I 

W(x, t)  χI, ∀x, t (7)

respectively, where χ =

ω/ω,

W = νW, and ν = 1/ω.

Proof: Since ν = 1/ω

> 0 and W(x, t)  0, multiply-

ing (2) by ν and then by W(x, t) from both sides preserves

matrix deﬁniteness and the resultant inequalities are equivalent

to the original ones [27, p. 114]. These operations yield (6).

Next, since M(x, t)  0, there exists a unique μ(x, t)  0

s.t. M = μ

. Multiplying (3) by μ

−1

from both sides gives

I  W(x, t)  ωI as we have ω, ω>0 and (μ

−1

)

= W.

We get (7) by multiplying this inequality by ν = 1/ω

We are now ready to state and prove our main result on the

convex optimization-based sampling.

Theorem 2: Consider the convex optimization problem:

∗

= min

χ∈R,

W0

dχ

s.t. (6) and (7) (8)

where χ and

W are deﬁned in Lemma 1, and α>0 and

d = sup

t≥0

d(t) are assumed to be given. Then J

∗

= J

∗

Proof: By deﬁnition, we have

dω/(αω) = dχ/α. Since (2)

and (3) are equivalent to (6) and (7) by Lemma 1, rewriting

the objective in the original problem (5) using this equality

completes the proof.

Remark 1: Since (6) and (7) are independent of ν = 1/ω,

the choice of ν does not affect the optimal value of the

minimization problem in Theorem 2. In practice, as we have

sup

x,t

M(x, t)≤1/ω = ν by (3), it can be used as a penalty

to optimally adjust the induced 2-norm of estimation and con-

trol gains when the problem explicitly depends on ν (see

Section IV for details). Also, although α is ﬁxed in Theorem 2,

it can be found by a line search as will be demonstrated in

Section V.

Remark 2: The problem (8) can be solved as a ﬁnite-

dimensional problem by using backward difference approx-

imation,

W(x(t

), t

)  (

W(x(t

), t

) −

W(x(t

i−1

), t

i−1

))/t

where t = t

− t

i−1

, ∀i with t  t

> 0, and

by discretizing it along a pre-computed system trajectory

{x(t

)}

i=0

B. Deep LSTM-RNN Training

Instead of directly using sequential data of optimal con-

traction metrics {M(x(t

), t

)}

i=0

for neural network training,

the positive deﬁniteness of M(x, t) is utilized to reduce the

dimension of the target output {y

}

i=0

deﬁned in Section II.

Lemma 2: AmatrixA  0 has a unique Cholesky decom-

position, i.e., there exists a unique upper triangular matrix

U ∈ R

n×n

with strictly positive diagonal entries s.t. A = U

Proof: See [28, p. 441].

As a result of Lemma 2, we can select (x, t) deﬁned in

Theorem 1 as the unique Cholesky decomposition of M(x, t)

and train the deep LSTM-RNN using only the non-zero entries

of the unique upper triangular matrices {(x(t

), t

)}

i=0

.We

denote these nonzero entries as θ(x, t) ∈ R

n(n+1)

. As a result,

the dimension of the target data θ(x, t) is reduced by n(n−1)/2

without losing any information on M(x, t).

The pseudocode to obtain an NCM depicted in Fig. 1 is

presented in Algorithm 1. The deep LSTM-RNN in Section II

is trained with the sequential state data {x(t

)}

i=0

and the target

Algorithm 1: NCM Algorithm

Inputs : Initial and terminal states {x

), x

)}

s=1

Outputs: NCM and steady-state bound J

∗

in (8)

A. Sampling of Optimal Contraction Metrics

for s ← 1 to S do

Generate a trajectory {x

)}

i=0

using x

) (could use x

) for

motion planning problems)

for α ∈ A

linesearch

Find J

∗

(α, x

) and {θ(x

), t

)}

i=0

by Th. 2

Find α

∗

) = arg min

α∈A

linesearch

∗

(α, x

)

Save {θ(x

), t

)}

i=0

for α = α

∗

)

Obtain J

∗

= max

∗

(α

∗

), x

)

B. Deep LSTM-RNN Training

Split data into a train set S

train

and test set S

test

for epoch ← 1 to N

epochs

for s ∈ S

train

Train the deep LSTM-RNN with {x

)}

i=0

, {θ(x

), t

)}

i=0

using SGD

Compute the test error for data in S

test

if test error < then

break

data {θ(x(t

), t

)}

i=0

using Stochastic Gradient Descent (SGD).

We note these pairs will be sampled for multiple trajectories

to increase sample size and to avoid overﬁtting.

IV. NCM-B

ASED OPTIMAL ESTIMATION AND CONTROL

This section delineates how to construct an NCM ofﬂine

and utilize it online for state estimation and feedback control.

A. Problem Statement

We apply an NCM to the state estimation problem for the

following nonlinear system with bounded disturbances:

˙x = f (x, t) + B(x, t)d

(t), y(t) = h(x, t) + G(x, t)d

(t) (9)

where d

: R

≥0

→ R

, B : R

× R

≥0

→ R

n×k

y : R

≥0

→ R

, d

: R

≥0

→ R

, h : R

× R

≥0

→ R

and G : R

× R

≥0

→ R

m×k

with d

= sup

d

(t) < +∞

and d

= sup

d

(t) < +∞.LetW = M(ˆx, t)

−1

 0,

A(x, t) = (∂f /∂x), and C(x, t) = (∂h/∂x).Letˆx : R

≥0

→ R

We design an estimator as

ˆx = f (ˆx, t) + M(ˆx, t)C(ˆx, t)

(y − h(ˆx, t)) (10)

W + WA(ˆx, t) + A(ˆx, t)

W − 2C(ˆx, t)

C(ˆx, t) −2αW (11)

where α>0. The virtual system of (9) and (10) is given as

˙q = f (q, t) + M(ˆx, t)C(ˆx, t)

(h(x, t) − h(q, t)) + d

(q, t) (12)

where d

(q, t) is deﬁned as d

(x, t) = B(x, t)d

(t) and

(ˆx, t) = M(ˆx, t)C(ˆx, t)

G(x, t)d

(t). Note that (12) has q = x

and q =ˆx as its particular solutions. The differential dynamics

of (12) with d

= 0 is given as

δ ˙q = (A(q, t) − M(ˆx, t)C(ˆx, t)

C(q, t))δq. (13)

B. Nonlinear Stability Analysis

We have the following lemma for deriving a condition to

guarantee the local contraction of (12) in Theorem 3.

Lemma 3: If (11) holds for t ≥ 0, there exists r(t)>0 s.t.

2γ W +

W + sym (WA(q, t)) − sym(C(ˆx, t)

C(q, t))  0 (14)

214 IEEE CONTROL SYSTEMS LETTERS, VOL. 5, NO. 1, JANUARY 2021

for all q(t) with q(t) −ˆx(t)≤r(t), where 0 <γ <α.

Proof: See [29, Lemma 2] or [9, Theorem 1].

The following theorem along with this lemma guarantees

the exponential stability of the estimator (10).

Theorem 3: Suppose that there exist positive constants ω

ω, b, ¯c, ¯g, and ρ s.t. ωI  W(ˆx, t)  ωI, B(x, t)≤b,

C(ˆx, t)≤¯c, G(x, t)≤¯g, and r(t) ≥ ρ, ∀ˆx, x, t, where r(t)

is deﬁned in Lemma 3. If (11) holds and R

(0)+D

/γ ≤

√

ωρ,

where R

(t) =



ˆx

(ˆx, t)δq(t) with W = 

 and D

√

ω + d

¯c¯g/

√

ω, then the distance between the trajectory

of (9) and (10) is exponentially bounded as follows:



ˆx

δq≤

(0)

√

−γ t

χ +

¯c¯g

ν (15)

where χ =

ω/ω, ν = 1/ω, and 0 <γ <α.

Proof: Using (13), we have d(δq

)/dt = δq

(

W +

sym(WA(q, t)) − sym(C(ˆx, t)

C(q, t)))δq when d

= 0. This

along with (14) gives

(t) ≤−γ R

(t) in the region

where Lemma 3 holds. Thus, using the bound (ˆx(t), t)d

(q, t)≤D

,wehave

√



ˆx

δq≤R

(0)e

−γ t

+ D

/γ

by the same proof as for Theorem 1. Rewriting this with

χ, ν, and 1 ≤

√

χ ≤ χ yields (15). This also implies that

√

ωx −ˆx≤R

(0) + D

/γ, ∀t. Hence, the sufﬁcient condi-

tion for q −ˆx in Lemma 3 reduces to the one required in

this theorem.

C. Convex Optimization-Based Sampling (CV-STEM)

We have the following proposition to sample optimal con-

traction metrics for the NCM-based state estimation.

Proposition 1: M(ˆx, t) that minimizes an upper bound of

lim

t→∞



ˆx

δq is found by the convex optimization problem:

∗

CVe

= min

ν>0,χ ∈R,

W0

χ +

¯c¯g

s.t.

W +

WA + A

W − 2νC

C −2α

W and I 

W  χI (16)

where χ =

ω/ω, ν = 1/ω,

W = νW, and 0 <γ <α.

The arguments of A(ˆx, t), C(ˆx, t), and

W(ˆx, t) are omitted for

notational simplicity.

Proof: Multiplying (11) and ω

I  W(ˆx, t)  ωI, ∀ˆx, t by ν

yields the constraints of (16). Then applying Theorem 2 with

the objective function given in (15) of Theorem 3 as t →∞

yields (16).

We have an analogous result for state feedback control.

Corollary 1: Consider the following system and a state

feedback controller u(t) with the bounded disturbance d(t):

˙x = f (x, t) + B

(x, t)u + B

(x, t)d(t) (17)

W − A(x, t)W − WA(x, t)

+ 2B

(x, t)B

(x, t)

 2αW (18)

where u =−B

(x, t)

M(x, t)x, B

: R

× R

≥0

→ R

n×m

: R

× R

≥0

→ R

n×k

, W = M

−1

 0, α>0, and A is a

matrix deﬁned as A(x, t)x = f(x, t), assuming that f (x, t) = 0

at x = 0 [24], [25]. Suppose there exist positive constants ω

, ω,

and

s.t. ωI  W(x, t)  ωI and B

(x, t)≤b

, ∀x, t. Then

M(x, t) that minimizes an upper bound of lim

t→∞



δq can

be found by the following convex optimization problem:

∗

CVc

= min

ν>0,χ ∈R,

W0

χ + λν

s.t. −

W + A

W +

− 2νB

−2α

W and I 

W  χ I (19)

where χ =

ω/ω, ν = 1/ω,

W = νW, and λ>0isa

user-deﬁned constant. The arguments of A(x, t), B

(x, t), and

W(x, t) are omitted for notational simplicity.

Proof: The system with q = x, 0 as its particular solutions

is given by ˙q = (A(x, t) −B

(x, t)B

(x, t)

M(x, t))q+d

(q, t),

where d

(x, t) = B

(x, t)d(t) and d

(0, t) = 0. Since we have

d

(q, t)≤b

d and the differential dynamics is

δ ˙q = (A(x, t) − B

(x, t)B

(x, t)

M(x, t))δq (20)

when d

= 0, we get lim

t→∞



δq≤b

dχ/α by the

same proof as for Theorem 3 with (13) replaced by (20), (14)

by (18), and R

(t) by R

(t) =



(x, t)δq(t), where

M = 

. Equation (19) then follows as in the proof of

Proposition 1, where λ ≥ 0 is for penalizing excessively large

control inputs through ν ≥ sup

x,t

M(x, t) (see Remark 1).

D. NCM Construction and Interpretation

Algorithm 1 along with Proposition 1 and Corollary 1

returns NCMs to compute ˆx(t) of (10) and u(t) of (17) for

state estimation and control in real-time. They also provide

the bounded error tube (see Theorem 1, [15], [16]) for robust

motion planning problems as will be seen in Section V.

The similarity of Corollary 1 to Proposition 1 stems from

the estimation and control duality due to the differential nature

of contraction analysis as is evident from (13) and (20).

Analogously to the discussion of the Kalman ﬁlter and LQR

duality in LTV systems, this leads to two different interpreta-

tions on the weight of ν (i.e.,

¯c¯g/γ in (16) and λ in (19)).

As discussed in Remark 1, one way is to see it as a penalty on

the induced 2-norm of feedback gains. Since

= 0in(16)

means no noise acts on y(t), it can also be viewed as an indi-

cator of how much we trust the measurement y(t): the larger

the weight of ν, the less conﬁdent we are in y(t). These agree

with our intuition as smaller feedback gains are suitable for

measurements with larger uncertainty.

V. S

IMULATION

The NCM framework is demonstrated using Lorentz oscilla-

tor state estimation and spacecraft motion planning and control

problems. CVXPY [13] with the MOSEK solver [14] is used to

solve convex optimization problems. A Python implementation

is available at https://github.com/astrohiro/ncm.

A. State Estimation of Lorenz Oscillator

We ﬁrst consider state estimation of the Lorentz oscillator

with bounded disturbances described as ˙x = f (x) + d

(t) and

y = Cx+d

(t), where f (x) = [σ(x

−x

), x

(ρ−x

)−x

, x

−

βx

]

, x = [x

, x

]

, σ = 10, β = 8/3, ρ = 28, C =

[100],sup

d

(t)=

√

3, and sup

d

(t)=1. We use

dt = 0.1 for integration, with one measurement y per dt.

1) Sampling of Optimal Contraction Metrics: Using

Proposition 1, we sample the optimal contraction metric

along 100 trajectories with uniformly distributed initial

conditions (−10 ≤ x

≤ 10, i = 1, 2, 3). Figure 2 plots J

∗

CVe

in (16) for 100 different trajectories and the optimal α is

found to be α = 3.4970. The optimal estimator parameters

averaged over 100 trajectories for α = 3.4970 are summarized

in Table I.

2) Deep LSTM-RNN Training: A deep LSTM-RNN is

trained using Algorithm 1 and Proposition 1 with the sequen-

tial data {{(x

), θ

(x(t

))}

i=0

}

s=1

sampled over the 100 differ-

ent trajectories (S = 100). Note that θ

(x(t

)) are standardized

TSUKAMOTO AND CHUNG: NEURAL CONTRACTION METRICS FOR ROBUST ESTIMATION AND CONTROL: CONVEX OPTIMIZATION APPROACH 215

Fig. 2. Upper bound of steady-state errors as a function of α: each

curve with a different color corresponds to a trajectory with a different

initial condition.

TABLE I

NCM E

STIMATOR PARAMETERS FOR α = 3.4970 WITH J

∗

CVe

AND MSE

VERAGED OVER 100 TRAJECTORIES

Fig. 3. LSTM-RNN test loss with 2 layers (left) and 32 hidden

units (right).

and normalized to make the SGD-based learning process sta-

ble. Figure 3 shows the test loss of the LSTM-RNN models

with different number of layers and hidden units. We can see

that the models with more than 2 layers overﬁt and those with

less than 32 hidden units underﬁt to the training samples. Thus,

the number of layers and hidden units are selected as 2 and 64,

respectively. The resultant MSE of the trained LSTM-RNN is

shown in Table I.

3) State Estimation With an NCM: The estimation problem is

solved using the NCM, sampling-based CV-STEM [10], [11],

and Extended Kalman Filter (EKF) with sup

d

(t)=20

√

and sup

d

(t)=20. We use x(0) = [−1.0, 2.0, 3.0]

and

ˆx(0) = [150.1, −1.5, −6]

for the actual and estimated initial

conditions, respectively. The EKF weight matrices are selected

as R = 20I and Q = 10I.

Figure 4 shows the smoothed estimation error x(t) −ˆx(t)

using a 15-point moving average ﬁlter. The errors of the

NCM and CV-STEM estimators are below the optimal upper

bound while the EKF has a larger error compared to the

other two. As expected from the small MSE of Table I,

Figure 4 implies that the NCM estimator is able to approxi-

mate the sampling-based CV-STEM estimator without losing

its estimation performance.

B. Spacecraft Optimal Motion Planning

We consider an optimal motion planning problem of the

planar spacecraft dynamical system, given as ˙x = Ax +

B(x)u + d(t), where u ∈ R

,sup

d(t)=0.15, and

Fig. 4. Lorentz oscillator state estimation error smoothed using a

15-point moving average ﬁlter.

TABLE II

NCM C

ONTROL PARAMETERS FOR α = 0.58 WITH J

∗

CVc

AND MSE

VERAGED OVER 100 TRAJECTORIES

TABLE III

ONTROL PERFORMANCES FOR SPACECRAFT

MOTION PLANNING PROBLEM

x = [p

, p

,φ,˙p

, ˙p

φ]

with p

, p

, and φ being the hor-

izontal coordinate, vertical coordinate, and yaw angle of the

spacecraft, respectively. The constant matrix A and the state-

dependent actuation matrix B(x) are deﬁned in [30]. All the

parameters of the spacecraft are normalized to 1.

1) Problem Formulation: In the planar ﬁeld, we have 6 cir-

cular obstacles with radius 3 located at (p

, p

) = (0, 11),

(5, 3), (8, 11), (13, 3), (16, 11), and (21, 3). The goal of the

motion planning problem is to ﬁnd an optimal trajectory that

avoids these obstacles and minimize



u(t)

dt subject to

input constraints 0 ≤ u

(t) ≤ 1, ∀i, ∀t and the dynamics con-

straints. The initial and terminal condition are selected as

x(0) = [0, 0,π/12, 0, 0, 0]

and x(t

) = [20, 18, 0, 0, 0, 0]

Following the same procedure described in the state estima-

tion problem, the optimal control parameters and the MSE of

the LSTM-RNN trained using Algorithm 1 with Corollary 1

are determined as shown in Table II.

2) Motion Planning With an NCM: GivenanNCM,wecan

solve a robust motion planning problem, where the state

constraint is now described by the bounded error tube (see

Theorem 1) with radius R

tube

= d(

√

χ/α) = 0.4488. Figure 5

shows the spacecraft motion (p

, p

) on a planar ﬁeld, com-

puted using the NCM, sampling-based CV-STEM [10], [11],

and baseline LQR control with Q = 2.4I and R = I which does

not account for the disturbance. As summarized in Table III,

input constraints 0 ≤ u

(t) ≤ 1, ∀i, ∀t are satisﬁed and the

control effort of these controllers are adjusted to be below 50

for the sake of fair comparison. All the controllers except the

LQR keep their trajectories within the tube avoiding collision

with the circular obstacles, even under the presence of distur-

bances as depicted in Fig. 5. Also, the NCM controller predicts

Neural Contraction Metrics for Robust Estimation and Control: A Convex Optimization Approach

Figures

Citations

Review of advanced guidance and control algorithms for space/aerospace vehicles

Safe Control With Learned Certificates: A Survey of Neural Lyapunov, Barrier, and Contraction Methods for Robotics and Control

Robust Controller Design for Stochastic Nonlinear Systems via Convex Optimization

Neural Stochastic Contraction Metrics for Learning-Based Control and Estimation

Learning Certified Control using Contraction Metric

References

Long short-term memory

Convex Optimization

Linear matrix inequalities in system and control theory

Linear Matrix Inequalities in System and Control Theory

Speech recognition with deep recurrent neural networks

Related Papers (5)

On contraction analysis for non-linear systems

Robust online motion planning via contraction theory and convex optimization

Stochastic Stability and Control

Applied Nonlinear Control

Reinforcement Learning: An Introduction

Frequently Asked Questions (13)

Q1. What have the authors contributed in "Neural contraction metrics for robust estimation and control: a convex optimization approach" ?

Q2. What is the novelty of the NCM approach?

Q3. What is the simplest way to get the limt?

Q4. What is the result of Lemma 2?

Q5. What is the similarity of Corollary 1 to Proposition 1?

Q6. What is the goal of the motion planning problem?

Q7. How many layers are underfit to the training samples?

Q8. What is the simplest formulation for the NCM?

Q9. how can i find in the problem (8)?

Q10. How do the authors get a contraction metric?

Q11. What is the difference between the two definitenesses?

Q12. What is the MSE of the NCM estimator?

Q13. What is the way to measure the optimal contraction metric?