What is the function that can be used to define the constraint set?

Letting yi = [xi,> hi]> be the decision vector of agent i, the local constraint set can be then defined as Y i = Xi ∩ {gi(xi) ≤ hi}, while the objective function can be rewritten as x>Qx+ q>x+ ∑m i=0 hi, which is quadratic in y = [y1,> . . . ym,>]>.

what is the performance criterion in this local problem?

The performance criterion in this local problem is a linear combination of the objective f(zi, x−ik ), where the variables of all other agents apart from the i-th one are fixed to their values at iteration k, and a quadratic regularization term, penalizing the difference between zi and the value of agent’s i own variable at iteration k, i.e., xik.

What is the solution for the constraint set?

Under Assumption 1, the function f is convex and hence continuous, while the constraint set X = X1 × · · · ×Xm is non-empty and compact, as result of Weierstrass’ theorem [12, Proposition A8, p. 625], P admits at least one optimal solution.

How many iterations of the Jacobi algorithm are there?

4: Evolution of the iterates xik(t) generated by Algorithm 1 at t = 12 as a function of the iteration index k, for i = 1, . . . , 10, i.e., the first 10 vehicles of the 1000-vehicle fleet.

What is the condition Qd + Ic 0?

Ic−Q 0 can be satisfied by choosing c > λmaxQz .C. Connection with gradient algorithms Recalling the formulation in (18) and (19), xik+1 = T̃i(xk), i = 1, . . . ,m, in step 6 of Algorithm 1 can be equivalently written as a scaled projected gradient step as follows:xk+1 = [ξ(xk)]

What is the importance of the two terms in Algorithm 1?

The relative importance of these two terms is dictated by the regularization coefficient c ∈ R+, which plays a key role in determining the convergence properties of Algorithm 1.

How does the central authority collect the solution of each agent?

at iteration k+ 1 of Algorithm 1, the central authority needs to collect the solution of each agent but it only has to broadcast x̄k = d +

What is the last equality of the unscaled projected gradient?

This implies thatλmaxQz ≤ v>Qvv>v − λminQd ≤ maxz 6=0z>Qzz>z − λminQd= λmaxQ − λminQd , (28) where the last equality follows recalling the definition of the induced 2-norm of a symmetric square matrix.

(Open Access) Regularized Jacobi Iteration for Decentralized Convex Quadratic Optimization With Separable Constraints (2019) | Luca Deori

Q: What have the authors contributed in "Regularized jacobi iteration for decentralized convex quadratic optimization with separable constraints" ?

The authors consider multi-agent, convex quadratic optimization programs subject to separable constraints, where the constraint function of each agent involves only its local decision vector, while the decision vectors of all agents are coupled via a common objective function. The authors provide a fixed-point theoretic analysis showing that the algorithm converges to a minimizer of the centralized problem under more relaxed conditions on the regularization coefficient from those available in the literature, and in particular with respect to scaled projected gradient algorithms.

Regularized Jacobi Iteration for Decentralized Convex Quadratic

Optimization with Separable Constraints

Luca Deori, Kostas Margellos and Maria Prandini

Abstract—We consider multi-agent, convex quadratic opti-

mization programs subject to separable constraints, where the

constraint function of each agent involves only its local decision

vector, while the decision vectors of all agents are coupled

via a common objective function. We focus on a regularized

variant of the so called Jacobi algorithm for decentralized

computation in such problems. We provide a ﬁxed-point theoretic

analysis showing that the algorithm converges to a minimizer of

the centralized problem under more relaxed conditions on the

regularization coefﬁcient from those available in the literature,

and in particular with respect to scaled projected gradient

algorithms. The efﬁcacy of the proposed algorithm is illustrated

by applying it to the problem of optimal charging of electric

vehicles.

Index Terms—Decentralized optimization, Jacobi algorithm,

iterative methods, optimal charging control, electric vehicles.

I. INTRODUCTION

PTIMIZATION in multi-agent systems has attracted sig-

niﬁcant attention in the control and operations research

communities, due to its applicability to different domains,

e.g., energy [1], [2], mobility [3], [4], [5], robotic systems

[6], etc. We focus on multi-agent optimization programs that

are convex and are subject to constraints that are separable.

The agents’ decisions are, however, coupled by means of

a common objective function, which is considered to be

quadratic. The considered structure, although speciﬁc, captures

a wide class of problems, like the electric vehicle charging

problem studied in this paper, and is amenable to efﬁcient

numerical solvers tailored for quadratic optimization [7].

Solving such problems in a centralized fashion would re-

quire agents to share their local constraint functions, while

even if this was possible it would unnecessarily increase

the computational burden. To alleviate these issues we adopt

an iterative, decentralized perspective, where agents perform

local computations in parallel, and then exchange with each

other their new solutions, or broadcast them to some central

authority that sends an update to each agent. Admittedly,

distributed optimization offers a more general communication

setup, however, the fact that agents decision vectors are

coupled via the objective function poses additional difﬁculties,

preventing the use of distributed algorithms [8], [9]. Even

upon an epigraphic reformulation, the resulting problem will

Research was supported by the European Commission, H2020, under the

project UnCoVerCPS, grant number 643921, and by EPSRC UK under

the grant EP/P03277X/1. The authors would like to thank one anonymous

reviewer for suggesting a reﬁnement in the calculations of Section III-C.

L. Deori and M. Prandini are with the Dipartimento di Elettronica

Informazione e Bioingegneria, Politecnico di Milano, Piazza

Leonardo da Vinci 32, 20133 Milano, Italy, e-mail: {luca.deori,

maria.prandini}@polimi.it

K. Margellos is with the Department of Engineering Science, Univer-

sity of Oxford, Parks Road, Oxford, OX1 3PJ, United Kingdom, e-mail:

kostas.margellos@eng.ox.ac.uk

not exhibit the structure typically encountered in distributed

optimization, with the resulting coupling constraint not nec-

essarily being of “budget” form [10]. A distributed gossip

based gradient algorithm has been proposed in [11], but with

reference to a noncooperative counterpart of the problem under

study here. As such, it does not lead to a social welfare

solution. Moreover, it requires an iteration varying step-size

as opposed to the constant step-size considered in this paper.

A. Related work

From a cooperative optimization point of view, algorithms

for decentralized solutions to convex optimization problems

with separable constraints can be found in [12], [13], and

references therein. Two main algorithmic directions can be

distinguished, both of them relying on an iterative process. The

ﬁrst one is based on each agent performing at every iteration

a local gradient descent step, while keeping the decision

variables of all other agents ﬁxed to the values communicated

at the previous iteration [14]–[16]. Under certain structural

assumptions (differentiability of the objective function and

Lipschitz continuity of its gradient), it is shown that this

scheme converges to some minimizer of the centralized prob-

lem, for an appropriately chosen gradient step-size.

The second direction for decentralized optimization involves

mainly the so called Jacobi algorithm, which serves as a

proximal based alternative to gradient algorithms. The Gauss-

Seidel algorithm exhibits similarities with the Jacobi one,

but is not of parallelizable nature [17], unless a colouring

scheme is adopted (see Section 1.2.4 in [12]). Under the Jacobi

algorithmic setup, at every iteration, instead of performing a

gradient step, each agent minimizes the common objective

function subject to its local constraints in a best-response

fashion, while keeping the decisions of all other agents ﬁxed

to their values at the previous iteration. In [12], it is shown that

the Jacobi algorithm converges under certain contractiveness

requirements, which are typically satisﬁed only under strong

(or strict in case of quadratic objective functions) convexity as-

sumptions that are, however, not imposed in the current work.

In [18], [19], a regularized version of the Jacobi algorithm is

proposed, however, an explicit condition on the regularization

coefﬁcient for convergence is not provided. A similar paral-

lelizable, albeit different scheme, has been presented in [4],

[20], [21], without employing regularization, while [22], [23]

follow a randomized block coordinate descent approach and

provide convergence results concerning the expected value of

the objective functions. The results most closely related to our

work appear in [24], [25]. In all aforementioned references,

however, unlike our paper, convergence is limited to the

optimal value and not in iterates.

From a non-cooperative perspective there has recently been

a notable research activity using tools from mean-ﬁeld and

aggregative game theory. Under a deterministic, discrete-time

setting, [3], [5], [26] deal with the non-cooperative counterpart

of our work, showing convergence not to a minimizer, but

to an approximate Nash equilibrium of a related game, and

to an exact Nash equilibrium in the limiting case where the

number of agents tends to inﬁnity. Using an approach similar

to the regularized Jacobi algorithm it is shown in [27] that

convergence to an exact Nash equilibrium for a ﬁnite number

of agents can be achieved. A similar result, using a gradient

based variant is recently provided in [28].

B. Contributions of this work and organization of the paper

We adopt a cooperative point of view, and consider a

regularized Jacobi algorithm similar to the one in [18], [19],

[24]. Our contributions extend these results as follows:

1. Focusing on the case where the objective function is

quadratic, we show that the iterates generated by the regu-

larized Jacobi algorithm converge to an optimal solution of

the centralized problem counterpart, as opposed to the weaker

statement that the iterates sequence achieves the optimal value

allowing, however, an oscillatory behaviour (i.e., all limit

points are optimal solutions) [24]. To achieve this, we follow

a fundamentally different analysis from [24], relying on an

operator theoretic approach. Our result serves as the Jacobi

counterpart of gradient methods, thus complementing the work

of [12], [29], [30]. The recent paper [31] shows convergence to

an optimal solution of the centralized problem counterpart as

well. However, the converge proof in [31, Theorem 1] strongly

depends on results of this paper and relies on the agents’

constraint sets to be convex polyhedra while our result requires

these sets to be only compact and convex.

2. As opposed to [18], [19], we provide an explicit calculation

of the regularization coefﬁcient that ensures convergence, and

show that the condition of Theorem 1 constitutes a relaxed

version of that of [24] (see Theorem 3 and discussion on

constant step-sizes therein), as well as of that of unscaled

projected gradient methods (see Proposition 3.3 in Chapter

3 of [12] for convergence in value, and Theorem 4.1 in [29]

or Theorem 2 in [30] for convergence in iterates) that ends up

being the same with that of [24]. We also show that the main

Jacobi iteration can be written as a scaled projected gradient

step and derive an improved convergence condition (however,

concerning convergence in optimal value not in iterates) under

a particular choice of the scaling matrix and projection norm.

Notably, the condition of Theorem 1 is less conservative.

This improvement can affect signiﬁcantly how well-behaved

numerically the underlying optimization programs are.

3. From an application point of view, we extend the results of

[4] on electric vehicle charging, achieving convergence to an

optimal charging solution as opposed to convergence in value.

The results obtained here extend signiﬁcantly our earlier

work in [32], where no formal comparison with the gradient

methods and [24] was provided. It should be noted that [12],

[24], provide algorithms that are limited to convergence in

optimal value under more restrictive choices on the step-size,

however, are applicable to convex function and not necessarily

quadratic as the focus of this paper. Our results can be

extended to the non-quadratic case (the proof is similar to

[24]), one can show convergence as far as the optimal value is

concerned using a less restrictive step-size condition. We refer

to this condition in Remark 3, while the reader is referred to

the technical memorandum [33] for more details and proofs.

Section II introduces the problem under study and states the

proposed algorithm. In Section III we provide the main conver-

gence result and a comparison with scaled projected gradient

methods and the algorithm of [24]. Section IV provides an

extensive simulation study for the electric vehicle charging

control case study, while Section V concludes the paper and

outlines some directions for future research.

II. DECENTRALIZED PROBLEM FORMULATION

A. Motivating example: Optimal charging of electric vehicles

We consider the problem of optimizing the charging strategy

for a ﬂeet of m plug-in electric vehicles (PEVs) over a ﬁnite

horizon T . Following [3], [5], [26], the PEV charging problem

is given by the following optimization problem.

min



(t)}

i=1



t=0

p(t)



d(t) +

i=1

(t)



(1)

subject to

t=0

(t) = γ

, for all i = 1, . . . , m

(t) ≤ x

(t), for all i = 1, . . . , m, t = 0, . . . , T,

where p(t) ∈ R is an electricity price coefﬁcient at time t,

d(t) ∈ R represents the non-PEV demand at time t, x

(t) ∈ R

is the charging rate of vehicle i at time t, γ

∈ R represents

a prescribed charging level to be reached by each vehicle i at

the end of the considered time horizon, and x

(t), x

(t) ∈ R

are bounds on the minimum and maximum value of x

(t),

respectively. The objective function in (1) encodes the total

electricity cost given by the demand (both PEVs and non-

PEVs) multiplied by the price of electricity, which in turn

depends linearly on the total demand through p(t), thus giving

rise to the quadratic function in (1). This linear dependency

of price with respect to the total demand models the fact that

agents/vehicles are price anticipating authorities, anticipating

their consumption to have an effect on the electricity price (see

introduction in [2] for further elaboration on price anticipating

agents). Problem (1) can be rewritten as

min

x∈R

m(T +1)

(d + Ax)

P (d + Ax) (2)

subject to: x

∈ X

, for all i = 1, . . . , m,

where P = (1/m)diag(p) ∈ R

(T +1)×(T +1)

, and diag(p)

is a matrix with p = (p(0), . . . , p(T )) ∈ R

T +1

on its

diagonal. A = 1

1×m

⊗ I ∈ R

(T +1)×m(T +1)

, where ⊗ denotes

the Kronecker product, and I ∈ R

(T +1)×(T +1)

the identity

matrix. Moreover, d = (d(0), . . . , d(T )) ∈ R

T +1

, x =

, . . . , x

) ∈ R

m(T +1)

, x

= (x

(0), . . . , x

(T )) ∈ R

T +1

and X

is the constraint set of vehicle i, i = 1, . . . , m, in (1).

Algorithm 1 Decentralized algorithm

1: Initialization

2: k = 0.

3: Consider x

∈ X

, for all i = 1, . . . , m.

4: For i = 1, . . . , m repeat until convergence

5: Agent i receives x

−i

from central authority.

6: x

k+1

= arg min

∈X



f(z

, x

−i

) + ckz

− x



7: k ← k + 1.

B. Problem statement

Motivated by the electric vehicle charging control problem

in (2), we consider the following class of programs:

P : min

∈R

}

i=1

f(x

, . . . , x

) (3)

subject to: x

∈ X

, for all i = 1, . . . , m, (4)

where each agent i, i = 1, 2, . . . , m, has a local decision vector

∈ R

and a local constraint set X

⊆ R

, and cooperates

to determine a minimizer of f : R

× . . . × R

→ R, which

couples its decision vector with those of the other agents.

Assumption 1. The objective function f : R

×. . .×R

→

R is given by f(x

, . . . , x

) = x

Qx + q

x, where x =

[(x

)

, . . . , (x

)

]

∈ R

with n =

i=1

, Q ∈ R

n×n

is symmetric and positive semi-deﬁnite (Q = Q

 0) and

q ∈ R

. Moreover, the sets X

⊆ R

, i = 1, . . . , m, are

non-empty, compact and convex.

Note that Q is assumed to be symmetric without loss of

generality; in the opposite case it could be split in a symmetric

and an antisymmetric part, with the latter giving rise to terms

that simplify each other.

Remark 1 (Problem generalization). We also allow for objec-

tive functions of the form f(x

, . . . , x

) = x

Qx + q

x +

i=0

), where the g

),i = 1, . . . , m, are convex

functions that could encode a utility function for each agent.

In this case an epigraphic reformulation can be exploited to

bring the cost back to be quadratic. Letting y

= [x

i,>

]

the decision vector of agent i, the local constraint set can be

then deﬁned as Y

= X

∩ {g

) ≤ h

}, while the objective

function can be rewritten as x

Qx + q

x +

i=0

, which

is quadratic in y = [y

1,>

. . . y

m,>

]

Under Assumption 1, the function f is convex and hence

continuous, while the constraint set X = X

× · · · × X

non-empty and compact, as result of Weierstrass’ theorem [12,

Proposition A8, p. 625], P admits at least one optimal solution.

However, P does not necessarily admit a unique minimizer.

With a slight abuse of notation, for each i, i = 1, . . . , m,

let f(·, x

−i

) : R

→ R be the objective function in (3) as a

function of the decision vector x

of agent i, when the decision

vectors of all other agents are ﬁxed to x

−i

∈ R

n−n

. We will

occasionally also write f (x) instead of f (x

, . . . , x

C. Regularized Jacobi algorithm

Solving problem P in a centralized fashion is not always

possible since agents may not be willing to share X

, i =

1, . . . , m. Moreover, even if this was the case, solving P in

one shot might be computationally challenging. To overcome

this and account for information sharing issues, motivated by

the separable structure of P we follow a decentralized, iterative

approach as described in Algorithm 1.

Initially, each agent i, i = 1, . . . , m, starts with some

value x

∈ X

, such that



, . . . , x



is feasible (step 3,

Algorithm 1). At iteration k + 1, each agent i receives x

−i

(step 5, Algorithm 1) from the central authority, and updates

its estimate for x

by solving a local minimization problem

(step 6, Algorithm 1). The performance criterion in this local

problem is a linear combination of the objective f(z

, x

−i

where the variables of all other agents apart from the i-th

one are ﬁxed to their values at iteration k, and a quadratic

regularization term, penalizing the difference between z

and

the value of agent’s i own variable at iteration k, i.e., x

The relative importance of these two terms is dictated by the

regularization coefﬁcient c ∈ R

, which plays a key role

in determining the convergence properties of Algorithm 1.

Note that under Assumption 1, and due to the presence of the

quadratic penalty term, the resulting problem is strictly convex

with respect to z

, and hence admits a unique minimizer.

Remark 2 (Information exchange). To implement Algorithm

1, at iteration k + 1, it is needed that some central authority

collects and broadcasts the current solution of each agent

to all others, so that each of them can compute f(·, x

−i

However, in the case where the coupling in the objective

function is only through the average of some agents’ variables

as in the example of Section II-A, at every iteration k the

central authority needs to broadcast only the average of the

agents’ decisions, or in other words the cumulative charging

d+Ax

with reference to the electric vehicle case study. Each

agent will then be able to compute f(·, x

−i

) by subtracting

from the average the value its local decision vector x

III. MAIN CONVERGENCE RESULT

We start deﬁning some matrices that will be used in the

following: for all i = 1, . . . , m, let Q

i,i

denote the i-th block

of Q, with row and column indices corresponding to x

, where

x = [x

1,>

. . . x

m,>

]

. Denote then by Q

a block diagonal

matrix whose i-th block is Q

i,i

, and let Q

= Q − Q

denote

the off (block) diagonal part of Q. Since Q is assumed to be

symmetric, Q

is symmetric as well and its eigenvalues are all

real. Since Q

has zero trace, at least one of its eigenvalues

will be non-negative. As a result, λ

max

≥ 0, where λ

max

denotes the maximum eigenvalue of Q

Theorem 1. Under Assumption 1, if c > λ

max

, then Algorithm

1 converges to a minimizer of P.

Theorem 1 provides an explicit bound on c that ensures

convergence. Such a bound is derived by a ﬁxed-point theoretic

approach. Note that if the objective function f was strictly

convex with respect to x, then the standard Jacobi iteration

of [12] can be adopted instead of the regularized version. In

that case, geometric convergence to some minimizer of P is

guaranteed by means of Proposition 3.5 in [12].

c>λ

max

c>λ

max

− λ

min

c>λ

max

m − 1

2m − 1

2λ

max

Fig. 1: Bar plot for the bound on c (from bottom to top): Algo-

rithm of [33] (convergence in value); Theorem 1 (convergence

in iterates); scaled projected gradient algorithm (convergence

in value); Algorithm of [24] (convergence in value) and un-

scaled projected gradient algorithm (convergence in iterates).

Remark 3 (Connection with [24] and extensions). For the

more general case of a convex objective function, by [24,

Theorem 3] (see constant step-size condition), it can be shown

that Algorithm 1 converges to the optimal value of P for c

greater than one half of the Lipschitz constant of the objective

function gradient, which for the case of quadratic objective

functions is 2λ

max

, thus leading to c > λ

max

. The sequence of

iterates, however, may not converge and exhibit an oscillatory

behaviour. Under the same condition on c it is shown unscaled

projected gradient algorithms with step-size 1/c (i.e., two over

the Lipschitz constant of the gradient) can converge not only

in value, but also in iterates (see Proposition 3.3 in Chapter

3 of [12] for convergence in value, and Theorem 4.1 in [29]

or Theorem 2 in [30] for convergence in iterates).

In Section III-C we write the Jacobi iteration as a scaled

projected gradient step and show that it converges in value

but not in iterates if c > λ

max

−λ

min

, which is less restrictive

than the aforementioned conditions. This result is strengthened

even further if c > λ

max

according to Theorem 1.

By Theorem 3 of [33] it can be shown that, as far as the

optimal value is concerned, Algorithm 1 converges for c >

m−1

2m−1

2λ

max

. The latter is a relaxed version for the condition

c > λ

max

of Theorem 1, since

m−1

2m−1

, for all m. However,

Theorem 1 ensures convergence to some minimizer and not

just convergence in value. This result is shown in [33] using

an analysis similar to the proof of Theorem 3 in [24], that is

based on Proposition 1 and sequence convergence properties

(see Exercise 1.19 in [34] (p. 18)). The relationship between

the various conditions on c is pictorially shown in Figure 1.

A. Preliminary results

The results of this section hold under Assumption 2.

Assumption 2. The function f : R

× . . . × R

→ R is

continuously differentiable, and jointly convex with respect to

all arguments, i.e., convex with respect to x. The sets X

⊆

, i = 1, . . . , m, are non-empty, compact and convex.

1) Minimizers and ﬁxed-points deﬁnitions: By (3)-(4), the

set of minimizers of P is given by

M = arg min

∈X

}

i=1

f(z

, . . . , z

) ⊆ X. (5)

Following the discussion below Assumption 1, M is non-

empty. Note that M is not necessarily a singleton; this is the

case if f is jointly strictly convex with respect to its arguments.

For each i, i = 1, . . . , m, consider the mappings T

X → X

and

: X → X

, deﬁned such that, for any

x = (x

, . . . , x

) ∈ X,

(x) = arg min

∈X

− x

(6)

subject to: f(z

, x

−i

) ≤ min

∈X

f(ζ

, x

−i

(x) = arg min

∈X



f(z

, x

−i

) + ckz

− x



. (7)

The mapping in (6) serves as a tie-break rule to select, in

case f(·, x

−i

) admits multiple minimizers over X

, the one

closer to x

with respect to the Euclidean norm. Note that

in (6) and (7) we use equality instead of inclusion since the

corresponding minimizers T

(x) and

(x), respectively, are

unique. Note also that with x

in place of x, (7) implies

that the update step 6 in Algorithm 1 can be equivalently

represented by x

k+1

Deﬁne also the mappings T : X → X and

T : X → X,

such that their components are given by T

and

T (x) = arg min

z∈X

i=1

− x

(8)

subject to: f(z

, x

−i

) ≤ min

∈X

f(ζ

, x

−i

), ∀i = 1, . . . , m,

T (x) = arg min

z∈X

i=1



f(z

, x

−i

) + ckz

− x



, (9)

where the terms inside the summation in (8) and (9) are

decoupled. The set of ﬁxed-points of T and

T is, respectively,

are given by



x ∈ X : x

= T

(x), for all i = 1, . . . , m



, (10)



x ∈ X : x

(x), for all i = 1, . . . , m



. (11)

2) Connections between minimizers and ﬁxed-points: We

report here a fundamental optimality result.

Proposition 1 ( [12, Proposition 3.1]). Assume that f is a

continuously differentiable function and X is a non-empty,

closed and convex set. We then have that,

1) if x ∈ X minimizes f over X, then (z − x)

∇f(x) ≥ 0,

for all z ∈ X.

2) if f is also convex on X, then the condition of the previous

part is also sufﬁcient for x ∈ arg min

z∈X

f(z).

We show that the set of minimizers M of P in (5) and the

set of ﬁxed-points F

of the mapping T in (8) coincide.

Proposition 2. Under Assumption 2, M = F

Proof. 1) M ⊆ F

: Fix any x ∈ M . For i = 1, . . . , m,

denote x by (x

, x

−i

). The fact that x ∈ M implies that

f(x

, x

−i

) will be no greater than f(ζ

, x

−i

), for all ζ

∈ X

i.e., f(x

, x

−i

) ≤ min

∈X

f(ζ

, x

−i

), which means that x

satisﬁes the inequality in (8). Moreover x is also optimal

for the objective function in (8), since it results in zero cost.

Hence, by (8), x is a ﬁxed-point of T , i.e., x ∈ F

2) F

⊆ M: Fix any x ∈ F

. By the deﬁnition of F

have f(x

, x

−i

) ≤ min

∈X

f(ζ

, x

−i

), for all i = 1, . . . , m.

The last statement implies that x

is the minimizer of f(·, x

−i

)

over X

. For all i = 1, . . . , m, by the ﬁrst part of Proposition

1 (with f(·, x

−i

) in place of f) we then have that

− x

)

∇

f(x

, x

−i

) ≥ 0, for all z

∈ X

, (12)

where ∇

f(x

, x

−i

) is the i-th component of the gradient

∇f(·, x

−i

) of f(·, x

−i

), evaluated at x

. By (12), we then

have that

i=1

− x

)

∇

f(x

, x

−i

) ≥ 0 for all z

∈ X

i = 1, . . . , m, which, by setting x = (x

, . . . , x

), z =

, . . . , z

), can be written as (z − x)

∇f(x) ≥ 0, for

all z ∈ X. By the second part of Proposition 1, and since

f is jointly convex with respect to all elements of x, the last

statement implies that x minimizes f over X, i.e., x ∈ M.

The connection between minimizers, ﬁxed-points and vari-

ational inequalities similar to (12), has been also investigated

in [35], in the context of non-cooperative games.

Proposition 3. Under Assumption 2, F

= F

Proof. 1) F

⊆ F

: Fix any x ∈ F

. By (10), this is

equivalent to the fact that x

= T

(x), for all i = 1, . . . , m,

which, due to the deﬁnition of T implies that, for all i =

1, . . . , m, f (x

, x

−i

) ≤ min

∈X

f(ζ

, x

−i

). This implies

that x

minimizes f(·, x

−i

) over X

, hence, by the ﬁrst part

of Proposition 1 (with f(·, x

−i

) in place of f) we have

that (z

− x

)

∇

f(x

, x

−i

) ≥ 0, for all z

∈ X

. Let

, x) = f(z

, x

−i

) + ckz

− x

, for all z

, i = 1, . . . , m,

and notice that ∇f

, x) = ∇f(x

, x

−i

), since the gradient

of the quadratic penalty term vanishes at x

. We then have

that, for all i = 1, . . . , m,

− x

)

∇

, x) ≥ 0, for all z

∈ X

. (13)

Since f

(·, x) is strictly convex with respect to its ﬁrst argu-

ment, by the second part of Proposition 1 (with f

(·, x) in

place of f), (13) implies that, for all i = 1, . . . , m, x

is the

unique minimizer of f

(·, x) over X

, i.e.,

= arg min

∈X

f(z

, x

−i

) + ckz

− x

. (14)

By (7), (14) is equivalent to x

(x), for all i = 1, . . . , m.

2) F

⊆ F

: Fix any x ∈ F

. By (11) this is equivalent to

the fact that x

(x), for all i = 1, . . . , m, which, by the

deﬁnition of

in (7), implies that, for all i = 1, . . . , m,

= arg min

∈X

f(z

, x

−i

) + ckz

− x

. (15)

Let again f

, x) = f(z

, x

−i

) + ckz

− x

. Equation (15)

implies then that, for all i = 1, . . . , m, x

minimizes f

(·, x)

over X

, and by the ﬁrst part of Proposition 1 (with f

(·, x) in

place of f) leads to (z

− x

)

∇

, x) ≥ 0, for all z

∈

. Notice that ∇f

, x) = ∇f(x

, x

−i

), since the gradient

of ckz

− x

with respect to z

vanishes at x

. Therefore,

for all i = 1, . . . , m, we have that

− x

)

∇

f(x

, x

−i

) ≥ 0, for all z

∈ X

. (16)

Since f (·, x

−i

) is convex with respect to its ﬁrst argu-

ment, by the second part of Proposition 1, (16) implies

that x

minimizes f (·, x

−i

) over X

. In other words,

∈ arg min

∈X

f(z

, x

−i

), for all i = 1, . . . , m.

This in turn implies that, for all i = 1, . . . , m,

f(x

, x

−i

) ≤ f (z

, x

−i

), for all z

∈ X

, i.e., f(x

, x

−i

) ≤

min

∈X

f(z

, x

−i

). The last inequality shows that x satisﬁes

the inequality in (8). Moreover, it minimizes the objective

function in (8), since it results in zero cost, so x = T(x).

By Propositions 2 and 3 we have that the set of minimizers

M of P coincides with the ﬁxed-points of the mapping

T .

Corollary 1. Under Assumption 2, M = F

B. Proof of Theorem 1

Step 6 of Algorithm 1 can be equivalently written as

k+1

), which entails that x

k+1

T (x

), i.e., a

Picard-Banach iteration of

T (see [36] (Chapter 1.2) for a

deﬁnition). Since

T is non-empty (it coincides with M by

Corollary 1), we only need to prove that

T is ﬁrmly non-

expansive (see [37] (Section 1) for a deﬁnition in general

Hilbert spaces). If that is the case, then, by [37], [38], we

have that the Picard-Banach iteration converges to a ﬁxed-

point of

T , for any initial condition x

. By Corollary 1 this

ﬁxed-point will also be a minimizer of P. We next show

that if c > λ

max

, then

T (·) is indeed ﬁrmly non-expansive

with respect to k · k

−Q

is the identity matrix I of

appropriate dimensions weighted by c), i.e.,

T (x) −

T (y)k

−Q

≤ (x − y)

+ I

− Q)(

T (x) −

T (y)), (17)

thus establishing Theorem 1. To this end, by Assumption 1,

T (x) = arg min

z∈X

i=1

f(z

, x

−i

) + ckz

− x

= arg min

z∈X

i=1

)

i,i

+ I

+ (2(x

−i

)

−i,i

− 2(x

)

+ q

= arg min

z∈X

+ I

)z + (2x

− 2x

+ q

)z. (18)

Notice the slight abuse of notation in (18), where I

in the

second and the third equality are not of the same dimension.

Let ξ(x) = (Q

+ I

)

−1

x − Q

x − q/2) denote the

unconstrained minimizer of (18). We then have that

T (x) = arg min

z∈X

(z − ξ(x))

+ I

)(z − ξ(x))

= [ξ(x)]

, (19)

where [ξ(x)]

denotes the projection, with respect to || ·

, of ξ(x) on X. Note that Q

+ I

is positive deﬁnite

for c ∈ R

, so its inverse exists and the projection is well

deﬁned. We then have that

T (x) −

T (y)k

= k [ξ(x)]

− [ξ(y)]

≤ (ξ(x) − ξ(y))

+ I

)([ξ(x)]

− [ξ(y)]

)

= (x − y)

(I − Q(Q

+ I

)

−1

)(Q

+ I

)

× ([ξ(x)]

− [ξ(y)]

)

= (x − y)

+ I

− Q)([ξ(x)]

− [ξ(y)]

(20)

Regularized Jacobi Iteration for Decentralized Convex Quadratic Optimization With Separable Constraints

Figures

Citations

Price of anarchy in electric vehicle charging control games: When Nash equilibria achieve social welfare

On the connection between Nash equilibria and social optima in electric vehicle charging control games

On the Convergence of a Regularized Jacobi Algorithm for Convex Optimization

On the probabilistic feasibility of solutions in multi-agent optimization problems under uncertainty

Synchronous Parallel Block Coordinate Descent Method for Nonsmooth Convex Function Minimization

References

Parallel and Distributed Computation: Numerical Methods

Convex Analysis and Monotone Operator Theory in Hilbert Spaces

Proximal Algorithms

Weak convergence of the sequence of successive approximations for nonexpansive mappings

Constrained Consensus and Optimization in Multi-Agent Networks

Related Papers (5)

An approximate gradient algorithm for constrained distributed convex optimization

Regularized dual gradient distributed method for constrained convex optimization over unbalanced directed graphs

Accelerated Convergence Algorithm for Distributed Constrained Optimization under Time-Varying General Directed Graphs

A discontinuous algorithm for distributed convex optimization

A Proximal Gradient Algorithm for Decentralized Composite Optimization

Frequently Asked Questions (14)

Q1. What have the authors contributed in "Regularized jacobi iteration for decentralized convex quadratic optimization with separable constraints" ?

Q2. what is the objective function in a pvv?

Q3. What is the function that can be used to define the constraint set?

Q4. what is the performance criterion in this local problem?

Q5. What is the performance criterion in Algorithm 1?

Q6. What is the solution for the constraint set?

Q7. How many iterations of the Jacobi algorithm are there?

Q8. What is the condition Qd + Ic 0?

Q9. What is the importance of the two terms in Algorithm 1?

Q10. How does the central authority collect the solution of each agent?

Q11. what is the optimum charging strategy for a fleet of m plug-in electric vehicles?

Q12. how is the convergence to some minimizer of P guaranteed?

Q13. How does the algorithm converge to the optimal value of P?

Q14. What is the last equality of the unscaled projected gradient?