What is the result of the inexact averaged projection algorithm?

If x0 is sufficiently close to ⋂pi=1 Fi, then the inexact averaged projection algorithm reduces to the gradient methodxk+1 = xk − θ p ∇f(xk) + ǫk,with f being given by (23), which therefore defines a unique sequence.

Why does the algorithm converge to a global minimizer?

Due to the fact that a convex function has at most one critical value, the bounded sequences generated by the above algorithms converge to a global minimizer.

What is the sequence of values f(xk)kN?

If (xk)k∈N is bounded, then it converges to a minimizer of f and the sequence of values f(xk) converges to the program value min f .

What is the importance of Kurdyka- Lojasiewicz inequality?

In the context of optimization, the importance of Kurdyka- Lojasiewicz inequality is due to the fact that many problems involve functions satisfying such inequalities, and it is often elementary to check that such an inequality is satisfied; real semi-algebraic functions provide a very rich class of functions satisfying the Kurdyka- Lojasiewicz, see [5] for a thorough discussion on these aspects, and also Section 2.2 for a simple illustration.

What is the simplest way to find the eigenvalues of each k ?

For each i in {1, . . . , p}, take a sequence of symmetric positive definite matrices (Aki )k∈N of size ni such that the eigenvalues of each A k i (k ∈ N, i ∈ {1, . . . , p}) lie in [λ, λ].

How can the authors solve the minimization subproblems?

In these cases, over-solving the minimization subproblems would increase the computational burden of the method, and may slow down the final computation of a good approximation of the solution.

What is the inverse of the g + h?

To see that g + h is a KL function, the authors simply note that h is a polynomial function and that ‖ · ‖0 has a piecewise linear graph, hence the sum g + h is semi-algebraic.

What is the effect of under-solving the minimization subproblems?

On the other hand, under-solving the minimization subproblems may result in a breakdown of the algorithm, and convergence to a solution may be lost.

What is the subdifferential of f at x dom?

The subdifferential of f at x ∈ dom f , written ∂f(x), is defined as follows∂f(x) := {v ∈ Rn : ∃xk → x, f(xk) → f(x), vk ∈ ∂̂f(xk) → v}.

What is the counting norm for n?

When n = 1, the counting norm is denoted by | · |0; in that case one easily establishes thatproxγλ|·|0u = u if |u| > √2γλ {0, u} if |u| = √2γλ 0 otherwise.

Journal Article•DOI•

Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized Gauss-Seidel methods

Q: What have the authors stated for future works in "Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized gauss-seidel methods" ?

The computational implementation of the methods analyzed in this paper, as well as these stopping rules are topics for future research.

Q: What is the first condition intended to model?

The first condition is intended to model a descent property: since it involves a measure of the quality of the descent, the authors call it a sufficientdecrease condition (see [20] for an early paper on this subject, [7] for an interpretation of this condition in decision sciences, and [43] for a discussion on this type of condition in a particular nonconvex nonsmooth optimization setting).

Q: What is the main assumption for the study of such algorithms?

In this paper, their central assumption for the study of such algorithms is that the function f satisfies the (nonsmooth) Kurdyka- Lojasiewicz inequality, which means, roughly speaking, that the functions under consideration are sharp up to a reparametrization (see Section 2.2).

Hedy Attouch¹, Jérôme Bolte², Benar Fux Svaiter³•Institutions (3)

University of Montpellier¹, University of Toulouse², Instituto Nacional de Matemática Pura e Aplicada³

01 Feb 2013-Mathematical Programming (Springer-Verlag)-Vol. 137, Iss: 1, pp 91-129

TL;DR: This work proves an abstract convergence result for descent methods satisfying a sufficient-decrease assumption, and allowing a relative error tolerance, that guarantees the convergence of bounded sequences under the assumption that the function f satisfies the Kurdyka–Łojasiewicz inequality.

read less

Abstract: In view of the minimization of a nonsmooth nonconvex function f, we prove an abstract convergence result for descent methods satisfying a sufficient-decrease assumption, and allowing a relative error tolerance. Our result guarantees the convergence of bounded sequences, under the assumption that the function f satisfies the Kurdyka–Łojasiewicz inequality. This assumption allows to cover a wide range of problems, including nonsmooth semi-algebraic (or more generally tame) minimization. The specialization of our result to different kinds of structured problems provides several new convergence results for inexact versions of the gradient method, the proximal method, the forward–backward splitting algorithm, the gradient projection and some proximal regularization of the Gauss–Seidel method in a nonconvex setting. Our results are illustrated through feasibility problems, or iterative thresholding procedures for compressive sensing.

...read moreread less

Summary (3 min read)

Jump to: [1 Introduction] – [2.1 Some definitions from variational analysis] – [2.2 Kurdyka- Lojasiewicz inequality: the nonsmooth case] – [2.3 An inexact descent convergence result for KL functions] – [3 Inexact gradient methods] – [3.1 General convergence result] – [3.2 Prox-regularity] – [3.3 Averaged projections for feasibility problems] – [4 Inexact proximal algorithm] – [4.1 Convergence of an inexact proximal algorithm for KL functions] – [4.2 A variant for convex functions] – [5 Inexact forward-backward algorithm] – [5.1 The forward-backward splitting algorithm for nonconvex functions] – [5.2 Convergence of an inexact forward-backward splitting algorithm] – [5.3 Examples] and [7 Conclusion]

1 Introduction

In Section 2, the authors consider functions satisfying the Kurdyka- Lojasiewicz inequality.
The authors recover and improve previous works on the question of gradient methods (Section 3) and proximal algorithms (Section 4).
The convergence results the authors obtained involve different assumptions on the linear operator A: they either assume that ‖A‖ < 1 [11, Theorem 3] or that A satisfies the restricted isometry property [12, Theorem 4].

2.1 Some definitions from variational analysis

The notion of subdifferential plays a central role in the following theoretical and algorithm developments.
The limiting processes used in an algorithmic context necessitate the introduction of the more stable notion of limiting-subdifferential ([47]) (or simply subdifferential) of f .
These generalized notions of differentiation give birth to generalized notions of critical point.
The authors end this section by some words on an important class of functions which are intimately linked to projection mappings: the indicator functions.

2.2 Kurdyka- Lojasiewicz inequality: the nonsmooth case

The authors begin this section by a brief discussion on real semi-algebraic sets and functions which will provide a very rich class of functions satisfying the Kurdyka- Lojasiewicz.
One easily sees that the class of semi-algebraic sets is stable under the operation of finite union, finite intersection, Cartesian product or complementation and that polynomial functions are, of course, semi-algebraic functions.
Of course, this result also holds when replacing sup by inf.
(2) (b) Proper lower semicontinuous functions which satisfy the Kurdyka- Lojasiewicz inequality at each point of dom ∂f are called KL functions.
Such examples are abundantly commented in [5], and they strongly motivate the present study.

2.3 An inexact descent convergence result for KL functions

In the sequel, the authors consider sequences (xk)k∈N which satisfy the following conditions, which they will subsequently refer to as H1, H2, H3: H1.
Consider a sequence (xk)k∈N which satisfies conditions H1, H2.
Simply reproduce the beginning of the proof of the previous lemma.
Theorem 2.12 (Local convergence to global minima).

3 Inexact gradient methods

The first natural domain of application of their previous results concerns the simplest firstorder methods, namely the gradient methods.
As the authors shall see, their abstract framework (Theorem 2.9) allows to recover some of the results of [1].
In order to illustrate the versatility of their algorithmic framework, the authors also consider a fairly general semi-algebraic feasibility problem, and they provide, in the line of [42], a local convergence proof for an inexact averaged projection method.

3.1 General convergence result

To illustrate the variety of dynamics covered by Algorithm 1, let us show how variable metric gradient algorithms can be cast in this framework.
This type of quadratic models arises, for instance, in trust-region methods (see [1] which is also connected to Lojasiewicz inequality).
For the convergence analysis of Algorithm 1, the authors shall of course use the elementary but important descent lemma (see for example [50] 3.2.12).
The authors then have the following result: Theorem 3.2.
The sequence (xk)k∈N has been assumed to be bounded.

3.2 Prox-regularity

When considering nonconvex feasibility problems, the authors are led to consider squared distance functions to nonconvex sets.
Contrary to what happens in the standard convex setting, such functions may fail to be differentiable.
The key concept of prox-regularity provides a characterization of the local differentiability of these functions and, as the authors will see in the next section, it allows in turn to design averaged projection methods with interesting converging properties.
Let us gather the following definition/properties concerning F that are fundamental for their purpose.

3.3 Averaged projections for feasibility problems

Moreover, this sequence has a finite length and converges to a feasible point x̄, i.e. such that x̄ ∈ p ⋂ i=1 Fi. Proof.
Let us first observe that the function f (given by (23)) is semi-algebraic, because the distance function to any nonempty semi-algebraic set is semi-algebraic (see Lemma 2.3 or [30, 15]).
Applying now Corollary 2.7, the authors get xk+1 ∈ B(x∗, ρ) and their induction proof is complete.
Fi having a linearly regular intersection at some point x̄, an important concept that originates from [47, Theorem 2.8].

4 Inexact proximal algorithm

Let us first recall the exact version of the proximal algorithm for nonconvex functions [36, 3].
In view of the assumption inf f > −∞, the lower semicontinuity of f and the coercivity of the squared norm imply that proxλf has nonempty values.

4.1 Convergence of an inexact proximal algorithm for KL functions

Let us introduce an inexact version of the proximal point method.
The following elementary Lemma is useful for the convergence analysis of the algorithm.
Direct algebraic manipulation of the above inequality yields the first inequality.

4.2 A variant for convex functions

When the function under consideration is convex and satisfies the Kurdyka- Lojasiewicz property, Algorithm 2 can be simplified while its convergence properties are maintained.
Consider the sequence (xk)k∈N generated by the following algorithm.
So are in particular many convex functions: this fact was a strong motivation for the above result.

5 Inexact forward-backward algorithm

This kind of structured problem occurs frequently, see for instance [25, 6] and Example 5.4.
In a first part, the authors recall what is the classical forward-backward algorithm and explain how Algorithm 3 provides an inexact version of the latter; the special case of projection methods is also discussed.
The authors end this section by providing illustrations of their results through problems coming from compressive sensing, and hardconstrained feasibility problems.

5.1 The forward-backward splitting algorithm for nonconvex functions

L where γ and λ are given thresholds, the forward-backward splitting algorithm reads xk+1 ∈ proxγk g(x k − γk∇h(xk)).
(50) An important observation here is that the sequence is not uniquely defined since proxγk g may be multivalued; a surprising fact is that this freedom in the choice of the sequence does not impact the convergence properties of the algorithm (see Theorem 5.1).
Let us show how this algorithm fits into the general framework of Algorithm 3.
As for the proximal algorithm, the inexact version offers some flexibility in the choice of xk+1 by relaxing both the descent condition and the optimality conditions.
The authors thus find the nonconvex nonsmooth gradient-projection method xk+1 ∈ PC(xk − γk∇h(xk)). (51).

5.2 Convergence of an inexact forward-backward splitting algorithm

Let us now return to the general inexact forward-backward splitting Algorithm 3, and show the following convergence result.
The authors are precisely in the case which has been examined in Theorem 4.2 (continuous functions on their domain).
Remark 5.2. (a) For the exact forward-backward splitting algorithm the continuity assumption concerning g is useless.

5.3 Examples

Example 5.4 (Forward-backward splitting for compressive sensing).
They also provide at the same time a very general convergence result which can be immediately generalized to compressive sensing problems involving semialgebraic or real-analytic nonlinear measurements.
By applying the forward-backward splitting algorithm to this problem, the authors aim at finding a point which satisfies the hard constraints modelled by F , while the other constraints are satisfied in a possibly weaker sense (see [25] and references therein).
Let us now consider the KL analysis in the regular intersection case (see definition in Remark 3.6).
To this end, the authors will use the following result [42, Proposition 8.5] (based itself on a characterization given in [37]).

7 Conclusion

Very often, iterative minimization algorithms rely on inexact solution of minimization subproblems, whose exact solution may be almost as difficult to obtain as the solution of the original minimization problem.
Even when the minimization subproblem can be solved with high accuracy, its solutions are mere approximations of the solution of the original problems.
In these cases, over-solving the minimization subproblems would increase the computational burden of the method, and may slow down the final computation of a good approximation of the solution.
In particular their abstract scheme was designed to handle relative errors because practical methods always involve numerical approximation, e.g., the representation of a real number in floating points numbers with a fixed byte-length.
Moreover, the authors also supplied stopping criteria for the solution of the minimization subproblems in general.

Did you find this useful? Give us your feedback

Content maybe subject to copyright Report

HAL Id: hal-00790042

https://hal.archives-ouvertes.fr/hal-00790042

Submitted on 19 Feb 2013

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-

entic research documents, whether they are pub-

lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diusion de documents

scientiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Convergence of descent methods for semi-algebraic and

tame problems: proximal algorithms, forward-backward

splitting, and regularized Gauss-Seidel methods

Hedy Attouch, Jérôme Bolte, Benar Fux Svaiter

To cite this version:

Hedy Attouch, Jérôme Bolte, Benar Fux Svaiter. Convergence of descent methods for semi-algebraic

and tame problems: proximal algorithms, forward-backward splitting, and regularized Gauss-Seidel

methods. Mathematical Programming, Series A, Springer, 2011, 137 (1), pp.91-124. �10.1007/s10107-

011-0484-9�. �hal-00790042�

Convergence of descent methods for semi-algebraic

and tame problems: proximal algorithms,

forward-backward splitting, and regularized

Gauss-Seidel methods

Hedy ATTOUCH

∗

J´erˆome BOLTE

†

Benar Fux SVAITER

‡

December, 15, 2010; revised: July, 21, 2011

Abstract In view of the minimization of a nonsmooth nonconvex function f, we prove an

abstract convergence result for descent methods satisfying a suﬃcient-decrease assumption,

and allowing a relative error tolerance. Our result guarantees the convergence of bounded

sequences, under the assumption that the function f satisﬁes the Kurdyka- Lojasiewicz in-

equality. This assumption allows to cover a wide range of problems, including nonsmooth

semi-algebraic (or more generally tame) minimization. The specialization of our result to

diﬀerent kinds of structured problems provides several new convergence results for inexact

versions of the gradient method, the proximal method, the forward-backward splitting algo-

rithm, the gradient projection and some proximal regularization of the Gauss-Seidel method

in a nonconvex setting. Our results are illustrated through feasibility problems, or iterative

thresholding procedures for compressive sensing.

2010 Mathematics Subject Classiﬁcation: 34G25, 47J25, 47J30, 47J35, 49M15, 49M37,

65K15, 90C25, 90C53.

Keywords: Nonconvex nonsmooth optimization, semi-algebraic optimization, tame opti-

mization, Kurdyka- Lojasiewicz inequality, descent methods, relative error, suﬃcient decrease,

forward-backward splitting, alternating minimization, proximal algorithms, iterative thresh-

olding, block-coordinate methods, o-minimal structures.

1 Introduction

Being given a proper lower semicontinuous function f : R

→ R ∪{+∞}, we consider descent

methods that generate sequences (x

)

k∈N

complying with the following conditions:

∗

I3M UMR CNRS 5149, Universit´e Montpellier II, Place Eug`ene Bataillon, 34095 Montpellier, France

(attouch@math.univ-montp2.fr) Partially supported by ANR-08-BLAN-0294-03.

†

TSE (GREMAQ, Universit´e Toulouse I), Manufacture des Tabacs, 21 all´ee de Brienne, Toulouse, France

(jerome.bolte@tse-eu.fr) Partially supported by ANR-08-BLAN-0294-03.

‡

IMPA, Estrada Dona Castorina 110, 22460 - 320 Rio de Janeiro, Brazil (benar@impa.br) Partially

supported by CNPq grants 474944/2010-7, 303583/2008-8, FAPERJ grant E-26/102.821/2008 and PRONEX-

Optimization.

– for each k ∈ N, f(x

k+1

) + akx

k+1

− x

≤ f(x

);

– for each k ∈ N, there exists w

k+1

∈ ∂f(x

k+1

) such that

k+1

k ≤ bkx

k+1

− x

where a, b are positive constants and ∂f (x

k+1

) denotes the set of limiting subgradients of f

at x

k+1

(see Section 2.1 for a deﬁnition). The ﬁrst condition is intended to model a descent

property: since it involves a measure of the quality of the descent, we call it a suﬃcient-

decrease condition (see [20] for an early paper on this subject, [7] for an interpretation of

this condition in decision sciences, and [43] for a discussion on this type of condition in

a particular nonconvex nonsmooth optimization setting). The second condition originates

from the well-known fact that most algorithms in optimization are generated by an inﬁnite

sequence of subproblems which involve exact or inexact minimization processes. This is

the case of gradient methods, Newton’s method, forward-backward algorithm, Gauss-Seidel

method, proximal methods... The second set of conditions precisely reﬂects relative inexact

optimality conditions for such minimization subproblems.

When dealing with descent methods for convex functions, it became natural to expect that

the algorithm will provide globally convergent sequences (i.e., for arbitrary starting point,

the algorithm generates a sequence that converges to a solution). The standard recipe to

obtain the convergence is to prove that the sequence is (quasi-)Fej´er monotone relative to the

set of minimizers of f. This fact has also been used intensively in the study of algorithms

for nonexpansive mappings (see e.g. [24]). When the functions under consideration are not

convex (or quasiconvex), the monotonicity properties are in general “broken”, and descent

methods may provide sequences that exhibit highly oscillatory behaviors. Apparently this

phenomenon was ﬁrst observed by Curry (see [27]); in the framework of diﬀerential equations

similar behaviors occur, in [28] a nonconverging bounded curve of a 2-dimensional gradient

system of a C

∞

function is provided, this example was adapted in [1] to gradient methods.

In order to circumvent such behaviors, it seems necessary to work with functions that

present a certain structure. This structure can be of an algebraic nature, e.g. quadratic func-

tions, polynomial functions, real analytic functions, but it can also be captured by adequate

analytic assumptions, e.g. metric regularity [2, 41, 42], cohypomonotonicity [51, 36], self-

concordance [49], partial smoothness [40, 59]. In this paper, our central assumption for the

study of such algorithms is that the function f satisﬁes the (nonsmooth) Kurdyka- Lojasiewicz

inequality, which means, roughly speaking, that the functions under consideration are sharp

up to a reparametrization (see Section 2.2). The reader is referred to [44, 45, 38] for the

smooth cases, and to [15, 17] for nonsmooth inequalities. Kurdyka- Lojasiewicz inequalities

have been successfully used to analyze various types of asymptotic behavior: gradient-like sys-

tems [15, 34, 35, 39], PDE [55, 22], gradient methods [1, 48], proximal methods [3], projection

methods or alternating methods [5, 14].

In the context of optimization, the importance of Kurdyka- Lojasiewicz inequality is due

to the fact that many problems involve functions satisfying such inequalities, and it is often

elementary to check that such an inequality is satisﬁed; real semi-algebraic functions pro-

vide a very rich class of functions satisfying the Kurdyka- Lojasiewicz, see [5] for a thorough

discussion on these aspects, and also Section 2.2 for a simple illustration.

Many other functions, that are met in real world problems, and which are not semi-

algebraic, satisfy very often the Kurdyka- Lojasiewicz inequality. An important class is given

by functions deﬁnable in an o-minimal structure. The monographs [26, 30] are good refer-

ences on o-minimal structures; concerning Kurdyka- Lojasiewicz inequalities in this context

the reader is referred to [38, 17]. Functions deﬁnable in o-minimal structures or functions

whose graphs are locally deﬁnable are often called tame functions. We do not give a precise

deﬁnition of deﬁnability in this work, but the ﬂexibility of this concept is brieﬂy illustrated

in Example 5.4(b). Functions that are not necessarily tame but that satisfy Lojasiewicz in-

equality are given in [5], basic assumptions involve metric-regularity and transversality (see

also [41, 42] and Example 5.5).

From a technical viewpoint, our work blends the approach to nonconvex problems pro-

vided in [1, 15, 3, 5] with the relative error philosophy developed in [56, 57, 58, 36]. A

valuable guideline for the error aspects is the development of an inexact proximal algorithm

for equations governed by a monotone operator, and which is based on an estimation of the

relative error, see [56, 57, 58]. Related results without monotonicity (with a control on the

lack of monotonicity) have been obtained in [36].

Thus, in summary, this article aims at:

– providing a uniﬁed framework for the analysis of classical descent methods,

– relaxing exact descent conditions,

– extending convergence results obtained in [1, 3, 5, 56, 57, 58, 36] to richer and more

ﬂexible algorithms,

– providing theorems which cover general nonsmooth problems under easily veriﬁable

assumptions (e.g. semi-algebraicity).

Let us proceed with a more precise description of the contents of this article.

In Section 2, we consider functions satisfying the Kurdyka- Lojasiewicz inequality. We

ﬁrst give the deﬁnition and a brief analysis of this basic property. Then in subsection 2.3,

we provide an abstract convergence result for sequences satisfying the suﬃcient-decrease

condition and the relative inexact optimality condition mentioned above.

This result is then applied to the analysis of several descent methods with relative error

tolerance.

We recover and improve previous works on the question of gradient methods (Section 3)

and proximal algorithms (Section 4). Our results are illustrated through semi-algebraic fea-

sibility problems by means of an inexact version of the averaged projection method.

We also provide, in Section 5, an in-depth analysis of forward-backward splitting algo-

rithms in a nonsmooth nonconvex setting. Setting aside the convex case, we did not ﬁnd

any general convergence results for this kind of algorithm, also, the results we present here

seem to be new. These results can be applied to general semi-algebraic problems (or tame

problems) and to nonconvex problems presenting a well-conditioned structure. An important

and enlightening consequence of our study is that the bounded sequences (x

)

k∈N

generated

by the nonconvex gradient projection algorithm

k+1

∈ P



−

∇h(x

)



are convergent sequences so long as C is a closed semi-algebraic subset of R

and h : R

→

R is C

semi-algebraic with L-Lipschitz gradient (see [9] for some applications in signal

processing). As an application of our general results on forward-backward splitting, we

consider the following type of problem

(P ) min



λkxk

kAx − bk

: x ∈ R



where λ > 0 and k· k

is the counting norm (or the ℓ

norm), A is an m ×n real matrix and

b ∈ R

. We recall that for x in R

, kxk

is the number of nonzero components of x. This

kind of problem is central in compressive sensing [29]. In [11, 12] this problem is tackled by

using a “hard iterative thresholding” algorithm

k+1

∈ prox

λk·k



− γ

− A



where (γ

)

k∈N

is a sequence of stepsizes evolving in a convenient interval (the deﬁnition of the

proximal mapping prox

λf

is given in Section 4). The convergence results the authors obtained

involve diﬀerent assumptions on the linear operator A: they either assume that kAk < 1 [11,

Theorem 3] or that A satisﬁes the restricted isometry property [12, Theorem 4]. Our results

show that convergence actually occurs for any linear map so long as the sequence (x

)

k∈N

bounded. We also consider iterative thresholding with ℓ

“norms” for sparse approximation

(in the spirit of [21]) and hard-constrained feasibility problems; in both cases convergence of

the bounded sequences is established.

In a last section, we study the proximal regularization of a p blocks alternating method

(with p ≥ 2). This method has been introduced by Auslender [8] for convex minimization;

see also [32] in a nonconvex setting. Convergence results for such methods are usually stated

in terms of cluster points. To our knowledge, the ﬁrst convergence result in a nonconvex

setting, under fairly general assumptions, was obtained in [5] for a two-blocks exact version.

Our generalization is twofolds: we consider methods involving an arbitrary numbers of blocks,

and we provide a proper convergence result.

2 An abstract convergence result for inexact descent methods

The Euclidean scalar product of R

and its corresponding norm are respectively denoted by

h·, ·i and k· k.

2.1 Some deﬁnitions from variational analysis

Standard references are [23, 54, 47].

If F : R

⇉ R

is a point-to-set mapping its graph is deﬁned by

Graph F := {(x, y) ∈ R

× R

: y ∈ F (x)},

while its domain is given by dom F := {x ∈ R

: F (x) 6= ∅}. Similarly, the graph of a

real-extended-valued function f : R

→ R ∪ {+∞} is deﬁned by

Graph f := {(x, s) ∈ R

× R : s = f(x)},

and its domain by dom f := {x ∈ R

: f(x) < +∞}. The epigraph of f is deﬁned as usual as

epi f := {(x, λ) ∈ R

× R : f(x) ≤ λ}.

HTML Viewer

Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized Gauss-Seidel methods

Summary (3 min read)

1 Introduction

2.1 Some definitions from variational analysis

2.2 Kurdyka- Lojasiewicz inequality: the nonsmooth case

2.3 An inexact descent convergence result for KL functions

3 Inexact gradient methods

3.1 General convergence result

3.2 Prox-regularity

3.3 Averaged projections for feasibility problems

4 Inexact proximal algorithm

4.1 Convergence of an inexact proximal algorithm for KL functions

4.2 A variant for convex functions

5 Inexact forward-backward algorithm

5.1 The forward-backward splitting algorithm for nonconvex functions

5.2 Convergence of an inexact forward-backward splitting algorithm

5.3 Examples

7 Conclusion

Citations

Additional excerpts

Cites background from "Convergence of descent methods for ..."

References

"Convergence of descent methods for ..." refers background in this paper

"Convergence of descent methods for ..." refers methods in this paper

"Convergence of descent methods for ..." refers background or methods in this paper

Related Papers (5)

Frequently Asked Questions (14)

Q1. What contributions have the authors mentioned in the paper "Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized gauss-seidel methods" ?

Q2. What have the authors stated for future works in "Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized gauss-seidel methods" ?

Q3. What is the result of the inexact averaged projection algorithm?

Q4. Why does the algorithm converge to a global minimizer?

Q5. What is the first condition intended to model?

Q6. What is the main assumption for the study of such algorithms?

Q7. What is the sequence of values f(xk)kN?

Q8. What is the importance of Kurdyka- Lojasiewicz inequality?

Q9. What is the simplest way to find the eigenvalues of each k ?

Q10. How can the authors solve the minimization subproblems?

Q11. What is the inverse of the g + h?

Q12. What is the effect of under-solving the minimization subproblems?

Q13. What is the subdifferential of f at x dom?

Q14. What is the counting norm for n?