Journal Article•DOI•

The Tortoise and the Hare restart GMRES

Q: What is the olor of a GMRES?

The olor indi ates kr(m)100 k=kr0k on a loga-rithmi s ale: blue regions orrespond to initial residuals that onverge satisfa torily, while the redregions show residuals that stagnate or onverge very slowly.

Q: What is the way to do this?

For the onve tion-di usion dis retization des ribed in [4℄,GMRES(1) or GMRES(5) an outperform GMRES(20) on moderately re ned grids.

Mark Embree¹•Institutions (1)

Rice University¹

01 Jan 2003-Siam Review (Society for Industrial and Applied Mathematics)-Vol. 45, Iss: 2, pp 259-266

TL;DR: Two simple examples are presented where GM RES(1) converges exactly in three iterations, while GMRES(2) stagnates, revealing that GMRES (1) convergence can be extremely sensitive to small changes in the initial residual.

read less

Abstract: When solving large nonsymmetric systems of linear equations with the restarted GMRES algorithm, one is inclined to select a relatively large restart parameter in the hope of mimicking the full GMRES process. Surprisingly, cases exist where small values of the restart parameter yield convergence in fewer iterations than larger values. Here, two simple examples are presented where GMRES(1) converges exactly in three iterations, while GMRES(2) stagnates. One of these examples reveals that GMRES(1) convergence can be extremely sensitive to small changes in the initial residual.

...read moreread less

Figures (2)

Fig. 4. Comparison of GMRES(1) (left) and GMRES(2) (right) as in Figure 2, but for thematrix in (3.1). (Double and quadruple pre ision omputations di er notably only at the boundariesbetween onvergen e and stagnation.)

Fig. 5. Close-up of the left plot of Figure 4. The white urve denotes unstable xed points ofthe map s(1)k+1.

Content maybe subject to copyright Report

THE TORTOISE AND THE HARE RESTART GMRES

MARK EMBREE



Abstrat.

When solving large nonsymmetri systems of linear equations with the restarted

GMRES algorithm, one is inlined to selet a relatively large restart parameter in the hop e of

mimiking the full GMRES pro ess. Surprisingly, ases exist where small values of the restart

parameter yield onvergene in fewer iterations than larger values. Here, two simple examples are

presented where GMRES(1) onverges exatly in three iterations, while GMRES(2) stagnates. One

of these examples reveals that GMRES(1) onvergene an b e extremely sensitive to small hanges

in the initial residual.

Key words.

Restarted GMRES, Krylov subspae methods.

AMS sub jet lassiations.

65F10, 37N30

1. Intro dution.

GMRES is an iterative method for solving large nonsymmetri

systems of linear equations,

[8℄. Throughout siene and engineering, this

algorithm and its variants routinely solve problems with millions of degrees of freedom.

Its p opularity is rooted in an optimality ondition: At the

th iteration, GMRES

omputes the solution estimate

that minimizes the Eulidean norm of the residual



over a subspae of dimension

= min

(0)=1

(

)

;

(1.1)

where

denotes those polynomials with degree not exeeding

, and



is the initial residual. As eah iteration enlarges the minimizing subspae, the residual

norm dereases monotonially.

GMRES optimality omes at a ost, however, sine eah iteration demands both

more arithmeti and memory than the one b efore it. A standard work-around is

to restart the pro ess after some xed number of iterations,

. The resulting algo-

rithm, GMRES(

), uses the approximate solution

as the initial guess for a new

run of GMRES, ontinuing this proess until onvergene. The global optimality of

the original algorithm is lost, so although the residual norms remain monotoni, the

restarted pro ess an stagnate with a non-zero residual, failing to ever onverge [8℄.

Sine GMRES(

) enfores loal optimality on

-dimensional spaes, one antiipates

that inreasing

will yield onvergene in fewer iterations. Many pratial examples

onrm this intuition.

We denote the

th residual of GMRES(

) by

(

)

. To b e preise, one yle

between restarts of GMRES(

) is ounted as

individual iterations. Conventionally,

then, one exp ets

(

)

k  k

(

)

for

` < m

. Indeed, this must be true when



Surprisingly, inreasing the restart parameter sometimes leads to

slower

onver-

gene:

(

)

(

)

for

` < m < k

. The author enountered this phenomenon

while solving a disretized onvetion-diusion equation desribed in [4℄. In unpub-

lished exp eriments, de Sturler [1℄ and Walker and Watson [11℄ observed similar b e-

havior arising in pratial appliations. One wonders, how muh smaller than

(

)

might

(

)

be? The smallest possible ases ompare GMRES(1) to GMRES(2) for

3-by-3 matries. Eiermann, Ernst, and Shneider present suh an example for whih



Oxford University Computing Laboratory, Wolfson Building, Parks Road, Oxford OX1 3QD,

United Kingdom (mark.embreeomlab.ox.a.uk). Supp orted by UK Engineering and Physial Si-

enes Researh Counil Grant GR/M12414.

MARK EMBREE

(1)

(2)

= 0

2154

: : :

[2, pp. 284{285℄. Otherwise, the phenomenon we desrib e

has apparently reeived little attention in the literature.

The purp ose of this artile is twofold. First, we desribe a pair of extreme ex-

amples where GMRES(1) onverges exatly at the third iteration, while GMRES(2)

seems to never onverge. The seond example leads to our seond p oint: Small p er-

turbations to the initial residual an dramatially alter the onvergene b ehavior of

GMRES(1).

2. First Example.

Consider using restarted GMRES to solve

for



1 1 1

0 1 3

0 0 1

;





(2.1)

Taking

yields the initial residual

. Using the fat that

and

are

real, we an derive expliit formulas for GMRES(1) and GMRES(2) diretly from the

GMRES optimality ondition (1.1). The reurrene for GMRES(1),

(1)



(1)T

(1)

(1)T

(1)

;

(2.2)

was studied as early as the 1950s [3,

71℄,[7℄. For the

and

dened in (2.1),

this iteration onverges

exatly

at the third step:

(1)





;

(1)



;

(1)



Expressions for one GMRES(2) yle an likewise b e derived using elementary alu-

lus. The up dated residual takes the form

(2)

(

)

(2)

, where

(

) = 1 +

z

 z

is a quadrati whose o eÆients



(

;

(2)

) and



(

;

(2)

) are given by



(

(2)T

AAr

(2)

)(

(2)T

AAr

(2)

)



(

(2)T

(2)

)(

(2)T

AAr

(2)

)

(

(2)T

(2)

)(

(2)T

AAr

(2)

)



(

(2)T

AAr

(2)

)(

(2)T

AAr

(2)

)

;



(

(2)T

(2)

)(

(2)T

AAr

(2)

)



(

(2)T

AAr

(2)

)(

(2)T

(2)

)

(

(2)T

(2)

)(

(2)T

AAr

(2)

)



(

(2)T

AAr

(2)

)(

(2)T

AAr

(2)

)

Exeuting GMRES(2) on the matrix and right hand side (2.1) reveals

(2)





;

(2)



;

(2)





;

(2)

122





108

162

The inferiority of GMRES(2) ontinues well b eyond the fourth iteration. For example:

(2)

5 0.376888290025532. . .

10 0.376502488858910. . .

15 0.376496927936533. . .

20 0.376496055944867. . .

25 0.376495995285626. . .

30 0.376495984909087. . .

RESTARTED GMRES

(

)

iteration,

GMRES(1)

GMRES(2)

0 5 10 15 20 25 30



Fig. 1

Convergene urves for GMRES

(1)

and GMRES

(2)

applied to

(2.1)

with

The entire onvergene urve for the rst thirty iterations is shown in Figure 1, based

on p erforming GMRES(2) in exat arithmeti using Mathematia.

The partiular value of

(and thus

) studied ab ove is exeptional, as it is

unusual for GMRES(1) to onverge exatly in three iterations. Remarkably, though,

GMRES(1) maintains sup eriority over GMRES(2) for a wide range of initial residuals.

For this matrix

, GMRES(2) onverges exatly in one yle for any initial residual

with zero in the third omp onent, so we restrit attention to residuals normalized to

the form

= (

 ;  ;

. Figure 2 indiates that GMRES(2) makes little progress for

most suh residuals, while GMRES(1) onverges to high auray for the vast ma jor-

ity of these

values. The olor in eah plot reets the magnitude of

(

)

100

Blue indiates satisfatory onvergene, while red signals little progress in one hun-

dred iterations. (To ensure this data's delity, we performed these omputations in

both double and quadruple preision arithmeti; dierenes b etween the two were

negligible.)

To gain an appreiation for the dynamis behind Figure 2, we rst examine the

ation of a single GMRES(1) step. From (2.2) it is lear that GMRES(1) will om-

pletely stagnate only when

= 0. For the matrix

speied in (2.1) and

= (

 ;  ;

, this ondition redues to



 





+ 3



+ 1 = 0

;

(2.3)

the equation for an oblique ellipse in the (

 ; 

) plane.

Now writing

(1)

= (

 ;  ;

, onsider the map

(1)

that pro jets

(1)

into the (

 ; 

) plane,

(1)

= (

(1)

)





(

(1)

)

(

(1)

)

;

MARK EMBREE



5 0 5 10



5 0 5 10







 

 

Fig. 2

Convergene of GMRES

(1) (

left

)

and GMRES

(2) (

right

)

for the matrix in

(2.1)

over

a range of initial residuals of the form

= (

 ; ;

. The olor indiates

(

)

100

on a loga-

rithmi sale: blue regions orrespond to initial residuals that onverge satisfatorily, while the red

regions show residuals that stagnate or onverge very slow ly.

where (

(1)

)

denotes the

th entry of

(1)

, whih itself is derived from

(1)

via (2.2).

For the present example, we have

(1)











+ 3

 

+ 9











 



+ 5



+ 10



 





+ 2





 











 



+ 5



+ 10

(2.4)

We an lassify the xed p oints (

 ; 

) satisfying (2.3) by investigating the Jaobian

of (2.4). One of its eigenvalues is always one, while the other eigenvalue varies ab ove

and b elow one in magnitude. In the left plot of Figure 2, we show the stable portion

of the ellipse (2.3) in blak and the unstable part in white.

We an similarly analyze GMRES(2). This iteration will never progress when, in

addition to the stagnation ondition for GMRES(1),

also satises

AAr

= 0.

For the present example, this requirement implies



+ 2

 



+ 5



+ 6



+ 1 = 0

;

the equation for an oblique parab ola. This urve intersets the ellipse (2.3) at two

points, drawn as dots in the right plot of Figure 2, the only stagnating residuals

(

 ;  ;

for GMRES(2). We an analyze their stability as done above for GMRES(1).

The pro jeted map for this iteration,

(2)

, takes the form

(2)









+ 4



+ 9













+ 4



+ 9

(2.5)

Analyzing the Jaobian for this GMRES(2) map at the pair of xed p oints, we nd one

to b e unstable (shown in blak in the right plot of Figure 2) while the other is stable

(shown in white). This stable xed p oint is an attrator for stagnating residuals.

RESTARTED GMRES

(

)

iteration,

GMRES(1)

GMRES(2)

0 5 10 15 20 25 30



Fig. 3

Convergene urves for GMRES

(1)

and GMRES

(2)

applied to

(3.1)

with

We return briey to the initial residual

= (2

;



;

. After the rst few itera-

tions, the angle between

(2)

and the xed vetor steadily onverges to zero at the

rate 0

6452

: : :

suggested by the Jaobian's dominant eigenvalue. We onlude with

high ondene that GMRES(2) never onverges for this initial residual. (If one yle

of GMRES(

) pro dues a residual parallel to

, then either

(

)

(

)

Thus a residual an't remain xed in the nite (

 ; 

) plane, but still onverge to

3. Seond Example.

The matrix

in (2.1) is nondiagonalizable, and one might

be tempted to blame its surprising onvergene behavior on this fat. To demonstrate

that nondiagonalizablity is not an essential requirement, we exhibit a diagonalizable

matrix with eigenvalues

;

for whih restarted GMRES also pro dues extreme

behavior. Take



1 2



0 2 4

0 0 3

;



;

(3.1)

with

. Again, we onstrut the rst few residuals. For GMRES(1),

(1)





;

(1)



;

(1)



;

while GMRES(2) yields

(2)





;

(2)





;

(2)





;

(2)





Figure 3 illustrates the onvergene urve for thirty iterations, again omputed using

exat arithmeti.

HTML Viewer

Frequently Asked Questions (2)

Q1. What is the olor of a GMRES?

The olor indi ates kr(m)100 k=kr0k on a loga-rithmi s ale: blue regions orrespond to initial residuals that onverge satisfa torily, while the redregions show residuals that stagnate or onverge very slowly.

Q2. What is the way to do this?

For the onve tion-di usion dis retization des ribed in [4℄,GMRES(1) or GMRES(5) an outperform GMRES(20) on moderately re ned grids.

The Tortoise and the Hare restart GMRES

Figures (2)

Citations

Cites background from "The Tortoise and the Hare restart G..."

Additional excerpts

Cites background from "The Tortoise and the Hare restart G..."

Cites methods from "The Tortoise and the Hare restart G..."

References

"The Tortoise and the Hare restart G..." refers background or methods in this paper

"The Tortoise and the Hare restart G..." refers methods in this paper

"The Tortoise and the Hare restart G..." refers background in this paper

Related Papers (5)

Frequently Asked Questions (2)

Q1. What is the olor of a GMRES?

Q2. What is the way to do this?