What is the way to do this?

For the onve tion-di usion dis retization des ribed in [4℄,GMRES(1) or GMRES(5) an outperform GMRES(20) on moderately re ned grids.

(Open Access) The Tortoise and the Hare restart GMRES (2003) | Mark Embree

Q: What is the olor of a GMRES?

The olor indi ates kr(m)100 k=kr0k on a loga-rithmi s ale: blue regions orrespond to initial residuals that onverge satisfa torily, while the redregions show residuals that stagnate or onverge very slowly.

THE TORTOISE AND THE HARE RESTART GMRES

MARK EMBREE



Abstrat.

When solving large nonsymmetri systems of linear equations with the restarted

GMRES algorithm, one is inlined to selet a relatively large restart parameter in the hop e of

mimiking the full GMRES pro ess. Surprisingly, ases exist where small values of the restart

parameter yield onvergene in fewer iterations than larger values. Here, two simple examples are

presented where GMRES(1) onverges exatly in three iterations, while GMRES(2) stagnates. One

of these examples reveals that GMRES(1) onvergene an b e extremely sensitive to small hanges

in the initial residual.

Key words.

Restarted GMRES, Krylov subspae methods.

AMS sub jet lassiations.

65F10, 37N30

1. Intro dution.

GMRES is an iterative method for solving large nonsymmetri

systems of linear equations,

[8℄. Throughout siene and engineering, this

algorithm and its variants routinely solve problems with millions of degrees of freedom.

Its p opularity is rooted in an optimality ondition: At the

th iteration, GMRES

omputes the solution estimate

that minimizes the Eulidean norm of the residual



over a subspae of dimension

= min

(0)=1

(

)

;

(1.1)

where

denotes those polynomials with degree not exeeding

, and



is the initial residual. As eah iteration enlarges the minimizing subspae, the residual

norm dereases monotonially.

GMRES optimality omes at a ost, however, sine eah iteration demands both

more arithmeti and memory than the one b efore it. A standard work-around is

to restart the pro ess after some xed number of iterations,

. The resulting algo-

rithm, GMRES(

), uses the approximate solution

as the initial guess for a new

run of GMRES, ontinuing this proess until onvergene. The global optimality of

the original algorithm is lost, so although the residual norms remain monotoni, the

restarted pro ess an stagnate with a non-zero residual, failing to ever onverge [8℄.

Sine GMRES(

) enfores loal optimality on

-dimensional spaes, one antiipates

that inreasing

will yield onvergene in fewer iterations. Many pratial examples

onrm this intuition.

We denote the

th residual of GMRES(

) by

(

)

. To b e preise, one yle

between restarts of GMRES(

) is ounted as

individual iterations. Conventionally,

then, one exp ets

(

)

k  k

(

)

for

` < m

. Indeed, this must be true when



Surprisingly, inreasing the restart parameter sometimes leads to

slower

onver-

gene:

(

)

(

)

for

` < m < k

. The author enountered this phenomenon

while solving a disretized onvetion-diusion equation desribed in [4℄. In unpub-

lished exp eriments, de Sturler [1℄ and Walker and Watson [11℄ observed similar b e-

havior arising in pratial appliations. One wonders, how muh smaller than

(

)

might

(

)

be? The smallest possible ases ompare GMRES(1) to GMRES(2) for

3-by-3 matries. Eiermann, Ernst, and Shneider present suh an example for whih



Oxford University Computing Laboratory, Wolfson Building, Parks Road, Oxford OX1 3QD,

United Kingdom (mark.embreeomlab.ox.a.uk). Supp orted by UK Engineering and Physial Si-

enes Researh Counil Grant GR/M12414.

MARK EMBREE

(1)

(2)

= 0

2154

: : :

[2, pp. 284{285℄. Otherwise, the phenomenon we desrib e

has apparently reeived little attention in the literature.

The purp ose of this artile is twofold. First, we desribe a pair of extreme ex-

amples where GMRES(1) onverges exatly at the third iteration, while GMRES(2)

seems to never onverge. The seond example leads to our seond p oint: Small p er-

turbations to the initial residual an dramatially alter the onvergene b ehavior of

GMRES(1).

2. First Example.

Consider using restarted GMRES to solve

for



1 1 1

0 1 3

0 0 1

;





(2.1)

Taking

yields the initial residual

. Using the fat that

and

are

real, we an derive expliit formulas for GMRES(1) and GMRES(2) diretly from the

GMRES optimality ondition (1.1). The reurrene for GMRES(1),

(1)



(1)T

(1)

(1)T

(1)

;

(2.2)

was studied as early as the 1950s [3,

71℄,[7℄. For the

and

dened in (2.1),

this iteration onverges

exatly

at the third step:

(1)





;

(1)



;

(1)



Expressions for one GMRES(2) yle an likewise b e derived using elementary alu-

lus. The up dated residual takes the form

(2)

(

)

(2)

, where

(

) = 1 +

z

 z

is a quadrati whose o eÆients



(

;

(2)

) and



(

;

(2)

) are given by



(

(2)T

AAr

(2)

)(

(2)T

AAr

(2)

)



(

(2)T

(2)

)(

(2)T

AAr

(2)

)

(

(2)T

(2)

)(

(2)T

AAr

(2)

)



(

(2)T

AAr

(2)

)(

(2)T

AAr

(2)

)

;



(

(2)T

(2)

)(

(2)T

AAr

(2)

)



(

(2)T

AAr

(2)

)(

(2)T

(2)

)

(

(2)T

(2)

)(

(2)T

AAr

(2)

)



(

(2)T

AAr

(2)

)(

(2)T

AAr

(2)

)

Exeuting GMRES(2) on the matrix and right hand side (2.1) reveals

(2)





;

(2)



;

(2)





;

(2)

122





108

162

The inferiority of GMRES(2) ontinues well b eyond the fourth iteration. For example:

(2)

5 0.376888290025532. . .

10 0.376502488858910. . .

15 0.376496927936533. . .

20 0.376496055944867. . .

25 0.376495995285626. . .

30 0.376495984909087. . .

RESTARTED GMRES

(

)

iteration,

GMRES(1)

GMRES(2)

0 5 10 15 20 25 30



Fig. 1

Convergene urves for GMRES

(1)

and GMRES

(2)

applied to

(2.1)

with

The entire onvergene urve for the rst thirty iterations is shown in Figure 1, based

on p erforming GMRES(2) in exat arithmeti using Mathematia.

The partiular value of

(and thus

) studied ab ove is exeptional, as it is

unusual for GMRES(1) to onverge exatly in three iterations. Remarkably, though,

GMRES(1) maintains sup eriority over GMRES(2) for a wide range of initial residuals.

For this matrix

, GMRES(2) onverges exatly in one yle for any initial residual

with zero in the third omp onent, so we restrit attention to residuals normalized to

the form

= (

 ;  ;

. Figure 2 indiates that GMRES(2) makes little progress for

most suh residuals, while GMRES(1) onverges to high auray for the vast ma jor-

ity of these

values. The olor in eah plot reets the magnitude of

(

)

100

Blue indiates satisfatory onvergene, while red signals little progress in one hun-

dred iterations. (To ensure this data's delity, we performed these omputations in

both double and quadruple preision arithmeti; dierenes b etween the two were

negligible.)

To gain an appreiation for the dynamis behind Figure 2, we rst examine the

ation of a single GMRES(1) step. From (2.2) it is lear that GMRES(1) will om-

pletely stagnate only when

= 0. For the matrix

speied in (2.1) and

= (

 ;  ;

, this ondition redues to



 





+ 3



+ 1 = 0

;

(2.3)

the equation for an oblique ellipse in the (

 ; 

) plane.

Now writing

(1)

= (

 ;  ;

, onsider the map

(1)

that pro jets

(1)

into the (

 ; 

) plane,

(1)

= (

(1)

)





(

(1)

)

(

(1)

)

;

MARK EMBREE



5 0 5 10



5 0 5 10







 

 

Fig. 2

Convergene of GMRES

(1) (

left

)

and GMRES

(2) (

right

)

for the matrix in

(2.1)

over

a range of initial residuals of the form

= (

 ; ;

. The olor indiates

(

)

100

on a loga-

rithmi sale: blue regions orrespond to initial residuals that onverge satisfatorily, while the red

regions show residuals that stagnate or onverge very slow ly.

where (

(1)

)

denotes the

th entry of

(1)

, whih itself is derived from

(1)

via (2.2).

For the present example, we have

(1)











+ 3

 

+ 9











 



+ 5



+ 10



 





+ 2





 











 



+ 5



+ 10

(2.4)

We an lassify the xed p oints (

 ; 

) satisfying (2.3) by investigating the Jaobian

of (2.4). One of its eigenvalues is always one, while the other eigenvalue varies ab ove

and b elow one in magnitude. In the left plot of Figure 2, we show the stable portion

of the ellipse (2.3) in blak and the unstable part in white.

We an similarly analyze GMRES(2). This iteration will never progress when, in

addition to the stagnation ondition for GMRES(1),

also satises

AAr

= 0.

For the present example, this requirement implies



+ 2

 



+ 5



+ 6



+ 1 = 0

;

the equation for an oblique parab ola. This urve intersets the ellipse (2.3) at two

points, drawn as dots in the right plot of Figure 2, the only stagnating residuals

(

 ;  ;

for GMRES(2). We an analyze their stability as done above for GMRES(1).

The pro jeted map for this iteration,

(2)

, takes the form

(2)









+ 4



+ 9













+ 4



+ 9

(2.5)

Analyzing the Jaobian for this GMRES(2) map at the pair of xed p oints, we nd one

to b e unstable (shown in blak in the right plot of Figure 2) while the other is stable

(shown in white). This stable xed p oint is an attrator for stagnating residuals.

RESTARTED GMRES

(

)

iteration,

GMRES(1)

GMRES(2)

0 5 10 15 20 25 30



Fig. 3

Convergene urves for GMRES

(1)

and GMRES

(2)

applied to

(3.1)

with

We return briey to the initial residual

= (2

;



;

. After the rst few itera-

tions, the angle between

(2)

and the xed vetor steadily onverges to zero at the

rate 0

6452

: : :

suggested by the Jaobian's dominant eigenvalue. We onlude with

high ondene that GMRES(2) never onverges for this initial residual. (If one yle

of GMRES(

) pro dues a residual parallel to

, then either

(

)

(

)

Thus a residual an't remain xed in the nite (

 ; 

) plane, but still onverge to

3. Seond Example.

The matrix

in (2.1) is nondiagonalizable, and one might

be tempted to blame its surprising onvergene behavior on this fat. To demonstrate

that nondiagonalizablity is not an essential requirement, we exhibit a diagonalizable

matrix with eigenvalues

;

for whih restarted GMRES also pro dues extreme

behavior. Take



1 2



0 2 4

0 0 3

;



;

(3.1)

with

. Again, we onstrut the rst few residuals. For GMRES(1),

(1)





;

(1)



;

(1)



;

while GMRES(2) yields

(2)





;

(2)





;

(2)





;

(2)





Figure 3 illustrates the onvergene urve for thirty iterations, again omputed using

exat arithmeti.

The Tortoise and the Hare restart GMRES

Figures

Citations

A class of linear solvers based on multilevel and supernodal factorization

Two-Grid Deflated Krylov Methods for Linear Equations.

An EM-based iterative method for solving large sparse linear systems

Fast solution of sparse linear systems with adaptive choice of preconditioners

Acelerando uma aplicação de simulação computacional para o processo de ablação por radiofrequência usando GPU

References

GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems

Computational Methods of Linear Algebra

Implicit application of polynomial filters in a k-step Arnoldi method

Computational methods of linear algebra

BiCGstab(ell) for Linear Equations involving Unsymmetric Matrices with Complex Spectrum

Related Papers (5)

GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems

Iterative Methods for Sparse Linear Systems

A Restarted GMRES Method Augmented with Eigenvectors

Any Nonincreasing Convergence Curve is Possible for GMRES

GMRES with Deflated Restarting

Frequently Asked Questions (2)

Q1. What is the olor of a GMRES?

Q2. What is the way to do this?