What are the future works in "A block newton method for nonlinear eigenvalue problems" ?

A logical next step of future research is to employ invariant pairs in single-vector methods for safely locking and purging converged eigenpairs, similar to the work by Meerbergen [ 19 ] on the quadratic eigenvalue problem.

What is the next step of future research?

A logical next step of future research is to employ invariant pairs in single-vector methods for safely locking and purging converged eigenpairs, similar to the work by Meerbergen [19] on the quadratic eigenvalue problem.

What is the step size of the first 2 iterations?

During the first 2 iterations the step size is at the allowed minimum 2−3 before it successively increases to 1 at the sixth step, after which quadratic convergence sets in.

What is the value of the stability analysis of the corresponding DDE?

0. For the stability analysis of the corresponding DDE ẋ(t) = A0x(t)+ A1x(t − τ), it is of interest to compute eigenvalues with large real part.

What is the simplest way to solve a sparse system?

Moreover,if the matrices A j are sparse then (20) is a bordered sparse system and a sparse direct solver, possibly adapted to such bordered matrices [3], could be used.

What is the simplest way to compute a eigenvalue?

To obtain an initial guess, the authors approximate T (λ) by a polynomialT (λ) ≈ P(λ) := λI − A0 − A1 ∑i=01 i ! (−λτ) i . (26)and compute the k eigenvalues λ1, . . . , λk of P that have largest real part.

What is the simplest invariant pair for X?

In turn, (X, S) is a simple invariant pair if and only if the linear matrix operatorL̃ : Cn×k × Ck×k → Cn×k × Ck×k ( X, S)→ (DP( X, S), DV( X, S)) ,is invertible, whereDP : ( X, S) → T( X, S)+ m∑j=1 A j X [Dp j (S)]( S).Using (12) the authors obtain from the results in [18] that[ f j (S) [D f j (S)]( S)0 f j (S)] = f j ([ S S 0 S ]) = p j ([ S S 0 S ])= [p j (S) [Dp j (S)]( S) 0 p j (S)]for j = 1, . . . , m.

What is the nonlinear eigenvalue problem of finding pairs (x, ?

Cn×n holomorphic on an open set ⊆ C, the authors consider the nonlinear eigenvalue problem of finding pairs (x, λ) ∈ Cn × with x = 0 such thatT (λ)x = 0.

What is the way to represent eigenvalues?

When little is known about a nonlinear eigenvalue problem at hand, the concept of invariant pairs proposed in this paper offers a robust way of representing several eigenvalues and eigenvectors simultaneously.

What is the simplest method for computing invariant pairs?

Algorithm 1 Newton method for computing invariant pairs Input: Initial pair (X0, S0) ∈ Cn×k × Ck×k such that Vl(X0, S0)H Vl(X0, S0) =

What is the suitable extension of the nonlinear eigenvalue problem?

Ck×k is called an invariant pair of the nonlinear eigenvalue problem (3) ifA1 X f1(S)+ A2 X f2(S)+ · · · + Am X fm(S) = 0. (4)Note that the matrix functions f1(S), . . . , fm(S) are well defined under the given assumptions [16].

Why did Volker Mehrmann develop a block Newton method?

To compute such invariant pairs, the authors have developed a block Newton method and described some algorithmic details, mainly to maintain a reasonable computational cost.

What is the simplest way to improve global convergence?

3.3 Improving global convergenceIn an attempt to improve the global convergence of Algorithm 1, the authors have implemented a simple Armijo rule based on the residual norm‖T(X, S)‖F = ‖A1 X f1(S)+

What is the Fréchet derivative of f j?

[D f j (S)] denotes the Fréchet derivative of f j at S. Note that the Fréchet derivative DS j of the map S → S j can be written asDS j : S → j∑i=0 Si S S j−i−1.

(Open Access) A block Newton method for nonlinear eigenvalue problems (2009) | Daniel Kressner

Q: What are the contributions in "A block newton method for nonlinear eigenvalue problems" ?

The authors consider matrix eigenvalue problems that are nonlinear in the eigenvalue parameter. The purpose of this work is to show that the concept of invariant pairs offers a way of representing eigenvalues and eigenvectors that is insensitive to this phenomenon.

Q: What is the minimumity index of a pair of X, S?

If a pair (X, S) ∈ Cn×k×Ck×k is minimal then its minimality index cannot exceed k.Proof Since (X, S) is minimal, there is l ∈ N such that rank (Vl(X, S)) = k.

ETH Library

A block Newton method for

nonlinear eigenvalue problems

Journal Article

Author(s):

Kressner, Daniel

Publication date:

2009-12

Permanent link:

https://doi.org/10.3929/ethz-b-000019530

Rights / license:

In Copyright - Non-Commercial Use Permitted

Originally published in:

Numerische Mathematik 114(2), https://doi.org/10.1007/s00211-009-0259-x

This page was generated automatically upon download from the ETH Zurich Research Collection.

For more information, please consult the Terms of use.

Numer. Math. (2009) 114:355–372

DOI 10.1007/s00211-009-0259-x

Numerische

Mathematik

A block Newton method for nonlinear eigenvalue

problems

Daniel Kressner

Received: 2 February 2009 / Revised: 20 July 2009 / Published online: 15 September 2009

Abstract We consider matrix eigenvalue problems that are nonlinear in the

eigenvalue parameter. One of the most fundamental differences from the linear case

is that distinct eigenvalues may have linearly dependent eigenvectors or even share

the same eigenvector. This has been a severe hindrance in the development of gen-

eral numerical schemes for computing several eigenvalues of a nonlinear eigenvalue

problem, either simultaneously or subsequently. The purpose of this work is to show

that the concept of invariant pairs offers a way of representing eigenvalues and eigen-

vectors that is insensitive to this phenomenon. To demonstrate the use of this concept

in the development of numerical methods, we have developed a novel block Newton

method for computing such invariant pairs. Algorithmic aspects of this method are

considered and a few academic examples demonstrate its viability.

Mathematics Subject Classiﬁcation (2000) Primary 65F15; Secondary 15A18 ·

47A56

1 Introduction

Given a function T :  → C

n×n

holomorphic on an open set  ⊆ C, we consider the

nonlinear eigenvalue problem of ﬁnding pairs (x,λ) ∈ C

×  with x = 0 such that

T (λ)x = 0. (1)

For any such pair (x,λ), we call x an eigenvector and λ an eigenvalue. This formula-

tion includes linear eigenvalue problems, for which T (λ) = A − λI with A ∈ C

n×n

as well as polynomial eigenvalue problems, for which T is a matrix polynomial in λ.

D. Kressner (

)

Seminar für Angewandte Mathematik, HG G 57.1, Rämistrasse 101, 8092 Zurich, Switzerland

e-mail: kressner@math.ethz.ch

123

356 D. Kressner

To avoid degenerate situations, we assume that T is regular, i.e., det

(

T (·)

)

≡ 0onany

of the components of , throughout this paper. For a recent overview on the numerics

and numerous applications of such nonlinear eigenvalue problems, we refer to [20].

In contrast to the linear case, there may be eigenvector/eigenvalue pairs (λ

, x

..., (λ

, x

) of (1), for which the eigenvalues λ

,...,λ

are pairwise distinct but

,...,x

} is linearly dependent. This possibility is already evident from the fact

that k can be larger than n. Another example [12]isgivenby

T (λ) =



012

−214



+ λ



−1 −6

2 −9



+ λ





, (2)

for which the eigenvalues 3 and 4 share the same eigenvector





. The occurrence

of such linear dependencies is an annoyance when attempting to develop numeri-

cal methods for computing more than one eigenvalue of (1). For example, standard

Newton methods [10,11] for the simultaneous computation of several eigenvalues

crucially depend on the existence of a basis for the invariant subspace belonging

to the eigenvalues of interest. In methods that determine several eigenvalues sub-

sequently, such as Krylov subspace or Jacobi-Davidson methods [2], repeated con-

vergence towards an eigenvalue is usually avoided by reorthogonalization against

converged eigenvectors. If such an idea was directly applied t o nonlinear eigen-

value problems, eigenvalues could be missed due to linear dependencies among eigen-

vectors.

In the case that the nonlinear eigenvalue problem admits a minimum–maximum

characterization [26,31], its eigenvalues can be ordered and numbered. Voss and his

co-authors [4,5,7,27–30] have developed Arnoldi-type and Jacobi-Davidson-type

methods that employ this numbering as a safety scheme for avoiding repeated con-

vergence towards the same eigenvalue. Unfortunately, for many applications such

minimum–maximum characterizations do not exist or are difﬁcult to verify.

In this work, we will propose a different approach for dealing with several eigen-

values, very much inspired by the work of Beyn and Thümmler [9] on continuation

methods for quadratic eigenvalue problems. For this purpose, it will be more conve-

nient to assume that the nonlinear eigenvalue problem (1) takes the form

(

(λ)A

+ f

(λ)A

+···+ f

(λ)A

)

x = 0. (3)

for holomorphic functions f

,..., f

:  → C and constant matrices A

,...,A

∈

n×n

. This is no restriction as we could turn (1)into(3) by choosing m = n

(i−1)n+ j

(λ) = t

(λ) and A

(i−1)n+ j

= e

, with e

and e

denoting the ith and jth

unit vectors of length n, respectively. However, many applications of nonlinear eigen-

value problems already come in the form (3) and such a reformulation is not needed.

For example, in eigenvalue problems related to the stability of time-delay systems [21],

the functions f

are exponentials or polynomials. In applications related to vibrating

mechanical structures [30], the functions f

are rational and model different material

properties.

The rest of this paper is organized as follows. In “Invariant pairs”, the concept of

invariant pairs for the nonlinear eigenvalue Problem (3) is introduced. We believe this

123

A block Newton method for nonlinear eigenvalue problems 357

to be the most suitable extension of an eigenvalue/eigenvector pair to several eigen-

values. Several useful properties are shown to substantiate this belief. In “A Newton

method for simple invariant pairs”, a Newton method for computing such invariant

pairs is developed, along with some algorithmic details and numerical experiments.

2 Invariant pairs

Deﬁnition 1 Let the eigenvalues of S ∈ C

k×k

be contained in  and let X ∈ C

n×k

Then (X, S) ∈ C

n×k

× C

k×k

is called an invariant pair of the nonlinear eigenvalue

problem (3)if

(S) + A

(S) +···+ A

(S) = 0. (4)

Note that the matrix functions f

(S),..., f

(S) are well deﬁned under the given

assumptions [16]. As an example, let (x

,λ

) and (x

,λ

) be eigenvector/eigenvalue

pairs of (3). Then (X, S) with X =[x

, x

] and S = diag(λ

,λ

) is an invariant pair.

To avoid trivial invariant pairs, such as X = 0, an additional property needs to be

imposed. However, we have already seen that requiring X to have full column rank is

not reasonable in the context of nonlinear eigenvalue problems. Instead, we use the

concept of minimal invariant pairs from [6,9].

Deﬁnition 2 A pair (X, S) ∈ C

n×k

× C

k×k

is called minimal if there is l ∈ N such

that the matrix

(X, S) =

⎡

⎢

⎣

l−1

⎤

⎥

⎦

(5)

has rank k. The smallest such l is called the minimality index of (X, S).

Example 3 For the example (2), the pair (X, S) with X =





and S = diag(3, 4)

is invariant and minimal with minimality index 2.

It has been shown in [6, Theorem 3] that any non-minimal pair can be turned into

a minimal one in the following sense. If V

(X, S) has rank

k < k then there is a

minimal pair (



S) ∈ C

n×

× C

such that span



X = span X and span V

(



S) =

span V

(X, S). The following Lemma reveals the connection of minimal invariant pairs

to the nonlinear eigenvalue problem (3).

Lemma 4 Let (X, S) ∈ C

n×k

× C

k×k

be a minimal invariant pair of (3). Then the

following statements hold.

1. For any invertible matrix Z ∈ C

k×k

, (XZ, Z

−1

SZ) is also a minimal invariant

pair of (3).

2. The eigenvalues of S are eigenvalues of (3).

123

358 D. Kressner

Proof 1. Using f

−1

SZ) = Z

−1

(S)Z, the relation (4) can be written as

XZf

−1

SZ)Z

−1

+ A

XZf

−1

SZ)Z

−1

+···

+ A

XZf

−1

SZ)Z

−1

= 0,

which is equivalent to

XZf

−1

SZ) + A

XZf

−1

SZ) +···+ A

XZf

−1

SZ) = 0, (6)

and shows that (XZ, Z

−1

SZ) is an invariant pair. Its minimality follows from

(XZ, Z

−1

SZ) = V

(X, S)Z.

2. By the Schur decomposition, we can choose Z orthogonal such that



S = Z

−1

is upper triangular with any eigenvalue λ of S appearing in the (1, 1) position



S. Setting x = XZe

, the ﬁrst column of V

−1

SZ, XZ) has the entries

x, xλ, . . . , xλ

l−1

. Hence, x = 0 since otherwise V

−1

SZ, XZ) would be rank

deﬁcient for any l. Moreover,

XZf

−1

SZ)e

= f

(λ)x

and thus the ﬁrst column of (6)impliesthat(x,λ) is an eigenvector/eigenvalue

pair. 

Let us brieﬂy discuss the practical consequences of Lemma 4. Once a minimal invariant

pair is computed we can extract the corresponding eigenvalues of T (·) by computing

the eigenvalues of S. Moreover, if S admits a diagonalization Z

−1

SZ then the columns

of XZ contain the corresponding eigenvectors.

The following lemma shows that for checking minimality, it is sufﬁcient to check

the rank of V

(X, S).

Lemma 5 If a pair (X, S) ∈ C

n×k

×C

k×k

is minimal then its minimality index cannot

exceed k.

Proof Since (X, S) is minimal, there is l ∈ N such that rank

(

(X, S)

)

= k.For

l ≤ k there is nothing to prove. For l > k, the Cayley-Hamilton theorem yields the

existence of coefﬁcients α

∈ C such that

k+i

= α

X + α

XS+···+α

i,k−1

k−1

, i ≥ 0.

Hence, there is a square invertible matrix W such that

(X, S) =



(X, S)



implying rank

(

(X, S)

)

= k. 

123

A block Newton method for nonlinear eigenvalue problems

Figures

Citations

Function theory in several complex variables

An integral method for solving nonlinear eigenvalue problems

The nonlinear eigenvalue problem

NLEIGS: A Class of Fully Rational Krylov Methods for Nonlinear Eigenvalue Problems

Chebyshev interpolation for nonlinear eigenvalue problems

References

Matrix perturbation theory

Numerical solution of saddle point problems

Functions of Matrices: Theory and Computation

Templates for the Solution of Algebraic Eigenvalue Problems: A Practical Guide

Function theory of several complex variables

Related Papers (5)

NLEVP: A Collection of Nonlinear Eigenvalue Problems

Nonlinear eigenvalue problems: a challenge for modern eigenvalue methods

An integral method for solving nonlinear eigenvalue problems

Algorithms for the nonlinear eigenvalue problem

The Quadratic Eigenvalue Problem

Frequently Asked Questions (17)

Q1. What are the contributions in "A block newton method for nonlinear eigenvalue problems" ?

Q2. What are the future works in "A block newton method for nonlinear eigenvalue problems" ?

Q3. What is the next step of future research?

Q4. What is the step size of the first 2 iterations?

Q5. What is the value of the stability analysis of the corresponding DDE?

Q6. What is the minimumity index of a pair of X, S?

Q7. What is the simplest way to solve a sparse system?

Q8. What is the simplest way to compute a eigenvalue?

Q9. What is the simplest invariant pair for X?

Q10. What is the nonlinear eigenvalue problem of finding pairs (x, ?

Q11. What is the way to represent eigenvalues?

Q12. What is the way to avoid converged eigenvalues?

Q13. What is the simplest method for computing invariant pairs?

Q14. What is the suitable extension of the nonlinear eigenvalue problem?

Q15. Why did Volker Mehrmann develop a block Newton method?

Q16. What is the simplest way to improve global convergence?

Q17. What is the Fréchet derivative of f j?