A Block-Preconditioned Inexact Linear Solver for Computing the Complex Eigenpairs of a Large Sparse Matrix

Richard Olatokunbo Akinola; Stephen Yakubu Kutchin; Ayodeji Sunday Ayodele; Kingsley Obiajulu Muka

doi:10.4236/jamp.2018.62040

Journal of Applied Mathematics and Physics > Vol.6 No.2, February 2018

A Block-Preconditioned Inexact Linear Solver for Computing the Complex Eigenpairs of a Large Sparse Matrix

Richard Olatokunbo Akinola^1*, Stephen Yakubu Kutchin¹, Ayodeji Sunday Ayodele¹, Kingsley Obiajulu Muka²
¹Department of Mathematics, Faculty of Natural Sciences, University of Jos, Jos, Nigeria.
²Department of Mathematics, University of Benin, Benin, Nigeria.
DOI: 10.4236/jamp.2018.62040 PDF HTML XML 655 Downloads 1,225 Views

Abstract

In computing eigenpairs of the matrix pencil, one obtains a linear system of equations. In this work, we show how a block triadiagonal preconditioner in GMRES can be used to solve a large sparse unsymmetric system of equations inexactly using a fixed and decreasing tolerance. While the fixed tolerance solver converged superlinearly to the eigenvalue of interest, the decreasing one converged quadratically. This surpasses an earlier result which converged harmonically.

Keywords

Preconditioner, Superlinear Convergence, Quadratic Convergence, Eigenvalues

Share and Cite:

Akinola, R. , Kutchin, S. , Ayodele, A. and Muka, K. (2018) A Block-Preconditioned Inexact Linear Solver for Computing the Complex Eigenpairs of a Large Sparse Matrix. Journal of Applied Mathematics and Physics, 6, 429-445. doi: 10.4236/jamp.2018.62040.

1. Introduction

Let $A$ be a large sparse, real n by n nonsymmetric matrix and $B \in ℝ^{n \times n}$ a symmetric positive definite matrix. In this paper, we consider the problem of computing the eigenpair $(z, λ)$ from the following generalized complex eigenvalue problem

$A z = λ B z, z \in ℂ^{n}, z \neq 0,$ (1)

where $λ \in ℂ$ is the eigenvalue of the pencil $(A, B)$ and $z$ its corresponding complex eigenvector. We assume that the eigenpair of interest $(z, λ)$ is algebraically simple, with $ψ^{H}$ the corresponding left eigenvector such that [1]

$ψ^{H} B z \neq 0.$ (2)

By adding the normalization

$z^{H} B z = 1,$ (3)

to (1), the combined system of equations can be expressed in the form $F (z) = 0$ as

$F (z) = [\begin{matrix} (A - λ B) z \\ - \frac{1}{2} z^{H} B z + \frac{1}{2} \end{matrix}] = 0 .$ (4)

Note that $z^{H} B z$ is real since $B$ is symmetric and positive definite. This results in solving a system of nonlinear n complex and one real equation for the $n + 1$ complex unknowns $v = {[z, λ]}^{T}$ . The reason we cannot use Newton’s method to solve (4) is because $\bar{z}$ in the normalization $z^{H} B z = {\bar{z}}^{T} B z$ is not differentiable.

Recall that for a real eigenpair $(z, λ)$ , (4) is $(n + 1)$ real equations for $(n + 1)$ real unknowns and Newton’s method for solving (4) involves the solution of the $(n + 1)$ square linear systems

$[\begin{matrix} A - λ^{(k)} B & - B z^{(k)} \\ - {(B z^{(k)})}^{T} & 0 \end{matrix}] [\begin{matrix} Δ z^{(k)} \\ Δ λ^{(k)} \end{matrix}] = - [\begin{matrix} (A - λ^{(k)} B) z^{(k)} \\ - \frac{1}{2} z^{(k)}^{^{T}} B z^{(k)} + \frac{1}{2} \end{matrix}],$ (5)

for the $(n + 1)$ real unknowns $Δ v^{(k)} = [Δ z^{(k)}^{^{T}}, Δ λ^{(k)}]$ , and updating $v^{(k + 1)} = v^{(k)} + Δ v^{(k)}$ . Secondly, if instead of the normalization (3), we add $c^{H} z = 1$ , where $c$ is a fixed complex vector (see, for example, [2] ), (1) and $c^{H} z = 1$ provide $(n + 1)$ complex equations for $(n + 1)$ complex unknowns, and the Jacobian of this new system is

$[\begin{matrix} (A - λ B) & - B z \\ c^{H} & 0 \end{matrix}] .$

The above Jacobian is square and can be easily shown to be nonsingular, using the ABCD Lemma [3] if the eigenvalue of interest is algebraically simple and $c^{H} z \neq 0$ . Thirdly, if $(z, λ)$ is complex, then, as stated earlier, we have n complex and one real equation. Also, if $z$ solves (4), then so does $z e^{i θ}$ for any $θ$ , such that $0 \leq θ \leq 2 π$ .

Our approach for analyzing the solution of (4) for $v$ begins by splitting the eigenpair $(z, λ)$ into their real and imaginary parts: $z = z_{1} + i z_{2}$ , $λ = α + i β$ where $z_{1}, z_{2} \in ℝ^{n}$ , and $α, β \in ℝ$ . After expanding (4), we will obtain a real $2 n + 1$ under-determined system of nonlinear equations in $2 n + 2$ real unknowns $v = {[z_{1}, z_{2}, α, β]}^{T}$ , and it is natural to use the Gauss-Newton method (see, for example, Deuflhard ( [4] , pp. 222-223)) to obtain a solution (see also, [5] [6] [7] [8] ). By linearizing the under-determined system of nonlinear equations, we obtain an under-determined system of linear equations involving the Jacobian. This paper is structured as follows: in Section 2, we show that the Jacobian has a unique nullvector at the root. This is then followed in Section 3, we present two orthogonality results and Algorithm 1 is given. In Section 4, we present an inexact inverse iteration algorithm with preconditioning for solving the large system of equations encountered.

Algorithm 1. Computing the complex eigenvalues of the pencil $(A, B)$ .

The main mathematical tools used in this paper are the LU factorization, inexact inverse iteration and preconditioned Generalized Minimal Residual (GMRES) [9] . The main reason for using inexact inverse iteration is due to the fact that as mentioned in an earlier paper, we do not solve $M u = B_{1} w$ , in practice but we will use it to show that the solution is possible. Theorem 2.1 shows that the Jacobian has a single nullvector at the root, while Theorem 3.1 gives an important orthogonality result. Algorithms 1-3 are presented. We remark that in the limit, the approximate nullvector converges to the exact. A numerical example is given which supports the validity of the algorithms presented though, as usual, relies on good initial guesses to the desired eigenpair. The classical inverse iteration for the matrix pencil converges slowly for some eigenvalue problems while we present algorithms that converge quadratically. Throughout this paper, unless otherwise stated all norms are the 2-norm.

The following result helps to enforce the validity of the results in this paper.

Lemma 1.1: [10] Let $F_{w} (w)$ be of full rank. If $F_{w} (w) Δ w = F (w)$ , is an under-determined linear system of equations, then its least squares solution

$Δ w = - F_{w} {(w)}^{T} {[F_{w} (w) F_{w} {(w)}^{T}]}^{- 1} F (w)$ , is orthogonal to the nullspace of $F_{w} (w)$ .

Algorithm 2. Inexact inverse iteration algorithm.

Algorithm 3. Complex eigenvalues of the pencil $(A, B)$ using Inexact Inverse Iteration with preconditioning.

In the next section, we will express both $z$ and $λ$ as $λ = α + i β$ and $z = z_{1} + i z_{2}$ , convert the nonlinear system (4) to a real under-determined system of nonlinear equations and prove some important results.

2. Computation of Complex Eigenpairs by Solving an Under-Determined System of Nonlinear Equations

In this section, we will expand the system of nonlinear n complex and one real equations in $n + 1$ complex unknowns (4) by writing $z$ and $λ$ as $z = z_{1} + i z_{2}$ and $λ = α + i β$ , respectively. The reason for having an underdetermined system of equations instead of a square system of equations is because, expanding $z^{H} B z = 1$ gives only one real equation, since $B$ is symmetric positive definite, while $(A - λ B) z = 0$ results in 2n real equations. This results in a $2 n + 1$ real under-determined system of nonlinear equations in $2 n + 2$ real unknowns. This will then be followed by presenting the underdetermined system of nonlinear equations and explicit expression for its Jacobian. Furthermore, we will show in the main result of this section-Theorem 2.1 that if the eigenvalue of interest in $(A, B)$ is algebraically simple, then the Jacobian has linearly independent rows. We will find the right nullvector of the Jacobian at the root and proof that it is unique.

If we let $z = z_{1} + i z_{2}$ and $λ = α + i β$ , then the square nonlinear system of Equations (4) can be written as

$\begin{matrix} (A - λ B) z = [A - (α + i β) B] (z_{1} + i z_{2}) \\ = (A - α B) z_{1} + β B z_{2} + i [(A - α B) z_{2} - β B z_{1}], \end{matrix}$ (6)

and

$z^{H} B z = z_{1}^{T} B z_{1} + z_{2}^{T} B z_{2} .$ (7)

Hence,

$- \frac{1}{2} z^{H} B z + \frac{1}{2} = - \frac{1}{2} (z_{1}^{T} B z_{1} + z_{2}^{T} B z_{2}) + \frac{1}{2} = 0 .$

Since $(A - λ B) z = 0$ , we equate the real and imaginary parts of (6) to zero and obtain the 2n real equations $(A - α B) z_{1} + β B z_{2} = 0$ and

$(A - α B) z_{2} - β B z_{1} = 0$ . This means, $F (v)$ consists of the 2n real equations

arising from (6) and one real equation $- \frac{1}{2} (z_{1}^{T} B z_{1} + z_{2}^{T} B z_{2}) + \frac{1}{2} = 0$ ;

$F (v) = [\begin{matrix} (A - α B) z_{1} + β B z_{2} \\ - β B z_{1} + (A - α B) z_{2} \\ - \frac{1}{2} (z_{1}^{T} B z_{1} + z_{2}^{T} B z_{2}) + \frac{1}{2} \end{matrix}] = 0,$ (8)

where $F : ℝ^{(2 n + 2)} \to ℝ^{(2 n + 1)}$ . The Jacobian $F_{v} (v)$ of $F (v)$ which has the following explicit expression

$F_{v} (v) = [\begin{matrix} (A - α B) & β B & - B z_{1} & B z_{2} \\ - β B & (A - α B) & - B z_{2} & - B z_{1} \\ - {(B z_{1})}^{T} & - {(B z_{2})}^{T} & 0 & 0 \end{matrix}],$ (9)

is a $2 n + 1$ by $2 n + 2$ real matrix. From the Jacobian (9) above, we define the real 2n by 2n matrix $M$ as

$M = [\begin{matrix} (A - α B) & β B \\ - β B & (A - α B) \end{matrix}] .$ (10)

Also, we form the 2n by 2 real matrix

$N = [\begin{matrix} - B z_{1} & B z_{2} \\ - B z_{2} & - B z_{1} \end{matrix}] = [\begin{matrix} - B_{1} w & B_{1} w_{1} \end{matrix}],$ (11)

consisting of the product of $B_{1} = [\begin{matrix} B & O \\ O & B \end{matrix}]$ and the matrix of right nullvectors of $M$ at the root, where

$w = [\begin{matrix} z_{1} \\ z_{2} \end{matrix}], w_{1} = [\begin{matrix} z_{2} \\ - z_{1} \end{matrix}],$ (12)

and $O$ is the n by n zero matrix. The Jacobian (9) can be rewritten in the following partitioned form

$F_{v} (v) = [\begin{matrix} M & - B_{1} w & B_{1} w_{1} \\ - {(B_{1} w)}^{T} & 0 & 0 \end{matrix}] = [\begin{matrix} M & N \\ - {(B_{1} w)}^{T} & 0^{T} \end{matrix}],$ (13)

with $M$ , $N$ are as defined in (10) and (11) respectively. Note that because at the root,

$[\begin{matrix} (A - α B) & β B \\ - β B & (A - α B) \end{matrix}] [\begin{matrix} z_{1} \\ z_{2} \end{matrix}] = [\begin{matrix} (A - α B) z_{1} + β B z_{2} \\ (A - α B) z_{2} - β B z_{1} \end{matrix}] = 0,$

this implies that $[\begin{matrix} z_{1} \\ z_{2} \end{matrix}]$ or its nonzero scalar multiple is a right nullvector of $M$ . In the same vein, we find

$[\begin{matrix} (A - α B) & β B \\ - β B & (A - α B) \end{matrix}] [\begin{matrix} z_{2} \\ - z_{1} \end{matrix}] = [\begin{matrix} (A - α B) z_{2} - β B z_{1} \\ - {(A - α B) z_{1} + β B z_{2}} \end{matrix}] = 0,$

and $[\begin{matrix} z_{2} \\ - z_{1} \end{matrix}]$ or its nonzero scalar multiple is also a right nullvector of $M$ at the

root. Since the eigenvalue $λ$ of $(A, B)$ is algebraically simple by assumption, then by (2), we need to give explicit expressions for the left nullvector of $(A - λ B)$ in order to prove that the Jacobian has full row rank at the root. Observe that if we define $ψ = ψ_{1} + i ψ_{2}$ , where $ψ_{1}, ψ_{2} \in ℝ^{n} \ {0}$ for all $ψ \in N {(A - λ B)}^{H} \ {0}$ , then this implies

$\begin{matrix} ψ^{H} (A - λ B) = (ψ_{1}^{T} - i ψ_{2}^{T}) [(A - α B) - i β B] \\ = ψ_{1}^{T} (A - α B) - β ψ_{2}^{T} B - i [β ψ_{1}^{T} B + ψ_{2}^{T} (A - α B)] = 0^{T} . \end{matrix}$

Hence, $ψ_{1}^{T} (A - α B) - β ψ_{2}^{T} B = 0^{T}$ and $β ψ_{1}^{T} B + ψ_{2}^{T} (A - α B) = 0^{T}$ . The implication of this is that

$\begin{matrix} [ψ_{1}^{T} ψ_{2}^{T}] M = [ψ_{1}^{T} ψ_{2}^{T}] [\begin{matrix} (A - α B) & β B \\ - β B & (A - α B) \end{matrix}] \\ = [ψ_{1}^{T} (A - α B) - β ψ_{2}^{T} B β ψ_{1}^{T} B + ψ_{2}^{T} (A - α B)] = 0^{T} . \end{matrix}$

which means, $[ψ_{1}^{T}, ψ_{2}^{T}]$ or its nonzero scalar multiple is a left nullvector of $M$ . Similarly,

$\begin{matrix} [ψ_{2}^{T} - ψ_{1}^{T}] M = [ψ_{2}^{T} - ψ_{1}^{T}] [\begin{matrix} (A - α B) & β B \\ - β B & (A - α B) \end{matrix}] \\ = [β ψ_{1}^{T} B + ψ_{2}^{T} (A - α B) - {ψ_{1}^{T} (A - α B) - β ψ_{2}^{T} B}] = 0^{T}, \end{matrix}$

and it shows that $[ψ_{2}^{T}, - ψ_{1}^{T}]$ is also a left nullvector of $M$ .

So we form the matrix $C$ consisting of the 2-dimensional left nullvectors of $M$ at the root (in practice $C$ is not computed), as

$C = [\begin{matrix} ψ_{1} & ψ_{2} \\ ψ_{2} & - ψ_{1} \end{matrix}] .$ (14)

Now, observe that the condition (2), implies

$ψ^{H} B z = [ψ_{1}^{T} B z_{1} + ψ_{2}^{T} B z_{2}] + i [ψ_{1}^{T} B z_{2} - ψ_{2}^{T} B z_{1}] \neq 0.$

Therefore, at the root, either $ψ_{1}^{T} B z_{1} + ψ_{2}^{T} B z_{2} \neq 0$ or $ψ_{1}^{T} B z_{2} - ψ_{2}^{T} B z_{1} = 0$ ; $ψ_{1}^{T} B z_{1} + ψ_{2}^{T} B z_{2} = 0$ or $ψ_{1}^{T} B z_{2} - ψ_{2}^{T} B z_{1} \neq 0$ or both $ψ_{1}^{T} B z_{1} + ψ_{2}^{T} B z_{2}$ and $ψ_{1}^{T} B z_{2} - ψ_{2}^{T} B z_{1}$ are nonzero. It excludes the possibility that they are both zero.

Before we continue with the rest of the analysis, we pause a little to present the main result of this section which shows that the Jacobian (9) has a one dimensional nullvector at the root.

Theorem 2.1 Assume that the eigenpairs $(z, λ)$ of the pencil $(A, B)$ are algebraically simple. If $z_{1}$ and $z_{2}$ are nonzero vectors, then $ϕ = τ [z_{2}^{T}, - z_{1}^{T},0,0]$ , $τ \in ℝ$ is the unique nonzero nullvector of $F_{v} (v)$ at the root.

Proof: See [5] .

After linearizing $F (v) = 0$ , we have the following under-determined linear system of equations

$F_{v} (v^{(k)}) Δ v^{(k)} = - F (v^{(k)}) .$ (15)

Let $n^{(k)}$ be the exact nullvector of the Jacobian $F_{v} (v^{(k)})$ . By adding the extra condition $n^{T} Δ v^{(k)} = 0$ , which stems from Lemma 1.1 to the underdetermined linear system of Equations (15), we obtain the following square linear system of equations

$[\begin{matrix} F_{v} (v^{(k)}) \\ n^{(k)}^{^{T}} \end{matrix}] Δ v^{(k)} = - [\begin{matrix} F (v^{(k)}) \\ 0 \end{matrix}] .$ (16)

3. Square System of Equations for the Numerical Computation of the Complex Eigenvalues of a Matrix

In the preceding section, we saw that by adding an extra equation to the under-determined linear system of equations 15, we obtained a square system of equations (16). However, in practice we would never compute $n^{(k)}$ , but Theorem 2.1 guarantees the existence of a unique nullvector $ϕ^{(k)}$ at the root. This is the motivation for the discussion in this section. In this section, we will use $ϕ^{(k)}$ defined by $ϕ^{(k)} = [z_{2}^{(k)}, - z_{1}^{(k)},0,0]$ as an approximation to $n$ in (16) and show that the solution obtained by solving (15) is equivalent to the solution obtained by solving

$[\begin{matrix} F_{v} (v^{(k)}) \\ ϕ^{(k)}^{^{T}} \end{matrix}] Δ v^{(k)} = - [\begin{matrix} F (v^{(k)}) \\ 0 \end{matrix}],$ (17)

in the absence of round off errors. To do this, we will show that $ϕ^{(k)}^{^{T}} Δ v^{(k)} = 0$ for each k, and this is presented in the main result of this section: Theorem 3.1. Algorithm 1 is given for computing an algebraically simple eigenpair of the pencil $(A, B)$ . Note that since $M$ has been shown to be singular at the root in section 2, this section is anchored on the assumption that when $v$ is not at the root, $M$ is nonsingular.

First, we define the 2n by 2n matrix $J$ as (see, also [11] )

$J = [\begin{matrix} 0 & I \\ - I & 0 \end{matrix}],$ (18)

and

$J w = [\begin{matrix} 0 & I \\ - I & 0 \end{matrix}] [\begin{matrix} z_{1} \\ z_{2} \end{matrix}] = [\begin{matrix} z_{2} \\ - z_{1} \end{matrix}] .$ (19)

The matrix $J$ satisfies the following properties:

1) $J^{T} = - J$ .

2) $J^{T} J = I_{2 n}$ , where $I_{2 n}$ is the 2n by 2n identity matrix.

3) $J^{2} = - I_{2 n} .$

4) The matrix $J$ commutes with $M$ , i.e., $J M = M J$ .

5) For $w \in ℝ^{2 n}$ , $w^{T} B_{1} J w = w^{T} J B_{1} w = 0$ .

6) Let $u$ be an unknown vector that solves $M u = B_{1} w$ . By premultiplying both sides by $J$ we obtain $J M u = J B_{1} w$ and hence $M J u = J B_{1} w$ because of the commutativity of $M$ and $J$ . Therefore, if $M u = B_{1} w$ , then $J u$ solves $M (J u) = J B_{1} w$ .

We begin by writing the linear system of Equations (15) explicitly. For ease of notation, we shall drop the superscripts and define $w^{+} = w + Δ w$ where $w^{+} = w^{(k + 1)}$ , and replace $w^{(k)}$ and $[Δ z_{1}^{(k)}^{^{T}}, Δ z_{2}^{(k)}^{^{T}}]$ with $w$ and $Δ w$ respectively. This means that (15) can now be rewritten as:

$[\begin{matrix} M & - B_{1} w & B_{1} J w \\ - {(B_{1} w)}^{T} & 0 & 0 \end{matrix}] [\begin{matrix} Δ w \\ Δ α \\ Δ β \end{matrix}] = - [\begin{matrix} M w \\ - \frac{1}{2} w^{T} B_{1} w + \frac{1}{2} \end{matrix}],$ (20)

which is equivalent to the following system of equations

$M Δ w - Δ α B_{1} w + Δ β B_{1} J w = - M w$

$- w^{T} B_{1} Δ w = \frac{1}{2} w^{T} B_{1} w - \frac{1}{2} .$

After rearrangement, the first n-equation reduces to

$M w^{+} - Δ α B_{1} w + Δ β B_{1} J w = 0 .$ (21)

By multiplying both sides of the $(n + 1) t h$ equation by 2, we obtain:

$2 w^{T} B_{1} Δ w + w^{T} B_{1} w = 1.$

This in turn reduces to

$w^{T} B_{1} (w + 2 Δ w) = 1.$ (22)

Observe that since $w^{+} = w + Δ w$ , $2 Δ w = 2 w^{+} - 2 w$ and $w + 2 Δ w = 2 w^{+} - w$ . Now, $w^{T} B_{1} (w + 2 Δ w) = w^{T} B_{1} (2 w^{+} - w) = 2 w^{T} B_{1} w^{+} - w^{T} B_{1} w$ . Consequently,

$w^{T} B_{1} w^{+} = \frac{1}{2} (w^{T} B_{1} w + 1) .$ (23)

The combined set of Equations (21) and (23), which is the simplified form of (20), can be expressed as:

$[\begin{matrix} M & - B_{1} w & B_{1} J w \\ {(B_{1} w)}^{T} & 0 & 0 \end{matrix}] [\begin{matrix} w^{+} \\ Δ α \\ Δ β \end{matrix}] = [\begin{matrix} 0 \\ \frac{1}{2} (w^{T} B_{1} w + 1) \end{matrix}] .$ (24)

We assume that the 2n by 2n matrix $M$ is nonsingular except at the root. This is what forms the basis for the following discussion. That is to say, we want to show that when not at the root, $ϕ^{(k)}^{^{T}} Δ v^{(k)} = 0$ .

First of all, let the exact nullvector $n$ of

$F_{v} (v) = [\begin{matrix} M & - B_{1} w & B_{1} J w \\ - {(B_{1} w)}^{T} & 0 & 0 \end{matrix}],$

be defined as $n = [n_{w}^{T}, n_{α}, n_{β}]$ , where $n_{w} \in ℝ^{2 n}$ , $n_{α}, n_{β}$ are real scalars, $J w$ and $M$ are defined respectively by (19) and (10). Hence,

$[\begin{matrix} M & - B_{1} w & B_{1} J w \\ - {(B_{1} w)}^{T} & 0 & 0 \end{matrix}] [\begin{matrix} n_{w} \\ n_{α} \\ n_{β} \end{matrix}] = 0,$

then after expanding the matrix-vector multiplication, we obtain

$M n_{w} - n_{α} B_{1} w + n_{β} (B_{1} J w) = 0$ (25)

$w^{T} B_{1} n_{w} = 0.$ (26)

We make distinctly clear at this juncture, that the nullvector $n = [n_{w}^{T}, n_{α}, n_{β}]$ is not exactly the same as $ϕ = [{(J w)}^{T},0,0]$ because, the later has the form of the exact nullvector at the root, but is evaluated at the kth iterate while the former is the nullvector even when not at the root.

Another way of writing (24) is as follows

$M w^{+} = Δ α B_{1} w - Δ β B_{1} J w .$ (27)

This means that we could solve (24) by solving

$M u = B_{1} w,$ (28)

for $u$ . After which the solution of (27) is given by

$w^{+} = Δ α u - Δ β J u .$ (29)

With this expression for $w^{+}$ , it can be observed that

$\begin{matrix} M w^{+} = Δ α M u - Δ β M J u = Δ α B_{1} w - Δ β J M u \\ = Δ α B_{1} w - Δ β J B_{1} w = Δ α B_{1} w - Δ β B_{1} J w . \end{matrix}$

Which means that $w^{+}$ is well defined. Furthermore, from (25)

$M n_{w} = n_{α} B_{1} w - n_{β} (B_{1} J w),$

using the fact that $J$ commutes with $B_{1}$ and (28) gives

$n_{w} = n_{α} u - n_{β} J u .$ (30)

Since $w$ is $B_{1}$ -orthogonal to $n_{w}$ by virtue of Equation (26), taking the $B_{1}$ -inner product of both sides of the above with $w$ yields

$w^{T} B_{1} n_{w} = n_{α} w^{T} B_{1} u - n_{β} w^{T} B_{1} J u = 0.$

From which we deduce

$n_{α} = w^{T} B_{1} J u and n_{β} = w^{T} B_{1} u .$ (31)

Consider the problem of solving the under-determined linear system of Equations (20) for the $2 n + 2$ real unknowns $Δ v = [Δ w^{T}, Δ α, Δ β]$ . It was stated in Lemma 1.1 that the minimum norm solution to an under-determined linear system of equations is orthogonal to the nullspace. It is an application of this result that yields the following important relationship:

$0 = n^{T} Δ v = n_{w}^{T} Δ w + n_{α} Δ α + n_{β} Δ β .$ (32)

If we add the nullvector $n$ to the last row of (24), then

$[\begin{matrix} M & - B_{1} w & B_{1} J w \\ {(B_{1} w)}^{T} & 0 & 0 \\ n_{w}^{T} & n_{α} & n_{β} \end{matrix}] [\begin{matrix} w^{+} \\ Δ α \\ Δ β \end{matrix}] = [\begin{matrix} 0 \\ \frac{1}{2} (w^{T} B_{1} w + 1) \\ n_{w}^{T} w \end{matrix}] .$ (33)

By expanding the second to the last row, $w^{T} B_{1} w^{+} = \frac{1}{2} (w^{T} B_{1} w + 1)$ . But from

(29), $w^{+} = Δ α u - Δ β J u$ . This implies that, by taking the inner product of both sides with $w$ , yields

$w^{T} B_{1} w^{+} = Δ α (w^{T} B_{1} u) - Δ β (w^{T} B_{1} J u) = \frac{1}{2} (w^{T} B_{1} w + 1) .$

Using the definition (31) for $n_{α}$ and $n_{β}$ , we obtain

$n_{β} Δ α - n_{α} Δ β = \frac{1}{2} (w^{T} B_{1} w + 1),$ (34)

where the unknown quantities $Δ α$ and $Δ β$ are to be determined, so we need an extra equation to be able to do so. The last row of the matrix-vector multiplication (cf. (33)) above comes from (32) since

$\begin{array}{l} n_{w}^{T} w^{+} + n_{α} Δ α + n_{β} Δ β \\ = n_{w}^{T} (w + Δ w) + n_{α} Δ α + n_{β} Δ β \\ = n_{w}^{T} w + \underset{= 0}{\underset{︸}{(n_{w}^{T} Δ w + n_{α} Δ α + n_{β} Δ β)}} = n_{w}^{T} w . \end{array}$

If we substitute the expression (30) for $n_{w}$ and (29) for $w^{+}$ into the left hand side, then one obtains

$[n_{α} u^{T} - n_{β} {(J u)}^{T}] [Δ α u - Δ β J u] + n_{α} Δ α + n_{β} Δ β = n_{w}^{T} w .$

Furthermore, by expanding the first term on the left hand side, using the properties of $J$ , then

$\begin{array}{l} [n_{α} u^{T} - n_{β} {(J u)}^{T}] (Δ α u - Δ β J u) \\ = n_{α} Δ α u^{T} u + n_{β} Δ β u^{T} J^{T} J u \\ = n_{α} Δ α {‖ u ‖}^{2} + n_{β} Δ β {‖ u ‖}^{2} . \end{array}$

Consequently,

$\begin{array}{l} n_{α} Δ α {‖ u ‖}^{2} + n_{β} Δ β {‖ u ‖}^{2} + n_{α} Δ α + n_{β} Δ β \\ = (1 + {‖ u ‖}^{2}) (n_{α} Δ α + n_{β} Δ β) = n_{w}^{T} w . \end{array}$

Observe that because $u$ is real, $(1 + {‖ u ‖}^{2})$ is nonzero. Accordingly, after dividing both sides by $(1 + {‖ u ‖}^{2})$

$n_{α} Δ α + n_{β} Δ β = \frac{n_{w}^{T} w}{(1 + {‖ u ‖}^{2})} .$ (35)

We combine the two equations (34) and (35) below

$[\begin{matrix} n_{β} & - n_{α} \\ n_{α} & n_{β} \end{matrix}] [\begin{matrix} Δ α \\ Δ β \end{matrix}] = [\begin{matrix} \frac{1}{2} (w^{T} B_{1} w + 1) \\ \frac{n_{w}^{T} w}{(1 + {‖ u ‖}^{2})} \end{matrix}],$ (36)

to compute $Δ α$ and $Δ β$ simultaneously. The matrix on the left hand side is always nonsingular except at the root (in which case all entries are zero), this is because, its determinant is $n_{α}^{2} + n_{β}^{2}$ . Equation (35) can now be applied to simplify

$\begin{matrix} w^{T} B_{1} J w^{+} = w^{T} B_{1} J (Δ α u - Δ β J u) \\ = w^{T} B_{1} (Δ α J u + Δ β u) \\ = Δ α (w^{T} B_{1} J u) + Δ β (w^{T} B_{1} u) \\ = n_{α} Δ α + n_{β} Δ β = 0. \end{matrix}$ (37)

Notice that we have used the property $J^{2} = - I_{2 n}$ to arrive at the second step above and the definition (29) for $w^{+}$ .

Next, we want to establish the orthogonality of $ϕ$ and $Δ v$ in the next key result. Before we do that, notice from Theorem (2.1) that $ϕ$ , at the root, is a scalar multiple of $[z_{2}^{T}, - z_{1}^{T},0,0]$ and by the definition of $J$ in (18), we can also write $ϕ = [{(J w)}^{T},0,0]$ , with $w = [z_{1}^{T}, z_{2}^{T}]$ . This result holds when $v = v^{(k)}$ is at the root or not, but because the analysis used to establish the orthogonality is based on the assumption that $M$ is nonsinsingular when not at the root. As a result of this, after presenting Algorithm (1) (to follow shortly), we then show that the same result holds when $M$ is singular at the root.

Theorem 3.1 Let $ϕ$ be an approximation to the exact nullvector $n$ of $F_{v} (v)$ . Then, $ϕ$ is orthogonal to $Δ v$ .

Proof: To proof this, recall that $v = [w^{T}, α, β]$ , $v^{+} = [w^{+}^{^{T}}, α^{+}, β^{+}]$ and $ϕ = [{(J w)}^{T},0,0]$ . This implies

$\begin{matrix} ϕ^{T} Δ v = ϕ^{T} (v^{+} - v) = {(J w)}^{T} (w^{+} - w) = w^{T} J^{T} w^{+} - w^{T} J^{T} w \\ = w^{T} J w - w^{T} J w^{+} = - w^{T} J w^{+} = 0, \end{matrix}$ (38)

showing that $ϕ^{T} Δ v = 0$ . In arriving at the last step above, we have used the properties of $J$ and a special case of (37) where $B_{1} = I_{2 n}$ .

We present Algorithm (1), which involves the solution of two linear systems. The first is the 2n by 2n linear system of equations in (28), while the second is the 2 by 2 linear system (36).

Stop Algorithm 1 as soon as

$‖ Δ v^{(k)} ‖ \leq t o l,$

where $Δ v = [w^{+} - w, ω]$ . The above analysis shows that $ϕ^{T} Δ v = 0$ when $v^{(k)}$ is not at the root. Next, we want to show that the same result holds at the root.

In a manner analogous to the proof of Lemma (2.1), we postmultiply both sides of the above system of equation by $C^{T}$ where $C$ is the 2n by 2 real matrix defined by (14), consisting of the left nullvectors of $M$ . If $M$ and $N$ are as defined respectively in (10) and (11), then, this is the same as

$C^{T} M w^{+} + C^{T} N [\begin{matrix} Δ α \\ Δ β \end{matrix}] = 0 .$

But by the definition of $C$ , the first term on the left hand side of the equation above is zero, since $C^{T} M = 0^{T}$ . It can be recalled from the proof of Lemma 2.1 that the 2 by 2 real matrix $H = C^{T} N$ is nonsingular at the root. This implies

$C^{T} N [\begin{matrix} Δ α \\ Δ β \end{matrix}] = H [\begin{matrix} Δ α \\ Δ β \end{matrix}] = [\begin{matrix} - (ψ_{1}^{T} B z_{1} + ψ_{2}^{T} B z_{2}) & ψ_{1}^{T} B z_{2} - ψ_{2}^{T} B z_{1} \\ ψ_{1}^{T} B z_{2} - ψ_{2}^{T} B z_{1} & ψ_{1}^{T} B z_{1} + ψ_{2}^{T} B z_{2} \end{matrix}] [\begin{matrix} Δ α \\ Δ β \end{matrix}] = 0 .$

Accordingly, $Δ α = Δ β = 0$ because of the nonsingularity of $H$ . Therefore,

$M w^{+} = 0 .$ (39)

From the property of $M$ at the root, it has two nonzero nullvectors and hence singular. The implication of this fact and the above is that, $w^{+}$ is in the nullspace of $M$ . But, we have already established that the nullspace of $M$ consists of $w^{T} = [z_{1}^{T}, z_{2}^{T}]$ and $w_{1} = J w = [z_{2}^{T}, - z_{1}^{T}]$ . Hence, we can write

$w^{+} = μ w + τ w_{1} .$

Now, from the last equation in (24),

$\begin{matrix} \frac{1}{2} (w^{T} B_{1} w + 1) = w^{T} B_{1} w^{+} = μ w^{T} B_{1} w + τ w^{T} B_{1} w_{1} \\ = μ w^{T} B_{1} w + τ (z_{1}^{T} B z_{2} - z_{2}^{T} B z_{1}) = μ w^{T} B_{1} w . \end{matrix}$

Consequently,

$μ = \frac{w^{T} B_{1} w + 1}{2 w^{T} B_{1} w} .$

But at the root, $w^{T} B_{1} w = z_{1}^{T} B z_{1} + z_{2}^{T} B z_{2} = 1$ . Therefore, $μ = 1$ , $w^{+} = w$ , $z_{1}^{+} = z_{1}$ and $z_{2}^{+} = z_{2}$ . This will now be used to deduce the following corollary at the root.

Corollary 3.1 Let $ϕ = [{(J w)}^{T},0,0]$ . Let $v^{+} = {[w^{+}, α^{+}, β^{+}]}^{T}$ . Then, $ϕ$ is orthogonal to $Δ v$ at the root.

Proof: This follows from the second to the last line of the proof of Theorem 3.1 (cf., Equation (38)) where $w^{+} = w$ . Hence,

$ϕ^{T} Δ v = - w^{T} J w^{+} = - w^{T} J w = 0.$

4. Inexact Inverse Iteration with Preconditioning for Solving (28)

In Section 2, we found two nonzero nullvectors for $M$ at the root. As a result of this property of $M$ at the root, in this section, we will describe an inexact inverse iteration technique for solving the large sparse system of Equations (28) in step 2 of Algorithm 1 and present Algorithm 2 and Algorithm 3. Result of a numerical experiment is given which supports the theory in Section 5.

We give the following version of inexact inverse iteration in Algorithm 2. We will use a fixed tolerance. Note that because of the special nature of $M$ at the root, the choice of a preconditioner is crucial for convergence to the desired accuracy to be achieved. The inexact linear solver that we use is the preconditioned GMRES [9] where we define the following block tridiagonal preconditioner,

$P \approx [\begin{matrix} A - α B & β B \\ A - α B \end{matrix}] .$ (40)

Next, we present Algorithm 3, which is the inexact inverse iteration equivalence of Algorithm 1. The stopping criterion for the outer iteration in Algorithm 3 depends on the norm of the eigenvalue residuals, that is

$‖ r_{1}^{(k)} ‖ = ‖ (A - α^{(k)}) z_{1}^{(k)} + β^{(k)} B z_{2}^{(k)} ‖ \leq t o l,$

and

$‖ r_{2}^{(k)} ‖ = ‖ (A - α^{(k)}) z_{2}^{(k)} - β^{(k)} B z_{1}^{(k)} ‖ \leq t o l,$

and

$‖ [\begin{matrix} Δ α \\ Δ β \end{matrix}] ‖ \leq t o l and ‖ w^{+} - w ‖ \leq t o l .$

5. Numerical Experiments

As mentioned earlier, the sparse linear system of equations in step 2 of Algorithm 1 is solved with an LU type factorization of $M$ which is expensive and besides the L and U factors may have more nonzero elements than $M$ . In this section, our main goal is to use preconditioned GMRES with the block triangular preconditioner $P$ in (40) to solve the system $M u = B_{1} w$ inexactly. We do this by considering a single numerical experiment with a fixed and a decreasing tolerance. Results are presented by means of tables and figures.

Example 5.1

Consider the 200 by 200 matrix $A$ bwm200.mtx from the matrix market library [12] . It is the discretised Jacobian of the Brusselator wave model for a chemical reaction. The resulting eigenvalue problem with $B = I$ was also studied in [13] and we are interested in finding the rightmost eigenvalue of $A$ which is closest to the imaginary axis and its corresponding eigenvector.

In this example, we take $α^{(0)} = 0.0, β^{(0)} = 2.5$ in line with [13] and took $z_{1}^{(0)} = 1 / 2 ‖ 1 ‖$ and $z_{2}^{(0)} = 1 / ‖ 1 ‖$ , where $1$ is the vector of all ones.

In Table 1 and Table 2, we present results for the computation of the eigenpairs $(z, λ)$ for the matrix in Example 5.1, and stop once the norms of ${[Δ α, Δ β]}^{T}$ and the eigenvalue residuals $r_{1}^{(k)}$ and $r_{2}^{(k)}$ are smaller than the outer tolerance $τ_{outer} = 2.5 \times 10^{- 14}$ . f represents the number of inner iterations used by preconditioned GMRES to satisfy the fixed inner tolerance $τ = 0.6$ in Table 1, while d in Table 2 represents number of inner iterations used by preconditioned GMRES to satisfy the decreasing inner tolerance $τ_{inner} = m i n {0.6,0.6 ‖ r_{1}^{(k)} ‖}$ . Quadratic convergence to $λ = 1.81999 e - 05 + 2.13950 i$ is easily observed from the second up to the seventh iterates in columns five and six of Table 2. However, this quadratic convergence is lost in the last iterate. This could be due to the fact that we are solving a nearly singular system as we approach the root. As shown in columns five and six of Table 1, as we approach the root, more number of inner iterations

Table 1. Table showing convergence to the eigenvalue $λ = 1.81999 e - 05 + 2.13950 i$ with a fixed inner tolerance for Example 5.1. The last column shows the number of inner iterations it took to satisfy the fixed inner tolerance $τ = 0.6$ .

Table 2. Table showing quadratic convergence to the eigenvalue $λ = 1.81999 e - 05 + 2.13950 i$ with a decreasing tolerance for Example 5.1. The last column shows the number of inner iterations it took to satisfy the decreasing inner tolerance $τ = m i n {0.6,0.6 ‖ r_{1}^{(k)} ‖}$ .

Figure 1. Convergence history of the eigenvalue residuals on Example 5.1 with a fixed tolerance $τ_{inner} = 0.6$ .

were needed to satisfy the decreasing inner tolerance as against the fixed tolerance. From Table 1, we observed superlinear convergence in columns five and six.

Figure 1 and Figure 2 shows a plot of the norm of the eigenvalue residuals against the outer iterations with fixed and decreasing inner tolerances respectively. While the norm of the eigenvalue residuals decayed almost

Figure 2. Convergence history of the eigenvalue residuals on Example 5.1 with a decreasing tolerance $τ_{inner} = m i n {0.6,0.6 ‖ r_{1}^{(k)} ‖}$ .

superlinearly in Figure 1, we observed a quadratic decrease in the norm of the eigenvalue residuals in Figure 2. It is quite surprising to see that Algorithm 3 works because $M$ is singular at the root, which means we solved a singular system at the root.

6. Conclusions

While Ruhe ( [2] Section 3) used the normalisation $c^{H} z = 1$ and solved the resulting $(n + 1)$ by $(n + 1)$ nonlinear system of equations to obtain ${[z, λ]}^{T}$ . We have been able to show that, with the addition of the non differentiable normalisation $z^{H} z = 1$ , it is still possible to convert the resulting system of under-determined nonlinear equations into a square one.

Nevertheless, in this work, we obtained Algorithm 1 which consists of a combination of a 2n -by-2n system of equations (which is the same as those in [13] ) and a 2-by-2 system. A numerical example show that using an LU-solver on the one hand and preconditioned GMRES as an inexact solver on the other hand to solve the large sparse 2n-by-2n system of Equations (28), followed by the 2 by 2 system in each case, give similar results in the limit. By and large, the algorithms presented in this paper relies on good initial guesses to the desired eigenpair of interest.

Acknowledgements

The first author acknowledges funds provided by the University of Bath, United Kingdom during his PhD as well as an anonymous referee for useful comments.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1]	Stewart, G.W. (2001) Matrix Algorithms. Vol. II, Eigensystems, SIAM.
[2]	Ruhe, A. (1973) Algorithms for the Nonlinear Eigenvalue Problem. SIAM Journal on Matrix Analysis and Applications, 10, 674-689. https://doi.org/10.1137/0710059
[3]	Keller, H.B. (1977) Numerical Solution of Bifurcation and Nonlinear Eigenvalue Problems. In: Rabinowitz, P., Ed., Applications of Bifurcation Theory, Academic Press, New York, 359-384.
[4]	Deuflhard, P. (2004) Newton Methods for Nonlinear Problems. Springer, 174-175.
[5]	Akinola, R.O. (2014) Theoretical Expression for the Nullvector of the Jacobian: Inverse Iteration with a Complex Shift. International Journal of Innovation in Science and Mathematics, 2, 367-371.
[6]	Akinola, R.O. and Spence, A. (2014) Two-Norm Normalization for the Matrix Pencil: Inverse Iteration with a Complex Shift. International Journal of Innovation in Science and Mathematics, 2, 435-439.
[7]	Akinola, R.O. and Spence, A. (2015) Numerical Computation of the Complex Eigenvalues of a Matrix by solving a Square System of Equations. Journal of Natural Sciences Research, 5, 144-157.
[8]	Akinola, R.O. (2015) Computing the Complex Eigenpair of a Large Sparse Matrix in Complex Arithmetic. International Journal of Pure & Engineering Mathematics (IJPEM), 3, 137-158.
[9]	Saad, Y. and Schultz, M.H. (1986) GMRES: A Generalized Minimal Residual Algorithm for Solving Nonsymmetric Linear Systems. SIAM Journal on Scientific and Statistical Computing, 7, 856-869. https://doi.org/10.1137/0907058
[10]	Boyd, S. (2008/2009) Lecture 8: Least Norm Solutions of Under-Determined Equations, EE263 Autumn.
[11]	Freitag, M.A. and Spence, A. (2009) The Calculation of the Distance to Instability by the Computation of a Jordan Block, Linear and Nonlinear Eigen Problems for PDEs, 274-275.
[12]	Biosvert, B., Pozo, R., Remington, K., Miller, B. and Lipman, R. Matrix Market. http://math.nist.gov/MatrixMarket/
[13]	Parlett, B.N. and Saad, Y. (1987) Complex Shift and Invert Strategies for Real Matrices. Linear Algebra and Its Applications, 575-595.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies