Eigenvalue Bounds for Symmetric, Multiple Saddle-Point Matrices with SPD Preconditioners

Luca Bergamaschi; Michele Bergamaschi

doi:10.20944/preprints202602.0162.v1

Submitted:

02 February 2026

Posted:

06 February 2026

You are already at the latest version

Abstract

We derive eigenvalue bounds for symmetric block-tridiagonal multiple saddle-point systems preconditioned with the symmetric positive definite (SPD) preconditioner proposed by J. Pearson and A. Potschka in 2024, and further studied by L. Bergamaschi and coauthors, for double saddle point problems, with inexact Schur complement matrices. The analysis applies to an arbitrary number of blocks. Numerical experiments are carried out to validate the proposed estimates.

Keywords:

multiple saddle-point systems

;

preconditioned iterative methods

;

symmetric positive definite preconditioner

;

eigenvalue bounds

Subject:

Computer Science and Mathematics - Computational Mathematics

1. Introduction

We consider the iterative solution of a block tridiagonal multiple saddle-point linear system

A x = b

, where

A = [\begin{matrix} A_{0} & B_{1}^{⊤} & 0 & \dots & 0 \\ B_{1} & - A_{1} & B_{2}^{⊤} & ⋱ & ⋮ \\ 0 & B_{2} & A_{2} & ⋱ & 0 \\ ⋮ & ⋱ & ⋱ & ⋱ & B_{N}^{⊤} \\ 0 & \dots & 0 & B_{N} & {(- 1)}^{N} A_{N} \end{matrix}] .

We assume that

A_{0} \in R^{n_{0} \times n_{0}}

is symmetric positive definite, all other square block matrices

A_{k} \in R^{n_{k} \times n_{k}}

are symmetric positive semi-definite and

B_{k} \in R^{n_{k} \times n_{k - 1}}

have full rank (for

k = 1, \dots, N

). We assume also that

n_{k} \leq n_{k - 1}

for all k. These conditions are sufficient to ensure the invertibility of

A

.

Linear systems involving matrix

A

arise mostly with

N = 2

(double saddle–point systems) in many scientific applications including magma–mantle dynamics [3], liquid crystal director modeling [4] or in the coupled Stokes–Darcy problem [5,6,7,8], and the preconditioning of such linear systems has been considered in [9,10,11]. In particular, block diagonal preconditioners for matrix

A

have been studied in [12,13,14,15,16].

Multiple saddle-point linear systems with

N > 2

have recently attracted the attention of a number of researchers. Such systems often arise from modeling multiphysics processes, i.e. the simultaneous simulation of different aspects of physical systems and the interactions among them. Preconditioning of the

4 \times 4

saddle-point linear system (

N = 3

), although with a different block structure, has been addressed in [17] to solve a class of optimal control problems. A practical preconditioning strategy for multiple saddle–point linear systems, based on sparse approximate inverses of the diagonal blocks of the block diagonal Schur complement preconditioning matrix, is proposed in [18], for the solution of coupled poromechanical models and the mechanics of fractured media. Theoretical analysis of such preconditioners has been carried on in [19]. In [17], two

5 \times 5

multiple saddle-point systems arising from optimal control problems constrained with either the heat or the wave equation are addressed, and iteratively solved by robust block diagonal preconditioners.

In this work we consider the preconditioner proposed in [1] (PP preconditioner in short) to develop a spectral analysis of the preconditioned matrix for an arbitrary number of blocks and in presence of inexact Schur complements, in the line of [15] in which the block diagonal preconditioner with exact Schur complements is considered. We will define a sequence of polynomials by a three-term recurrence, whose extremal roots characterize the eigenvalues of the preconditioned matrix in terms of the nonnegative eigenvalues of the preconditioned Schur complements. Using a similar technique, we will extend results, obtained in [2] for a double saddle-point, to multiple saddle-point linear system.

Arguably the most prominent Krylov subspace methods for solving the linear system

A x = b

are preconditioned variants of MINRES [20] and GMRES [21]. In contrast to GMRES, the previously-discovered MINRES algorithm can explicitly exploit the symmetry of

A

. As a consequence, MINRES features a three-term recurrence relation, which is beneficial for its implementation (low memory requirements because subspace bases need not be stored) and its purely eigenvalue-based convergence analysis (via the famous connection to orthogonal polynomials; see [22,23]). Specifically, if the eigenvalues of the preconditioned matrix are contained within

[ρ_{l}^{-}, ρ_{u}^{-}] \cup [ρ_{l}^{+}, ρ_{u}^{+}]

, for

ρ_{l}^{-} < ρ_{u}^{-} < 0 < ρ_{l}^{+} < ρ_{u}^{+}

such that

ρ_{u}^{+} - ρ_{l}^{+} = ρ_{u}^{-} - ρ_{l}^{-}

, then at iteration k the Euclidean norm of the preconditioned residual

r_{k}

satisfies the bound

\frac{∥ r_{k} ∥}{∥ r_{0} ∥} \leq 2 {(\frac{\sqrt{| ρ_{l}^{-} ρ_{u}^{+} |} - \sqrt{| ρ_{u}^{-} ρ_{l}^{+} |}}{\sqrt{| ρ_{l}^{-} ρ_{u}^{+} |} + \sqrt{| ρ_{u}^{-} ρ_{l}^{+} |}})}^{⌊ k / 2 ⌋} .

The paper is structured as follows: In Section 2 we describe the preconditioner and write the eigenvalue problem for the preconditioned matrix. In Section 3, a sequence of polynomials is defined by recurrence, and the connection of these with the eigenvalues of the preconditioned matrix is stated. Then, in Section 4, we characterize the extremal roots of such polynomials and the dependence of such roots with two vectors of parameters, containing information on the inexact Schur complements. Section 5 is devoted to comparing the bounds with the exactly computed eigenvalues on a large set of synthetic experiments for the number of blocks ranging from 3 to 5. In Section 6 we present as a realistic test case, the 3D Mixed Finite Element discretization of the Biot model, which gives raise to a double saddle-point linear system, to again validate the bounds and to show the good performance of the PP preconditioner, also in comparison with the most known block diagonal preconditioner. Section 7 draws some conclusions.

2. The Preconditioner

Consider the exact block factorization of matrix

A = U^{⊤} D^{- 1} U

, where

D = [\begin{matrix} A_{0} \\ - S_{1} \\ S_{2} \\ ⋱ \\ {(- 1)}^{N} S_{N} \end{matrix}], U = [\begin{matrix} A_{0} & B_{1}^{⊤} \\ - S_{1} & B_{2}^{⊤} \\ S_{2} & ⋱ \\ ⋱ & B_{N}^{⊤} \\ {(- 1)}^{N} S_{N} \end{matrix}],

(1)

with

S_{k} = A_{k} + B_{k} S_{k - 1}^{- 1} B_{k}^{⊤}, k = 1, \dots, N

. The preconditioner is initially obtained by removing the minus signs in D, thus obtaining a symmetric positive definite matrix

P = U^{⊤} | D^{- 1} | U

, and then, in view of its practical application, by approximating the Schur complements with their inexact counterparts. The final expression of the preconditioner is therefore

P = P_{U}^{⊤} P_{D}^{- 1} P_{U},

where

P_{D} = [\begin{matrix} {\hat{S}}_{0} \\ {\hat{S}}_{1} \\ {\hat{S}}_{2} \\ ⋱ \\ {\hat{S}}_{N} \end{matrix}], P_{U} = [\begin{matrix} {\hat{S}}_{0} & B_{1}^{⊤} \\ - {\hat{S}}_{1} & B_{2}^{⊤} \\ {\hat{S}}_{2} & ⋱ \\ ⋱ & B_{N}^{⊤} \\ {(- 1)}^{N} {\hat{S}}_{N} \end{matrix}],

(2)

with

\begin{matrix} {\hat{S}}_{0} \approx A_{0} \\ {\tilde{S}}_{k} & = A_{k} + B_{k} {\hat{S}}_{k - 1}^{- 1} B_{k}^{⊤} & {\hat{S}}_{k} \approx {\tilde{S}}_{k} . \end{matrix}

This preconditioner has been proposed by Pearson and Potschka in [1], in the framework of PDE-constrained optimization. Since

P

is symmetric and positive definite, this allows the use of the MINRES iterative method.

Finding the eigenvalues of

P^{- 1} A

is equivalent to solving

P_{D}^{- 1 / 2} A P_{D}^{- 1 / 2} u = λ P_{D}^{- 1 / 2} P_{L}^{⊤} P_{D}^{- 1 / 2} P_{D}^{- 1 / 2} P_{U} P_{D}^{- 1 / 2} u, u = [\begin{matrix} u_{1} \\ u_{2} \\ \dots \\ u_{N + 1} \end{matrix}] .

Exploiting the block components of this generalized eigenvalue problem, we obtain

\begin{matrix} [\begin{matrix} E_{0} & R_{1}^{⊤} \\ R_{1} & - E_{1} & R_{2}^{⊤} \\ 0 & R_{2} & E_{2} & R_{3}^{⊤} \\ \dots & \dots & \dots \\ R_{N - 1} & {(- 1)}^{N - 1} E_{N - 1} & R_{N}^{⊤} \\ R_{N} & {(- 1)}^{N} E_{N} \end{matrix}] [\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \\ \dots \\ u_{N} \\ u_{N + 1} \end{matrix}] = \\ = & λ [\begin{matrix} I & R_{1}^{⊤} \\ R_{1} & I + R_{1} R_{1}^{⊤} & - R_{2}^{⊤} \\ 0 & - R_{2} & I + R_{2} R_{2}^{⊤} & R_{3}^{⊤} \\ \dots & \dots & \dots \\ {(- 1)}^{N} R_{N - 1} & I + R_{N - 1} R_{N - 1}^{⊤} & {(- 1)}^{N + 1} R_{N}^{⊤} \\ {(- 1)}^{N + 1} R_{N} & I + R_{N} R_{N}^{⊤} \end{matrix}] [\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \\ \dots \\ u_{N} \\ u_{N + 1} \end{matrix}] \end{matrix}

(3)

where

\begin{matrix} R_{k} & = & {\hat{S}}_{k}^{- 1 / 2} B_{k} {\hat{S}}_{k - 1}^{- 1 / 2}, k = 1, \dots, N; E_{k} = {\hat{S}}_{k}^{- 1 / 2} A_{k} {\hat{S}}_{k}^{1 / 2}, k = 0, \dots, N \\ R_{k}^{⊤} R_{k} + E_{k} & = & {\hat{S}}_{k}^{- 1 / 2} {\tilde{S}}_{k} {\hat{S}}_{k}^{- 1 / 2} \equiv {\bar{S}}_{k} . \end{matrix}

Componentwise, we write the eigenvalue problem (3) as

\begin{matrix} 0 & = & (E_{0} - λ I) u_{1} + (1 - λ) R_{1}^{⊤} u_{2}, \\ 0 & = & (1 - λ) R_{1} u_{1} + (- E_{1} - λ (I + R_{1} R_{1}^{⊤})) u_{2} + (1 + λ) R_{2}^{⊤} u_{3}, \\ 0 & = & (1 + λ) R_{2} u_{2} + (E_{2} - λ (I + R_{2} R_{2}^{⊤})) u_{3} + (1 - λ) R_{3}^{⊤} u_{4}, \\ \dots \dots \dots \\ 0 & = & (1 + {(- 1)}^{N - 1} λ) R_{N - 1} u_{N - 1} + ({(- 1)}^{N - 1} E_{N - 1} - λ (I - R_{N - 1} R_{N - 1}^{⊤})) u_{N} \\ + (1 + {(- 1)}^{N} λ) R_{N}^{⊤} u_{N + 1}, \\ 0 & = & (1 + {(- 1)}^{N} λ) R_{N} u_{N} + ({(- 1)}^{N} E_{N} - λ (I + R_{N} R_{N}^{⊤})) u_{N + 1} . \end{matrix}

(4)

The matrices

R_{k} R_{k}^{⊤}

are all symmetric positive definite. We define two indicators

γ_{E}^{(k)}

and

γ_{R}^{(k)}

using the Rayleigh quotient

\begin{matrix} α_{E}^{(k)} \equiv λ_{min} (E_{k}), & β_{E}^{(k)} \equiv λ_{max} (E_{k}), & γ_{E}^{(k)} (w) = \frac{w^{T} E_{k} w}{w^{T} w} \in [α_{E}^{(k)}, β_{E}^{(k)}] \equiv I_{E_{k}}, \\ i = 0, \dots, N \\ α_{R}^{(k)} \equiv λ_{min} (R_{k} R_{k}^{T}), & β_{R}^{(k)} \equiv λ_{max} (R_{k} R_{k}^{T}), & γ_{R}^{(k)} (w) = \frac{w^{T} R_{k} R_{k}^{T} w}{w^{T} w} \in [α_{R}^{(k)}, β_{R}^{(k)}] \equiv I_{R_{k}}, \\ i = 1, \dots, N . \end{matrix}

(5)

A third set of indicators,

γ_{S}^{(k)}

, linked to the previous ones as

γ_{S}^{(k)} (w) = γ_{E}^{(k)} (w) + γ_{R}^{(k)} (w), \forall w \in R^{n_{k}},

(6)

describes the field-of-values of each preconditioned Schur complement

{\bar{S}}_{k}

:

α_{S}^{(k)} \equiv λ_{min} ({\bar{S}}_{k}), β_{S}^{(k)} \equiv λ_{max} ({\bar{S}}_{k}), γ_{S}^{(k)} (w) = \frac{w^{T} \bar{S} w}{w^{T} w} \in [α_{S}^{(k)}, β_{S}^{(k)}] \equiv I_{S_{k}}, i = 1, \dots, N .

(7)

If the Schur complements are exactly inverted, then

E_{0} = I

and

R_{k} R_{k}^{⊤} + E_{k} = I

. In such a case, denoting

γ_{R}^{(0)} = 0

, we have

γ_{S}^{(k)} (w) = γ_{E}^{(k)} (w) + γ_{R}^{(k)} (w) \equiv 1, j = 0, \dots, N, \forall w

, and the eigenvalues of (3) are either

- 1

or 1, with multiplicity, respectively,

n_{1} + n_{3} + \dots

and

n_{0} + n_{2} + \dots

(see [1], Theorem 2.1). In the following, we will often remove the argument w from the previously defined indicators. We define the two vectors of parameters:

γ_{E} = [γ_{E}^{(0)}, \dots, γ_{E}^{(N)}], γ_{R} = [γ_{R}^{(1)}, \dots, γ_{R}^{(N)}] .

We will now make some very mild assumptions on two of these indicators:

The value 1 is strictly included in both

I_{E_{0}}

and

I_{S_{1}}

, namely

1.: $α_{E}^{(0)} < 1 < β_{E}^{(0)}$ ,
2.: $α_{S}^{(1)} < 1 < β_{S}^{(1)}$ .

These assumptions are very commonly satisfied in practice, meaning that 1 is in both the spectra of the preconditioned

(1, 1)

block and of the preconditioned Schur complement

{\bar{S}}_{1}

.

3. Characterization of the Eigenvalues of the Preconditioned Matrix

We now recursively define a sequence of polynomials, with

γ_{E}

and

γ_{R}

as parameters, whose roots are strictly related with eigenvalues of

P^{- 1} A

.

Definition 1.

\begin{matrix} U_{0} (λ, γ_{R}, γ_{E}) & = & 1, \\ U_{1} (λ, γ_{R}, γ_{E}) & = & λ - γ_{E}^{(0)}, \\ U_{k + 1} (λ, γ_{R}, γ_{E}) & = & (λ (1 + γ_{R}^{(k)}) + {(- 1)}^{k + 1} γ_{E}^{(k)}) U_{k} (λ, γ_{R}, γ_{E}) \\ - γ_{R}^{(k)} {(λ + {(- 1)}^{k})}^{2} U_{k - 1} (λ, γ_{R}, γ_{E}), k \geq 1 . \end{matrix}

(8)

Then a sequence of matrix valued function is also defined by recurrence:

Definition 2.

\begin{matrix} Y_{1} (λ) & = λ I - E_{0}, \\ Y_{k + 1} (λ) & = {(- 1)}^{k + 1} E_{k} + λ (I + R_{k} R_{k}^{⊤}) - {(λ + {(- 1)}^{k})}^{2} R_{k} Y_{k} {(λ)}^{- 1} R_{k}^{⊤}, k \geq 1, \\ and λ s . t . 0 \notin σ (Y_{k} (λ)) . \end{matrix}

Notation. We use the notation

I_{k}

to denote the union of intervals bounding the roots of polynomials of the form

U_{k}

over the valid range of

γ_{E}, γ_{R}

.

We recall a technical lemma, based on an idea in [24], whose proof can be found in [2].

Lemma 1.

Let Y be a symmetric matrix valued function defined in

F \subset R

and

0 \notin [min {σ (Y (ξ))}, max {σ (Y (ξ))}] for all ξ \in F .

Then, for arbitrary

s \neq 0

, there exists a vector

v \neq 0

such that

\frac{s^{⊤} Y {(ξ)}^{- 1} s}{s^{⊤} s} = \frac{1}{γ_{Z}} with γ_{Z} = \frac{v^{⊤} Y (ξ) v}{v^{⊤} v} .

The next lemma, which will be used in the proof of the subsequent Theorem 1, links together the two sequences previously defined.

Lemma 2.

For every

u \neq 0

, there is a choice of γ for which

\frac{u^{⊤} Y_{k + 1} (λ) u}{u^{⊤} u} = \frac{U_{k + 1} (λ)}{U_{k} (λ)} for all λ \notin ⋃_{j = 1}^{k} I_{j} .

Proof.

This is shown by induction. We first define

η_{k} (λ) = {(- 1)}^{k + 1} γ_{E}^{(k)} + λ (1 + γ_{R}^{(k)}), {\bar{μ}}_{k} (λ) = {(λ + {(- 1)}^{k})}^{2} .

(9)

For

k = 0

we have

\frac{u^{⊤} Y_{1} (λ) u}{u^{⊤} u} = λ - γ_{E}^{(0)} = \frac{U_{1} (λ)}{U_{0} (λ)}

for all

λ \in R

. If

k \geq 1

, the condition

λ \notin I_{k}

, together with the inductive hypothesis

\frac{u^{⊤} Y_{k} (λ) u}{u^{⊤} u} = \frac{U_{k} (λ)}{U_{k - 1} (λ)}

, implies invertibility of

Y_{k} (λ)

. Moreover, this is equivalent to the condition

0 \notin [min {σ (Y (ξ))}, max {σ (Y (ξ))}]

that guarantees the applicability of Lemma 1. Therefore, we can write

\begin{matrix} \frac{u^{⊤} Y_{k + 1} (λ) u}{u^{⊤} u} & = η_{k} (λ) - {\bar{μ}}_{k} (λ) \frac{u^{⊤} R_{k} Y_{k} {(λ)}^{- 1} R_{k}^{⊤} u}{u^{⊤} u} \underset{w = R_{k}^{⊤} u}{=} η_{k} (λ) - {\bar{μ}}_{k} (λ) \frac{w^{⊤} Y_{k} {(λ)}^{- 1} w}{w^{⊤} w} γ_{R}^{(k)} \end{matrix}

(10)

We then apply Lemma 1 and the inductive hypothesis to write

\frac{w^{⊤} Y_{k} {(λ)}^{- 1} w}{w^{⊤} w} = \frac{U_{k - 1} (λ)}{U_{k} (λ)} .

Substituting into (10) and using relation (8) we obtain

η_{k} (λ) - {\bar{μ}}_{k} (λ) \frac{w^{⊤} Y_{k} {(λ)}^{- 1} w}{w^{⊤} w} γ_{R}^{(k)} = \frac{η_{k} (λ) U_{k} (λ) - {\bar{μ}}_{k} (λ) γ_{R}^{(k)} U_{k - 1} (λ)}{U_{k} (λ)} = \frac{U_{k + 1} (λ)}{U_{k} (λ)} .

□

We now state the main results of this Section:

Theorem 1.

The eigenvalues of

P^{- 1} A

are located in

I = ⋃_{k = 1}^{N + 1} I_{k}

.

Proof.

The proof is carried out through induction on k, that is for every

k \leq N + 1

either

(i) λ \in I_{k} or (ii) u_{k} = (1 + (- 1^{k}) λ) Y_{k} {(λ)}^{- 1} R_{k}^{⊤} u_{k + 1},

and for

k = N + 1

only condition (i) can hold. Let

u = {[u_{1}^{⊤}, \dots, u_{k + 1}^{⊤}]}^{⊤}

be an eigenvector of (4). Assume that

λ \notin I_{E_{0}}

(for

k = 0

), then

Y_{1} (λ)

is invertible. From the first equation of (4) we obtain

(λ I - E_{0}) u_{1} = (1 - λ) R_{1}^{⊤} u_{2} \Rightarrow Y_{1} (λ) u_{1} = (1 - λ) R_{1}^{⊤} u_{2},

(11)

whereupon inserting (11) into the second row of (4) yields

\underset{Y_{2} (λ)}{\underset{︸}{(E_{1} + λ (I + R_{1} R_{1}^{⊤}) - {(λ - 1)}^{2} R_{1} {(λ I - E_{0})}^{- 1} R_{1}^{⊤})}} u_{2} = (1 + λ) R_{2}^{⊤} u_{3} .

(12)

Pre-multiplying the left hand side of (12) by

\frac{u_{2}^{⊤}}{u_{2}^{⊤} u_{2}}

, we obtain that

\frac{u_{2}^{⊤} Y_{2} (λ) u_{2}}{u_{2}^{⊤} u_{2}} = \frac{U_{2} (λ)}{U_{1} (λ)}

If

λ

is a zero of

U_{2}

then

λ \in I_{2}

, Otherwise

Y_{2} (λ)

is invertible and definite, and we can write

u_{2} = (1 + λ) Y_{2} {(λ)}^{- 1} u_{3}

. Assume now the inductive hypothesis holds for

k - 1

. If

λ \notin I_{k - 1}

, then

Y_{k - 1} (λ)

is definite and invertible. We can write

\begin{matrix} Y_{k} (λ) u_{k} & \equiv & ({(- 1)}^{k} E_{k - 1} + λ (I + R_{k - 1} R_{k - 1}^{⊤}) - {(λ + {(- 1)}^{k - 1})}^{2} R_{k - 1} Y_{k - 1}^{- 1} R_{k - 1}^{⊤}) u_{k} \\ = & (1 + {(- 1)}^{k} λ) R_{k}^{⊤} u_{k + 1} . \end{matrix}

Since

\frac{u_{k}^{⊤} Y_{k} (λ) u_{k}}{u_{k}^{⊤} u_{k}} = \frac{U_{k} (λ)}{U_{k - 1} (λ)},

then either

λ

is a zero of

U_{k} (λ)

and hence

λ \in I_{k}

or

Y_{k} (λ)

is definite, in such a case we can write

u_{k} = (1 + {(- 1)}^{k} λ) Y_{k} {(λ)}^{- 1} u_{k + 1}

. The induction process ends for

k = N + 1

. In this case we have that

Y_{N + 1} (λ) u_{N + 1} = 0,

and the condition

λ \in I_{N + 1}

must hold, noticing that

u_{N + 1} = 0

would imply that also

u_{N} = \dots = u_{1} = 0

contradicting the definition of an eigenvector. □

4. Bounds on the Roots of ${U_{k}}$

In this Section we will first characterize the set

I = ⋃ I_{k}

. Then we will specify the dependence of the zeros

U_{k} (λ)

on the parameters

γ_{E}, γ_{R}

.

Proposition 1.

Since the polynomials of the sequence (8) are monic

lim_{λ \to + \infty} U_{k} (λ) = + \infty lim_{λ \to - \infty} U_{k} (λ) = {(- 1)}^{k} \cdot \infty

Lemma 3.

Given the sequence of polynomials (8), for all

k \geq 1

sgn (U_{2 k - 1} (0)) = sgn (U_{2 k} (0)) = {(- 1)}^{k} .

In other words, the signs of

U_{k} (0)

follow the repeating pattern

- -, + +, - -, + +, \dots

starting from

k = 1

. Equivalently,

s g n (U_{k} (0)) = {(- 1)}^{⌈ \frac{k}{2} ⌉}

.

Proof.

We prove this by induction of k. The claim holds for

k = 1

:

U_{1} (0) = - γ_{E}^{(0)} < 0, U_{2} (0) = - γ_{E}^{(1)} γ_{E}^{(0)} - γ_{R}^{(1)} < 0 .

Assume that the claim holds for some

k \geq 1

, that is the sign of

U_{2 k - 1} (0)

and

U_{2 k} (0)

is

s = {(- 1)}^{k}

, then

\begin{matrix} sgn (U_{2 k + 1} (0)) & = & {(- 1)}^{2 k + 1} sgn (U_{2 k} (0)) - sgn (U_{2 k - 1} (0)) = - s = {(- 1)}^{k + 1} . \\ sgn (U_{2 k + 2} (0)) & = & {(- 1)}^{2 k + 2} sgn (U_{2 k + 1} (0)) - sgn (U_{2 k} (0)) = - s = {(- 1)}^{k + 1} . \end{matrix}

□

We now show that, based on Assumptions 1, both

- 1

and 1 are strictly included in

I

, in fact

α_{E}^{(0)} < 1 < β_{E}^{(0)}

implies 1 is strictly included in

I_{1}

, and

α_{S}^{(1)} < 1 < β_{S}^{(1)}

implies that

- 1

is strictly included in

I_{2} .

In fact

U_{2} (- 1; γ_{E}^{(0)}, γ_{E}^{(1)}, γ_{R}^{(1)}) = 1 + γ_{E}^{(0)} - 3 γ_{R}^{(1)} + γ_{E}^{(0)} (γ_{R}^{(1)} - γ_{E}^{(1)}) - γ_{E}^{(1)} .

Using

γ_{E}^{(0)} = 1,

we have

U_{2} (- 1; 1, γ_{E}^{(1)}, γ_{R}^{(1)}) = 2 (1 - γ_{E}^{(1)} - γ_{R}^{(1)}) = 2 (1 - γ_{S}^{(1)}) \equiv U_{2} (- 1; 1, γ_{S}^{(1)})

. That

- 1

strictly belongs to

I_{2}

comes from

U_{2} (- 1; 1, α_{S}^{(1)}) > 0

and

U_{2} (- 1; 1, β_{S}^{(1)}) < 0

.

We now state a results regarding the extremal roots of the polynomials

{U_{k}}

.

Theorem 2.

Assume that the polynomial

U_{k} (λ)

has

s_{k}

distinct roots, and let us denote as

ξ_{1}^{(k)} < ξ_{2}^{(k)} < \dots ξ_{s_{k}}^{(k)}

the roots of

U_{k} (λ)

. If

ξ_{1}^{(k)} < - 1

and

ξ_{k}^{(k)} > 1, \forall k

, then the extremal roots of the polynomials

U_{k}

satisfy

ξ_{1}^{(k)} < ξ_{1}^{(k - 1)} < \dots < ξ_{1}^{(2)}, and ξ_{1}^{(1)} < \dots < ξ_{s_{k - 1}}^{(k - 1)} < ξ_{s_{k}}^{(k)} .

Proof.

We prove the claim by induction, starting from

l = 2

. Consider first the positive roots. The basis of the induction is

1 < ξ_{1}^{(1)} < ξ_{2}^{(2)},

which is readily proved by taking

ξ_{1}^{(1)} = γ_{E}^{(0)} > 1

and observing that

U_{2} (γ_{E}^{(0)}) < 0 .

Assume now that the claim holds for

l - 1

, that is,

ξ_{s_{l - 1}}^{(l - 1)} < ξ_{s_{l}}^{(l)}

. This implies that

U_{l - 1} (ξ_{s_{l}}^{(l)}) > 0

. Then, from the recursion (8)

U_{l + 1} (ξ_{s_{l}}^{(l)}) = - γ_{R} {(ξ_{s_{l}}^{(l)} + {(- 1)}^{l})}^{2} U_{l - 1} (ξ_{s_{l}}^{(l)}) < 0,

which implies that

ξ_{s_{l}}^{(l)} < ξ_{s_{l + 1}}^{(l + 1)}

.

Consider now the negative roots. If

γ_{E}^{(0)} < 1

and

γ_{S}^{(1)} > 1

, direct computation shows that

U_{2} (- 1) < 0,

providing

ξ_{1}^{(2)} < - 1 .

Moreover,

U_{3} (ξ_{1}^{(2)}) = - γ_{R}^{(2)} {(ξ_{1}^{(2)} + 1)}^{2} (ξ_{1}^{(2)} - γ_{E}^{(0)}) > 0

, which implies

ξ_{1}^{(3)} < ξ_{1}^{(2)}

. Assume now that the claim holds for

l - 1

, that is

ξ_{1}^{(l - 1)} > ξ_{1}^{(l)}

which implies that

sgn (U_{l - 1} (ξ_{1}^{(l)})) = {(- 1)}^{l - 1}

. Then, from the recursion (8)

sgn (U_{l + 1} (ξ_{1}^{(l)})) = - sgn (U_{l - 1} (ξ_{1}^{(l)}))) = - {(- 1)}^{l - 1} = {(- 1)}^{l},

which finally shows that

ξ_{1}^{(l + 1)} < ξ_{1}^{(l)}

, as desired. □

The results of this section allow us to characterize the set

I

in Theorem 1 like

I = [λ_{-}^{L B}, λ_{-}^{U B}] \cup [λ_{+}^{L B}, λ_{+}^{U B}],

where

\begin{matrix} λ_{-}^{L B} & = & ξ_{1}^{(N + 1)} \\ λ_{-}^{U B} & = & max_{j} {max {ξ_{l}^{(j)}, s . t . ξ_{l}^{(j)} < 0}} \\ λ_{+}^{L B} & = & min_{j} {min {ξ_{l}^{(j)}, s . t . ξ_{l}^{(j)} > 0}} \\ λ_{+}^{U B} & = & ξ_{s_{N + 1}}^{(N + 1)} . \end{matrix}

4.1. How Zeros of ${U_{k}}$ Move Depending on $γ_{E}^{(j)}, γ_{R}^{(j)}$

Now the question is: for which values of the parameters

γ_{E}^{(i)}, γ_{R}^{(i)}

the extremal values of the roots is attained? The following results will allow us to state that the extremal values of the zeros of

U_{k + 1} (λ, γ_{E}, γ_{R})

are obtained at the extremal values of

γ_{E}

and

γ_{R}

, namely

{α_{E}^{(i)}, β_{E}^{(i)}; α_{R}^{(i)}, β_{R}^{(i)}}

.

Let

ξ

an extremal zero of the polynomial

U_{k + 1}

(smallest/largest negative, smallest/largest positive) for a particular combination of the parameters. Then taking separately one of the parameters,

γ_{*}

we can write

γ_{*}

as a convex combination of its extremal values,

γ_{*} = α γ_{*}^{min} + (1 - α) γ_{*}^{max}

and write

\begin{matrix} U_{k + 1} (λ, γ_{*}) & = s_{1} (λ) + γ_{*} s_{2} (λ) = s_{1} (λ) + (α γ_{*}^{min} + (1 - α) γ_{*}^{max}) s_{2} (λ) \\ = α (s_{1} (λ) + γ_{*}^{min} s_{2} (λ)) + (1 - α) (s_{1} (λ) + γ_{*}^{max} s_{2} (λ)) \\ = α U_{k + 1} (λ, γ_{*}^{min}) + (1 - α) U_{k + 1} (λ, γ_{*}^{max}) . \end{matrix}

Hence,

0 = U_{k + 1} (ξ, γ_{*}) = α U_{k + 1} (ξ, γ_{*}^{min}) + (1 - α) U_{k + 1} (ξ, γ_{*}^{max}),

with

ρ_{1} = U_{k + 1} (ξ, γ_{*}^{min})

and

ρ_{2} = U_{k + 1} (ξ, γ_{*}^{max})

therefore either taking opposite signs, or being both zero. In the first case it is clear that one between

ρ_{1}

and

ρ_{2}

improves the sought root, in particular we select

γ \in {γ_{*}^{min}, γ_{*}^{max}}

corresponding to the

ρ_{*} \in {ρ_{1}, ρ_{2}}

such that

1.: (Smallest negative root): The sign of $ρ_{*}$ is opposite to the sign of $U_{k + 1}$ at $- \infty$ .
2.: (Largest negative root): The sign of $ρ_{*}$ is opposite to the sign of $U_{k + 1}$ at 0.
3.: (Smallest positive root): The sign of $ρ_{*}$ is opposite to the sign of $U_{k + 1}$ at 0.
4.: (Largest positive root): The sign of $ρ_{*}$ is opposite to the sign of $U_{k + 1}$ at $+ \infty$ .

If, finally, both

ρ_{1}

and

ρ_{2}

are zero, for linearity, it means that the root is independent of this parameter and we can choose one of its extremal values.

We will now prove (in Theorem 3, and in Section 4.2) that we can predict whether

γ_{E}^{(i)} = α_{E}^{(i)}

or

γ_{E}^{(i)} = β_{E}^{(i)}

in order to obtain the intervals bounding the roots of

U_{k}

.

Lemma 4.

Given a fixed k, there exist polynomials

W_{l}

for any

0 \leq l \leq k

such that

U_{k + 1} = U_{j} W_{k - j + 1} - U_{j - 1} W_{k - j} μ_{j},

with

μ_{j} = γ_{R}^{(j)} {(λ + {(- 1)}^{j})}^{2} .

Proof.

The statement holds for

j = k

, by setting

W_{1} = η_{k}

and

W_{0} = 1

. Let the statement be true for

j \geq 2

, then,

\begin{matrix} U_{k + 1} & = & U_{j} W_{k - j + 1} - U_{j - 1} W_{k - j} μ_{j} = \\ = & W_{k - j + 1} (η_{j - 1} U_{j - 1} - μ_{j - 1} U_{j - 2}) - U_{j - 1} W_{k - j} = \\ = & \underset{W_{k - j + 2}}{\underset{︸}{(W_{k - j + 1} η_{j - 1} - W_{k - j})}} U_{j - 1} - W_{k - j + 1} μ_{j - 1} U_{j - 2} \end{matrix}

It is also clear by induction that

W_{k - j}

depends only on the parameters with index greater than j. □

We denote with

e_{j} \in {α_{E}^{(j)}, β_{E}^{(j)}}

one choice of the extremal values of

γ_{E}^{(j)}

and, analogously,

r_{j} \in {α_{R}^{(j)}, β_{R}^{(j)}}

; moreover we denote by

e_{j}^{*}

and

r_{j}^{*}

the other extremal value of the parameter. We now consider the polynomial

U_{k}

with

γ_{E}^{(j)} \equiv e_{j}, γ_{R}^{(j)} \equiv r_{j}

and also define

^{E} U_{k}^{j}

as the polynomial

U_{k}

with

γ_{E}^{(j)} \equiv e_{j}^{*}

. Similarly we define

^{R} U_{k}^{j}

as the polynomial with

γ_{R}^{(j)} = r_{j}^{*}

. Then it is immediate from the recursion of

U_{k}

that

\begin{matrix} ^{E} U_{j + 1}^{j} & = & U_{j + 1} + {(- 1)}^{j + 1} (e_{j}^{*} - e_{j}) U_{j} \end{matrix}

(13)

\begin{matrix} ^{R} U_{j + 1}^{j} & = & U_{j + 1} + [λ U_{j} - {(λ + {(- 1)}^{j})}^{2} U_{j - 1}] (r_{j}^{*} - r_{j}) . \end{matrix}

(14)

Setting

z_{j} (λ) = λ U_{j} - {(λ + {(- 1)}^{j})}^{2} U_{j - 1}

, the second equality can be rewritten simply as

^{R} U_{j + 1}^{j} = U_{j + 1} + z_{j} (r_{j}^{*} - r_{j}) .

Lemma 5.

We have the following relations

\begin{matrix} ^{E} U_{k + 1}^{j} & = & U_{k + 1} + {(- 1)}^{j + 1} W_{k - j} U_{j} (e_{j}^{*} - e_{j}) \end{matrix}

(15)

\begin{matrix} ^{R} U_{k + 1}^{j} & = & U_{k + 1} + W_{k - j} z_{j} (r_{j}^{*} - r_{j}) . \end{matrix}

(16)

Proof.

We write

U_{k + 1} = U_{j + 1} W_{k - j} - U_{j} W_{k - j - 1} μ_{j + 1},

and notice that the only term in the right hand side that depends on

γ_{E}^{(j)}

is

U_{j + 1}

. Thus, using (13),

\begin{matrix} ^{E} U_{k + 1}^{j} - U_{k + 1} & = W_{k - j} (^{E} U_{j + 1}^{j} - U_{j + 1}) = {(- 1)}^{j + 1} W_{k - j} U_{j} (e_{j}^{*} - e_{j}) . \end{matrix}

This proves the first formula. The second one is proved in the same way, using (14). □

Theorem 3.

Let ξ be the largest (resp. smallest) positive or negative root of

U_{k + 1}

and let

e_{j}, r_{j}

be the parameters that realize this root. Then ξ assumes the largest (resp. smallest) value when the

γ_{E}^{(j)}

are alternatively the maximum or minimum.

Proof.

It is enough to show that

(e_{j}^{*} - e_{j}) (e_{j - 1}^{*} - e_{j - 1})

is negative. First we observe that if

W_{k - j} (ξ)

is zero, then

ξ

does not depend on

γ_{E}^{(j)}

. Thus we can assume

W_{k - j} (ξ), W_{k - j - 1} (ξ) \neq 0

. We also assume, without loss of generality, that

U_{j + 1} (ξ), U_{j} (ξ) \neq 0

. With these assumptions, and recalling that

μ_{j} > 0

, we have

sgn (U_{j} W_{k - j}) = sgn (U_{j - 1} W_{k - j - 1}) .

Now we observe that

^{E} U_{k + 1}^{j} (ξ)

must have a fixed sign as j varies to guarantee that the root

ξ

is either maximal or minimal (this sign depends on the type of of the root – smallest/largest, positive/negative – and on k but cannot depend on j). We call this sign s. Now we have, using (15) and recalling that

U_{k + 1} (ξ) = 0

,

\begin{matrix} sgn (e_{j}^{*} - e_{j}) (e_{j - 1}^{*} - e_{j - 1}) \\ = sgn (^{E} U_{k + 1}^{j} (ξ) (^{E} U_{k + 1}^{j - 1} (ξ)) {(- 1)}^{k - j} {(- 1)}^{j} sgn (W_{k - j - 1} U_{j}) sgn (W_{j} U_{j - 1}) \\ = s^{2} {(- 1)}^{2 j + 1} = - 1 . \end{matrix}

□

4.2. Choice of $γ_{E}^{(k)}$

In the previous section we have shown that the

γ_{E}^{(k)}

assume alternatively the maximum or minimum value. It is enough to fix the value of

γ_{E}^{(k)}

to determine all the

γ_{E}^{(j)}

. We know, from (13) that

sign (e_{k}^{*} - e_{k})

is equal to

{(- 1)}^{k + 1} sign (U_{k} (ξ)^{E} U_{k + 1}^{k} (ξ)

. We will consider four cases

Let $ξ = λ_{+}^{U B} = ξ_{s_{k + 1}}^{(k + 1)}$ . We know that $ξ$ is larger than all the roots of $U_{k}$ and $^{E} U_{k + 1}^{k}$ and so $U_{k} (ξ),^{E} U_{k + 1}^{k} (ξ) > 0$ . Thus

$sign (e_{k}^{*} - e_{k}) = {(- 1)}^{k + 1}$

and so

$sign (e_{j}^{*} - e_{j}) = (e_{k}^{*} - e_{k}) {(- 1)}^{k - j} = {(- 1)}^{k + 1} {(- 1)}^{k - j} = {(- 1)}^{j + 1} .$

(17)
Let $ξ = λ_{-}^{L B} = ξ_{1}^{(k + 1)}$ . The root $ξ$ is negative and smaller than all the roots of $U_{k}$ and $^{E} U_{k + 1}^{k}$ and so

$sign (U_{k} (ξ)) = {(- 1)}^{k} sign (^{E} U_{k + 1}^{k} (ξ)) = {(- 1)}^{k + 1} .$

Thus

$sign (e_{k}^{*} - e_{k}) = {(- 1)}^{k + 1} {(- 1)}^{k} {(- 1)}^{k + 1} = {(- 1)}^{k}$

and we have

$sign (e_{j}^{*} - e_{j}) = sign (e_{k}^{*} - e_{k}) {(- 1)}^{k - j} = {(- 1)}^{k} {(- 1)}^{k - j} = {(- 1)}^{j} .$

(18)
Let now $ξ = λ_{+}^{L B}$ and let m be the (smallest) index such that $U_{m + 1} (ξ) = 0$ . Then the sign of $U_{m} (ξ)$ and $^{E} U_{m + 1}^{m}$ must be the sign that this polynomial assumes on 0. Thus

$\begin{matrix} sign (e_{m}^{*} - e_{m}) & = {(- 1)}^{m + 1} sign (U_{m} (0)) sign (^{E} U_{m + 1}^{m} (0)) \\ = {(- 1)}^{m + 1} {(- 1)}^{⌈\frac{m + 1}{2}⌉} {(- 1)}^{⌈\frac{m}{2}⌉} \\ = {(- 1)}^{m + 1} {(- 1)}^{m + 1} = + \end{matrix}$

and

$sign (e_{j}^{*} - e_{j}) = sign (e_{m}^{*} - e_{m}) {(- 1)}^{m - j} = {(- 1)}^{m - j} .$

(19)
Let finally $ξ = λ_{-}^{U B}$ and let m be the (smallest) index such that $U_{m + 1} (ξ) = 0$ . Reasoning as above shows that

$sign (e_{m}^{*} - e_{m}) = + and sign (e_{j}^{*} - e_{j}) = {(- 1)}^{m - j} .$

(20)

4.3. Sign of $r_{j}^{*} - r_{j}$ in Terms of the Sign of $z_{j} (ξ)$

Recall that we have

\begin{matrix} ^{E} U_{k + 1}^{j} (ξ) & = {(- 1)}^{j + 1} W_{j + 1} (ξ) U_{j} (ξ) (e_{j}^{*} - e_{j}), \\ ^{R} U_{k + 1}^{j} (ξ) & = W_{j + 1} (ξ) z_{j} (ξ) (r_{j}^{*} - r_{j}) . \end{matrix}

If

W_{j + 1} (ξ) = 0

, then the root is independent of

γ_{R}^{(j)}

, so we can assume

W_{j + 1} (ξ) \neq 0

. We also assume, without loss of generality, that

U_{j} (ξ) \neq 0

. Then we have

\begin{matrix} sign (r_{j}^{*} - r_{j}) & = sign (z_{j} (ξ)^{R} U_{k + 1}^{j} (ξ)) sign (W_{j + 1}) \\ = sign (z_{j} (ξ)) sign (^{R} U_{k + 1}^{j} (ξ)) {(- 1)}^{j + 1} sign (^{E} U_{k + 1}^{j} (ξ) U_{j} (ξ)) sign (e_{j}^{*} - e_{j}) \\ = {(- 1)}^{j + 1} sign (U_{j} (ξ)) sign (e_{j}^{*} - e_{j}) sign (z_{j} (ξ)), \end{matrix}

where the last step follows from the fact that

^{E} U_{k + 1}^{j} (ξ)

and

^{R} U_{k + 1}^{j} (ξ)

must have the same sign. Again we distinguish four cases depending on the type of root we are considering.

Let $ξ = λ_{+}^{U B} = ξ_{s_{k + 1}}^{(k + 1)}$ . Then $U_{j} (ξ) > 0$ and $sign (e_{j}^{*} - e_{j}) = {(- 1)}^{j + 1}$ so

$sign (r_{j}^{*} - r_{j}) = {(- 1)}^{j + 1} {(- 1)}^{j + 1} sign (z_{j} (ξ)) = sign (z_{j} (ξ)) .$

(21)
Let $ξ = λ_{-}^{U B} = ξ_{1}^{(k + 1)}$ . Then

$sign (U_{j} (ξ)) = {(- 1)}^{j} and sign (e_{j}^{*} - e_{j}) = {(- 1)}^{j}$

so

$\begin{matrix} sign (r_{j}^{*} - r_{j}) & = {(- 1)}^{j + 1} {(- 1)}^{j} {(- 1)}^{j} sign (z_{j} (ξ)) \\ = {(- 1)}^{j + 1} sign (z_{j} (ξ)) . \end{matrix}$

(22)
Let now $ξ = λ_{+}^{L B}$ and $m \leq k$ be the (smallest) index such that $U_{m + 1} (ξ) = 0$ . Then

$sign (U_{j} (ξ)) = {(- 1)}^{⌈\frac{j}{2}⌉} and sign (e_{j}^{*} - e_{j}) = {(- 1)}^{m - j}$

so

$\begin{matrix} sign (r_{j}^{*} - r_{j}) & = {(- 1)}^{j + 1} {(- 1)}^{⌈\frac{j}{2}⌉} {(- 1)}^{m - j} sign (z_{j} (ξ)) \\ = {(- 1)}^{m + 1} {(- 1)}^{⌈\frac{j}{2}⌉} sign (z_{j} (ξ)) . \end{matrix}$
Let finally $ξ = λ_{-}^{U B}$ and $m \leq k$ be the (smallest) index such that $U_{m + 1} (ξ) = 0$ . Reasoning as above shows that

$sign (r_{j}^{*} - r_{j}) = {(- 1)}^{m + 1} {(- 1)}^{⌈\frac{j}{2}⌉} sign (z_{j} (ξ)) .$

The sign of

z_{j} (ξ)

cannot be computed in general.

4.4. Bounds for the Eigenvalues of the Preconditioned Matrix

A procedure to compute the intervals bounding the eigenvalues of

P^{- 1} A

with

N + 1

blocks can be described as follows:

Upper bound for the positive eigenvalues. In view of Theorem 2, this is given by the largest root of the polynomial $U_{N + 1} (λ)$ . The sign of (17) is ${(- 1)}^{j + 1}$ . When the sign of $e_{j}^{*} - e_{j}$ is positive, this means that the wrong indicator is larger than the correct one, and therefore we must set $γ_{E}^{(j)} = α_{E}^{(j)}$ . Hence, as $sgn (e_{0}^{*} - e_{0}) = - 1$ , we should start with $γ_{E}^{(0)} = β_{E}^{(0)}$ , and then alternating between the two extremal values: $β_{E}^{(0)}, α_{E}^{(1)}, β_{E}^{(2)}, \dots$ .
Lower bound for the negative eigenvalues. In view of Theorem 2, this is given by the smallest zero of the polynomial $U_{N + 1} (λ)$ . The sign of (18) is ${(- 1)}^{j}$ . This means that we should start with $γ_{E}^{(0)} = α_{E}^{(0)}$ (since $e_{0}^{*} - e_{0} > 0$ ) and then alternating between the two extremal values: $α_{E}^{(0)}, β_{E}^{(1)}, α_{E}^{(2)}, \dots$ .
Upper bounds for the negative eigenvalues. We consider evaluating the largest negative root $ξ_{-}^{(m)}$ of all polynomials $U_{m}, 2 \leq m \leq N + 1$ . The sign in (19) is ${(- 1)}^{m + j}$ suggesting that we have to start with $α_{E}^{(0)}$ if m is even and with $β_{E}^{(0)}$ , if m is odd, and proceed as before. Finally we compute

$λ_{-}^{U B} = max_{m \leq N + 1} {ξ_{-}^{(m)}} .$
Lower bounds for the positive eigenvalues. We consider evaluating the smallest positive root $ξ_{+}^{(m)}$ of all polynomials $U_{m}, 1 \leq m \leq N + 1$ . The sign in (20) is again ${(- 1)}^{m + j}$ suggesting that we have to start with $α_{E}^{(0)}$ if m is even and with $β_{E}^{(0)}$ , if m is odd, and proceed as before. Finally we compute

$λ_{+}^{L B} = min_{m \leq N + 1} {ξ_{+}^{(m)}} .$

In each of the four cases, the choice of

γ_{R}^{(j)} \in [α_{R}^{(j)}, β_{R}^{(j)}]

must be performed by taking all the combinations of these values. Some insights on how to select the extremal values of the

γ_{R}

indicators, upon additional assumptions, are given in Lemma 6, whose proof is deferred to Appendix A, and in Theorem 4.

Lemma 6.

Let ρ a positive number and assume that, for every

j = 0, \dots, k

,

α_{E}^{(j)} \leq 2 - ρ

and

β_{E}^{(j)} \geq 2 + ρ

. Assume further that

| ξ | > \frac{1}{ρ}

. Then

z_{j} (ξ_{k + 1}^{(k + 1)}) < 0

, and

s g n (z_{j} (ξ_{1}^{(k + 1)})) = {(- 1)}^{j}

.

Under the assumptions of Lemma 6 we are now able to determine the monotonicity of the extremal roots of

U_{k + 1}

depending on

γ_{R}^{(j)}

.

Theorem 4.

Under the assumptions of Lemma 6, the largest (respectively, the smallest) root of

U_{k + 1}

assumes its largest (respectively, smallest) value, in combination with

γ_{R}^{(j)} = β_{R}^{(j)}, j = 1, \dots, k

.

Proof.

Let

ξ

be the largest positive root. Then the previous lemma shows that

sign (z_{j} (ξ)) = (- 1)

. But then (21) implies

sign (r_{j}^{*} - r_{j}) = - 1

and so

r_{j} = β_{R}^{(j)}

. Now let

ξ

be the smallest negative root. The previous lemma shows that

sign (z_{j} (ξ)) = {(- 1)}^{j}

but then (22) implies

\begin{matrix} sign (r_{j}^{*} - r_{j}) = {(- 1)}^{j + 1} {(- 1)}^{j} = - 1 \end{matrix}

and so again

r_{j} = β_{R}^{(j)}

. □

5. Numerical Results. Randomly Generated Matrices

We now undertake numerical tests to validate the theoretical bounds of Theorem 1, and subsequent characterizations. We determine the extremal eigenvalues of

P^{- 1} A

on randomly-generated linear systems. Specifically, we considered a simplified case with

E_{i} \equiv 0, i > 0

and run a number of different test cases , combining the values of the extremal eigenvalues (Table 1) of the symmetric positive definite matrices involved. In particular

Case $N = 2$ . We have three parameters with 6 different endpoints of the corresponding intervals. Overall we run $3^{6} = 729$ test cases.
Case $N = 3$ . We have 4 parameters with 4 different endpoints of the corresponding intervals. Overall we run $4^{4} = 256$ test cases.
Case $N = 4$ . We have 5 parameters with 4 different endpoints of the corresponding intervals. Overall we run $5^{4} = 1024$ test cases.

From Figure 1, Figure 2 and Figure 3 we notice that three out of four bounds perfectly capture the eigenvalues of the preconditioned matrix, while the upper bounds for negative eigenvalues are not as tight for

N = 2, 4

and lower bounds for positive eigenvalues are less effective for

N = 3

. All in all, our technique is able to predict the behavior of the MINRES iterative solver applied to the preconditioned saddle-point linear system.

5.1. Comparisons Against the Block Diagonal Preconditioner

The preconditioner we have studied so far, being SPD, allows the MINRES iteration for the iterative solution of the multiple saddle-point linear systems. Another, most known, preconditioner sharing the same properties is the block diagonal preconditioner

P_{D}

, based on inexact Schur complements, defined in (1).

We compare the MINRES number of iterations in all the previously described test cases for

N = 3

. In Figure 4 we plot the number of iterations with the PP preconditioner and the block diagonal preconditioner, together with the square root of

κ

, an indicator of the ill-conditioning of the overall saddle-point system, namely

κ = \frac{β_{E}^{(0)}}{α_{E}^{(0)}} \frac{β_{R}^{(1)} β_{R}^{(2)} β_{R}^{(3)}}{α_{R}^{(1)} α_{R}^{(2)} α_{R}^{(3)}} .

From the figure, we notice that the two preconditioners are both affected by the

κ

indicator, with the PP preconditioner more effective than

P_{D}

for small to modest values of

κ

. When the Schur complements are instead not optimally approximated, the block diagonal preconditioner is to be preferred.

6. A Realistic Example: The 3-D Biot’s Consolidation Model

A numerical experiment, concerning the mixed form of Biot’s poroelasticity equations, is used to investigate the quality of the bounds for the eigenvalues of the preconditioned matrix and the effectiveness of the proposed triangular preconditioners on a realistic problem. For the PDEs and boundary conditions governing this model, we refer to the description in [19]. As a reference test case for the experiments, we considered the porous cantilever beam problem originally introduced in [25] and already used in [26]. The domain is the unit cube with no-flow boundary conditions along all sides, zero displacements along the left edge, and a uniform load applied on top. The material properties are described in ([19], Table 5,1). We considered the Mixed-Hybrid Finite Element discretization of the coupled Biot equations, which gives rise to a double saddle point linear system. The test case refers to a unitary cubic domain, uniformly discretized using a

h = 0.05

mesh size in all the spatial dimensions. The size of the block matrices and the overall number of nonzeros in the double saddle-point system are reported in Table 2.

6.1. Handling the Case $n_{2} > n_{1}$

The theory previously developed is based on the assumption

n_{0} \geq n_{1} \geq n_{2} \dots

, which is not verified in this problem, where

n_{2} = 25200 > n_{1} = 8000

. The main consequence of this is that

R_{2} R_{2}^{⊤}

is singular and

γ_{R}^{(2)} = 0

. This drawback can be circumvented by attacking the eigenvalue problem (3) in a different way. Let us write (3) for a double saddle-point linear system:

\begin{matrix} (E_{0} - λ I) u_{1} + (1 - λ) R_{1}^{⊤} u_{2} & = & 0, \\ (1 - λ) R_{1} u_{1} + (- E_{1} - λ (I + R_{1} R_{1}^{⊤})) u_{2} + (1 + λ) R_{2}^{⊤} u_{3} & = & 0, \\ (1 + λ) R_{2} u_{2} + (E_{2} - λ (I + R_{2} R_{2}^{⊤})) u_{3} & = & 0 . \end{matrix}

(23)

As in the proof of Theorem 1, we have, for all

λ \notin [α_{E}^{(0)}, β_{E}^{(0)}]

(E_{1} + λ (I + R_{1} R_{1}^{⊤}) - {(λ - 1)}^{2} R_{1} {(λ I - E_{0})}^{- 1} R_{1}^{⊤}) u_{2} = (1 + λ) R_{2}^{⊤} u_{3} .

(24)

Assuming now that

λ \notin [\frac{α_{E}^{(2)}}{1 + β_{R}^{(2)}}, \frac{β_{E}^{(2)}}{1 + α_{R}^{(2)}}] \equiv \bar{I}

, which ensures that

E_{2} - λ (I + R_{2} R_{2}^{⊤})

is SPD; we obtain

u_{3}

from the third of (23)

u_{3} = - (λ + 1) {(E_{2} - λ (I + R_{2} R_{2}^{⊤}))}^{- 1} R_{2} u_{2}

Substituting this into (24) yields

\begin{matrix} (E_{1} + λ (I + R_{1} R_{1}^{⊤}) - {(λ - 1)}^{2} R_{1} {(λ I - E_{0})}^{- 1} R_{1}^{⊤} \\ + {(1 + λ)}^{2} R_{2}^{⊤} {(E_{2} - λ (I + R_{2} R_{2}^{⊤}))}^{- 1} R_{2}) u_{2} = 0 \end{matrix}

(25)

Premultiplying now by

u_{2}^{⊤}

and defining

w = R_{1}^{⊤} u_{2}

and

z = R_{2} u_{2}

, yields

\begin{matrix} {(λ - 1)}^{2} \frac{w^{⊤} {(E_{0} - λ I)}^{- 1} w}{w^{⊤} w} γ_{R}^{(1)} + λ (1 + γ_{R}^{(1)}) + γ_{E}^{(1)} \\ + {(1 + λ)}^{2} \frac{z^{⊤} {(E_{2} - λ (I + R_{2}^{⊤} R_{2}))}^{- 1} z}{z^{⊤} z} {\bar{γ}}_{R}^{(2)} = 0 . \end{matrix}

Before proceeding we notice that

{\bar{γ}}_{R}^{(2)}

, differently from definition (5), is the Rayleigh quotient of

R_{2}^{⊤} R_{2}

. Due to the fact that

R_{2}

has full rank we can bound

{\bar{γ}}_{R}^{(2)}

as

0 < α_{R}^{(2)} \leq {\bar{γ}}_{R}^{(2)} \leq β_{R}^{(2)} .

Hence, from now on, we will refer to

{\bar{γ}}_{R}^{(2)}

as simply

γ_{R}^{(2)}

. Applying Lemma 1 to both

E_{0} - λ I

and

E_{2} - λ (I + R_{2}^{⊤} R_{2})

, we rewrite the previous as

\begin{matrix} 0 & = & - γ_{R}^{(1)} \frac{{(λ - 1)}^{2}}{λ - γ_{E}^{(0)}} + λ (1 + γ_{R}^{(1)}) + γ_{E}^{(1)} - {(1 + λ)}^{2} γ_{R}^{(2)} \frac{1}{λ (1 + γ_{R}^{(2)}) - γ_{E}^{(2)}} \\ = & \frac{U_{2} (λ)}{U_{1} (λ)} - \frac{{(1 + λ)}^{2} γ_{R}^{(2)}}{λ (1 + γ_{R}^{(2)}) - γ_{E}^{(2)}} = \frac{U_{3} (λ)}{U_{1} (λ) (λ (1 + γ_{R}^{(2)}) - γ_{E}^{(2)})}, \end{matrix}

which tells that any eigenvalue of

P^{- 1} A

outside

[α_{E}^{(0)}, β_{E}^{(0)}] \cup \bar{I}

is a zero of

U_{3} (λ)

, meaning that any eigenvalue is in

I_{1} \cup \bar{I} \cup I_{3}

.

6.2. Verification of the Bounds

To approximate the

(1, 1)

block, we employed the classical incomplete Cholesky factorization (IC) with fill-in based on a drop tolerance

δ

. The approximations

{\hat{S}}_{1}

and

{\hat{S}}_{2}

of the Schur complements are as in [19], which prove completely scalable with the problem size. Even if the IC preconditioner does not scale with the discretization parameter, it is helpful to conduct a spectral analysis as a function of the drop tolerance.

In Table 3 we report the extremal values of the five indicators for this problem.

We computed the relevant eigenvalues of the preconditioned matrix

P^{- 1} A

, using the Matlab function eigs with a function handle for the application of the preconditioned matrix (which we could not compute explicitly, due to its size and density) as well as the four endpoints of

}} I_{3}

, as predicted by the theory, and reported all of them in Table 4.

6.3. Improving the Upper Bounds for the Negative Eigenvalues

From Table 4 we see that the bounds perfectly capture the extremal eigenvalues, except for the upper bound for the negative eigenvalues, which is still loose. However, this can be improved by taking into account the indicator

γ_{S}^{(k)} (w),

defined in (7). The endpoints of this indicator are, for this test case:

I_{S_{1}} = [0.3187, 1.2408], I_{S_{2}} = [0.9998, 1.0002],

for every value of

δ

. In order to use these indicators in the eigenvalue analysis, we perform a change of variable, using (6), in the polynomial recurrence which reads as, for

k \geq 1

,

U_{k + 1} (λ) = (λ (1 + γ_{S}^{(k)} - γ_{E}^{(k)}) + {(- 1)}^{k + 1} γ_{E}^{(k)}) U_{k} (λ) + (γ_{E}^{(k)} - γ_{S}^{(k)}) {(λ + {(- 1)}^{k})}^{2} U_{k - 1} (λ),

(26)

for which all the results obtained in Section 3 still hold, as well as the observation at the beginning of Section 4.1, stating that the bounds for the roots of

U_{k + 1}

are taken at the endpoints of the parameters. On the contrary, in this case, we cannot select which of the extremal value of

γ_{E}^{(k)}

and

γ_{S}^{(k)}

provides the bound. However, in view of the low degree of the polynomial we can compute its roots for every combination of the endpoints of the indicators, and take the minimum/maximum values of the roots, depending on the bounds sought. To do this we also should consider the following

Remark 1.

The definition of

γ_{S}^{(k)} (w)

, constrains this indicator to be larger than the corresponding

γ_{E}^{(k)} (w)

. Hence in correspondence of

γ_{E}^{(k)} = β_{E}^{(k)}

, the lower bound for

γ_{S}^{(k)}

is

min {β_{E}^{(k)}, α_{S}^{(k)}}

.

Taking this into account, we compute the 3 roots of

U_{3} (λ; γ_{E}^{(0)}, γ_{E}^{(1)}, γ_{E}^{(2)}, γ_{S}^{(1)}, γ_{S}^{(2)}),

for the

2^{5} = 32

combinations of the extremal values of the indicators, and call

ξ_{-}^{L B}, ξ_{-}^{U B}, ξ_{+}^{L B}, ξ_{+}^{U B}

the extremal values of the roots. In view of the discussion in Section 6.3, we finally compute as the eigenvalue bounds the following expressions:

\begin{matrix} λ_{-}^{LB} = ξ_{-}^{L B}, & λ_{-}^{UB} = ξ_{-}^{U B} \\ λ_{+}^{LB} = min \{α_{E}^{(0)}, \frac{α_{E}^{(2)}}{1 + β_{R}^{(2)}}, ξ_{+}^{L B}\}, & λ_{+}^{UB} = max \{β_{E}^{(0)}, \frac{β_{E}^{(2)}}{1 + α_{R}^{(2)}}, ξ_{+}^{U B}\} \end{matrix}

which we report in Table 5, together with the exact eigenvalues.

The adherence of the bounds to the extremal eigenvalues is now impressive.

6.4. Comparisons with the Block Diagonal Preconditioner for the Biot Problem

We finally compare the block diagonal with the PP preconditioners in terms of number of iterations, for different values of the parameter

δ

, and report the obtained results in Table 6, showing that the preconditioner analyzed in this work represents a valid alternative to the block diagonal preconditioner, especially when the Schur complements are all well approximated. Figure 5 compare the convergence profile of MINRES with both preconditioners for

δ = 10^{- 5}, 10^{- 6}

.

A final remark is in order: In this study we have not been concerned with the cost of the application of the two preconditioners, at each MINRES iteration, which is crucial to evaluate the performance of both. This comparison is outside the scope of this paper, which is mainly concerned with eigenvalue distribution and number of iterations. However, even taking into account that the application of the PP preconditioner is from 1.5 to (slightly less than) 2 times more expensive than that of the block diagonal preconditioner, our results show that the PP preconditioner can be a viable alternative to accelerate the MINRES solution of multiple saddle-point linear systems.

7. Conclusions

We have developed an eigenvalue analysis of the preconditioned multiple saddle-point linear systems, with the SPD preconditioner (PP in short) proposed by Pearson and Potschka in [1]. This analysis is based on the (approximate) knowledge of the smallest and the largest eigenvalues of a number of SPD matrices, related to the preconditioned blocks of the original system. The obtained bounds reveal very close to the exact extremal eigenvalues of the preconditioned matrix, and therefore, this study represents a valuable tool to predict with some accuracy the number of MINRES iterations to solve the multiple saddle-point linear system. In a number of both synthetic and realistic test cases, the PP preconditioner reveals more efficient than the block diagonal preconditioner when the Schur complement matrices are well approximated.

Acknowledgments

LB acknowledges financial support under the PNRR research activity, Mission 4, Component 2, Investment 1.1, funded by the EU Next-GenerationEU – #2022AKNSE4_005 (PE1) CUP C53D2300242000. LB is member of the INdAM research group GNCS.

Appendix A. Proof of Lemma 6

Proof.

We start with the definition of

z_{j} (λ)

and apply the recursion:

\begin{matrix} z_{j} (λ) & = λ U_{j} (λ) - {(λ + {(- 1)}^{j})}^{2} U_{j - 1} (λ) \\ = λ (U_{j - 1} (λ) (λ (1 + γ_{R}^{(j - 1)}) + {(- 1)}^{j} γ_{E}^{(j - 1)}) - γ_{R}^{(j - 1)} {(λ + {(- 1)}^{j - 1})}^{2} U_{j - 2} (λ)) \\ - {(λ + {(- 1)}^{j})}^{2} U_{j - 1} (λ) . \end{matrix}

We now apply the definition to

z_{j - 1} (λ)

:

z_{j - 1} (λ) = λ U_{j - 1} (λ) - {(λ + {(- 1)}^{j - 1})}^{2} U_{j - 2} (λ) .

Combining these two expressions we get

\begin{matrix} z_{j} (λ) - γ_{R}^{(j - 1)} λ z_{j - 1} (λ) & = U_{j - 1} (λ) (λ^{2} (1 + γ_{R}^{(j - 1)}) + {(- 1)}^{j} λ γ_{E}^{(j - 1)} \\ - {(λ + {(- 1)}^{j})}^{2} - λ^{2} γ_{R}^{(j - 1)}) \\ = U_{j - 1} (λ) ({(- 1)}^{j} λ γ_{E}^{(j - 1)} + 2 {(- 1)}^{j + 1} λ - 1) \\ = U_{j - 1} (λ) {(- 1)}^{j + 1} (λ (2 - γ_{E}^{(j - 1)}) + {(- 1)}^{j}) \\ \equiv U_{j - 1} (λ) {(- 1)}^{j + 1} t_{j} (λ) . \end{matrix}

We know that

z_{1} (λ) = t_{1} (λ)

and we have just proved that

z_{j} (λ) = λ γ_{R}^{(j - 1)} z_{j - 1} (λ) + {(- 1)}^{j + 1} U_{j - 1} (λ) t_{j} (λ) .

Sign of $z_{j} (ξ)$ for the largest positive root. Now let

ξ \equiv ξ_{s_{k + 1}}^{(k + 1)}

be the largest positive root. In such a case the previous analysis shows that

γ_{E}^{(j - 1)} = \{\begin{matrix} α_{E}^{(j - 1)} & if j is even \\ β_{E}^{(j - 1)} & if j is odd \end{matrix}

. Using now the hypotheses, we have that

sgn (t_{j} (ξ)) = sgn (2 - γ_{E}^{(j - 1)}) = {(- 1)}^{j} .

Now we prove by induction that

sgn (z_{j} (ξ)) = - 1

for all j. The base step is the following

sgn (z_{1} (ξ)) = sgn (t_{1} (ξ)) = {(- 1)}^{1} = - 1 .

We know that

z_{j} (ξ) = λ γ_{R}^{(j - 1)} z_{j - 1} (ξ) + {(- 1)}^{j + 1} U_{j - 1} (ξ) t_{j} (ξ) .

The first term is negative by induction hypothesis. As for the second term

sgn ({(- 1)}^{j + 1} U_{j - 1} (ξ) t_{j} (ξ) = {(- 1)}^{j + 1} (+ 1) {(- 1)}^{j} = - 1 .

Thus

z_{j}

is negative as well.

Sign of $z_{j} (ξ)$ for the smallest negative root. Let

ξ \equiv ξ_{1}^{(k + 1)}

be the smallest negative root. By assumptions,

- ξ > \frac{1}{ρ}

. Moreover, we know that

γ_{E}^{(j - 1)} = \{\begin{matrix} β_{E}^{(j - 1)} & if j is even \\ α_{E}^{(j - 1)} & if j is odd \end{matrix}

. Then

sgn (t_{j} (ξ)) = - sgn (2 - γ_{E}^{(j - 1)}) = {(- 1)}^{j} .

Now we prove by induction that

sgn (z_{j} (ξ)) = {(- 1)}^{j}

. The base step is the following:

sgn (z_{1} (ξ)) = sgn (t_{1} (ξ)) = - 1 .

Now we assume that

z_{j - 1} (ξ)

has sign

{(- 1)}^{j - 1}

. We know that

z_{j} = λ γ_{R}^{(j - 1)} z_{j - 1} (ξ) + {(- 1)}^{j + 1} U_{j - 1} (ξ) t_{j} (ξ) .

The first term has sign

{(- 1)}^{j}

by induction hypothesis (since

ξ

is negative). As for the second term

sgn ({(- 1)}^{j + 1} U_{j - 1} (ξ) t_{j} (ξ)) = {(- 1)}^{j + 1} {(- 1)}^{j - 1} {(- 1)}^{j} = {(- 1)}^{j} .

Thus

z_{j} (ξ)

has sign

{(- 1)}^{j}

. □

References

Pearson, J.W.; Potschka, A. On symmetric positive definite preconditioners for multiple saddle-point systems. IMA J. Numer. Anal. 2024, 44, 1731–1750.
Bergamaschi, L.; Martinez, A.; Pearson, J.W.; Potschka, A. Spectral analysis of block preconditioners for double saddle-point linear systems with application to PDE-constrained optimization. Computational Optimization with Applications 2024, 91, 423–455.
Rhebergen, S.; Wells, G.N.; Wathen, A.J.; Katz, R.F. Three-field block preconditioners for models of coupled magma/mantle dynamics. SIAM J. Sci. Comput. 2015, 37, A2270–A2294.
Ramage, A.; Gartland, E.C. A Preconditioned Nullspace Method for Liquid Crystal Director Modeling. SIAM Journal on Scientific Computing 2013, 35, B226–B247.
Greif, C.; He, Y. Block Preconditioners for the Marker-and-Cell Discretization of the Stokes-Darcy Equations. SIAM Journal on Matrix Analysis and Applications 2023, pp. 1540–1565.
Chidyagwai, P.; Ladenheim, S.; Szyld, D.B. Constraint preconditioning for the coupled Stokes-Darcy system. SIAM J. Sci. Comput. 2016, 38, A668–A690.
Beik, F.P.A.; Benzi, M. Preconditioning techniques for the coupled Stokes-Darcy problem: spectral and field-of-values analysis. Numer. Math. 2022, 150, 257–298.
Greif, C. A BFBt preconditioner for Double Saddle-Point Systems. IMA Journal of Numerical Analysis 2026. to appear, . [CrossRef]
Bakrani Balani, F.; Hajarian, M.; Bergamaschi, L. Two block preconditioners for a class of double saddle point linear systems. Applied Numerical Mathematics 2023, 190, 155 – 167.
Bakrani Balani, F.; Bergamaschi, L.; Martínez, A.; Hajarian, M. Some preconditioning techniques for a class of double saddle point problems. Numerical Linear Algebra with Applications 2024, 31, e2551.
Beik, F.P.A.; Benzi, M. Iterative methods for double saddle point systems. SIAM J. Matrix Anal. Appl. 2018, 39, 902–921.
Sogn, J.; Zulehner, W. Schur complement preconditioners for multiple saddle point problems of block tridiagonal form with application to optimization problems. IMA Journal of Numerical Analysis 2018, 39, 1328–1359.
Bradley, S.; Greif, C. Eigenvalue bounds for double saddle-point systems. IMA J. Numer. Anal. 2023, 43, 3564–3592.
Pearson, J.W.; Potschka, A. Double saddle-point preconditioning for Krylov methods in the inexact sequential homotopy method. Numerical Linear Algebra with Applications 2024, 31, e2553.
Bergamaschi, L.; Martinez, A.; Pearson, J.W.; Potschka, A. Eigenvalue bounds for preconditioned symmetric multiple saddle-point matrices. Linear Algebra with Applications 2026. Published online January 16, 2026, . [CrossRef]
Bergamaschi, L.; Martínez, A.; Pilotto, M. Spectral Analysis of Block Diagonally Preconditioned Multiple Saddle-Point Linear Systems with Inexact Schur Complements 2026. In preparation.
Beigl, A.; Sogn, J.; Zulehner, W. Robust preconditioners for multiple saddle point problems and applications to optimal control problems. SIAM Journal on Matrix Analysis and Applications 2020, 41, 1590–1615.
Ferronato, M.; Franceschini, A.; Janna, C.; Castelletto, N.; Tchelepi, H. A general preconditioning framework for coupled multi-physics problems. J. Comput. Phys. 2019, 398, 108887. [CrossRef]
Bergamaschi, L.; Ferronato, M.; Martínez, A. Triangular preconditioners for double saddle point linear systems arising in the mixed form of poroelasticity equations. SIAM Journal on Matrix Analysis and Applications 2026. Accepted on October 14, 2025, . [CrossRef]
Paige, C.C.; Saunders, M.A. Solution of Sparse Indefinite Systems of Linear Equations. SIAM J. on Numer. Anal. 1975, 12, 617–629. [CrossRef]
Saad, Y.; Schultz, M.H. GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM J. Sci. Statist. Comput. 1986, 7, 856–869. [CrossRef]
Fischer, B. Polynomial based iteration methods for symmetric linear systems; Wiley-Teubner Series Advances in Numerical Mathematics, John Wiley & Sons, Ltd., Chichester; B. G. Teubner, Stuttgart, 1996; p. 283. [CrossRef]
Greenbaum, A. Iterative methods for solving linear systems; Vol. 17, Frontiers in Applied Mathematics, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, 1997; pp. xiv+220. [CrossRef]
Bergamaschi, L. On Eigenvalue distribution of constraint-preconditioned symmetric saddle point matrices. Numer. Linear Algebra Appl. 2012, 19, 754–772. [CrossRef]
Phillips, P.J.; Wheeler, M.F. Overcoming the problem of locking in linear elasticity and poroelasticity: A heuristic approach. Computational Geosciences 2009, 13, 5–12.
Frigo, M.; Castelletto, N.; Ferronato, M.; White, J.A. Efficient solvers for hybridized three-field mixed finite element coupled poromechanics. Computers and Mathematics with Applications 2021, 91, 36 – 52. [CrossRef]

Figure 1. Double saddle-point linear system. Extremal eigenvalues of the preconditioned matrix (blue dots) and bounds (red line) after 10 runs with each combination of the parameters from Table 1.

Figure 2. Multiple saddle-point linear system with

N = 3

. Extremal eigenvalues of the preconditioned matrix (blue dots) and bounds (red line) after 10 runs with each combination of the parameters from Table 1.

Figure 2. Multiple saddle-point linear system with

N = 3

. Extremal eigenvalues of the preconditioned matrix (blue dots) and bounds (red line) after 10 runs with each combination of the parameters from Table 1.

Figure 3. Multiple saddle-point linear system with

N = 4

. Extremal eigenvalues of the preconditioned matrix (blue dots) and bounds (red line) after 10 runs with each combination of the parameters from Table 1.

Figure 3. Multiple saddle-point linear system with

N = 4

. Extremal eigenvalues of the preconditioned matrix (blue dots) and bounds (red line) after 10 runs with each combination of the parameters from Table 1.

Figure 4. Iteration numbers of

P

and

P_{D}

for all the tests with

N = 3

. The condition number indicator

\sqrt{κ}

is also displayed.

Figure 4. Iteration numbers of

P

and

P_{D}

for all the tests with

N = 3

. The condition number indicator

\sqrt{κ}

is also displayed.

Figure 5. 3D cantilever beam problem. Convergence profiles of MINRES preconditioner with

P_{D}

and

P

. Drop tolerance Cholesky parameter:

δ = 10^{- 5}

(left),

δ = 10^{- 6}

(right).

Figure 5. 3D cantilever beam problem. Convergence profiles of MINRES preconditioner with

P_{D}

and

P

. Drop tolerance Cholesky parameter:

δ = 10^{- 5}

(left),

δ = 10^{- 6}

(right).

Table 1. Extremal eigenvalues of the relevant symmetric positive definite matrices used in the verification of the bounds.

$α_{E}^{(0)}, α_{R}^{(1)}, α_{R}^{(2)}$	0.1	0.3	0.9	$α_{E}^{(0)}, α_{R}^{(1)}, \dots α_{R}^{(N)}$	0.1	0.8
$β_{E}^{(0)}, β_{R}^{(1)}, β_{R}^{(2)}$	1.2	1.8	3	$β_{E}^{(0)}, β_{R}^{(1)}, \dots β_{R}^{(N)}$	1.2	2
Case $N = 2$				Cases $N > 2$

Table 2. 3D cantilever beam problem on a unit cube: size and number of nonzeros of the test matrices.

$1 / h$	$n_{0}$	$n_{1}$	$n_{2}$	$n_{0} + n_{1} + n_{2}$	nonzeros
20	27783	8000	25200	60983	2.4 $\times 10^{6}$

Table 3. Extremal values of the indicators for different values of the threshold parameter

δ

.

Table 3. Extremal values of the indicators for different values of the threshold parameter

δ

.

	$δ = 10^{- 4}$		$δ = 10^{- 5}$		$δ = 10^{- 6}$		$δ = 10^{- 7}$
$I_{E_{0}}$	$[0.0422$ ,	$1.2036]$	$[0.2563$ ,	$1.2232]$	$[0.9600$ ,	$1.0118]$	$[0.9976$ ,	$1.0001]$
$I_{E_{1}}$	$[5 \times 10^{- 4}$ ,	$0.0889]$
$I_{E_{2}}$	$[0.9976$ ,	$1.0001]$
$I_{R_{1}}$	$[2 \times 10^{- 4}$ ,	$0.7790]$
$I_{R_{2}}$	$[3 \times 10^{- 5}$ ,	$0.0023]$

Table 4. 3D cantilever beam problem on a unit cube: eigenvalue bounds vs real eigenvalues.

		$μ_{-}^{LB}$	$μ_{-}^{UB}$	$μ_{+}^{LB}$	$μ_{+}^{UB}$
$δ = 10^{- 4}$	Bounds	-2.6786	-0.0012	0.0424	1.2216
	Exact Eigvs	-1.2408	-0.3192	0.0453	1.2055
$δ = 10^{- 5}$	Bounds	-2.4104	-0.0012	0.2564	1.2447
	Exact Eigvs	-1.2408	-0.3192	0.2895	1.2252
$δ = 10^{- 6}$	Bounds	-1.7001	-0.0012	0.9600	1.0119
	Exact Eigvs	-1.2408	-0.3192	0.9600	1.0118
$δ = 10^{- 7}$	Bounds	-1.6701	-0.0012	0.9976	1.0070
	Exact Eigvs	-1.2408	-0.3190	0.9979	1.0013

Table 5. Refined eigenvalue bounds vs real eigenvalues.

		$λ_{-}^{LB}$	$λ_{-}^{UB}$	$λ_{+}^{LB}$	$λ_{+}^{UB}$
$δ = 10^{- 4}$	Bounds	-2.8268	-0.2629	0.0422	1.2274
	Exact Eigvs	-1.2408	-0.3192	0.0453	1.2055
$δ = 10^{- 5}$	Bounds	-2.4204	-0.2583	0.1809	1.2517
	Exact Eigvs	-1.2408	-0.3192	0.2895	1.2252
$δ = 10^{- 6}$	Bounds	-1.2913	-0.2951	0.9593	1.0119
	Exact Eigvs	-1.2408	-0.3192	0.9600	1.0118
$δ = 10^{- 7}$	Bounds	-1.2434	-0.3174	0.9979	1.0029
	Exact Eigvs	-1.2408	-0.3190	0.9979	1.0013

Table 6. Number of MINRES iterations for the block diagonal and the PP preconditioner, for different values of the threshold parameter

δ

. With

δ = 0

we indicate that the exact Cholesky factor of A is used for preconditioning.

Table 6. Number of MINRES iterations for the block diagonal and the PP preconditioner, for different values of the threshold parameter

δ

. With

δ = 0

we indicate that the exact Cholesky factor of A is used for preconditioning.

$δ$	$10^{- 4}$	$10^{- 5}$	$10^{- 6}$	$10^{- 7}$	0
its (diag)	132	83	64	63	63
its (PP)	99	61	31	23	10

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Eigenvalue Bounds for Symmetric, Multiple Saddle-Point Matrices with SPD Preconditioners

Abstract

Keywords:

Subject:

1. Introduction

2. The Preconditioner

3. Characterization of the Eigenvalues of the Preconditioned Matrix

4. Bounds on the Roots of ${U_{k}}$

4.1. How Zeros of ${U_{k}}$ Move Depending on $γ_{E}^{(j)}, γ_{R}^{(j)}$

4.2. Choice of $γ_{E}^{(k)}$

4.3. Sign of $r_{j}^{*} - r_{j}$ in Terms of the Sign of $z_{j} (ξ)$

4.4. Bounds for the Eigenvalues of the Preconditioned Matrix

5. Numerical Results. Randomly Generated Matrices

5.1. Comparisons Against the Block Diagonal Preconditioner

6. A Realistic Example: The 3-D Biot’s Consolidation Model

6.1. Handling the Case $n_{2} > n_{1}$

6.2. Verification of the Bounds

6.3. Improving the Upper Bounds for the Negative Eigenvalues

6.4. Comparisons with the Block Diagonal Preconditioner for the Biot Problem

7. Conclusions

Acknowledgments

Appendix A. Proof of Lemma 6

References

MDPI Initiatives

Important Links

Subscribe

Eigenvalue Bounds for Symmetric, Multiple Saddle-Point Matrices with SPD Preconditioners

Abstract

Keywords:

Subject:

1. Introduction

2. The Preconditioner

3. Characterization of the Eigenvalues of the Preconditioned Matrix

4. Bounds on the Roots of { U k }

4.1. How Zeros of { U k } Move Depending on γ E ( j ) , γ R ( j )

4.2. Choice of γ E ( k )

4.3. Sign of r j * − r j in Terms of the Sign of z j ( ξ )

4.4. Bounds for the Eigenvalues of the Preconditioned Matrix

5. Numerical Results. Randomly Generated Matrices

5.1. Comparisons Against the Block Diagonal Preconditioner

6. A Realistic Example: The 3-D Biot’s Consolidation Model

6.1. Handling the Case n 2 > n 1

6.2. Verification of the Bounds

6.3. Improving the Upper Bounds for the Negative Eigenvalues

6.4. Comparisons with the Block Diagonal Preconditioner for the Biot Problem

7. Conclusions

Acknowledgments

Appendix A. Proof of Lemma 6

References

MDPI Initiatives

Important Links

Subscribe

4. Bounds on the Roots of ${U_{k}}$

4.1. How Zeros of ${U_{k}}$ Move Depending on $γ_{E}^{(j)}, γ_{R}^{(j)}$

4.2. Choice of $γ_{E}^{(k)}$

4.3. Sign of $r_{j}^{*} - r_{j}$ in Terms of the Sign of $z_{j} (ξ)$

6.1. Handling the Case $n_{2} > n_{1}$