On Schur Forms for Matrices with Simple Eigenvalues

Mihail Mihailov Konstantinov; Petko Hristov Petkov

doi:10.20944/preprints202410.0785.v1

Submitted:

10 October 2024

Posted:

10 October 2024

You are already at the latest version

Abstract

In this paper we consider the standard Schur problem for a square matrix A, namely the similarity unitary transformation of A into upper Schur form containing the eigenvalues of A on its diagonal. Since the profound work of Issai Schur (1909), this is a fundamental issue in the theory and applications of matrices. Nevertheless, certain details concerning the Schur problem need further clarification especially in connection with the perturbation analysis of the Schur decomposition relative to perturbations in A. In particular, the concept of regular solution to the perturbed Schur form is introduced and illustrated by several examples. We also introduce the concepts of diagonally spectral matrices and of quasi-Schur condensed forms of a matrix A, and show that they may be much less sensitive to perturbations in A.

Keywords:

Schur canonical form

;

Schur condensed form

;

diagonally spectral matrix

;

quasi-Schur form

;

perturbations of Schur form

Subject:

Computer Science and Mathematics - Applied Mathematics

MSC: 15A21; 65G30

1. Introduction and Notation

The Schur decomposition of general square matrix and its generalizations are major tools both in the theory and applications of matrix analysis [5]. In this paper we consider the main definitions and properties of the Schur decomposition of a square matrix which are important from the point of view of the perturbation analysis. We also introduce new concepts in this field. A number of examples is given for illustration of the results presented. This is a specific issue and we shall need a large number of notations. For convenience of the reader the general notations are gathered below in this section, while some specific notations appear further in the text. Some of the matrix notations are inspired by the language of the program system MATLAB [10].

Let

Z = {0, \pm 1, \pm 2, \dots}

be the set of integers and

m, n \in Z

, where

m \leq n

. We denote by

Z [m, n] = {k \in Z : m \leq k \leq n}

(or by

m : n

) the set of

n - m + 1

integers

m, m + 1, \dots, n

. We write

Z [m, m] = m

and

Z [n, m] = \emptyset

when

n > m

. The set of real (resp. complex) numbers is denoted by

R

(resp.

C ≃ R \times R

) and

i = \sqrt{- 1}

is the imaginary unit. A complex number z is written as

z = x + i y

with

x, y \in R

, or

z = | z | exp (i φ)

, where

| z | = \sqrt{x^{2} + y^{2}}

is the absolute value and

φ \in (- π, π]

is the angle of z. The complex conjugate of z is denoted as

\bar{z} = x - i y = | z | exp (- i φ)

.

The sign function

sign : R \to Z [- 1, 1]

for scalar arguments is defined as

sign (x) = - 1

,

sign (x) = 0

and

sign (x) = 1

for

x < 0

,

x = 0

and

x > 0

, resp. The sign function for real n-tuples

x = (x_{1}, x_{2}, \dots, x_{n})

is defined by the expression

sign (x) = \sum_{k = 1}^{n} 2^{1 - k} sign (x_{k}) .

The lexicographical order ≺ for n-tuples is defined as

x ⪯ \tilde{x}

,

x = \tilde{x}

and

x ≻ \tilde{x}

if

sign (x - \tilde{x}) < 0

,

sign (x - \tilde{x}) = 0

and

sign (x - \tilde{x}) > 0

, resp. Otherwise speaking,

x ⪯ \tilde{x}

when either

x_{1} < {\tilde{x}}_{1}

, or there exists

m \in Z [2, n]

such that

x_{k} = {\tilde{x}}_{k}

for

k \in Z [1, m - 1]

and

x_{m} < {\tilde{x}}_{m}

. We write

x ⪯ \tilde{x}

if either

x ⪯ \tilde{x}

or

x = \tilde{x}

. We use this lexicographical order for complex numbers

z = x + i y

written as real pairs

(x, y)

and for pairs

(m, n)

of integers

m, n \in Z

. For example, for the fourth roots

{\pm 1, \pm i}

of 1, we have

- 1 ⪯ - i ⪯ i ⪯ 1

.

We denote by

C (m, n)

(resp.

R (m, n)

) the space of

m \times n

complex (resp. real) matrices

A = [A (i, j)]

with elements

A (i, j)

and we set

C (n) = C (n, n)

,

R (n) = R (n, n)

. The column m-vector

b \in C (m, 1)

with elements

b_{i} = b (i)

is written as

b = [b_{1}; b_{2}; \dots; b_{m}]

, while the row n-vector

c \in C (1, n)

with elements

c_{j} = c (j)

is denoted as

c = [c_{1}, c_{2}, \dots, c_{n}]

. A quantity

z \in C ∖ R

is said to be genuinely complex. A vector or a matrix is genuinely complex if at least one of its elements is complex.

The identity

n \times n

matrix is denoted as

I_{n} \in R (n)

. The element

I_{n} (i, j)

of

I_{n}

is the Kronecker delta symbol

d (i, j) = 1 - {sign}^{2} (i - j)

. The zero

m \times n

matrix is denoted as

O_{m, n} \in R (m, n)

with

O_{n} = O_{n, n}

, or simply as O. We denote by

L_{n} \in R (n)

the strictly lower triangular matrix with ones below its main diagonal and zeros otherwise, i.e.

L_{n} (i, j) = 1

if

i > j

and

L_{n} (i, j) = 0

if

i \leq j

. The elementary matrix with element 1 in position

(p, q)

and zero otherwise is denoted as

E_{p, q} \in R (m, n)

, i.e.

E_{p, q} (i, j) = d (i, p) d (j, q)

.

The absolute value of the matrix

A \in C (m, n)

is the matrix

| A | \in R (m, n)

with elements

| A | (i, j) = | A (i, j) |

. The transpose of the matrix

A \in C (m, n)

is denoted

A^{⊤} \in C (n, m)

and has elements

A^{⊤} (i, j) = A (j, i)

. The complex conjugate transpose of

A \in C (m, n)

is denoted by

A^{H} \in C (n, m)

and has elements

A^{H} (i, j) = \bar{A (j, i)}

. The i-th row and the j-th column of A are denoted as

A (i, :) \in C (1, n)

and

A (:, j) \in C (m, 1)

, respectively. For

A, B \in C (m, n)

we denote by

A \circ B \in C (m, n)

the element-wise product of A and B, i.e.

(A \circ B) (i, j) = A (i, j) B (i, j)

. The spectral and the Frobenius norms of the matrix

A \in C (m, n)

are denoted as

∥ A ∥

and

{∥ A ∥}_{F}

, resp.

The spectrum

spect (A)

of the matrix

A \in C (n)

is the collection, or the multiset, of the eigenvalues

λ_{k} (A) \in C

of A,

k \in Z [1, n]

, counted according to their algebraic multiplicities. With certain abuse of notation we write

spect (A) \subset C

in the general case, and

spect (A) \subset R

in the case when all eigenvalues of A are real.

The multiplicative group of unitary matrices

U \in C (n)

such that

U^{H} U = I_{n}

is denoted by

U (n)

. The group of orthonormal matrices

U \in R (n)

such that

U^{⊤} U = I_{n}

is denoted as

O (n)

. For

A \in C (n)

we denote by

Low (A) = A \circ L_{n}

and

Diag (A) = A \circ I_{n}

the strictly lower triangular and the diagonal parts of A, respectively. If x is an n-vector with elements

x (i) \in C

then

diag (x) \in C (n)

is the matrix with elements

diag (x) (i, j) = x (i) d (i, j)

.

The set of upper triangular matrices

A = A - Low (A)

is denoted as

T (n) \subset C (n)

, while the set of diagonal matrices

A = Diag (A)

is denoted as

D (n) \subset T (n)

. For

n \geq 2

the group of diagonal matrices of the form

diag (1, exp (i φ_{2}), \dots, exp (i φ_{n})),

where

φ_{k} \in R

, is denoted as

D^{*} (n) \subset U (n)

.

If

K

is a finite set then

card (K)

is the number of its elements. The set of

(n - 1)

-tuples of pairs

{(i_{1}, j_{1}), (i_{2}, j_{2}), \dots, (i_{n - 1}, j_{n - 1})}

of integers

i_{k} \in Z [1, n - 1]

,

j_{k} \in Z [2, n]

, where

i_{k} < j_{k}

, is denoted as

K (n)

.

Finally we set

ν_{n} = n (n - 1) / 2

and

μ_{n} = card (K (n)) = (ν_{n})! / (n - 1)!

In particular

ν_{4} = 6

and

μ_{4} = 6! / 3! = 20

. Unspecified matrix block are denoted by star. The end of definitions, examples and propositions is marked by □.

2. Condensed Schur Forms

Let an arbitrary matrix

A \in C (n)

,

n \geq 2

, be given. Then according to the famous Schur theorem [13] there exists a factorization

A = U T U^{H}

of the matrix A, where

U \in U (n)

and

T \in T (n)

.

Definition 1.

The pair

(U, T) \in U (n) \times T (n), T = U^{H} A U,

(1)

is said to be a Schur decomposition (SD), or an upper triangular unitary decomposition of the matrix A. The matrix T is referred to as a condensed Schur form (ConSF), or an upper triangular form of A. The columns of the unitary matrix U form a Schur basis for

C (n, 1)

relative to A. □

Thus defined the condensed Schur form T of A is not unique. Hence the condensed Schur forms are not canonical but rather quasi-canonical. If

A \in R (n)

and if the spectrum of A is real then the transformation matrix may be chosen as

U \in O (n)

and we have

T = U^{⊤} A U \in R (n)

. If

A \in R (n)

and the spectrum of A is genuinely complex then a real block Schur form with

1 \times 1

and

2 \times 2

diagonal blocks may also be constructed.

Next we define two sets of matrices depending on the matrix A which play an important role in our analysis. Denote

U (A) = {U \in U (n) : U^{H} A U \in T (n)} \subset U (n)

and

T (A) = {U^{H} A U : U \in U (A)} \subset T (n) .

Thus

U (A) \subset U (n)

is the set of unitary matrices transforming the matrix A into ConSF, and

T (A)

is the set of ConSF of A. For matrices

A \in R (n)

with real spectra we denote

O (A) = {U \in O (n) : U^{⊤} A U \in T (n)} \subset O (n) .

In general the set

U (A)

is not a group and not even a groupoid, i.e.

U_{1}, U_{2} \in U (A)

does not imply

U_{1} U_{2} \in U (A)

.

The most important (actually, the only important) property of the ConSF T of the matrix A is that its diagonal elements are the eigenvalues of A, i.e.

T (k, k) = λ_{k} (A)

,

k \in Z [1, n]

. Because of the only condition

Low (T) = 0

the matrix T is only a condensed form (rather than a canonical form) of A relative to the similarity action

U (n) \times C (n) \to T (n)

, defined by

(U, A) \mapsto U^{H} A U

, of the group

U (n)

on the set

C (n)

.

Definition 2.

The problem of finding the ConSF (1) is referred to as the Schur problem (SP) for the matrix

A \in C (n)

. The general solution of the SP is the set

Schur (A) = \{(U, U^{H} A U) : U \in U (n), U^{H} A U \in T (n)\} \subset U (A) \times T (A)

of all ConSF of A. A pair

(U, T) \in Schur (A)

is a particular solution of the SP for the matrix A. □

Sometimes the matrices U and T in a ConSF of A are written as

U (A)

and

T (A)

to emphasize their dependence on A. This dependence, however, is not functional. Indeed, the transformation matrix

U (A)

is always not unique, e.g.

(U, T) \in Schur (A)

implies

(- U, T) \in Schur (A)

. With exception of the case

A = λ I_{n}

,

λ \in C

, when it is fulfilled

T (A) = A

, the upper triangular unitary equivalent form

T (A)

of A is also not unique. For the latter choice of A we have

Schur (A) = U (n) \times {A}

.

All upper triangular unitary equivalent forms of a given matrix are unitary similar. In particular the next proposition is a direct corollary of the definitions, see e.g. [14].

Proposition 1.

Let

Π_{1} = (U_{1}, T_{1})

and

Π_{2} = (U_{2}, T_{2})

be two solutions of the SP for A. Then

T_{2} = U_{2}^{H} U_{1} T_{1} U_{1}^{H} U_{2}

. □

Proof. It suffices to observe that

A = U_{1} T_{1} U_{1}^{H} = U_{2} T_{2} U_{2}^{H}

. □

Definition 3.

The solutions

Π_{1}

and

Π_{2}

are said to be diagonally equal if

Diag (T_{1}) = Diag (T_{2})

, and diagonally different if

Diag (T_{1}) \neq Diag (T_{2})

. □

The next proposition is generally known since 1933 and is attributed to H. Röseler, see e.g. [[14], Theorem 2.3]. It gives sufficient and “almost necessary” conditions for diagonal equality of the solutions of the Schur problem. The formulation and proof of the results given below are slightly different from the known ones.

Proposition 2.

The following assertions hold true.

If $V = U_{1}^{H} U_{2} \in D^{*} (n)$ then the solutions $Π_{1}$ and $Π_{2}$ are diagonally equal.
If the matrix A has pair-wise distinct eigenvalues and the solutions $Π_{1}$ and $Π_{2}$ are diagonally equal then $V \in D^{*} (n)$ .□

Proof. To prove 1 note that the condition

V \in D^{*} (n)

is equivalent to the existence of a matrix

D \in D^{*} (n)

such that

U_{2} = U_{1} D

. In this case

T_{1} (i, j) = D (i, i) \bar{D (j, j)} T_{2} (i, j), T_{1} (i, i) = T_{2} (i, i)

and

Diag (T_{1}) = Diag (T_{2})

.

To prove 2 we use the fact that

T_{1} V = V T_{2}

. Partition the matrices in this equality as

T_{1} = [\begin{matrix} λ & * \\ 0 & Λ \end{matrix}], T_{2} = [\begin{matrix} λ & * \\ 0 & * \end{matrix}], V = [\begin{matrix} μ & u \\ v & W \end{matrix}],

where

λ \in spect (A)

,

Λ \in C (n - 1)

,

μ \in C

,

W \in C (n - 1)

and * is a matrix block of corresponding size. We have

T_{1} V = [\begin{matrix} * & * \\ Λ v & Λ W \end{matrix}], V T_{2} = [\begin{matrix} λ μ & * \\ λ v & * \end{matrix}]

and comparing the (2,1)-blocks of these matrices we get

Λ v = λ v

. Since

λ \notin spect (Λ)

we obtain

v = 0

. Hence

| μ | = 1

,

u = 0

and

V = diag (μ, W)

. Now the proof is completed by induction. □

The MATLAB^® command [U,T] = schur(A) computes a particular solution

(U, T)

of the SP for the matrix

A \in C (n)

. The aim of the computation of the ConSF T of a general matrix

A \in C (n)

is to determine the eigenvalues of A as the diagonal elements of T. But the Schur problem may be defined also for matrices A which are already in ConSF, i.e.

A \in T (n)

. For such matrices the above MATLAB^® command computes the solution

(I_{n}, A)

of the Schur problem for A. At the same time the Schur problem for

A \in T (n)

has infinitely many solutions. In order to tie them down we introduce the following definition.

Definition 4.

If

A \in T (n)

then the pair

(I_{n}, A)

is called the principal solution of the Schur problem for A. □

Without additional assumptions the matrix

T \in T (n)

is only a condensed form rather than a canonical form of A relative to the similarity action of

U (n)

. The only (albeit most important) invariants for this action which, revealed by the matrix T, are the eigenvalues

λ_{k} (A) = T (k, k)

,

k \in Z [1, n]

, of the matrix A which appear on the diagonal of T.

The definition of complete invariants and canonical forms for the similarity action of

U (n)

on

C (n)

, see [14], is more subtle and is not considered in full detail here. Further on we consider, among others, only a partial formulation of Schur canonical forms for generic matrices A, see also [1,9] and [3]. Note that from point of view of applications the condensed forms provide the same advantages as the canonical forms. Moreover, strict canonical forms of the matrix A are rarely (if ever) used in practice since they are usually defined by complicated conditions and procedures and are more sensitive to perturbations in A.

Let

U \in U (A)

and

D \in D^{*} (n)

. Then

U D \in U (A)

as well. Thus we have

c U \in U (A)

, where

c \in C

,

| c | = 1

, and

- U \in U (A)

in particular. This fact has an important implication. The diameter of the set

U (A)

, i.e. the maximum of

∥ U - V ∥

for

U, V \in U (A)

, is equal to 2 and is achieved for

U \in U (A)

and

V = - U \in U (A)

.

Given the matrix

A \in C (n)

, neither the ConSF

T \in T (n)

of A nor the transformation matrix

U \in U (n)

are unique in general. In fact, the ConSF T of

A \in C (n)

is unique if and only if

A = λ I_{n}

, where

λ \in C

. In this case

T = A

and

U \in U (n)

is an arbitrary unitary matrix, or, equivalently,

T (A) = {A}

and

U (A) = U (n)

.

If A has at least two different eigenvalues then we have a set

T (A)

of ConSF T with different ordering of the eigenvalues of A on the diagonal of T. The ConSF also differ in their strictly upper triangular parts.

Suppose that

spect (A)

consists of

m \leq n

pair-wise disjoint elements

λ_{1}, λ_{2}, \dots, λ_{m}

with multiplicities

n_{1}, n_{2}, \dots, n_{m}

, where

n_{1} + n_{2} + \dots + n_{m} = n

. Then there are

N = N (n_{1}, n_{2}, \dots, n_{m}) = \frac{n!}{n_{1}! n_{2}! \dots n_{m}!}

different orderings of the elements

T (k, k)

on the diagonal

Diag (T)

of the ConSF T, or N diagonally different solutions of the SP for A.

Here one of the ConSF of A is the block matrix

T = [T_{k, l}]

with

T_{k, l} \in C (n_{k}, n_{l})

and

T (k, k) \in T (n_{k})

, where

Diag (T_{k, k}) = λ_{k} I_{k}

. In the generic case

m = n

we have

N (1, 1, \dots, 1) = n!

diagonally different ConSF, while in the “most non-generic” case

m = 1

we have

N (n) = 1

and all ConSF are diagonally equal.

3. Canonical Schur Forms for Generic Matrices

In this section we summarize and reformulate some of the results concerning Schur canonical forms for the unitary similarity action of

U (n)

on the set

C (n)

. The canonical Schur form

T \in T (n)

of the matrix

A \in C (n)

is a ConSF with additional conditions imposed on its elements, see [14] and the references therein. We consider only generic matrices A with pair-wise disjoint eigenvalues for which the solution

(U, T)

of the Schur problem is continuous as a function of the matrix A. At the same time the Schur basis U for condensed forms (and hence for canonical forms as well) of a matrix A with multiple eigenvalues may be discontinuous as a function of A.

Definition 5.

For

A \in C (n)

the set

Or [A, U (n)] = {U^{H} A U : U \in U (n)} \subset C (n)

is called the equivalence class, or orbit, of the matrix A relative to the similarity action of the unitary group

U (n)

. □

Obviously

B \in Or [A, U (n)]

implies

A \in Or [B, U (n)]

and vice versa. Let

A \subset C (n)

and

C \subset T (n)

be certain sets.

Definition 6.

The matrices

A, B \in C (n)

are said to be unitary equivalent (denoted as

A \sim B

) if

B \in Or [A, U (n)]

.□

Definition 7.

The function

γ : A \to C

is said to be a canonical form for the similarity action of the group

U (n)

on the set

A

when the equality

γ (A) = γ (B)

holds if and only if

A \sim B

. □

Thus the canonical form

γ : A \to C

is a complete invariant [6] for the similarity action of the group

U (n)

on the set

A

but the opposite, of course, is not true. The canonical form

γ

thus defined is a function. Informally, we also say that the image

γ (A) \in T (n)

of the matrix A under

γ

is a unitary canonical form, or Schur form, of A.

Definition 8.

The subset

A

of

C (n)

is said to be closed in the Zariski topology [6] if it is the union of the zeros of a system of polynomials in

z \in C (n)

. The subset

A \subset C (n)

is said to be open in the Zariski topology if its complement

C (n) ∖ A \subset C (n)

is closed in this topology. □

Definition 9.

A property

P

of a matrix

A \in C (n)

is said to be generic if it is fulfilled on a subset

A \subset C (n)

which is open in the Zariski topology. □

Informally, the matrix A is said to be generic relative to a given property if this property is generic.

Proposition 3.

The following properties of a matrix

A \in C (n)

are generic.

The matrix A is totally different from any fixed matrix $A_{0} \in C (n)$ , i.e. $A (k, l) \neq A_{0} (k, l)$ for $k, l \in Z [1, n]$ ; in particular $A (k, l) \neq 0$ for any given pair $(k, l)$ .
The matrix A is not normal, i.e. $A^{H} A \neq A A^{H}$ ; in particular the matrix A is not unitary.
The singular values of the matrix A are positive and pair-wise different; in particular $rank (A) = n$ .
The eigenvalues $λ_{k}$ of the matrix A satisfy the inequalities $Re (λ_{k}) \neq Re (λ_{l})$ and $Im (λ_{k}) \neq Im (λ_{l})$ for $k \neq l$ ; in particular $λ_{k} \neq λ_{l}$ for $k \neq l$ and the Jordan canonical form of A is diagonal.
Any ConSF T of the matrix A has nonzero and pair-wise different elements on and above its diagonal, i.e. $T (k, l) \neq 0$ and $T (i, j) \neq T (k, l)$ for $i \leq j$ , $k \leq l$ and $(i, j) \neq (k, l)$ . □

4. Geometry of Schur Canonical Sets

Let

ω = (ω_{1}, ω_{2}, \dots, ω_{n}) : Z [1, n] \to Z [1, n]

be a permutation of the integers

1, 2, \dots, n

and recall that

Z [1, n - 1] = {1, 2, \dots, n - 1}

and

Z [2, n] = {2, 3, \dots, n}

. Set

Z_{n} = Z [1, n - 1] \times Z [2, n]

.

Below we describe a possible set of canonical forms for the similarity action of the group

U (n)

on the subset

C (n)

of matrices with pair-wise disjoint eigenvalues. Let

K (n) \subset Z_{n}^{n - 1}

be the set of

(n - 1)

-tuples

{(i_{1}, j_{1}), (i_{2}, j_{2}), \dots, (i_{n - 1}, j_{n - 1})}

of integer pairs

p_{k} = (i_{k}, j_{k})

,

k \in Z [1, n - 1]

, where

i_{k} < j_{k}

. There are

μ_{n}

such

(n - 1)

-tuples, see Table 1. Later on we shall define three important types of such sets.

Definition 10.

The conjugate pair of the pair

p = (i, j) \in Z_{n}

is

p^{τ} = {(i, j)}^{τ} = (n + 1 - j, n + 1 - i) \in Z_{n}

The pair p is self-conjugate if

p^{τ} = p

.□

Obviously, the pair

(i, j)

is self-conjugate if and only if

i + j = n + 1

.

Definition 11.

The conjugate

(n - 1)

-tuple of the

(n - 1)

-tuple

θ = (p_{1}, p_{2}, \dots, p_{n - 1})

, where

p_{k} = (i_{k}, j_{k})

,

k \in Z [1, n - 1]

, is

θ^{τ} = (p_{1}^{τ}, p_{2}^{τ}, \dots, p_{n - 1}^{τ}) .

The

(n - 1)

-tuple

θ

is self-conjugate if

θ^{τ} = θ

. □

The conjugation for pairs p and

(n - 1)

-tuples

θ

is an involution, i.e.

{({(i, j)}^{τ})}^{τ} = (i, j)

and

{(θ^{τ})}^{τ} = θ

. It corresponds to reflection relative to the anti-diagonal

(1, n), (2, n - 1), \dots, (n, 1)

of

n \times n

arrays.

Definition 12.

The set

K (n)

has the following important subsets.

The set $K_{1} (n) \subset K (n)$ is of type 1 if its elements are of the form

${(i_{1}, 1), (i_{2}, 2), \dots, (i_{n - 1}, n - 1)}$
The set $K_{2} (n) \subset K (n)$ is of type 2 if its elements are of the form

${(1, j_{1}), (2, j_{2}), \dots, (n - 1, j_{n - 1})}$
The set $K_{3} (n) = K (n) ∖ (K_{1} (n) \cup K_{2} (n)) \subset K (n)$ is of type 3 if it is neither of type 1 nor of type 2. □

Note that the elements of the set

K_{1} (n)

are conjugate to the elements of the set

K_{2} (n)

.

Proposition 4.

The intersection

K_{1} (n) \cap K_{2} (n)

has a single element

θ^{@} = {(1, 2), (2, 3), \dots, (n - 1, n)}

which is a self-conjugate

(n - 1)

-tuple.□

Definition 13.

The elements of the set

K_{1} (n) \cup K_{2} (n)

are said to be proper. The elements of the set

K_{3} (n)

are said to be improper. □

There are

(n - 1)!

elements in each of the sets

K_{1} (n)

and

K_{2} (n)

and one joint element of

K_{1} (n)

and

K_{2} (n)

. Thus we have

\begin{matrix} card (K_{1} (n) \cup K_{2} (n)) = (n - 1)! + (n - 1)! - 1 = 2 (n - 1)! - 1 \\ card (K_{3} (n)) = μ_{n} - 2 (n - 1)! + 1 \end{matrix}

Example 1.

For

n = 2

there is

μ_{2} = 1

pair of indexes

(1, 2)

and it is proper. For

n = 3

there are

μ_{3} = 3

sets of pairs of indexes

{(1, 2), (1, 3)}, {(1, 2), (2, 3)}, {(1, 3), (2, 3)}

and all they are proper. For

n = 4

there are

μ_{4} = 20

triples of pairs of indexes of which 11 are proper, namely

\begin{matrix} {(1, 2), (2, 3), (3, 4)}, {(1, 2), (1, 3), (1, 4)}, {(1, 4), (2, 4), (3, 4)} \\ {(1, 2), (1, 3), (2, 4)}, {(1, 3), (2, 4), (3, 4)}, {(1, 2), (1, 3), (3, 4)} \\ {(1, 3), (2, 3), (3, 4)}, {(1, 2), (2, 3), (2, 4)}, {(1, 4), (2, 3), (3, 4)} \\ {(1, 4), (2, 3), (3, 4)}, {(1, 2), (2, 3), (1, 4)} \end{matrix}

and 9 are improper, namely

\begin{matrix} {(1, 3), (2, 3), (3, 4)}, {(1, 3), (1, 4), (2, 4)}, {(1, 2), (1, 4), (3, 4)} \\ {(1, 2), (1, 3), (2, 4)}, {(1, 3), (2, 4), (3, 4)}, {(1, 2), (1, 3), (3, 4)} \\ {(1, 2), (1, 3), (2, 3)}, {(1, 3), (1, 4), (3, 4)}, {(1, 2), (1, 4), (2, 4)} □ \end{matrix}

Proposition 5.

The minimal and maximal elements relative to the order relation ≺ on the set

K_{2} (n)

are

θ_{1} = {(1, 2), (1, 3), \dots, (1, n)}

and

θ_{(n - 1)!} = θ^{@} = {(1, 2), (2, 3), \dots, (n - 1, n)},

resp. The minimal and maximal elements of the set

K_{1} (n)

are

θ^{@}

and

θ_{2 (n - 1)! - 1} = {(1, n), (2, n), \dots, (n - 1, n)}

resp.□

Now we are in position to define a possible set of Schur canonical forms

S \in T (n)

for generic matrices

A \in C (n)

. There are

n! (2 (n - 1)! - 1)

such sets. The multiplier

n!

comes from the different orderings of the (simple) eigenvalues of A on the diagonal of S. The multiplier

2 (n - 1)! - 1

corresponds to different choices of proper

(n - 1)

-tuples

{(i_{1}, j_{1}), (i_{2}, j_{2}), \dots, (i_{n - 1}, j_{n - 1})}

such that the elements

S (i_{k}, j_{k})

,

k \in Z [1, n - 1]

, of S are positive.

If we assume that the eigenvalues of S are ordered as

λ_{1} ≺ λ_{2} ≺ \dots ≺ λ_{n}

then there remain

2 (n - 1)! - 1

sets of Schur canonical forms. Note that any fixed ordering of the (simple) eigenvalues of A on the diagonal of S is preserved only by unitary similarity transformations with matrices U, such that

U (k, k) = exp (i φ_{k})

,

k \in Z [1, n]

.

If in particular we choose a given

(n - 1)

-tuple, say

{(1, 2), (2, 3), \dots, (n - 1, n)} \in K_{1} (n) \cap K_{2} (n)

then the set of Schur canonical forms is uniquely fixed. In this case the Schur canonical forms have the form

[\begin{matrix} λ_{1} & \oplus & * & \dots & * \\ 0 & λ_{2} & \oplus & \dots & * \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & λ_{n - 1} & \oplus \\ 0 & 0 & \dots & 0 & λ_{n} \end{matrix}]

where ⊕ denotes a positive element. Two other Schur canonical forms are

[\begin{matrix} λ_{1} & \oplus & \oplus & \dots & \oplus \\ 0 & λ_{2} & * & \dots & * \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & λ_{n - 1} & * \\ 0 & 0 & \dots & 0 & λ_{n} \end{matrix}], [\begin{matrix} λ_{1} & * & * & \dots & \oplus \\ 0 & λ_{2} & * & \dots & \oplus \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & λ_{n - 1} & \oplus \\ 0 & 0 & \dots & 0 & λ_{n} \end{matrix}]

Note that there is a similar problem with Jordan canonical forms of matrices

A \in C

relative to general similarity transformations. Usually it is assumed that different orderings of the Jordan blocks do not produce different Jordan forms. Formally this means that the Jordan canonical form of A is not a single block-diagonal matrix

J \in C (n)

but a class of block-diagonal matrices, which are permutationally equivalent to J.

Definition 14.

A set

C \subset T (n)

of Schur canonical forms

T \in T (n)

for generic matrices

A \in C (n)

and a fixed

(n - 1)

-tuple

{(i_{1}, j_{1}), (i_{2}, j_{2}), \dots, (i_{n - 1}, j_{n - 1})} \in K_{1} \cup K_{2}

is characterized as follows.

The n diagonal elements of the matrix T are ordered as

$T (1, 1) ≺ T (2, 2) ≺ \dots ≺ T (n, n) .$
The $n - 1$ elements $T (i_{1}, j_{1}), T (i_{2}, j_{2}), \dots, T (i_{n - 1}, j_{n - 1})$ of T over the diagonal are real and positive.□

Of course, we may choose the elements

T (i_{k}, j_{k})

to be real and negative as well, or to have angles equal to a fixed value

φ_{0} \in (- π, π]

, etc.

A matrix

A \in C

with eigenvalues

λ_{1} ≺ λ_{2} ≺ \dots ≺ λ_{n}

may be transformed into Schur canonical form

C \in T (n)

by the next three steps.

The matrix A is transformed into any Schur condensed form $T_{1} = U_{1}^{H} A U_{1} \in T (n)$ by a matrix $U_{1} \in U (n)$ . Numerically this is done by the QR algorithm [5]. For this purpose the code schur from MATLAB^® may be used [10].
A condensed Schur form $T_{2} = U_{2}^{H} T_{1} U_{2} \in T (n)$ is constructed so that $T_{2} (k, k) = λ_{k}$ , $k \in Z [1, n]$ . This may be done by several complex plane rotations, which interchange the positions of two diagonal elements $T_{1} (i, i)$ and $T_{1} (j, j)$ of $T_{1}$ such that $i < j$ but $T (i, i) > T (j, j)$ , see e.g. [5].
A diagonal matrix $U_{3} \in U (n)$ with elements $U_{3} (1, 1) = 1$ , $U_{3} (k, k) = exp (i φ_{k})$ , $φ_{k} \in R$ , $k \in Z [2, n]$ , is chosen so that the matrix $T = U_{3}^{H} T_{2} U_{3}$ has positive elements in positions $(i_{k}, j_{k})$ .

We recall that to introduce Schur canonical forms in the set

C (n)

relative to the similarity action of

U (n)

we use the lexicographical order ≺ on

C ≃ R \times R

. For

z_{k} = x_{k} + i y_{k} \in C

, where

x_{k}, y_{k} \in R

,

k = 1, 2

, we write

z_{1} ≺ z_{2}

if either

x_{1} < x_{2}

, or

x_{1} = x_{2}

and

y_{1} < y_{2}

.

There are

μ_{n} = (\binom{ν_{n}}{n - 1}), ν_{n} = n (n - 1) / 2,

sets of generic canonical forms

C_{k} \subset T (n)

,

k \in Z [1, μ_{n}]

, for

A \in C (n)

. The values of

μ_{n}

for small values of n are given at Table 1.

There are

μ_{n}

different pairs of

p_{k} = (i_{k}, j_{k})

,

1 \leq i < j \leq n

. They are ordered lexicographically according to the rule

(i_{1}, j_{1}) ≺ (i_{2}, j_{2})

if either

i_{1} < i_{2}

, or

i_{1} = i_{2}

and

j_{1} < j_{2}

. We may order the pairs

p_{k}

as

p_{1} ≺ p_{2} ≺ \dots ≺ p_{μ_{n}}

, where

k = k_{n} (i, j) = j + n (i - 1) - \frac{i (i + 1)}{2} .

(2)

Thus we have the chain of inequalities

\begin{matrix} (1, 2) ≺ (1, 3) ≺ \dots ≺ (1, n) ≺ (2, 3) ≺ \dots ≺ (2, n) ≺ \dots \\ ≺ (n - 2, n - 1) ≺ (n - 2, n) ≺ (n - 1, n) \end{matrix}

It follows from (2) that for n fixed and any

k \in Z [1, ν_{n}]

there exists a unique integer

i = i_{n} (k)

such that

a_{n} (i) \leq k \leq b_{n} (i)

, where

a_{n} (i) = 1 + \frac{(i - 1) (2 n - i)}{2}, b_{n} (i) = \frac{i (2 n - i - 1)}{2} .

The integer

i_{n} (k)

may be defined from

a_{n} (i_{n} (k)) = max_{i} {a_{n} (i) \leq k}

or

b_{n} (i_{n} (k)) = min_{i} {b_{n} (i) \geq k} .

Finally set

j_{n} (k) = k - n (i_{n} (k) - 1) + \frac{i_{n} (k) (i_{n} (k) + 1)}{2} .

Thus we have defined a bijection

(i, j) \mapsto k = k_{n} (i, j), k \mapsto (i, j) = (i_{n} (k), j_{n} (k)),

between the ordered sets of integers

Z [1, ν_{n}]

and integer pairs

(i, j)

, where

1 \leq i < j \leq n

.

Proposition 6.

The triple of pairs of indexes

((i, k), (i, j), (l, j))

, where

i < k

,

i < j

,

l < j

and

k < j

,

i < l

, is said to be improper. The triple of pairs of indexes

((k, j), (i, j), (i, l))

, where

k < j

,

i < j

,

i < l

and

k < i

,

j < l

, is said to be improper. □

Each set

θ \in K_{1} (n) \cup K_{2} (n)

of proper integer

(n - 1)

-tuples defines a class

C (θ) \subset T (n)

of canonical forms for the unitary similarity action of the group

U (n)

on generic matrices

A \in C (n)

. These forms are upper triangular matrices S with

S (k, k) ≺ S (k + 1, k + 1)

for

k \in Z [1, n - 1]

and

T (i, j) \in R

,

T (i, j) > 0

for

(i, j) \in θ

. □

If the matrix

A \in C (n)

with eigenvalues

λ_{1} ≺ λ_{2} ≺ \dots ≺ λ_{n}

is already transformed into condensed Schur form, i.e.

A \in T (n)

, it is then easily put into canonical form as follows. First a matrix

U \in U (n)

is chosen so as

Diag (S) = diag (λ_{1}, λ_{2}, \dots, λ_{n}), S = U^{H} A U .

Then a diagonal unitary matrix D with

D (1, 1) = 1

is found so that

T = D^{H} S D \in C (θ)

. Denoting

S (i, j) = | S (i, j) | exp (i α (i, j))

and

D (i, i) = exp (i φ (i))

,

φ (1) = 0

, where

α (i, j), φ (i) \in (- π, π]

, the conditions

S (i, j) > 0

give the system of

n - 1

linear equations

φ (i) - φ (j) = α (i, j), (i, j) \in θ,

(3)

for

φ (2), φ (3), \dots, φ (n)

. If it happens that

φ (k) \notin (- π, π]

for some k then

φ (k)

is replaced by

\tilde{φ} (k) = φ (k) mod (2 π) \in (- π, π] .

(4)

Three special sets of Schur canonical forms for generic matrices

A \in C (n)

deserve attention. For these sets the system (3) for

φ (i)

,

i \in Z [2, n]

, is easily solved explicitly. The first set of generic canonical forms corresponds to the pairs of indexes

(1, j)

,

j \in Z [2, n]

, and here the solution of (3) is

φ (i) = - α (1, i) .

The second set corresponds to the index pairs

(i, n)

,

i \in Z [1, n - 1]

, the solution of (3) being

φ (i) = α (i, n) - α (1, n) .

The third set corresponds to the indexes

(i, i + 1)

,

i \in Z [1, n - 1]

, and here the solution of (3) is

φ (i) = - \sum_{k = 1}^{i - 1} α (k, k + 1) .

In all these cases the convention (4) is presupposed.

The restrictions assumed in this section, and in particular the condition that the eigenvalues of A are pair-wise distinct, seem serious, but in fact their violation can make the perturbation analysis of the Schur decomposition meaningless. If e.g. A has two or more equal eigenvalues then the Schur basis of the perturbed Schur problem may be discontinuous as a function of the perturbation in A, see e.g. [8] and [12].

5. Real Schur Canonical Forms

The considerations above are valid for real or genuinely complex matrices with spectra that may in turn be real or genuinely complex. In particular we have the following four possibilities: 1. The matrix A is real and has real spectrum; 2. The matrix A is real and has genuinely complex spectrum (i.e. there is at least one complex conjugate pair

α \pm i β

of eigenvalues, where

0 < β \in R

); 3. The matrix A is genuinely complex and has real spectrum, and 4. The matrix A is genuinely complex and has genuinely complex spectrum.

When the matrix A is real (cases 1 and 2), i.e.

A \in R (n)

, then we may use orthogonal transformation matrices

U \in O (n)

instead of unitary ones to obtain the real Schur canonical form and the real Schur condensed forms of A. In case 1 the transformation matrix

U \in U (n)

is taken as orthogonal, i.e.

U \in O (n)

, and both the Schur canonical form and the Schur condensed forms of A are real upper triangular matrices T with the eigenvalues of A on their main diagonals.

Case 2 is slightly more subtle. Here the transformation matrix U may be chosen as orthogonal [11,14], while the canonical form and the condensed forms of A are upper block-triangular matrices with

1 \times 1

or

2 \times 2

blocks (

λ_{k} \in R

or

Λ_{k} \in R (2)

) on the main diagonal. In this case there is at least one

2 \times 2

block

Λ_{k} = [\begin{matrix} α_{k} & β_{k} \\ - β_{k} & a_{k} \end{matrix}] \in R (2)

corresponding to the eigenvalues

α_{k} + i β_{k}

,

α_{k} - i β_{k}

of A, where

α_{k}, β_{k} \in R

and

β_{k} > 0

.

Let

n > 2

and suppose that the spectrum

spect (A)

contains m real elements

λ_{1}, λ_{2}, \dots, λ_{m}

and

n - m

genuinely complex elements

α_{k} + i β_{k}, α_{k} - i β_{k}

,

k = 1, 2, \dots, (n - m) / 2

, where the number

n - m

is even. Set

q = (n - m) / 2

. Then the orthogonal canonical form of A has the structure

S = U^{⊤} A U = [\begin{matrix} S_{1, 1} & S_{1, 2} \\ O_{n - m, m} & S_{2, 2} \end{matrix}]

Here

S_{1, 1} \in R (m)

,

S_{1, 2} \in R (m, n - m)

,

S_{2, 2} \in R (n - m)

,

S_{1, 1} = [\begin{matrix} λ_{1} & s_{1, 2} & \dots & s_{1, m} \\ 0 & λ_{2} & \dots & s_{2, m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & λ_{m} \end{matrix}], S_{2, 2} = [\begin{matrix} Λ_{1} & S_{1, 2} & \dots & S_{1, q} \\ O_{2} & Λ_{2} & \dots & S_{2, q} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ O_{2} & O_{2} & \dots & Λ_{q} \end{matrix}]

and

s_{i, j} \in R

,

S_{i, j} \in R (2)

. The diagonal blocks are ordered so as

λ_{1} < λ_{2} < \dots < λ_{m}

and

α_{t} + i β_{t} ≺ α_{t + 1} + i β_{t + 1}

,

t = 1, 2, \dots, q

.

6. Perturbations

Let

(U, T) \in U (n) \times T (n)

be a fixed solution of the Schur problem for the matrix

A \in C (n)

with the convention that if

A \in T (n)

then

U = I_{n}

and

T = A

. Let

δ A \in C (n)

be a perturbation in A. Usually (but not always) we suppose that the matrix

δ A

is small relative to A, e.g.

∥ δ A ∥ \leq ρ c ∥ A ∥

, where

ρ

is the rounding unit of the binary floating-point arithmetic (FPA) used in the computations [2] and c is a small positive constant. In double precision FPA we have

ρ = 2^{- 53} ≃ 1.1102 \times 10^{- 16}

.

We often assume that the perturbation

δ A

is a 1-parameter family

δ A (ε) = ε E

, where

ε > 0

is a small parameter and

E \in C (n)

is a fixed matrix with

∥ E ∥ = 1

, i.e.

∥ δ A ∥ = ε

. The technique of the so called fictitious small parameter can also be used in the perturbation analysis of matrix problems. Assuming that

∥ δ A ∥

is small relative to

∥ A ∥

we use the identity

δ A = ε E

, where

E = δ A / ∥ δ A ∥

and

ε

is finally set to 1.

The formulation of the perturbed Schur problem (PSP), i.e. the Schur problem for a perturbed matrix

A + δ A

, is not trivial, see the examples in the next Section. First we mention two facts.

If the PSP for a slightly perturbed matrix $A + δ A$ has a slightly perturbed solution $(U + δ U, T + δ T)$ with

$∥ δ U ∥, ∥ δ T ∥ = O (δ), δ = ∥ δ A ∥ \to 0,$

then it also has a significantly perturbed solution $(U + Δ U, T + δ T)$ with $Δ U = - 2 U - δ U$ and $∥ Δ U ∥ = 2 + O (δ)$ . More precisely, we have

$2 - ∥ δ U ∥ \leq ∥ 2 U + δ U ∥ = ∥ Δ U ∥ = ∥ U + (U + δ U) ∥ \leq 2 .$
The solution of the PSP for the matrix $A + δ A$ may have the form $(U, T + δ T)$ with $δ U = 0$ . This will happen if and only if $U^{H} δ A U \in T (n)$ and in this case $∥ δ T ∥ = ∥ δ A ∥$ .

Let

(U_{0}, T_{0}) \in U (n) \times T (n)

be a solution of the Schur problem for the matrix

A_{0} \in C (n)

under the convention that if

A_{0} \in T (n)

then

U_{0} = I_{n}

and

T_{0} = A_{0}

. Consider for simplicity the case

δ A = ε A_{1}

, where

ε \in [0, ε_{0})

,

ε_{0} > 0

, is a small parameter and

A_{1} \in C (n)

is a fixed matrix with

∥ A_{1} ∥ = 1

. Let

(U (ε), T (ε)) \in U (n) \times T (n)

be a solution to the PSP for the matrix

A (ε) : = A_{0} + ε A_{1}

, i.e.

T (ε) = U^{H} (ε) A (ε) U (ε), U (ε) \in U (n), T (ε) \in T (n) .

(5)

Since the solution of the PSP always exists, we have defined functions

U (\cdot) : [0, ε_{0}) \to U (n)

and

T (\cdot) : [0, ε_{0}) \to T (n)

through the relations (5). The problem is that there are many such functions and not all of them are suitable for perturbation analysis. The aim of the next definition is to clarify the concepts in this area.

Definition 15.

The pair

(U (ε), T (ε))

is said to be a regular solution of the PSP for the matrix

A (ε) = A_{0} + ε A_{1}

if the functions

U (\cdot)

and

T (\cdot)

are continuous on the interval

[0, ε_{0})

and

(U (0), T (0))

is the principal solution of the SP for

A_{0}

. □

A number of examples of condensed Schur forms presented in the next section illustrate the structure of these forms and the behavior of their perturbations, see also [7].

Example 2.

Let

A \in C (n)

be a scalar matrix, i.e.

A = λ I_{n}

,

λ \in C

. Then the general solution of the Schur problem for A is

U (n) \times {A}

. The opposite statement is also true in the form of the next two assertions.

If $U (A) = U (n)$ then A is a scalar matrix and $T (A) = {A}$ .
If $T (A) = {A}$ then A is a scalar matrix and $U (A) = U (n)$ . □

Example 3.

Let

A = λ I_{n} + J_{n}

, where

λ \in C

and

J_{n} = [0, I_{n - 1}; 0, 0]

is the

n \times n

Jordan block with zero eigenvalue. Then

U (A) = U (n) \cap D (n), T (A) = {λ I_{n} + X : | X | = J_{n}} . □

7. Examples of Real $2 \times 2$ Matrices

In this section we consider several examples illustrating the concepts introduced so far. In what follows “Schur form” means “condensed Schur form”. The examples are for Schur problem and PSP for matrices

A \in R (2)

with real spectra for which the transformation group is

O (2)

. This is the most simple non-trivial case. However, the effects observed are in fact valid for matrices

A \in C (n)

,

n > 2

, e.g. of the form

A = [A_{1, 1}, A_{1, 2}; 0, A_{2, 2}]

, where

A_{1, 1} \in R (2)

and

A_{2, 2} \in C (n - 2)

.

Matrices

A \in R (2)

correspond to linear operators

R^{2} \to R^{2}

and have the simplest nontrivial albeit rich structure. A surprisingly large number of facts about general linear operators is revealed by such matrices, see e.g. [4] and the examples below.

Example 4.

Let the matrix

A \in R (2)

has eigenvalues

λ_{1}, λ_{2}

and set

r = \sqrt{{∥ A ∥}_{F}^{2} - | λ_{1} |^{2} - {| λ_{2} |}^{2}} .

Then the following four cases are possible in which the statements are reversible.

If $λ_{1} = λ_{2} = λ$ and $r = 0$ then there exists a unique Schur form $λ I_{2}$ of A.
If $λ_{1} = λ_{2} = λ$ and $r > 0$ then there exist two Schur forms $λ I_{2} \pm r E_{1, 2}$ of A.
If $λ_{1} \neq λ_{2}$ and $r = 0$ then there exist two Schur forms $diag (λ_{1}, λ_{2})$ and $diag (λ_{2}, λ_{1})$ of A.
If $λ_{1} \neq λ_{2}$ and $r > 0$ then there exist four Schur forms

$diag (λ_{1}, λ_{2}) \pm r E_{1, 2}, diag (λ_{2}, λ_{1}) \pm r E_{1, 2}$

of A. □

Example 5.

Let

A = λ I_{2}

,

λ \in R

. We have

U (λ I_{2}) = O (2)

and

T (λ I_{2}) = {λ I_{2}}

. Since

λ I_{2}

is in Schur form, the principal solution of the Schur problem is

(I_{2}, λ I_{2})

. Let the matrix

λ I_{2}

be perturbed to

λ I_{2} + ε E_{2, 1}

,

ε \neq 0

. Then the Schur decomposition of the perturbed matrix is

U = I_{2} + δ U, T = λ I_{2} + δ T .

The set of transformation matrices

U (λ I_{2} + ε E_{2, 1})

consists of 4 matrices

U_{1}, - U_{1}, U_{2}, - U_{2}

, where

U_{1} = E_{1, 2} + E_{2, 1}, U_{2} = E_{1, 2} - E_{2, 1} .

In view of the equalities

U_{k} = I_{2} + δ U_{k}

, for two of these matrices we have

∥ δ U_{1} ∥ = ∥ I_{2} \pm U_{1} ∥ = 2

and for the other two we have

∥ δ U_{2} ∥ = ∥ I_{2} \pm U_{2} ∥ = \sqrt{2} .

At the same time the set of Schur forms

T (λ I_{2} + ε E_{2, 1})

consists of two matrices

λ I_{2} \pm ε E_{1, 2}

. Thus the transformation matrix

U (λ I_{2} + ε E_{2, 1})

is discontinuous (or infinitely sensitive) as a function of the perturbation parameter

ε

at the point

ε = 0

.

Consider also the multivalued function

Ψ : R \to 2^{O (2)}

, where

2^{O (2)}

is the set of subsets of

O (2)

, defined by

ε \mapsto Ψ (ε) = U (λ I_{2} + ε E_{2, 1}) .

We have

Ψ (0) = O (2)

and

Ψ (ε) = {U_{1}, - U_{1}, U_{2}, - U_{2}}

for

ε > 0

. Hence the function

Ψ

, i.e. the Schur basis for

R (2, 1)

relative to the matrix

λ I_{2} + ε E_{2, 1}

, is discontinuous at the point

ε = 0

, while the Schur forms of

λ I_{2} + ε E_{2, 1}

are continuous and well conditioned in

ε

. □

Example 6.

Let

A_{0} = λ I_{2} + E_{1, 2} \in R (2)

be a Jordan block with eigenvalue

λ \in R

. The set

T (A_{0})

contains two matrices

T_{0, 1} = λ I_{2} + E_{1, 2}, T_{0, 2} = λ I_{2} - E_{1, 2},

while the set

U (A_{0})

contains four matrices

I_{2}, - I_{2}, E_{1, 1} - E_{2, 2}, E_{2, 2} - E_{1, 1} .

Let the matrix

A_{0}

be perturbed to

A (ε) = A_{0} + ε E_{2, 1}

,

ε > 0

. The eigenvalues of

A (ε)

are

λ_{1} (ε) = λ - \sqrt{ε}

,

λ_{2} (ε) = λ + \sqrt{ε}

. Setting

c (ε) = \frac{1}{\sqrt{1 + ε}}, s (ε) = - \frac{\sqrt{ε}}{\sqrt{1 + ε}}

we see that there are four Schur forms

T_{1} (ε) = [\begin{matrix} λ_{1} (ε) & t (ε) \\ 0 & λ_{2} (ε) \end{matrix}], T_{2} (ε) = [\begin{matrix} λ_{1} (ε) & - t (ε) \\ 0 & λ_{2} (ε) \end{matrix}],

and

T_{3} (ε) = [\begin{matrix} λ_{2} (ε) & t (ε) \\ 0 & λ_{1} (ε) \end{matrix}], T_{4} (ε) = [\begin{matrix} λ_{2} (ε) & - t (ε) \\ 0 & λ_{1} (ε) \end{matrix}],

where

t (ε) = 1 - ε

. The orthonormal matrices

U_{k} (ε)

that transform

A (ε)

into Schur forms

T_{k} (ε)

, respectively, are

U_{1} (ε) = [\begin{matrix} c (ε) & - s (ε) \\ s (ε) & c (ε) \end{matrix}], U_{2} (ε) = [\begin{matrix} c (ε) & s (ε) \\ s (ε) & - c (ε) \end{matrix}],

and

U_{3} (ε) = [\begin{matrix} c (ε) & s (ε) \\ - s (ε) & c (ε) \end{matrix}], U_{4} (ε) = [\begin{matrix} c (ε) & - s (ε) \\ - s (ε) & - c (ε) \end{matrix}]

Hence there are two regular solutions of this PSP, namely

(U_{1} (ε), T_{1} (ε))

and

(U_{3} (ε), T_{3} (ε))

corresponding to the unperturbed diagonally different Schur forms

T_{0, 1}

and

T_{0, 2}

, respectively. □

Example 7.

Let

A_{0} = diag (λ_{1}, λ_{2})

,

λ_{1} \neq λ_{2}

. Here the set

T (A_{0})

contains two diagonally different Schur forms

T_{0, 1} = diag (λ_{1}, λ_{2}), T_{0, 2} = diag (λ_{2}, λ_{1})

of

A_{0}

, while the set

O (A_{0})

has 8 elements, namely

\pm E_{1, 1} \pm E_{2, 2}, \pm E_{1, 2} \pm E_{2, 1} .

Let us again choose

δ A (ε) = ε E_{2, 1}

. For

ε \neq 0

the set

T (A_{0} + ε E_{2, 1})

has four elements:

{\tilde{T}}_{1} (ε) = T_{0, 1} + ε E_{1, 2}, {\tilde{T}}_{2} (ε) = T_{0, 1} - ε E_{1, 2}

and

{\tilde{T}}_{3} (ε) = T_{2} + ε E_{1, 2}, {\tilde{T}}_{4} (ε) = T_{2} - ε E_{1, 2} .

The matrices

U_{1}, - U_{1}

from Example 5 transform the perturbed matrix

A + δ A (ε)

into the Schur form

{\tilde{T}}_{3} (ε)

and the matrices

U_{2}, - U_{2}

transform

A + δ A (ε)

into the Schur form

{\tilde{T}}_{4} (ε)

since

U_{1}

and

U_{2}

transform

δ A (ε)

in

\pm ε E_{1, 2} \in T (2)

, respectively.

Consider now the transformation of

A + δ A

into some of the Schur forms

{\tilde{T}}_{1} (ε)

or

{\tilde{T}}_{2} (ε)

. Define the orthogonal matrices

U_{1} (ε) = [\begin{matrix} c (ε) & s (ε) \\ s (ε) & - c (ε) \end{matrix}], U_{2} (ε) = [\begin{matrix} c (ε) & - s (ε) \\ s (ε) & c (ε) \end{matrix}],

where

c (ε) = \frac{λ_{1} - λ_{2}}{\sqrt{ε^{2} + {(λ_{1} - λ_{2})}^{2}}}, s (ε) = \frac{ε}{\sqrt{ε^{2} + {(λ_{1} - λ_{2})}^{2}}} .

We have

U_{k}^{⊤} (ε) (A + δ A (ε)) U_{k} (ε) = {\tilde{T}}_{k} (ε), k = 1, 2 .

Furthermore, it is fulfilled

U_{2} (0) = I_{2}

and

U_{1} (0) = diag (1, - 1)

. Hence the regular solution of the PSP is

(U_{2} (ε), {\tilde{T}}_{2} (ε))

. □

Example 8.

Let

A_{0} = diag (λ_{1}, λ_{2}) + a E_{1, 2} \in R (2),

where

δ = | λ_{1} - λ_{2} | > 0

and

a \neq 0

. The set

T (A_{0})

of condensed Schur forms of

A_{0}

contains 4 matrices:

T_{1, 2} = diag (λ_{1}, λ_{2}) \pm a E_{1, 2}, T_{3, 4} = diag (λ_{2}, λ_{1}) \pm a E_{1, 2} .

The Schur canonical form of

A_{0}

is

S_{0} = diag (λ_{min}, λ_{max}) + | a | E_{1, 2} \in T (A_{0}),

where

λ_{min} = min {λ_{1}, λ_{2}} < λ_{max} = max {λ_{1}, λ_{2}} .

Let the matrix

A_{0}

be perturbed to

A (ε) = A_{0} + ε E_{2, 1}

, where

ε

is a small parameter such that

δ^{2} + 4 a ε > 0

, i.e.

ε \in (- ε_{0}, ε_{0})

, where

ε_{0} = δ^{2} / (4 | a |)

. The condensed Schur forms of the matrix

A (ε)

are

{\tilde{T}}_{1, 2} = diag ({\tilde{λ}}_{1}, {\tilde{λ}}_{2}) \pm \tilde{a} E_{1, 2}, {\tilde{T}}_{3, 4} = diag ({\tilde{λ}}_{2}, {\tilde{λ}}_{1}) \pm \tilde{a} E_{1, 2},

where the quantities

{\tilde{λ}}_{1} = {\tilde{λ}}_{1} (ε)

,

{\tilde{λ}}_{2} = {\tilde{λ}}_{2} (ε)

and

\tilde{a} = \tilde{a} (ε)

are analytical functions of

ε

. In particular

\begin{matrix} {\tilde{λ}}_{1} = λ_{1} + \frac{a ε}{λ_{1} - λ_{2}} + O (ε^{2}), \\ {\tilde{λ}}_{2} = λ_{2} + \frac{a ε}{λ_{2} - λ_{1}} + O (ε^{2}), \\ \tilde{a} = a - ε + O (ε^{2}), ε \to 0 . \end{matrix}

Among the four condensed forms only the matrix

{\tilde{T}}_{1} = diag ({\tilde{λ}}_{1}, {\tilde{λ}}_{2}) + \tilde{a} E_{1, 2}

is regular. □

8. Diagonally Spectral Matrices

Denote by

Δ (n) \subset C (n)

the set of matrices

A \in C (n)

such that the multiset of its diagonal elements is equal to the multiset of its eigenvalues, i.e.

spect (A) = {A (1, 1), A (2, 2), \dots, A (n, n)}

(6)

Otherwise speaking,

Δ (n)

is the set of matrices A such that

det (A - A (k, k) I_{n}) = 0, k \in Z [1, n]

(7)

Definition 16.

The matrices

A \in Δ (n)

which satisfy (6) or (7) are said to be diagonally spectral. □

The set

Δ (n) \subset C (n)

is defined by n algebraic equations (7) (some of them may not be independent) in the elements of A and is hence a closed algebraic variety of complex dimension up to

n (n - 1)

.

Upper triangular matrices and lower triangular matrices are diagonally spectral. Schur condensed form in particular are diagonally spectral. More generally, for

P \in O (n)

being a permutation matrix, and

A \in C (n)

being a diagonally spectral matrix, the matrix

P A P

is also diagonally spectral.

Example 9.

The elements of the matrix

A \in Δ (2)

satisfy one independent algebraic equation

A (1, 2) A (2, 1) = 0

. Hence matrices

A \in Δ (2)

have the form

[\begin{matrix} * & * \\ 0 & * \end{matrix}], [\begin{matrix} * & 0 \\ * \end{matrix}]

where * denotes unspecified matrix elements.□

Example 10.

The matrices

A_{1}, A_{1}^{⊤}, A_{2}, A_{2}^{⊤}, A_{3}, A_{3}^{⊤} \in C (3)

, where

A_{1} = [\begin{matrix} * & * & * \\ 0 & * & * \\ 0 & 0 & * \end{matrix}], A_{2} = [\begin{matrix} * & 0 & * \\ * & * \\ 0 & 0 & * \end{matrix}], A_{3} = [\begin{matrix} * & * & * \\ 0 & * & 0 \\ 0 & * & * \end{matrix}],

are diagonally spectral.□

Matrices from

Δ (n)

may not be condensed in the sense that they have

n (n - 1) / 2

zero elements. In particular matrices from

Δ (n)

may have all their elements different from zero.

Example 11.

Let

z \in C

be a parameter. Then the matrices

A (z) = [\begin{matrix} 1 & 1 & 1 \\ 0 & 2 & 1 \\ z & - z & 3 \end{matrix}], B (z) = [\begin{matrix} 1 & 1 & 1 \\ z & 2 & 1 \\ - z - 2 & 2 & 3 \end{matrix}]

are diagonally spectral, i.e.

spect (A (z)) = spect (B (z)) = {1, 2, 3} .

We stress that

A (0) \in T (3)

but

B (z) \notin T (3)

for all

z \in C

. □

The main advantage of a Schur canonical or condensed form

T = U^{⊤} A U \in T (n), U \in U (n)

of a matrix

A \in C (n)

is that it reveals the spectrum

{λ_{1}, λ_{2}, \dots, λ_{n}}

of the matrix A as the collection of the diagonal elements

{T (1, 1), T (2, 2), \dots, T (n, n)}

of the form T. Thus the sets of Schur canonical and Schur condensed forms are subsets of the larger set (closed in the Zarisky topology)

Δ (n)

.

Important observation The requirement that the condensed form T is upper triangular, i.e.

T \in T (n)

, may lead to extreme sensitivity of the transformation pair

(T, U)

relative to perturbations in the matrix A.

Example 12.

For the matrix

A = λ I_{2}

,

λ \in C

, the pair

(T, U)

is

(λ I_{2}, I_{2})

. If we perturb A to

\tilde{A} = λ I_{2} + ε E_{2, 1}

, where

ε > 0

is arbitrarily small, the pair

(T, U)

is transformed to

(\tilde{T}, \tilde{U})

, where

\tilde{T} = λ I_{2} + ε E_{1, 2}

and

\tilde{U}

is any of the four matrices

\pm E_{1, 2} \pm E_{2, 1}

. Thus

∥ U - \tilde{U} ∥_{F} = 2

and the transformation matrix

U = U (ε)

is even discontinuous at the point

ε = 0

.□

This high sensitivity may not be relevant to the problem of computing the spectrum

spect (A)

of A. To avoid such artificial sensitivity in the next section we introduce the concept of quasi-Schur condensed forms.

9. Quasi-Schur Condensed Forms

Definition 17.

A matrix

S = U^{H} A U \in C (n)

,

U \in U (n)

, is said to be a quasi-Schur condensed form of

A \in C (n)

if it is block-upper triangular with

m \geq 1

diagonal blocks

S_{k, k} \in C (n_{k})

,

n = n_{1} + n_{2} + \dots + n_{m}

, i.e.

S = [\begin{matrix} S_{1, 1} & S_{1, 2} & \dots & S_{1, m} \\ O & S_{2, 2} & \dots & S_{2, m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ O & O & \dots & S_{m, m} \end{matrix}]

and either

S_{k, k} \in T (n_{k})

or

S_{k, k}^{⊤} \in T (n_{k})

,

k \in Z [1, m]

.□

Example 13.

For

n = 2

the quasi-Schur condensed forms have the structure

A_{1}, A_{1}^{⊤}

, where

A_{1} \in T (2)

. For

n = 3

the quasi-Schur condensed forms are

A_{1}, A_{1}^{⊤}

,

A_{2}, A_{2}^{⊤}

and

A_{3}, A_{3}^{⊤}

, where

A_{1} \in T (3)

,

A_{2} = [\begin{matrix} * & * & * \\ 0 & * & 0 \\ 0 & * & * \end{matrix}], A_{3} = [\begin{matrix} * & 0 & * \\ * & * \\ 0 & 0 & * \end{matrix}] .

and * denotes unspecified quantities.□

Quasi-Schur condensed forms are diagonally spectral but the opposite is not true for

n \geq 3

, see e.g. Example 11.

Obviously a Schur condensed form is also a quasi-Schur condensed form but the opposite may not be true (we recall that

n \geq 2

). We stress that high sensitivity of Schur forms as in Example 12 may not be observed for quasi-Schur condensed forms.

10. Conclusions

In this paper we have considered the Schur canonical forms for a square matrix A with pair-wise distinct eigenvalues. Sensitivity of the Schur form relative to perturbations in A was also studied. The concept of regular solution to the perturbed Schur form was introduced and illustrated by a number of examples. We have also introduced the concepts of diagonally spectral matrices (Schur forms are diagonally spectral) and of quasi-Schur condensed forms of a matrix A which may be much less sensitive to perturbations in A.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

J. Brenner, The problem of unitary equivalence, Acta Mathematica, 86 (1951), pp. 297-308. [CrossRef]
IEEE Computer Society, 754-2019-IEEE Standard for Floating-Point Arithmetic, https://ieeexplore.ieee.org/document/8766229, 2019. [CrossRef]
K. Ikramov, The canonical Schur form of a matrix with simple eigenvalues, Doklady Mathematics, 77 (2008), no. 3, pp. 359-360. [CrossRef]
M. Glazman and Ju. Ljubich, Finite-Dimensional Linear Analysis: A Systematic Presentation in Problem Form, New York, Dover Books in Mathematics, Dover Publications, 2006, ISBN 978-0486453323.
G. Golub and C. Van Loan, Matrix Computations, Baltimore, The Johns Hopkins University Press (4th edition), 2013, ISBN 978-1421407944.
R. Hartshorne, Algebraic Geometry, Berlin, Springer-Verlag, 1977, ISBN 978-0387902449.
M. Konstantinov and P. Petkov, Perturbation Methods in Matrix Analysis and Control, New York, Nova Science Publishers, 2020, ISBN 978-1536174700, BISAC:MAT034000. M. [CrossRef]
M. Konstantinov, P. Petkov and N. Christov, Nonlocal perturbation analysis of the Schur system of a matrix, SIAM Journal on Matrix Analysis and Applications, 15 (1994), no. 2, pp. 383-392. [CrossRef]
D. Littlewood, On unitary equivalence, Journal of the London Mathematical Society, 28 (1953), pp. 314-322. [CrossRef]
MATLAB®, Math. Graphics. Programming, Natick, MathWorks Inc. (release R2024b), 2024, www.mathworks.com.
F. Murnaghan and A. Wintner, A canonical form for real matrices under orthogonal transformations. Proceedings of the National Academy of Sciences of the USA, 17(7) (1931), pp. 417-420. [CrossRef]
P. Petkov, Componentwise perturbation analysis of the Schur decomposition of a matrix, SIAM Journal on Matrix Analysis and Applications, 42 (2021), no. 1, pp. 108-133. [CrossRef]
I. Schur, Beiträge zur Theorie der Gruppen linearer homogener Substitutionen. Transactions of the American Mathematical Society, 10 (1909), pp. 159-175. [CrossRef]
H. Shapiro, A survey of canonical forms and invariants for unitary similarity. Linear Algebra and its Applications, 147 (1991), pp. 101-167. [CrossRef]

Table 1. Number of generic canonical forms

n	2	3	4	5	6	7	8	9	10
$μ_{n}$	1	3	20	210	3003	54264	1184040	30260340	886163135

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

On Schur Forms for Matrices with Simple Eigenvalues

Abstract

Keywords:

Subject:

1. Introduction and Notation

2. Condensed Schur Forms

3. Canonical Schur Forms for Generic Matrices

4. Geometry of Schur Canonical Sets

5. Real Schur Canonical Forms

6. Perturbations

7. Examples of Real $2 \times 2$ Matrices

8. Diagonally Spectral Matrices

9. Quasi-Schur Condensed Forms

10. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe

On Schur Forms for Matrices with Simple Eigenvalues

Abstract

Keywords:

Subject:

1. Introduction and Notation

2. Condensed Schur Forms

3. Canonical Schur Forms for Generic Matrices

4. Geometry of Schur Canonical Sets

5. Real Schur Canonical Forms

6. Perturbations

7. Examples of Real 2 × 2 Matrices

8. Diagonally Spectral Matrices

9. Quasi-Schur Condensed Forms

10. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe

7. Examples of Real $2 \times 2$ Matrices