On Asymptotic Covariance of Unrotated Factor Solutions

Xingwei Hu; Caihong Hu; Cheng-Kuang Wu

doi:10.20944/preprints202603.1901.v1

Submitted:

23 March 2026

Posted:

24 March 2026

You are already at the latest version

Abstract

This paper derives closed-form expressions for the asymptotic covariance matrices of unrotated factor loading and uniqueness estimators for several widely used non-maximum-likelihood factor extraction methods. These include least squares, principal factor, iterative principal component, alpha factor, and image factor analysis. By expressing these results explicitly in terms of the asymptotic covariance of the sample covariance or correlation matrix, the proposed formulas facilitate straightforward computation of standard errors. When combined with the delta method from rotation criteria, they further yield analytically tractable standard errors for rotated factor loadings. Monte Carlo simulations demonstrate accurate finite-sample performance, and an empirical application illustrates practical implementation of the proposed approach.

Keywords:

factor analysis

;

asymptotic covariance

;

standard errors

;

factor loadings

;

uniqueness

;

multivariate implicit differentiation

;

structural VAR

Subject:

Computer Science and Mathematics - Probability and Statistics

MSC: 62H25; 62F12; 62H12

1. Introduction

Let

X_{1}, X_{2}, \dots, X_{p}

be random variables, collectively denoted by the random vector Y, Common or exploratory factor analysis postulates the existence of k independent common factors, with

k < p

, that underlie the observed variables Y. Let these factors be jointly represented by the random vector F. The relationship between the observed variables and the common factors is specified by

Y = c + Λ F + ϵ,

(1)

where c is a constant vector,

Λ

is a

p \times k

factor-loading matrix, and

ϵ

is a vector of p mutually independent idiosyncratic disturbances.

Let

Σ

denote the variance-covariance (or correlation) matrix of Y, and assume that the common factors have unit variance. Under these assumptions, the covariance structure of Y is given by

Σ = Λ Λ^{⊤} + Ψ,

(2)

where the superscript ⊤ denotes matrix transpose, and

Ψ

is a

p \times p

diagonal matrix whose entries correspond to the variances of the idiosyncratic disturbances, often referred to as uniquenesses.

Eq. (2) implies that the off-diagonal covariances of Y are entirely determined by the factor loadings. In contrast, the diagonal variances reflect contributions from both the common and idiosyncratic components: the common factors capture the variation shared across variables Y, while the idiosyncratic disturbances represent variable-specific noise. Let

λ_{i t}

denote the element in the i-th row and t-th column of

Λ

. The quantity

h_{i} = \sum_{t = 1}^{k} λ_{i t}^{2},

(3)

represents the proportion of the variance of

X_{i}

explained by the common factors and is known as the communality of variable

X_{i}

.

A wide range of factor-extraction methods has been proposed in the literature (e.g., Harman [1976] [7]; Lawley and Maxwell [1971] [15]; SAS [2015] [19]), including principal-factor, maximum-likelihood, and least-squares approaches. These methods aim to recover the factor model in Eq. (2) from a given sample covariance or correlation matrix

\hat{Σ}

. From a practical perspective, it is important not only to obtain point estimates of factor loadings and uniquenesses, but also to assess the precision of these estimates and to identify substantively meaningful—often rotated—factor structures. Such assessments require knowledge of the asymptotic covariance matrices (acov) of the estimators, as well as an understanding of how these covariances are affected by the chosen rotation criterion.

The asymptotic covariance of the estimators depends on both the factor-extraction method and the covariance of

\hat{Σ}

. Let n denote the sample size, and suppose that the estimator of the loading matrix can be expressed as

Λ = f (Σ)

for a given extraction method f. Assume that f is differentiable at

Σ

, with differential

d f

, which defines a linear mapping from the space of

p \times p

symmetric matrices to the space of

p \times k

matrices. By the delta method (see Rao [2009] [17]), the difference between

n^{1 / 2} d Λ

and

d f (n^{1 / 2} d Σ)

converges to zero in probability as

n \to \infty

. In this setting,

d Λ \approx \hat{Λ} - Λ

and

d Σ \approx \hat{Σ} - Σ

denote deviations from the unobservable population parameters

Σ

and

Λ

, respectively.

Let

Σ = [σ_{i j}]

and define

d σ_{i j} = {\hat{σ}}_{i j} - σ_{i j}

, where

{\hat{σ}}_{i j}

denotes the sample covariance (or correlation) between

X_{i}

and

X_{j}

. The asymptotic covariance between elements of the estimated loading matrix

\hat{Λ}

can then be expressed as

acov ({\hat{λ}}_{i r}, {\hat{λ}}_{j s}) = \sum_{m, n, x, y = 1}^{p} \frac{\partial λ_{i r}}{\partial σ_{m x}} acov ({\hat{σ}}_{m x}, {\hat{σ}}_{n y}) \frac{\partial λ_{j s}}{\partial σ_{n y}} .

Similarly, let

ψ = {(ψ_{1}, \dots, ψ_{p})}^{⊤}

denote the diagonal elements of the uniqueness matrix

Ψ

. The asymptotic covariance of the uniqueness estimators can be written as

acov ({\hat{ψ}}_{i}, {\hat{ψ}}_{j}) = \sum_{m, n, x, y = 1}^{p} \frac{\partial ψ_{i}}{\partial σ_{m x}} acov ({\hat{σ}}_{m x}, {\hat{σ}}_{n y}) \frac{\partial ψ_{j}}{\partial σ_{n y}} .

In this paper, we derive explicit expressions for the differentials

d Λ

and

d ψ

in terms of

d Σ

for several widely used factor extraction methods. The motivation for this research is twofold. First, given the covariance of

\hat{Σ}

, our results enable practitioners to compute standard errors for estimated factor loadings

\hat{Λ}

and uniquenesses

\hat{ψ}

. When the data are drawn from a multivariate normal population, for example, the covariance of

\hat{Σ}

is well-known (see Girshick [1939] [6]). Second, delta-method-based inference for rotated factor loadings (e.g., Archer and Jennrich [1973] [2]; Jennrich [1973] [10]; Yung and Hayashi [2001] [21]) requires these differentials through the chain rule of differentiation. At present, readily available standard error formulas exist primarily for maximum-likelihood factor analysis, as implemented in software such as SAS [2015] [19], reflecting a general property of maximum-likelihood estimators.

When the assumption of normality is violated, the results developed here can be combined with existing asymptotic results for sample covariances or correlations (e.g., Hsu [1949] [9]; de Leeuw [1983] [4]) to obtain more general asymptotic covariance formulas for unrotated factor loadings. While previous studies have examined the asymptotic properties of principal component analysis (e.g., Girshick [1939] [6]; Anderson [1963] [1]) and derived standard error formulas for maximum-likelihood factor analysis (Lawley [1967] [14], Jennrich and Thayer [1973] [11]), our focus is on least-squares, iterative principal component, principal, alpha, and image factor solutions, as implemented in SAS [2015] [19].

Despite the availability of maximum-likelihood (ML) factor analysis, non–ML extraction methods remain widely used due to their computational simplicity, robustness, distribution-free restrictions, and broad implementation in standard statistical software (e.g., Lin et al. [2023] [16]; Raykov et al. [2026] [18]). In many empirical applications, factor analysis is applied to binary, categorical, or qualitative data that may be high-frequency, high-dimensional, or heavy-tailed. Although recent research has emphasized robustness, high-dimensional asymptotics, and software-based inference for rotated solutions, explicit closed-form asymptotic covariance formulas for non-ML factor extraction methods remain largely unavailable. This gap substantially limits formal statistical inference and model comparison. The present paper addresses this limitation by deriving analytically tractable asymptotic covariance expressions, thereby enabling delta-method inference under both normal and non-normal settings.

Our approach is based on multivariate implicit differentiation of the estimating equations that define the factor loadings

\hat{Λ}

and uniquenesses

\hat{Σ}

under each extraction method, following techniques widely used in factor-analysis literature (e.g., Archer and Jennrich [1973] [2]; Girshick [1939] [6]; Jennrich [1973] [10]; Yung and Hayashi [2001] [21]). We assess the accuracy of the proposed standard errors for the uniqueness estimator

\hat{ψ}

through a simulation study and present an empirical application illustrating their use for the least-squares factor solution.

Accordingly, the main contributions of this paper are as follows:

(i): Derivation of explicit asymptotic covariance formulas for several non-maximum-likelihood factor solutions;
(ii): A unified implicit-differentiation framework applicable across multiple factor extraction methods;
(iii): Closed-form standard error formulas for uniqueness estimators;
(iv): A practical extension to rotated factor loadings via the delta method; and
(v): Simulation evidence demonstrating finite-sample accuracy.

2. Iterative Principal Component and Principal Factor Analysis

We begin by introducing notation. For a

p \times k

matrix

Λ

, let

\vec{Λ}

denote its vectorization obtained by stacking the columns of

Λ

, that is,

\vec{Λ} = {(λ_{11}, \dots, λ_{p 1}, λ_{12}, \dots, λ_{p 2}, \dots, λ_{1 k}, \dots, λ_{p k})}^{⊤} .

When no confusion arises, we continue to use double subscripts to refer to individual elements of

\vec{Λ}

. In particular, the

(i, r)

-th element of

Λ

, denoted by

λ_{i r}

, corresponds to the

((r - 1) p + i)

-th element of

\vec{Λ}

. For a square matrix M, let

diag (M)

denote the vector formed by the diagonal elements of M, and let

Diag (M)

denote the diagonal matrix whose diagonal is given by

diag (M)

. We also use

δ_{i j}

to denote the Kronecker delta, which equals 1 if

i = j

and 0 otherwise.

Both iterative principal component factor analysis and principal factor analysis construct the loading matrix

Λ

from the principal eigenvectors of

Σ - Ψ

:

Σ - Ψ = Λ Λ^{⊤} = \sum_{r = 1}^{k} λ_{r} λ_{r}^{⊤},

(4)

where

λ_{r}

denotes the r-th column of

Λ = [λ_{1}, λ_{2}, \dots, λ_{k}]

. These columns are ordered according to the eigenvalues of

Σ - Ψ

, with

λ_{r}

corresponding to the r-th largest eigenvalue. Without loss of generality, we assume that these eigenvalues are distinct, which is typically the case for a sample variance-covariance matrix. Consequently, the inner product between any two distinct columns of

Λ

is zero. In particular, Eq. (4) implies

σ_{i i} - ψ_{i} = \sum_{t = 1}^{k} λ_{i t}^{2}, \forall i = 1, . . ., p .

(5)

For the sample variance-covariance matrix

\hat{Σ}

, Eq. (4) may, in general, include additional items of the form

{\hat{λ}}_{r} {\hat{λ}}_{r}^{⊤}

for

r > k

. These items are omitted in accordance with the model specification in Eq. (2).

Using Eq. (4), we obtain

(Σ - Ψ) λ_{r} = Λ Λ^{⊤} λ_{r} = \sum_{s = 1}^{k} λ_{s} λ_{s}^{⊤} λ_{r} = \sum_{s = 1}^{k} λ_{s} (λ_{s}^{⊤} λ_{r}) = (λ_{r}^{⊤} λ_{r}) λ_{r} .

(6)

This result holds for any

1 \leq r \leq k

. Now fix indices

1 \leq i \leq p

and

1 \leq r \leq k

. Applying Eq. (5) to the i-th components of both sides of Eq. (6) yields

\sum_{j \neq i} σ_{i j} λ_{j r} + λ_{i r} \sum_{t = 1}^{k} λ_{i t}^{2} = λ_{i r} \sum_{j = 1}^{p} λ_{j r}^{2} .

Taking differentials on both sides, we obtain

\sum_{j \neq i} (σ_{i j} d λ_{j r} + λ_{j r} d σ_{i j}) + (\sum_{t = 1}^{k} λ_{i t}^{2}) d λ_{i r} + 2 λ_{i r} \sum_{t = 1}^{k} λ_{i t} d λ_{i t} = (\sum_{j = 1}^{p} λ_{j r}^{2}) d λ_{i r} + 2 λ_{i r} \sum_{j = 1}^{p} λ_{j r} d λ_{j r} .

After algebraic simplification, this expression reduces to

\sum_{j \neq i} (σ_{i j} - 2 λ_{i r} λ_{j r}) d λ_{j r} + (\sum_{t = 1}^{k} λ_{i t}^{2} - \sum_{j = 1}^{p} λ_{j r}^{2}) d λ_{i r} + 2 λ_{i r} \sum_{t \neq r} λ_{i t} d λ_{i t} = - \sum_{j \neq i} λ_{j r} d σ_{i j} .

Collecting all linear equations derived above, the system can be written compactly in matrix form as

A d \vec{Λ} = B d \vec{Σ},

where the coefficient of

d λ_{j t}

in the

(i, r)

-th equation is given by

A_{i r, j t} = \{\begin{matrix} \sum_{s = 1}^{k} λ_{i s}^{2} - \sum_{s = 1}^{p} λ_{s r}^{2}, & if j = i and t = r, \\ 2 λ_{i r} λ_{i t}, & if j = i but t \neq r, \\ σ_{i j} - 2 λ_{i r} λ_{j r}, & if j \neq i but t = r, \\ 0, & otherwise, \end{matrix}

for all

1 \leq i, j \leq p

and

1 \leq r, t \leq k

. Similarly, the coefficient of

d σ_{x y}

in the same equation is

B_{i r, x y} = - δ_{i x} (1 - δ_{x y}) λ_{y r}, \forall 1 \leq i, x, y \leq p, 1 \leq r \leq k .

Finally, we obtain

d \vec{Λ} = A^{- 1} B d \vec{Σ}

and, by the delta method, the asymptotic covariance matrix of

\vec{\hat{Λ}}

is given by

acov (\vec{\hat{Λ}}) = (A^{- 1} B) acov (\vec{\hat{Σ}}) {(A^{- 1} B)}^{⊤} .

3. Least-Squares Factor Analysis

In least-squares factor analysis (see Harman [1976] [7]), the objective is to minimize the sum of squared off-diagonal elements of the residual covariance matrix

\hat{Σ} - \hat{Λ} {\hat{Λ}}^{⊤}

. Define the objective function

g (Λ) = \sum_{1 \leq y < x \leq p} {(σ_{x y} - \sum_{t = 1}^{k} λ_{x t} λ_{y t})}^{2},

that is, the sum of squares of the strictly lower-triangular elements of

Σ - Λ Λ^{⊤}

.

The first-order optimality conditions yield

p \times k

constraints on the loading matrix

Λ

. Specifically, we require

\frac{\partial g (Λ)}{\partial λ_{i r}} = 0

for all

1 \leq i \leq p

and

1 \leq r \leq k

. Straightforward differentiation gives

\begin{matrix} - \frac{1}{2} \frac{\partial g}{\partial λ_{i r}} (Λ) & = & \sum_{1 \leq y < x \leq p} (σ_{x y} - \sum_{t = 1}^{k} λ_{x t} λ_{y t}) \sum_{t = 1}^{k} (δ_{i x} δ_{r t} λ_{y t} + δ_{i y} δ_{r t} λ_{x t}) \\ = & \sum_{1 \leq y < x \leq p} (δ_{i x} λ_{y r} + δ_{i y} λ_{x r}) (σ_{x y} - \sum_{t = 1}^{k} λ_{x t} λ_{y t}) \\ = & \sum_{1 \leq y < i} λ_{y r} (σ_{i y} - \sum_{t = 1}^{k} λ_{i t} λ_{y t}) + \sum_{i < x \leq p} λ_{x r} (σ_{x i} - \sum_{t = 1}^{k} λ_{x t} λ_{i t}) \\ = & \sum_{j \neq i} λ_{j r} (σ_{i j} - \sum_{t = 1}^{k} λ_{i t} λ_{j t}) . \end{matrix}

For any fixed

1 \leq i \leq p

and

1 \leq r \leq k

, the first-order condition reduces to

\sum_{j \neq i} λ_{j r} σ_{i j} = \sum_{j \neq i} λ_{j r} \sum_{t = 1}^{k} λ_{i t} λ_{j t} .

(7)

The differential form of Eq. (7) is given by

\sum_{j \neq i} (λ_{j r} d σ_{i j} + σ_{i j} d λ_{j r}) = \sum_{j \neq i} (\sum_{t = 1}^{k} λ_{i t} λ_{j t}) d λ_{j r} + \sum_{j \neq i} λ_{j r} \sum_{t = 1}^{k} (λ_{j t} d λ_{i t} + λ_{i t} d λ_{j t}) .

Grouping terms and simplifying yield

\sum_{j \neq i} λ_{j r} d σ_{i j} = \sum_{j \neq i} (\sum_{t = 1}^{k} λ_{i t} λ_{j t} - σ_{i j} + λ_{j r} λ_{i r}) d λ_{j r} + \sum_{t = 1}^{k} (\sum_{j \neq i} λ_{j r} λ_{j t}) d λ_{i t} + \sum_{j \neq i} \sum_{t \neq r} λ_{j r} λ_{i t} d λ_{j t},

which completes the differential characterization of the first-order condition.

We rewrite the preceding system of equations in matrix form as

- B d \vec{Σ} = D d \vec{Λ},

where the coefficient of

d λ_{j t}

in the

(i, r)

-th equation is defined as

D_{i r, j t} = \{\begin{matrix} \sum_{z \neq i} λ_{z r} λ_{z t}, & if j = i, \\ \sum_{z = 1}^{k} λ_{i z} λ_{j z} - σ_{i j} + λ_{i r} λ_{j r}, & if j \neq i and t = r, \\ λ_{j r} λ_{i t}, & if j \neq i and t \neq r . \end{matrix}

These definitions apply for all

1 \leq i, j \leq p

and

1 \leq r, t \leq k

. Finally, solving for

d \vec{Λ}

yields

d \vec{Λ} = - D^{- 1} B d \vec{Σ} .

4. Alpha Factor Analysis

Let H be the diagonal matrix whose i-th diagonal element is

\sqrt{h_{i}}

. Clearly,

h_{i} > 0

; otherwise, Eq. (3) would imply that the i-th row of the loading matrix

Λ

is a zero vector, rendering the corresponding factor redundant. Moreover,

H^{- 1} (Σ - Ψ) H^{- 1}

is the correlation matrix for

Λ F

and Eq (3) implies that

d h_{i} = 2 \sum_{t = 1}^{k} λ_{i t} d λ_{i t} .

(8)

To obtain the spectral decomposition of this correlation matrix, let

θ_{r}

denote its r-th largest eigenvalue, and let

β_{r}

be the corresponding eigenvector with length

θ_{r}^{1 / 2}

. Thus,

θ_{r} = β_{r}^{⊤} β_{r},

and

H^{- 1} (Σ - Ψ) H^{- 1} β_{r} = θ_{r} β_{r}, \forall 1 \leq r \leq k .

In alpha factor analysis (Kaiser and Caffrey [1965] [13]), the loading matrix

Λ = [λ_{1}, λ_{2}, \dots, λ_{k}]

is defined as

Λ = H [β_{1}, β_{2}, \dots, β_{k}] .

Additionally, let

Γ = [γ_{1}, γ_{2}, \dots, γ_{k}]

, where

γ_{r} = H^{- 1} β_{r}

. Then, for each

r = 1, \dots, k

, the following relationships—corresponding to Eqs. (9)-(11)—hold:

λ_{r} = H β_{r} = H^{2} γ_{r},

(9)

θ_{r} = β_{r}^{⊤} β_{r} = γ_{r}^{⊤} H^{2} γ_{r} = γ_{r}^{⊤} λ_{r},

(10)

(Σ - Ψ) γ_{r} = (Σ - Ψ) H^{- 1} β_{r} = H H^{- 1} (Σ - Ψ) H^{- 1} β_{r} = θ_{r} H β_{r} = γ_{r}^{⊤} λ_{r} λ_{r} .

(11)

In the remainder of this section, we first express

d \vec{Γ}

in terms of

d \vec{Λ}

using Eq. (9). We then derive a matrix equation relating

d \vec{Σ}

,

d \vec{Γ}

, and

d \vec{Λ}

from Eq. (11).

From Eq. (9), for any

1 \leq i \leq p

and

1 \leq r \leq k

, we have

λ_{i r} = h_{i} γ_{i r},

whose differential is given by

d λ_{i r} = h_{i} d γ_{i r} + 2 γ_{i r} \sum_{t = 1}^{k} λ_{i t} d λ_{i t} .

Collecting these expressions yields the following matrix representation:

d \vec{Γ} = E d \vec{Λ},

(12)

where the elements of the matrix E are defined by

E_{i r, j t} = \frac{\partial γ_{i r}}{\partial λ_{j t}} = δ_{i j} (δ_{r t} - 2 γ_{i r} λ_{i t}) / h_{i},

for all

1 \leq i, j \leq p

and

1 \leq r, t \leq k

.

For any fixed

1 \leq i \leq p

and

1 \leq r \leq k

, the i-th component of Eq. (11) can be written as

\sum_{j \neq i} σ_{i j} γ_{j r} + h_{i} γ_{i r} = λ_{i r} \sum_{j = 1}^{p} λ_{j r} γ_{j r} .

(13)

Taking differentials of Eq. (13) yields

\sum_{j \neq i} (σ_{i j} d γ_{j r} + γ_{j r} d σ_{i j}) + h_{i} d γ_{i r} + γ_{i r} d h_{i} = λ_{i r} \sum_{j = 1}^{p} (λ_{j r} d γ_{j r} + γ_{j r} d λ_{j r}) + (\sum_{j = 1}^{p} λ_{j r} γ_{j r}) d λ_{i r} .

Substituting Eq. (8) into the above expression and rearranging terms, we obtain

\sum_{j \neq i} γ_{j r} d σ_{i j} = \sum_{j \neq i} (λ_{i r} λ_{j r} - σ_{i j}) d γ_{j r} + (λ_{i r}^{2} - h_{i}) d γ_{i r} + (\sum_{j \neq i} λ_{j r} γ_{j r}) d λ_{i r} - 2 γ_{i r} \sum_{t \neq r} λ_{i t} d λ_{i t} + λ_{i r} \sum_{j \neq i} γ_{j r} d λ_{j r} .

We express all the differential forms above in matrix notation as

F d \vec{Σ} = G d \vec{Γ} + J d \vec{Λ} .

(14)

The coefficient matrices F, G, and J are defined as follows.

Matrix F: The coefficient of $d σ_{x y}$ in the $(i, r)$ -th equation of Eq. (14) is

$F_{i r, x y} = δ_{i x} (1 - δ_{i y}) γ_{y r} .$
Matrix G: The coefficient of $d γ_{j t}$ is given by

$G_{i r, j t} = \{\begin{matrix} λ_{i r}^{2} - h_{i}, & if j = i and t = r, \\ λ_{i r} λ_{j r} - σ_{i j}, & if t = r but j \neq i, \\ 0, & otherwise . \end{matrix}$
Matrix J: The coefficient of $d λ_{j t}$ is

$J_{i r, j t} = \{\begin{matrix} \sum_{s \neq i} λ_{s r} γ_{s r}, & if j = i and t = r, \\ - 2 γ_{i r} λ_{i t}, & if j = i but t \neq r, \\ λ_{i r} γ_{j r}, & if t = r but j \neq i, \\ 0, & otherwise . \end{matrix}$

Combining Eqs. (12) and (14), we obtain

F d \vec{Σ} = G d \vec{Γ} + J d \vec{Λ} = G (E d \vec{Λ}) + J d \vec{Λ} = (G E + J) \vec{Λ} .

Consequently,

d \vec{Λ} = {(G E + J)}^{- 1} F d \vec{Σ} .

5. Image Factor Analysis

Let

Δ = {(Diag (Σ^{- 1}))}^{- 1}

. In image factor analysis (see Jöreskog [1969] [12]), the covariance matrix

Σ

is modeled as

Σ = Λ Λ^{⊤} + τ Δ,

(15)

where

τ > 0

is an unknown scalar parameter. Within this framework, one may write

Ψ = τ Δ

. Jöreskog [1969] [12] proposed a maximum-likelihood procedure for estimating this model, under which the covariance matrix of the estimators follows directly from standard maximum-likelihood theory.

For the principal-factor procedure (see SAS [2015] [19]), the loading matrix

Λ

is obtained from the k largest eigenvalues and their corresponding eigenvectors of

Σ - τ Δ

. The parameter

τ

is simultaneously estimated as the coefficient in the regression of

diag (Σ - Λ Λ^{⊤})

on

diag (Δ)

, namely,

τ = \frac{〈 diag (Δ), diag (Σ - Λ Λ^{⊤}) 〉}{〈 diag (Δ), diag (Δ) 〉}

(16)

where

〈 \cdot, \cdot 〉

denotes the standard inner product.

Let

σ^{x y}

denote the element at the x-th row and y-th column of

Σ^{- 1}

. Since

Σ Σ^{- 1}

is the

p \times p

identity matrix, differentiating this identity using the product rule yields

d Σ^{- 1} = - Σ^{- 1} (d Σ) Σ^{- 1} .

In particular, for each

i = 1, . . ., p

,

d σ^{i i} = - \sum_{x, y = 1}^{p} σ^{i x} (d σ_{x y}) σ^{y i} = - \sum_{x, y = 1}^{p} σ^{i x} σ^{i y} d σ_{x y} .

Consequently,

d (\frac{1}{σ^{i i}}) = - \frac{1}{{(σ^{i i})}^{2}} d σ^{i i} = \sum_{x, y = 1}^{p} \frac{σ^{i x} σ^{i y}}{{(σ^{i i})}^{2}} d σ_{x y} .

(17)

Eq. (16) implies that

τ 〈 diag (Δ), diag (Δ) 〉 = 〈 diag (Δ), diag (Σ - Λ Λ^{⊤}) 〉,

and therefore

τ \sum_{i = 1}^{p} {(\frac{1}{σ^{i i}})}^{2} = \sum_{i = 1}^{p} \frac{1}{σ^{i i}} (σ_{i i} - h_{i}) .

Differentiating both sides yields

[\sum_{i = 1}^{p} {(\frac{1}{σ^{i i}})}^{2}] d τ + \sum_{i = 1}^{p} \frac{2 τ}{σ^{i i}} d (\frac{1}{σ^{i i}}) = \sum_{i = 1}^{p} [\frac{1}{σ^{i i}} (d σ_{i i} - d h_{i}) + (σ_{i i} - h_{i}) d (\frac{1}{σ^{i i}})] .

Substituting Eq. (17) into the above expression, we obtain

[\sum_{i = 1}^{p} {(\frac{1}{σ^{i i}})}^{2}] d τ = \sum_{x, y = 1}^{p} [\sum_{i = 1}^{p} (σ_{i i} - h_{i} - \frac{2 τ}{σ^{i i}}) \frac{σ^{i x} σ^{i y}}{{(σ^{i i})}^{2}}] d σ_{x y} + \sum_{i = 1}^{p} \frac{1}{σ^{i i}} (d σ_{i i} - 2 \sum_{t = 1}^{k} λ_{i t} d λ_{i t}) .

These expressions can be written compactly as

d τ = μ^{⊤} d \vec{Σ} + η^{⊤} d \vec{Λ},

(18)

where

μ

and

η

are vectors of dimensions

p^{2}

and

p k

, respectively. The coefficient of

d σ_{x y}

is given by

μ_{x y} = \frac{\frac{δ_{x y}}{σ^{x x}} + \sum_{i = 1}^{p} (σ_{i i} - h_{i} - \frac{2 τ}{σ^{i i}}) \frac{σ^{i x} σ^{i y}}{{(σ^{i i})}^{2}}}{\sum_{i = 1}^{p} {(\frac{1}{σ^{i i}})}^{2}}

and the coefficient of

d λ_{i r}

is

η_{i r} = - \frac{2 λ_{i r}}{σ^{i i} \sum_{z = 1}^{p} {(\frac{1}{σ^{z z}})}^{2}} .

Next, let

θ_{r}

denote the r-th largest eigenvalue of

Σ - τ Δ

, and let

λ_{r}

be the associated eigenvector with length

θ_{r}^{1 / 2}

. Then,

λ_{r}^{⊤} λ_{r} = θ_{r},

and

(Σ - τ Δ) λ_{r} = θ_{r} λ_{r} = λ_{r}^{⊤} λ_{r} λ_{r} .

For any fixed

1 \leq i \leq p

, the i-th component of this equation is given by

\sum_{j = 1}^{p} σ_{i j} λ_{j r} - \frac{τ λ_{i r}}{σ^{i i}} = λ_{i r} \sum_{j = 1}^{p} λ_{j r}^{2} .

Differentiating both sides with respect to the underlying parameters yields

\sum_{j = 1}^{p} (σ_{i j} d λ_{j r} + λ_{j r} d σ_{i j}) - \frac{τ}{σ^{i i}} d λ_{i r} - \frac{λ_{i r}}{σ^{i i}} d τ - τ λ_{i r} d (\frac{1}{σ^{i i}}) = 2 λ_{i r} \sum_{j = 1}^{p} λ_{j r} d λ_{j r} + (\sum_{j = 1}^{p} λ_{j r}^{2}) d λ_{i r} .

Applying Eq. (17) and collecting terms involving

d Σ, d Γ

, and

d τ

, we obtain

\sum_{j = 1}^{p} λ_{j r} d σ_{i j} - τ λ_{i r} \sum_{x, y = 1}^{p} \frac{σ^{i x} σ^{i y}}{{(σ^{i i})}^{2}} d σ_{x y} = \sum_{j \neq i} (2 λ_{i r} λ_{j r} - σ_{i j}) d λ_{j r} + (2 λ_{i r}^{2} - σ_{i i} + \frac{τ}{σ^{i i}} + \sum_{j = 1}^{p} λ_{j r}^{2}) d λ_{i r} + \frac{λ_{i r}}{σ^{i i}} d τ .

The matrix representation of the above system of equations can be written as

L d \vec{Σ} = P d \vec{Λ} + π d τ

(19)

where L, P, and

π

are appropriately defined coefficient matrices. In particular, the coefficient of

d σ_{x y}

in the

(i, r)

-th equation is given by

L_{i r, x y} = δ_{i x} λ_{y r} - \frac{τ λ_{i r} σ^{i x} σ^{i y}}{{(σ^{i i})}^{2}} .

The coefficient of

d λ_{j t}

is

P_{i r, j t} = \{\begin{matrix} 2 λ_{i r}^{2} - σ_{i i} + \frac{τ}{σ^{i i}} + \sum_{z = 1}^{p} λ_{z r}^{2}, & if j = i and t = r, \\ 2 λ_{i r} λ_{j r} - σ_{i j}, & if t = r but j \neq i, \\ 0, & otherwise, \end{matrix}

and the coefficient of

d τ

is

π_{i r} = \frac{λ_{i r}}{σ^{i i}},

for any

1 \leq i, j, x, y \leq p

and

1 \leq r, t \leq k

. Note that

π

is a vector of dimension

p k

.

Combining Eqs. (18) and (19) yields

d \vec{Λ} = {(P + π η^{⊤})}^{- 1} (L - π μ^{⊤}) d \vec{Σ} .

Substituting this expression back into Eq. (18) gives the differential of

τ

in terms of

d \vec{Σ}

:

d τ = [μ^{⊤} + η^{⊤} {(P + π η^{⊤})}^{- 1} (L - π μ^{⊤})] d \vec{Σ} .

(20)

6. Standard Error of Uniqueness Estimator $\hat{ψ}$

Assume that

d \vec{Λ} = M d \vec{Σ}

for some

p k \times p^{2}

matrix M. To derive the standard errors of the uniqueness estimator

\hat{ψ}

, we begin with the generic relationship from Eq. (5):

ψ_{i} = σ_{i i} - \sum_{t = 1}^{k} λ_{i t}^{2} .

Differentiating both sides yields

d ψ_{i} = d σ_{i i} - 2 \sum_{t = 1}^{k} λ_{i t} d λ_{i t}, i = 1, 2, \dots, p .

(21)

Eq. (21) can be written more compactly in matrix form as

d ψ = T d \vec{Σ} + Z d \vec{Λ},

(22)

where the coefficient of

d σ_{x y}

in the i-th equation of Eq. (22) is given by

T_{i, x y} = δ_{i x} δ_{i y}

and the coefficient of

d λ_{j t}

in the same equation is

Z_{i, j t} = - 2 δ_{i j} λ_{j t},

for all

1 \leq i, j, x, y \leq p

and

1 \leq t \leq k

. Consequently, we obtain

d ψ = (T + Z M) d \vec{Σ} .

When

Σ

is a correlation matrix,

d σ_{i i} = 0

, and the expression simplifies further to

d ψ = Z M d \vec{Σ}

.

In image factor analysis, the relationship

{\hat{ψ}}_{i} = {\hat{σ}}_{i i} - \sum_{t = 1}^{k} {\hat{λ}}_{i t}^{2}

may no longer hold. Instead, we rewrite Eq. (20) as

d τ = ϕ^{⊤} d \vec{Σ}

for some vector

ϕ

. Using the identity

ψ_{i} = \frac{τ}{σ^{i i}}

and applying Eq. (17), we obtain

d ψ_{i} = \frac{d τ}{σ^{i i}} + τ d (\frac{1}{σ^{i i}}) = \frac{1}{σ^{i i}} \sum_{x, y = 1}^{p} ϕ_{x y} d σ_{x y} + τ \sum_{x, y = 1}^{p} \frac{σ^{i x} σ^{i y}}{{(σ^{i i})}^{2}} d σ_{x y},

(23)

for all

1 \leq i \leq p

. In matrix form, this expression can be written compactly as

d ψ = Q d \vec{Σ},

where the coefficient of

d σ_{x y}

in the i-th equation is given by

Q_{i, x y} = \frac{\partial ψ_{i}}{\partial σ_{x y}} = \frac{σ^{i i} ϕ_{x y} + τ σ^{i x} σ^{i y}}{{(σ^{i i})}^{2}}, \forall 1 \leq i, x, y \leq p .

7. An Empirical and Simulation Study

In this section, we apply the least-squares factor analysis method to a sample correlation matrix consisting of nine variables measured on 211 subjects. The data were originally reported by Lawley and Maxwell [1971, p.43] [15], and the sample is assumed to be drawn from an approximately multivariate normal population. For simplicity, we set the number of common factors to

k = 2

. This choice is supported by both Horn [1965] [8] ’s parallel analysis and Velicer [1976] [20] ’s minimum average partial (MAP) test. Estimates of the factor loading matrix

\hat{Λ}

and the uniqueness vector

\hat{ψ}

are obtained using the iterative procedure described by Harman [1976] [7].

To compute the asymptotic covariances of these estimators, we first apply Girshick [1939] [6]’s formula (3.23) to derive the covariances of the sample correlation coefficients. We then use the results developed in Section 3 and Section 6 to compute the asymptotic covariance matrix of

\hat{Λ}

, as well as the standard errors of

\hat{ψ}

. To illustrate the effect of factor rotation, we orthogonally rotate

\hat{Λ}

using the varimax criterion and apply the method of Archer and Jennrich [1973] [11] to obtain the standard errors of the rotated factor loadings. Table 1 reports the parameter estimates for the two-factor least-squares model; standard errors are reported in parentheses.

To verify the theoretical derivation of the asymptotic covariance,

acov (\hat{ψ})

, presented in Section 6, we conduct a simulation study. The sample correlation matrix reported above is treated as the population correlation matrix

Σ

. Using this matrix, we generate a large number m of random correlation matrices from a Wishart distribution with parameter

(Σ, 211)

. For each simulated correlation matrix, the uniqueness parameters are estimated using the least-squares method with

k = 2

.

Based on the resulting collection of simulated estimates of

ψ

, we compute empirical standard errors for the uniqueness estimates. For

m = 50, 100, 500, 1000, 2000

, the corresponding empirical standard errors are reported in Table 2. The final column of the table presents the theoretical asymptotic standard errors computed using the formulas developed in Section 6. The results demonstrate clear convergence of the empirical standard errors toward their theoretical counterparts, although the convergence occurs at a relatively slow rate.

8. Discussions

Recent years have seen renewed interest in formal statistical inference for factor-analytic models. This resurgence is driven by a range of practical considerations, including applications involving non-normal data, concerns about robustness, and the growing complexity of covariance structures encountered in empirical work. Broadly speaking, the contemporary literature can be grouped into three main strands. The first strand emphasizes likelihood-based approaches to factor analysis and covariance structure modeling, building on classical frameworks and their subsequent extensions (e.g., Lawley [1967] [14]; Jennrich and Thayer [1973] [11]; Raykov et al. [2026] [18]). A second line of research focuses on high-dimensional and dynamic factor models, in which both the cross-sectional and temporal dimensions grow, posing new challenges for estimation and inference (e.g., Bai [2003] [3]; Fan et al. [2021] [5]). The third strand concentrates on software-oriented implementations of standard error estimation for rotated factor solutions, typically relying on numerical differentiation or sandwich-type approximations (e.g., Archer and Jennrich [1973] [2]; Yung and Hayashi [2001] [21]).

In contrast, explicit asymptotic covariance formulas for widely used non-maximum-likelihood extraction method—such as least-squares, principal, alpha, and image factor analysis—remain relatively scarce, despite their continued prevalence in empirical research and standard statistical software (e.g., Harman [1976] [7]; SAS [2015] [19]). The results developed in this paper help bridge this gap by providing closed-form, analytically tractable asymptotic covariance expressions for several classical no-ML factor solutions. By formulating these results within a unified implicit-differentiation framework, the proposed approach accommodates non-normal data and can be readily combined with existing delta-method results for factor rotation (e.g., Archer and Jennrich [1973] [2]; Jennrich [1973] [10]; Yung and Hayashi [2001] [21]). From a practical perspective, this framework enables transparent computation of standard errors without reliance on numerical differentiation or resampling. From a methodological standpoint, it complements recent advances in robust and high-dimensional factor modeling by providing a rigorous inferential foundation for classical extraction methods that remain central to applied factor analysis.

In the preceding sections, the derived asymptotic covariances of

\hat{Λ}

and

\hat{ψ}

are conditional on the assumed number of factors. Consequently, abnormal standard errors may reflect model misspecification or systematic errors—such as an incorrect choice of factor dimensionality—rather than genuine sampling variability. In particular, such distortions may arise from either over-factoring (when k is chosen too large) or under-factoring (when k is chosen too small).

For expositional simplicity, we have not explicitly imposed the symmetry condition

σ_{i j} = σ_{j i}

. In practical implementations, however, exploiting this property is important not only for reducing memory requirements but also for improving computational efficiency, especially when working with large-scale datasets.

The matrix

Σ

need not represent the variance-covariance matrix of a single group of observed data series; it may also correspond to the variance-covariance matrix of residuals in a system of equations. As an illustration, consider a structural vector autoregressive (SVAR) model with p time series:

Y_{t} = c + \sum_{s = 1}^{ζ} A_{s} Y_{t - s} + Λ F_{t} + ϵ_{t},

where

Y_{t}

is a p-dimensional vector of observed time series at time t,

ζ

denotes the lag length,

F_{t}

represents k common structural shocks, and

ϵ_{t}

captures p idiosyncratic shocks. The composite disturbance

Λ F_{t} + ϵ_{t}

therefore exhibits a two-component error structure, while the common factors F in Eq. (1) are latent and may possess a nonzero mean.

In much of the SVAR literature, idiosyncratic shocks are frequently ignored, effectively imposing the restriction

k = p

. Allowing for idiosyncratic shocks, however, substantially reduces both the number of structural shocks and the number of identification conditions required. From a computational perspective, incorporating an error-component structure—such as a common-factor model—into SVAR systems is straightforward. The formulas developed in this paper offer practical guidance for implementing such extensions and for assessing the precision of estimated coefficients associated with structural shocks.

This form of dimension reduction is particularly valuable in environments characterized by highly interconnected data. In macroeconomic modeling, structural shocks are typically assumed to be independent. Although an economy may generate a large number of interrelated observable time series, its dynamics are generally driven by only a small number of underlying shocks, such as demand, supply, and technological disturbances.

Finally, an important advantage of the derived formulas is their applicability under non-normality without requiring modification. For example, when the data are multivariate binary, the asymptotic covariance

acov (\hat{Λ})

remains well defined. More generally, for categorical data—which are common in practical factor-analysis applications—the computation of

acov (\hat{Λ})

is unlikely to pose significant difficulties.

Author Contributions

All authors have equally contributed to conceptualization, methodology, and validation. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

$Σ$	Unobservable population variance-covariance matrix of the random vector Y
$\hat{Σ}$	Sample variance-covariance matrix of the random vector Y
$Λ$	Loading matrix in the variance-covariance structure model $Σ = Λ Λ^{⊤} + Ψ$
$Ψ$	Uniqueness, a diagonal matrix
$\hat{Λ}$ , $\hat{Ψ}$	Estimates of the loading matrix and uniqueness, respectively
p and k	Dimensions of $Λ$ ; p is also the dimension for Y and, consequently, for $Σ$ and $Ψ$
$δ_{i j}$	The Kronecker delta, that equals 1 when $i = j$ and 0 otherwise
$\vec{Λ}$	The vectorization of the matrix $Λ$ , ordered by columns. Similar for $\vec{Σ}$
$ψ$	The diagonal vector of $Ψ$
$θ_{r}$	The r-th largest eigenvalue of a symmetric matrix
$σ_{x y}$	The element at the x-th row and y-th column of $Σ$
$σ^{x y}$	The element at the x-th row and y-th column of $Σ^{- 1}$ , the inverse of $Σ$
$diag (M)$	The vector of diagonal elements of the square matrix M
$Diag (M)$	The diagonal matrix with diagonal elements in M
ACOV	Asymptotic covariance
SVAR	Structural Vector AutoRegressive model

References

Anderson, T.W. Asymptotic theory for principal components analysis. Annals of Mathematical Statistics 1963, 34, 122–148. [Google Scholar] [CrossRef]
Archer, C.O.; Jennrich, R.I. Standard errors for orthogonally rotated factor loadings. Psychometrika 1973, 38, 581–592. [Google Scholar] [CrossRef]
Bai, J. Inferential theory for factor models of large dimensions. Econometrica 2003, 71(1), 135–171. [Google Scholar] [CrossRef]
de Leeuw, J. Models and methods for the analysis of correlation coefficients. Journal of Econometrics 1983, 22, 113–137. [Google Scholar] [CrossRef]
Fan, J.; Wang, K.; Zhong, Y.; Zhu, Z. Robust high-dimensional factor models with applications to statistical machine learning. Statistical Science 2021, 36(2), 303–327. [Google Scholar] [CrossRef] [PubMed]
Girshick, M.A. On the sampling theory of roots of determinantal equations. Annals of Mathematical Statistics 1939, 10, 203–224. [Google Scholar] [CrossRef]
Harman, H.H. Modern Factor Analysis (Third Edition); University of Chicago Press: Chicago, 1976. [Google Scholar]
Horn, J. A rationale and test for the number of factors in factor analysis. Psychometrika 1965, 30, 179–185. [Google Scholar] [CrossRef]
Hsu, P.L. The limiting distribution of functions of sample means and application to testing hypotheses. In Proceedings of the First Berkeley Symposium in Mathematical Statistics and Probability, 1949; pp. 359–402. [Google Scholar]
Jennrich, R.I. Standard errors for obliquely rotated factor loadings. Psychometrika 1973, 38, 593–604. [Google Scholar] [CrossRef]
Jennrich, R.I.; Thayer, D.T. A note on Lawley’s formulas for standard errors in maximum likelihood factor analysis. Psychometrika 1973, 38, 571–580. [Google Scholar] [CrossRef]
Jöreskog, K.G. Efficient estimation in image factor analysis. Psychometrika 1969, 34, 51–75. [Google Scholar] [CrossRef]
Kaiser, H.F.; Caffrey, J. Alpha factor analysis. Psychometrika 1965, 30, 1–14. [Google Scholar] [CrossRef] [PubMed]
Lawley, D.N. Some new results in maximum likelihood factor analysis. Proceedings of the Royal Society of Edinburgh 1967, 67A, 256–264. [Google Scholar] [CrossRef]
Lawley, D.N.; Maxwell, A.E. Factor Analysis as a Statistical Method; American Elsevier: New York, 1971. [Google Scholar]
Lin, T.I.; Chen, I.A.; Wang, W.L. A robust factor analysis model based on the canonical fundamental skew-t distribution. Stat Papers 2023, 64, 367–393. [Google Scholar] [CrossRef]
Rao, C.R. Linear Statistical Inference And Its Applications (Second Edition); Wiley: New York, 2009. [Google Scholar]
Raykov, T.; Penev, S.; Zhang, B. Fitting covariance structure models with arbitrary observed distributions: strong convergence of the asymptotically distribution-free estimator. Structural Equation Modeling: A Multidisciplinary Journal 2026, 33:1, 32–38. [Google Scholar] [CrossRef]
SAS. SAS/STAT® 14.1 User’s Guide The FACTOR Procedure.; SAS Institute Inc: Cary, NC, 2015. [Google Scholar]
Velicer, W.F. Determining the number of components from the matrix of partial correlations. Psychometrika 1976, 41, 321–327. [Google Scholar] [CrossRef]
Yung, Y.F.; Hayashi, K. A computational efficient method for obtaining standard error estimates for the promax and related solutions. British Journal of Mathematical and Statistical Psychology 2001, 54, 125–138. [Google Scholar] [CrossRef] [PubMed]

Table 1. Estimates and Standard Errors for Two-Factor Least-Squares Factor Analysis.

	Unrotated Factors		Rotated Factors
Variable	I	II	I	II	Uniqueness
$X_{1}$	.6639(.0397)	.3285(.0502)	.6745(.0453)	.3063(.0538)	.4512(.0557)
$X_{2}$	.6879(.0381)	.2388(.0545)	.6202(.0487)	.3815(.0555)	.4698(.0539)
$X_{3}$	.4956(.0536)	.2831(.0657)	.5328(.0583)	.2047(.0658)	.6743(.0599)
$X_{4}$	.8470(.0249)	-.3037(.0372)	.3007(.0381)	.8481(.0295)	.1904(.0412)
$X_{5}$	.7035(.0392)	-.3179(.0637)	.1990(.0493)	.7459(.0399)	.4040(.0551)
$X_{6}$	.8037(.0297)	-.3581(.0659)	.2312(.0404)	.8490(.0353)	.2258(.0526)
$X_{7}$	.6686(.0440)	.3889(.0654)	.7242(.0412)	.2717(.0513)	.4018(.0546)
$X_{8}$	.4236(.0609)	.2552(.0813)	.4656(.0639)	.1666(.0695)	.7555(.0578)
$X_{9}$	.7718(.0347)	.4398(.0598)	.8289(.0328)	.3194(.0444)	.2109(.0443)

Table 2. Simulation of Standard Errors for Uniqueness Estimates.

	Empirical S.E. of Uniqueness for Simulated Correlations					Theoretical
Uniqueness	$m = 50$	100	500	1000	2000	S.E.
$ψ_{1}$	.0499849	.0511051	.0549498	.0560314	.0556181	.0556623
$ψ_{2}$	.0513352	.0520701	.0545346	.0555878	.0549598	.0538818
$ψ_{3}$	.0666649	.0660852	.0599577	.0603062	.0598762	.0598473
$ψ_{4}$	.0405467	.0433619	.0432832	.0428048	.0415737	.0411420
$ψ_{5}$	.0481342	.0538724	.0524052	.0530417	.0544235	.0550669
$ψ_{6}$	.0501591	.0543790	.0541692	.0550827	.0546638	.0526150
$ψ_{7}$	.0524787	.0565299	.0547573	.0553479	.0546232	.0546297
$ψ_{8}$	.0507511	.0562004	.0576808	.0582864	.0580424	.0578393
$ψ_{9}$	.0394781	.0409714	.0430909	.0448280	.0438135	.0443326

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

On Asymptotic Covariance of Unrotated Factor Solutions

Abstract

Keywords:

Subject:

1. Introduction

2. Iterative Principal Component and Principal Factor Analysis

3. Least-Squares Factor Analysis

4. Alpha Factor Analysis

5. Image Factor Analysis

6. Standard Error of Uniqueness Estimator $\hat{ψ}$

7. An Empirical and Simulation Study

8. Discussions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

References

MDPI Initiatives

Important Links

Subscribe

On Asymptotic Covariance of Unrotated Factor Solutions

Abstract

Keywords:

Subject:

1. Introduction

2. Iterative Principal Component and Principal Factor Analysis

3. Least-Squares Factor Analysis

4. Alpha Factor Analysis

5. Image Factor Analysis

6. Standard Error of Uniqueness Estimator ψ ^

7. An Empirical and Simulation Study

8. Discussions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

References

MDPI Initiatives

Important Links

Subscribe

6. Standard Error of Uniqueness Estimator $\hat{ψ}$