Edgeworth Expansions When the Parameter Dimension Increases with Sample Size

Christopher Stroude Withers

doi:10.20944/preprints202511.1584.v1

Submitted:

20 November 2025

Posted:

20 November 2025

You are already at the latest version

Abstract

Suppose that we have a statistical model with $q=q_n$ unknown parameters, $w=(1_1,\dots,w_q)'$, estimated by $\hat{w}$, based on a sample of size $n$. Suppose also, that we have Edgeworth expansions for the density and distribution of $X_n=n^{1/2} (\hat{w}-w)$. %We ask the question: How fast can $q=q_n$ increase with $n$ for the three main Edgeworth expansions to remain valid? We show that it is sufficient that $q_n=o(n^{1/6})$, if the estimate $\hat{w}$ is a standard estimate. That is, $E\ \hat{w}\rightarrow w$ as $n\rightarrow w$, and for $r\geq 1$, its $r$th order cumulants have magnitude $n^{1-r}$ and can be expanded in powers of $n^{-1}$. This very large class of estimates has a huge range of potential applications. When $\hat{w}=t(\bar{X})$ for $t:R^q\rightarrow R^p$ a smooth function of a sample mean $\bar{X}$ from a distribution on $R^q$, and $p_nq_n=pq\rightarrow\infty$ as $n\rightarrow\infty$, I show that the Edgeworth expansions for $\hat{w}$ remain valid if $q_n^8 p_n^6=o(n)$. For example, this holds for fixed $p=p_n$ if $q_n=o(n^{1/8})$. I also give a method that greatly reduces the number of terms needed for the 2nd and 3rd order terms in the Edgeworth expansions, that is, for the 1st and 2nd order corrections to the Central Limit Theorems (CLTs).

Keywords:

standard estimate

;

increasing number of unknown parameters

;

extended Edgeworth expansions

Subject:

Computer Science and Mathematics - Probability and Statistics

1. Introduction and Summary

Suppose that we have a estimate

\hat{w}

of an unknown parameter

w \in R^{q}

of a statistical model, and that

\begin{matrix} as n \to \infty, X_{n} = n^{1 / 2} (\hat{w} - w) \overset{L}{\to} X \sim N_{q} (0, V), \end{matrix}

(1)

the multivariate normal on

R^{q}

, with density and distribution

\begin{matrix} ϕ_{V} (x) = {(2 π)}^{- q / 2} {(d e t V)}^{- 1 / 2} exp (- x^{'} V^{- 1} x / 2), Φ_{V} (x) = \int_{- \infty}^{x} ϕ_{V} (x) d x . \end{matrix}

(2)

Here we assume that

\hat{w}

is a standard estimate. That is,

E \hat{w} \to w

as

n \to \infty

, and for

r \geq 1

, its rth order cumulants have magnitude

n^{1 - r}

and can be expanded in powers of

n^{- 1}

. The class of standard estimates includes smooth functions of sample means or empirical distributions, based on one or more random samples, or on samples from a stationary time series. So it has a huge range of potential applications.

For

\hat{w}

non-lattice, the density and distribution of

X_{n}

of (1) can be expanded in powers of

n^{- 1 / 2}

about those of X of (1). These are the Edgeworth expansions. To be self-contained, Section 2 summarises these expansions.

How fast can

q = q_{n}

increase with n, for these expansions to hold? Section 3 shows that they hold if

q_{n} = o (n^{1 / 6})

.

Section 4 gives a theorem that reduces the number of terms needed for 2nd and 3rd order Edgeworth expansions. For example, if

q = 3

, it reduces the number of terms needed for 2nd and 3rd order Edgeworth expansions by 57% and 94%. This % reduction increases with q. If

q = 4

or 5, it reduces the number of terms by 65% or 69% for the 2nd order Edgeworth expansions, and by 98% for the 3rd order Edgeworth expansions.

Section 5 considers the case when

\hat{w} = t (\bar{X})

for

t : R^{q} \to R^{p}

a smooth function of a sample mean

\bar{X}

from a distribution on

R^{q}

with finite moments. When

p_{n} q_{n} = p q \to \infty

as

n \to \infty

, it shows the Edgeworth expansions for

\hat{w}

remain valid if

q_{n}^{8} p_{n}^{6} = o (n)

. For example, this holds for fixed

p = p_{n}

if

q_{n} = o (n^{1 / 8})

. These may be the 1st CLTs or Edgeworth expansions when, not 1, but 2 parameters are allowed to increase to ∞ with n.

Earlier work done when dimension

q = q_{n}

increases with sample size n, has been mainly for sample means, including some CLTs and a 2nd order Edgeworth-type expansion. [10,11] showed asymptotic normality for M-estimators with

q_{n}

regression parameters when

q_{n}^{2} / n

is large. [3] gave a CLT for M-estimates in a linear regression model of dimension

q_{n}

when

q_{n} / n \to a \in (0, 1) .

Remarkably, (8) of [2] gave a CLT for

\hat{w}

the sample mean of bounded random vectors that holds for

q_{n} = O (exp (a n^{c}))

if

c < 1 / 7

. (Substitute into their condition to confirm.) (1.4) of [7], appears to allow for

q_{n} = O (n^{c})

if

c < 1 / 3

to suffice for a CLT for the sample mean, when

X_{1}, \dots, X_{n}

are log-concave, quoting [4]. It will be interesting to see if this bound can be extended to a broader class of estimates than sample means, and if the log-concave condition can be removed.

Section 2.1 of the very recent paper [7] considers the 2nd order Edgeworth expansion for the distribution of a standardized sample mean. His Theorem 2.1 gives conditions for this which hold when

q_{n} = O (n^{c})

for any c, or even when

q_{n} = O (exp (b n^{c}))

and

c < 1 / 3 .

The bounds (1.4) and (2.4) of [7] give a 2nd order Edgeworth expansion for a sample mean, that allows for

q_{n}

of magnitude

n^{c}

if

c < 1 / 3

, a remarkable result.

[8] considered sampling from vectors, and investigated the simultaneous estimation of the marginal distributions for large

q_{n}

. [5] considered the asymptotic distributions of the canonical correlations between

X_{1} \in R^{p}

and

X_{2} \in R^{q}

with

p \leq q

. They derived asymptotic distributions of the canonical correlations when p is fixed,

q = q_{n} \to \infty

, and

q_{n} / n \to a \in [0, 1)

, as

n \to \infty

. It assumes that

X_{1},

and

X_{2}

have a joint normal distribution. [9] gave a CLT for the sample mean for large

q_{n}

. [6] gave a number of results for large dimensions. [1] and [7] considered the validity and accuracy of Edgeworth expansions of

{max}_{i = 1}^{q} X_{n}

for large q when

X_{n}

is a standardized sample mean.

2. Multivariate Edgeworth Expansions

Suppose that

\hat{w}

is a standard estimate of

w \in R^{q}

with respect to n. (n is typically the sample size.) That is,

E \hat{w} \to w

as

n \to \infty

, where we use

E

for expected value, and for

r \geq 1

and

1 \leq i_{1}, \dots, i_{r} \leq q,

the rth order cumulants of

\hat{w} = {({\hat{w}}_{1}, \dots, {\hat{w}}_{q})}^{'}

can be expanded as

\begin{matrix} {\bar{k}}^{1 - r} = k^{i_{1} \dots i_{r}} = κ ({\hat{w}}_{i_{1}}, \dots, {\hat{w}}_{i_{r}}) \approx \sum_{d = r - 1}^{\infty} n^{- d} {\bar{k}}_{d}^{1 - r}, where {\bar{k}}_{d}^{1 - r} = k_{d}^{i_{1} \dots i_{r}}, \end{matrix}

(3)

≈ indicates an asymptotic expansion, and the cumulant coefficients

{\bar{k}}_{d}^{1 - r}

may depend on n but are bounded as

n \to \infty

. So the bar replaces each

i_{k}

by k. For example,

{\bar{k}}_{0}^{1} = {\bar{w}}_{1} = w_{i_{1}}

and

{\bar{k}}_{1}^{12} = k_{1}^{i_{1} i_{2}} .

I reserve

i_{k}

for this bar notation to avoid double subscripts. So, (1) holds with

V = ({\bar{k}}_{1}^{12}), q \times q

. V may depend on n, but I assume that

d e t V

is bounded away from 0.

\begin{matrix} Let {\bar{P}}_{r}^{1 - k} = P_{r}^{i_{1} \dots i_{k}} be the kth Edgeworth coefficient of order r for \hat{w} . \end{matrix}

These are Bell polynomials in the cumulant coefficients,

{\bar{k}}_{d}^{1 - r}

of (3), defined and given in [14] for

1 \leq r \leq 3

. Their importance lies in their central role in the Edgeworth expansions of

X_{n}

of (1): see (8) and (14) below.

The

{\bar{P}}_{r}^{1 - k}

needed for

r = 1, 2, 3

are given in (19)–(21) of [14]:

\begin{matrix} {\bar{P}}_{1}^{1} = {\bar{k}}_{1}^{1}, {\bar{P}}_{1}^{1 - 3} = {\bar{k}}_{2}^{1 - 3} / 6, {\bar{P}}_{2}^{12} = {\bar{k}}_{2}^{12} / 2 + {\bar{k}}_{1}^{1} {\bar{k}}_{1}^{2} / 2, \\ {\bar{P}}_{2}^{1 - 4} = {\bar{k}}_{3}^{1 - 4} / 4! + S {\bar{k}}_{1}^{4} {\bar{k}}_{2}^{1 - 3} / 6, {\bar{P}}_{2}^{1 - 6} = S {\bar{k}}_{2}^{1 - 3} {\bar{k}}_{2}^{4 - 6} / 72, \\ {\bar{P}}_{3}^{1} = {\bar{k}}_{2}^{1}, {\bar{P}}_{3}^{1 - 3} = {\bar{k}}_{3}^{1 - 3} / 6 + S {\bar{k}}_{2}^{12} {\bar{k}}_{1}^{3} / 2 + {\bar{k}}_{1}^{1} {\bar{k}}_{1}^{2} {\bar{k}}_{1}^{3} / 6, \\ {\bar{P}}_{3}^{1 - 5} = {\bar{k}}_{4}^{1 - 5} / 5! + S_{1} / 24 + S_{2} / 12 + S_{3} / 12 \\ where S_{1} = S {\bar{k}}_{3}^{1 - 4} {\bar{k}}_{1}^{5}, S_{2} = S {\bar{k}}_{2}^{12} {\bar{k}}_{2}^{345}, S_{3} = S {\bar{k}}_{1}^{1} {\bar{k}}_{1}^{2} {\bar{k}}_{2}^{345}, \\ {\bar{P}}_{3}^{1 - 7} = S {\bar{k}}_{2}^{123} {\bar{k}}_{3}^{4 - 7} / 144 + S {\bar{k}}_{2}^{123} {\bar{k}}_{2}^{4 - 6} {\bar{k}}_{1}^{7} / 72, \end{matrix}

(4)

\begin{matrix} {\bar{P}}_{3}^{1 - 9} = S {\bar{k}}_{2}^{1 - 3} {\bar{k}}_{2}^{4 - 6} {\bar{k}}_{2}^{7 - 9} / 6^{4}, \end{matrix}

(5)

where

S

is the operator

S

that symmetrizes

{\bar{f}}^{1 - k}

over

i_{1}, \dots, i_{k}

.

Set

P (A) =

Probability that A is true. By [15], or [14], for

\hat{w}

non-lattice, the density and distribution of

X_{n}

can be expanded as

\begin{matrix} p_{X_{n}} (x) \approx \sum_{r = 0}^{\infty} n^{- r / 2} p_{r} (x), P (X_{n} \leq x) \approx \sum_{r = 0}^{\infty} n^{- r / 2} P_{r} (x), x \in R^{q}, \\ where p_{0} (x) = ϕ_{V} (x), P_{0} (x) = Φ_{V} (x), of (2), and for r \geq 1, \end{matrix}

(6)

\begin{matrix} p_{r} (x) / ϕ_{V} (x) = \sum_{k = 1}^{3 r} [{\tilde{p}}_{r k} : k - r even] = {\tilde{p}}_{r} (x) s a y, {\tilde{p}}_{r k} = {\bar{P}}_{r}^{1 - k} {\bar{H}}^{1 - k}, \end{matrix}

(7)

\begin{matrix} P_{r} (x) = \sum_{k = 1}^{3 r} [P_{r k} (x) : k - r even], P_{r k} (x) = {\bar{P}}_{r}^{1 - k} {\bar{H}}_{*}^{1 - k}, \end{matrix}

(8)

\begin{matrix} and {\bar{H}}^{1 - k} = {\bar{H}}^{1 - k} (x, V) = H^{i_{1} \dots i_{k}} = ϕ_{V} {(x)}^{- 1} {\bar{O}}^{1 - k} ϕ_{V} (x), \end{matrix}

\begin{matrix} {\bar{H}}_{*}^{1 - k} = {\bar{H}}_{*}^{1 - k} (x, V) = {\bar{O}}^{1 - k} Φ_{V} (x) = \int_{- \infty}^{x} {\bar{H}}^{1 - k} ϕ_{V} (x) d x, \end{matrix}

(9)

\begin{matrix} {\bar{O}}^{1 - k} = (- {\bar{\partial}}_{1}) \dots (- {\bar{\partial}}_{k}), {\bar{\partial}}_{k} = \partial_{i_{k}}, \partial_{i} = \partial / \partial x_{i} . \end{matrix}

{\bar{H}}^{1 - k} (x, V) = {\bar{H}}^{1 - k}

and

{\bar{H}}_{*}^{1 - k} (x, V) = {\bar{H}}_{*}^{1 - k}

are the multivariate Hermite polynomial, and the integrated multivariate Hermite polynomial. By [12],

\begin{matrix} {\bar{H}}^{1 - k} = E ({\bar{y}}_{1} + \sqrt{- 1} {\bar{Y}}_{1}) \dots ({\bar{y}}_{k} + \sqrt{- 1} {\bar{Y}}_{k}) \\ where y = V^{- 1} x, Y = V^{- 1} X \sim N_{q} (0, V^{- 1}), {\bar{y}}_{k} = y_{i_{k}}, and {\bar{Y}}_{k} = Y_{i_{k}} . \end{matrix}

I use the tensor summation convention: repetition of

i_{1}, \dots, i_{k}

in

{\tilde{p}}_{r k}

of (7) and

P_{r k} (x)

of (8) implies their implicit summation over their range,

1, \dots, q

. [14] gave

{\bar{H}}^{1 - k}

explicitly for

k \leq 6

, and for

k \leq 9

when

q = 2

.

\begin{matrix} Set {\bar{μ}}^{1 - 2 k} = E {\bar{Y}}_{1} \dots {\bar{Y}}_{2 k} = \sum^{1.3 \dots (2 k - 1)} {\bar{V}}^{12} \dots {\bar{V}}^{2 k - 1, 2 k}, \end{matrix}

where

\sum^{N} {\bar{f}}^{1 - k}

sums

{\bar{f}}^{1 - k}

over all N permutations of

i_{1}, \dots, i_{k}

giving distinct values. For example,

\begin{matrix} {\bar{μ}}^{1 - 4} = V^{12} V^{34} + V^{13} V^{24} + V^{14} V^{23}, {\bar{H}}^{1} = {\bar{y}}_{1}, {\bar{H}}^{12} = {\bar{y}}_{1} {\bar{y}}_{2} - {\bar{V}}^{12}, \\ {\bar{H}}^{1 - 3} = {\bar{y}}_{1} {\bar{y}}_{2} {\bar{y}}_{3} - \sum^{3} {\bar{y}}_{1} {\bar{V}}^{23} = {\bar{y}}_{1} {\bar{y}}_{2} {\bar{y}}_{3} - {\bar{y}}_{1} {\bar{V}}^{23} - {\bar{y}}_{2} {\bar{V}}^{13} - {\bar{y}}_{3} {\bar{V}}^{12}, \\ {\bar{H}}_{*}^{1} = {\bar{J}}^{1}, {\bar{H}}_{*}^{12} = {\bar{J}}^{12} - {\bar{V}}^{12} Φ_{V} (x), {\bar{H}}_{*}^{1 - 3} = {\bar{J}}^{123} - \sum^{3} {\bar{J}}^{1} {\bar{V}}^{23}, where \end{matrix}

\begin{matrix} {\bar{J}}^{1 - k} = {\bar{J}}^{1 - k} (x, V) = E {\bar{Y}}_{1} \dots {\bar{Y}}_{k} I (X \leq x) = {\bar{V}}^{1, k + 1} \dots {\bar{V}}^{k, 2 k} {\bar{M}}^{k + 1 - 2 k}, \end{matrix}

(10)

\begin{matrix} and {\bar{M}}^{a - b} = {\bar{M}}^{a - b} (x, V) = E {\bar{X}}_{1} \dots {\bar{X}}_{k} I (X \leq x) = \int_{- \infty}^{x} {\bar{x}}_{a} \dots {\bar{x}}_{b} ϕ_{V} (x) d x, \end{matrix}

(11)

for

{\bar{x}}_{a} = x_{i_{a}} .

So the repeated

i_{k + 1}, \dots, i_{2 k}

in (10) implies their repeated summation over

1, \dots, q

. As x lies in

R^{q}

,

\int_{- \infty}^{x} d x

in (9) and (11), stands for

\int_{- \infty}^{x_{1}} d x_{1} \dots \int_{- \infty}^{x_{q}} d x_{q} .

(6) with the

{\bar{P}}_{r}^{1 - k}

of [14], give the Edgeworth expansions for the density and distribution of

X_{n}

of (1) to

O (n^{- 2})

.

{\tilde{p}}_{r k}

and

P_{r k}

each have

q^{k}

terms, but many are duplicates as

{\bar{P}}_{r}^{1 - k}

is symmetric in

i_{1}, \dots, i_{k}

. This is exploited in Section 4 to greatly reduce the number of terms in (8) and (14) below.

By (7), the density of

X_{n}

relative to its asymptotic value is

\begin{matrix} p_{X_{n}} (x) / ϕ_{V} (x) \approx 1 + \sum_{r = 1}^{\infty} n^{- r / 2} {\tilde{p}}_{r} (x) = 1 + n^{- 1 / 2} {\tilde{p}}_{1} (x) + O (n^{- 1}), \end{matrix}

for

x \in R^{q} .

For measurable

C \subset R^{q}

,

\begin{matrix} P (X_{n} \in C) \approx \sum_{r = 0}^{\infty} n^{- r / 2} P_{r} (C), where P_{0} (C) = Φ_{V} (C), and for r \geq 1, \end{matrix}

(12)

\begin{matrix} P_{r} (C) = E p_{r} (X) I (X \in C) = \int_{C} p_{r} (x) ϕ_{V} (x) d x = \sum_{k = 1}^{3 r} [P_{r k} (C) : k - r even], \end{matrix}

(13)

\begin{matrix} P_{r k} (C) = E {\tilde{p}}_{r k} (X) I (X \in C) = \int_{C} {\tilde{p}}_{r k} (x) ϕ_{V} (x) d x = {\bar{P}}_{r}^{1 - k} {\bar{H}}_{*}^{1 - k} (C), \end{matrix}

(14)

\begin{matrix} and {\bar{H}}_{*}^{1 - k} (C) = E {\bar{H}}^{1 - k} (X, V) I (X \in C) = \int_{C} {\bar{H}}^{1 - k} ϕ_{V} (x) d x . \end{matrix}

This paper focuses on the three Edgeworth expansions, (6) and (12), when

q \to \infty

as

n \to \infty

.

If

- C = C

, then for r odd,

P_{r k} (C) = P_{r} (C) = 0,

so that

\begin{matrix} P (X_{n} \in C) \approx \sum_{r = 0}^{\infty} n^{- r} P_{2 r} (C) = Φ_{V} (C) + n^{- 1} P_{2} (C) + O (n^{- 2}) . \end{matrix}

Examples 3 and 4 of [14] gave

P_{2} (C)

for

C = {x : x^{'} V^{- 1} x \leq u}

, and

C = {x : | {(V^{- 1 / 2} x)}_{j} | \leq u_{j}, j = 1, \dots, q}

.

The main take-away here is that for

x \in R^{q}

,

C \subset R^{q},

{\tilde{p}}_{r} (x), P_{r} (x), P_{r} (C)

of (7), (8) and (12), and

s = 1, 2, \dots

,

\begin{matrix} p_{X_{n}} (x) / ϕ_{V} (x) = \sum_{r = 0}^{s - 1} n^{- r / 2} {\tilde{p}}_{r} (x) + O (n^{- s / 2}), \\ P (X_{n} \leq x) = \sum_{r = 0}^{s - 1} n^{- r / 2} P_{r} (x) + O (n^{- s / 2}), \\ P (X_{n} \in C) = \sum_{r = 0}^{s - 1} n^{- r / 2} P_{r} (C) + O (n^{- s / 2}), \end{matrix}

where, for example, by (4),

\begin{matrix} {\tilde{p}}_{1} (x) = p_{1} (x) / ϕ_{V} (x) = \sum_{k = 1, 3} {\tilde{p}}_{1 k}, {\tilde{p}}_{11} = {\bar{k}}_{1}^{1} {\bar{H}}^{1}, {\tilde{p}}_{13} = {\bar{k}}_{2}^{1 - 3} {\bar{H}}^{1 - 3} / 6, \\ P_{1} (x) = \sum_{k = 1, 3} P_{1 k} (x), P_{11} (x) = {\bar{k}}_{1}^{1} {\bar{H}}_{*}^{1}, P_{13} (x) = {\bar{k}}_{2}^{1 - 3} {\bar{H}}_{*}^{1 - 3} / 6, \\ P_{1} (C) = \sum_{k = 1, 3} P_{1 k} (C), P_{11} (C) = {\bar{k}}_{1}^{1} {\bar{H}}_{*}^{1} (C), P_{13} (C) = {\bar{k}}_{2}^{1 - 3} {\bar{H}}_{*}^{1 - 3} (C) / 6 . \end{matrix}

These asymptotic expansions generally diverge, as normal moments and Hermite polynomials increase very rapidly with their degree.

3. The Case $q = q_{n} \to \infty$ as $n \to \infty$

Theorem 1.

Let

\hat{w}

be a non-lattice estimate of

w \in R^{q}

, satisfying

E \hat{w} \to w

, and (3). Set

X_{n} = n^{1 / 2} (\hat{w} - w)

. Take

s \geq 1

. Suppose that as

n \to \infty,

\begin{matrix} q = q_{n} \to \infty, and ν_{n} = n^{- 1 / 2} q_{n}^{3} \to 0, that is, q_{n} = o (n^{1 / 6}) . \end{matrix}

Then, for

{\tilde{p}}_{r} (x)

of (7),

P_{r} (x)

of (8), and

P_{r} (C)

of (13),

\begin{matrix} p_{X_{n}} (x) / ϕ_{V} (x) = \sum_{r = 0}^{s - 1} n^{- r / 2} {\tilde{p}}_{r} (x) + O (ν_{n}^{s}), \end{matrix}

(15)

\begin{matrix} P (X_{n} \leq x) = \sum_{r = 0}^{s - 1} n^{- r / 2} P_{r} (x) + O (ν_{n}^{s}), \end{matrix}

(16)

\begin{matrix} P (X_{n} \in C) = \sum_{r = 0}^{s - 1} n^{- r / 2} P_{r} (C) + O (ν_{n}^{s}) . \end{matrix}

(17)

PROOF Set

Q = q^{2}

.

{\tilde{p}}_{r k} (x)

and

P_{r k} (x)

of (8), and

{\tilde{p}}_{r k} (C)

of (14), each have

q^{k}

terms for

q = q_{n}

. So

p_{r} (x)

of (7),

P_{r} (x)

of (8), and

P_{r} (C)

of (12), each have

N_{r}

terms, where

\begin{matrix} for r odd, N_{r} = q + q^{3} + q^{5} + \dots + q^{3 r} = q (Q^{(3 r + 1) / 2} - 1) / (Q - 1), \end{matrix}

(18)

\begin{matrix} for r even, N_{r} = q^{2} + q^{4} + \dots + q^{3 r} = Q (Q^{3 r / 2} - 1) / (Q - 1) . \end{matrix}

(19)

\begin{matrix} So, N_{r} = O (q^{3 r}) as n \to \infty . \end{matrix}

So

p_{r} (x), P_{r} (x)

and

P_{r} (C)

each have magnitude

q_{n}^{3 r}

as

n \to \infty .

So

n^{- s / 2} (p_{s} (x), P_{s} (x), P_{s} (C))

has magnitude

n^{- s / 2} q_{n}^{3 s} = ν_{n}^{s}

. The theorem follows. □

Example 1.

Let

\hat{w}

be the sample mean from a distribution on

R^{q}

with finite cross cumulants

{\bar{κ}}^{1 - r}, r \geq 1

. Then

E \hat{w} = w

, and only the leading coefficient in (3) is non-zero. As in Example 2 of [14], by (4)–(5), the non-zero Edgeworth coefficients

P_{r}^{1 - k}

needed for the three 4th order Edgeworth expansions of

\hat{w}

with

V = ({\bar{κ}}^{12})

, are

\begin{matrix} {\bar{P}}_{1}^{1 - 3} = {\bar{κ}}^{1 - 3} / 3!, {\bar{P}}_{2}^{1 - 4} = {\bar{κ}}^{1 - 4} / 4!, {\bar{P}}_{2}^{1 - 6} = S {\bar{κ}}^{1 - 3} {\bar{κ}}^{4 - 6} / 72,, \\ {\bar{P}}_{3}^{1 - 5} = {\bar{κ}}^{1 - 5} / 5!, {\bar{P}}_{3}^{1 - 7} = S {\bar{κ}}^{123} {\bar{κ}}^{4 - 7} / 144, {\bar{P}}_{3}^{1 - 9} = S {\bar{κ}}^{1 - 3} {\bar{κ}}^{4 - 6} {\bar{κ}}^{7 - 9} / 6^{4} . \end{matrix}

Substitution gives

(p_{s} (x), P_{s} (x), P_{s} (C))

for

s = 1, 2, 3 .

For example, as

q_{n} = q \to \infty,

the coefficients of

n^{- 1 / 2}

in the 2nd terms in the Edgeworth expansions for

p_{X_{n}} (x) / ϕ_{V} (x), P (X_{n} \leq x)

, and

P (X_{n} \in C)

, are

\begin{matrix} {\tilde{p}}_{1} (x) = {\tilde{p}}_{13} = {\bar{κ}}^{1 - 3} {\bar{H}}^{1 - 3} / 6 = O (q_{n}^{3}), \\ P_{1} (x) = P_{13} (x) = {\bar{κ}}^{1 - 3} {\bar{H}}_{*}^{1 - 3} / 6 = O (q_{n}^{3}), \\ P_{1} (C) = P_{13} (C) = {\bar{κ}}^{1 - 3} {\bar{H}}_{*}^{1 - 3} (C) / 6 = O (q_{n}^{3}) . \end{matrix}

Example 2.

Suppose that

X_{1}, X_{2}, \dots, X_{n}

are independent random vectors in

R^{q}

, with mean

\hat{w} = \bar{X}

, and that

X_{j}

has finite cross-cumulants

{\bar{κ}}_{j}^{1 - r} = κ (X_{j i_{1}}, \dots, X_{j i_{r}}) f o r r \geq 1 a n d i_{1}, \dots, i_{r} i n 1, \dots, q .

Then for

r \geq 1,

κ ({\hat{w}}_{i_{1}}, \dots, {\hat{w}}_{i_{r}}) = n^{1 - r} {\bar{κ}}^{1 - r} w h e r e {\bar{κ}}^{1 - r} = n^{- 1} \sum_{j = 1}^{n} {\bar{κ}}_{j}^{1 - r} .

Suppose also, that

{{\bar{κ}}^{1 - r}}

are bounded in n, and that for

V = ({\bar{κ}}^{12})

,

d e t V

is bounded away from 0, as n increases. Then the Edgeworth coefficients needed for the three 4th order Edgeworth expansions for

\hat{w}

, are given by Example 3.1.

4. Further Reduction of Terms

Our next theorem gives a way to reduce the number of terms in

{\tilde{p}}_{r} (x)

of (7),

P_{r} (x)

of (8), and

P_{r} (C)

of (14), from

q^{k}

, to

L_{k} = q_{k} [1 + O (q^{- 1})]

as

q \to \infty,

where

q_{k} = (\binom{q}{k}) = q (q - 1) \dots (q - k + 1) / k! .

As we do not use

q_{n}

for q in this section, there is no ambiguity with this different use of

q_{k}

. k

{\tilde{p}}_{r k} (x),

P_{r k} (x)

and

P_{r k} (C)

have k summations over

1, \dots, q

, and so have

q^{k}

terms. But many are duplicates as

{\bar{P}}_{r}^{1 - k}

is symmetric in

i_{1}, \dots, i_{k}

. Set

(\binom{k}{a \dots b}) = k! / a! \dots b!

, the multinomial coefficient. For example

(\binom{3}{111}) = 6

.

Theorem 2.

Let

T^{i_{1} \dots i_{k}} \in R

be symmetric in

i_{1}, \dots, i_{k} .

\begin{matrix} S e t T^{i_{1}^{2} i_{2}} = T^{i_{1} i_{1} i_{2}} a n d s o o n, a n d u_{k} = \sum_{i_{1}, \dots, i_{k} = 1}^{q} T^{i_{1} \dots i_{k}} . \\ T h e n, u_{1} = \sum_{i_{1} = 1}^{q} T^{i_{1}}, u_{2} = \sum_{i_{1} = 1}^{q} T^{i_{1}^{2}} + 2! \sum_{i_{1} > i_{2}}^{q_{2}} T^{i_{1} i_{2}}, \\ u_{3} = \sum_{i_{1} = 1}^{q} T^{i_{1}^{3}} + (\binom{3}{1}) \sum^{q (q - 1)} T^{i_{1}^{2} i_{2}} + 3! \sum_{i_{1} > i_{2} > i_{3}}^{q_{3}} T^{i_{1} i_{2} i_{3}}, \\ u_{4} = \sum_{i_{1} = 1}^{q} T^{i_{1}^{4}} + (\binom{4}{1}) \sum^{q (q - 1)} T^{i_{1}^{3} i_{2}} + (\binom{4}{2}) \sum_{i_{1} > i_{2}}^{q_{2}} T^{i_{1}^{2} i_{2}^{2}} + (\binom{4}{211}) \sum_{i_{1} > i_{2}}^{3 q_{3}} T^{i_{1}^{2} i_{2} i_{3}} \\ + 4! \sum_{i_{1} > i_{2} > i_{3} > i_{4}}^{q_{4}} T^{i_{1} i_{2} i_{3} i_{4}}, \\ u_{5} = \sum_{i_{1} = 1}^{q} T^{i_{1}^{5}} + (\binom{5}{1}) \sum^{q (q - 1)} T^{i_{1}^{4} i_{2}} + (\binom{5}{2}) \sum^{q (q - 1)} T^{i_{1}^{3} i_{2}^{2}} + (\binom{5}{221}) \sum_{i_{1} > i_{2}}^{5 q_{3}} T^{i_{1}^{2} i_{2}^{2} i_{3}} \\ + (\binom{5}{2111}) \sum_{i_{2} > i_{3} > i_{4}}^{4 q_{4}} T^{i_{1}^{2} i_{2} i_{3} i_{4}} + 5! \sum_{i_{1} > i_{2} > i_{3} > i_{4} > i_{5}}^{q_{5}} T^{i_{1} i_{2} i_{3} i_{4} i_{5}}, \\ u_{6} = \sum_{i_{1} = 1}^{q} T^{i_{1}^{6}} + (\binom{6}{1}) \sum^{q (q - 1)} T^{i_{1}^{5} i_{2}} + (\binom{6}{2}) \sum^{q (q - 1)} T^{i_{1}^{4} i_{2}^{2}} + (\binom{6}{3}) \sum_{i_{1} > i_{2}}^{q_{2}} T^{i_{1}^{3} i_{2}^{3}} \\ + (\binom{6}{411}) \sum_{i_{2} > i_{3}}^{3 q_{3}} T^{i_{1}^{4} i_{2} i_{3}} + (\binom{6}{321}) \sum^{6 q_{3}} T^{i_{1}^{3} i_{2}^{2} i_{3}} + (\binom{6}{222}) \sum_{i_{1} > i_{2} > i_{3}}^{q_{3}} T^{i_{1}^{2} i_{2}^{2} i_{3}^{2}} \\ + (\binom{6}{3111}) \sum_{i_{2} > i_{3} > i_{4}}^{4 q_{4}} T^{i_{1}^{3} i_{2} i_{3} i_{4}} + (\binom{6}{2211}) \sum_{i_{1} > i_{2}, i_{3} > i_{4}}^{6 q_{4}} T^{i_{1}^{2} i_{2}^{2} i_{3} i_{4}} \\ + (\binom{6}{21111}) \sum_{i_{2} > i_{3} > i_{4} > i_{5}}^{5 q_{5}} T^{i_{1}^{2} i_{2} i_{3} i_{4} i_{5}} + 6! \sum_{i_{1} > \dots > i_{6}}^{q_{6}} T^{i_{1} \dots i_{6}}, \end{matrix}

where the sums are for distinct

i_{j}

. This reduces the number of terms in

u_{k}

from

q^{k}

to

L_{q k}

where

\begin{matrix} L_{q 1} = q, L_{q 2} = q + q_{2}, L_{q 3} = q + 2 q_{2} + q_{3}, \\ L_{q 4} = q + 3 q_{2} + 3 q_{3} + q_{4}, L_{q 5} = q + 4 q_{2} + 5 q_{3} + 4 q_{4} + q_{5}, \\ L_{q 6} = q + 5 q_{2} + 10 q_{3} + 14 q_{4} + 5 q_{5} + q_{6} . \end{matrix}

For example,

(q, k) = (5, 3) \Rightarrow L_{q k} / q^{k} = 0.28;

that is, Theorem 4.1 reduces the number of terms to 28%.

As a check,

q^{k} = \sum_{j = 1}^{k} S (k, j) {(q)}_{j}

where

{(q)}_{j} = j! q_{j} = q (q - 1) \dots (q - j + 1)

, and

S (k, j)

is the Stirling number of the 2nd kind tabled on p310 of Comtet (1974). For example, counting the number of terms to calculate in

u_{4}

above gives

q^{4} = q + (4 + 3) {(q)}_{2} + 6 {(q)}_{3} + {(q)}_{4}

.

Taking

\begin{matrix} T^{i_{1} \dots i_{k}} = {\bar{P}}_{r}^{1 - k} {\bar{f}}^{1 - k}, w h e r e {\bar{f}}^{1 - k} = {\bar{H}}^{1 - k}, o r {\bar{H}}_{*}^{1 - k}, o r {\bar{H}}_{*}^{1 - k} (C), \end{matrix}

(20)

and tensor summation is not used, gives

u_{k} = {\tilde{p}}_{r k}

of (7), or

u_{k} = P_{r k} (x)

of (8), or

u_{k} = P_{r k} (C)

of (14). For example,

\begin{matrix} {\tilde{p}}_{r 1} = \sum_{i_{1} = 1}^{q} P_{r}^{i_{1}} H^{i_{1}}, P_{r 1} (x) = \sum_{i_{1} = 1}^{q} P_{r}^{i_{1}} H_{*}^{i_{1}}, P_{r 1} (C) = \sum_{i_{1} = 1}^{q} P_{r}^{i_{1}} H_{*}^{i_{1}} (C), \\ {\tilde{p}}_{r 2} = \sum_{i_{1} = 1}^{q} P_{r}^{i_{1} i_{1}} H^{i_{1} i_{1}} + 2 \sum_{i_{1} > i_{2}}^{q_{2}} P_{r}^{i_{1} i_{2}} H^{i_{1} i_{2}}, P_{r 2} (x) = \sum_{i_{1} = 1}^{q} P_{r}^{i_{1} i_{1}} H_{*}^{i_{1} i_{1}} + 2 \sum_{i_{1} > i_{2}}^{q_{2}} P_{r}^{i_{1} i_{2}} H_{*}^{i_{1} i_{2}}, \\ P_{r 2} (C) = \sum_{i_{1} = 1}^{q} P_{r}^{i_{1} i_{1}} H_{*}^{i_{1} i_{1}} (C) + 2 \sum_{i_{1} > i_{2}}^{q_{2}} P_{r}^{i_{1} i_{2}} H_{*}^{i_{1} i_{2}} (C), \\ {\tilde{p}}_{r 3} = \sum_{i_{1} = 1}^{q} P_{r}^{i_{1} i_{1} i_{1}} H^{i_{1} i_{1} i_{1}} + 3 \sum_{i_{1} \neq i_{2}}^{q (q - 1)} P_{r}^{i_{1} i_{1} i_{2}} H^{i_{1} i_{1} i_{2}} + 6 \sum_{i_{1} > i_{2} > i_{3}}^{q_{3}} P_{r}^{i_{1} i_{2} i_{3}} H^{i_{1} i_{2} i_{3}}, \\ P_{r 3} (x) = \sum_{i_{1} = 1}^{q} P_{r}^{i_{1} i_{1} i_{1}} H_{*}^{i_{1} i_{1} i_{1}} + 3 \sum_{i_{1} \neq i_{2}}^{q (q - 1)} P_{r}^{i_{1} i_{1} i_{2}} H_{*}^{i_{1} i_{1} i_{2}} + 6 \sum_{i_{1} > i_{2} > i_{3}}^{q_{3}} P_{r}^{i_{1} i_{2} i_{3}} H_{*}^{i_{1} i_{2} i_{3}}, \\ P_{r 3} (C) = \sum_{i_{1} = 1}^{q} P_{r}^{i_{1} i_{1} i_{1}} H_{*}^{i_{1} i_{1} i_{1}} (C) + 3 \sum_{i_{1} \neq i_{2}}^{q (q - 1)} P_{r}^{i_{1} i_{1} i_{2}} H_{*}^{i_{1} i_{1} i_{2}} (C) + 6 \sum_{i_{1} > i_{2} > i_{3}}^{q_{3}} P_{r}^{i_{1} i_{2} i_{3}} H_{*}^{i_{1} i_{2} i_{3}} (C) . \end{matrix}

This shows that we can replace

q^{k}

in the derivation of Theorem 3.1 by

L_{q k}

. So for

r = 1, 2

, the number of terms,

N_{r}

of (18) and (19), in

p_{r} (x)

of (7),

P_{r} (x)

of (8), and

P_{r} (C)

of (12), can be reduced to

N_{r}^{'}

where

\begin{matrix} N_{1}^{'} = L_{q 1} + L_{q 3} = 2 q + 2 q_{2} + q_{3}, \end{matrix}

(21)

\begin{matrix} N_{2}^{'} = L_{q 2} + L_{q 4} + L_{q 6} = 3 q + 10 q_{2} + 10 q_{3} + 15 q_{4} + 5 q_{5} + q_{6} . \end{matrix}

(22)

So,

q = 1 \Rightarrow N_{1} = 2, N_{1}^{'} = 2, N_{2} = 3, N_{2}^{'} = 3,

q = 2 \Rightarrow N_{1} = 10, N_{1}^{'} = 6 = . 6 N_{1}, N_{2} = 84, N_{2}^{'} = 16 = . 19 N_{2},

q = 3 \Rightarrow N_{1} = 30, N_{1}^{'} = 13 = . 43 N_{1}, N_{2} = 819, N_{2}^{'} = 49 = . 06 N_{2},

q = 4 \Rightarrow N_{1} = 68, N_{1}^{'} = 24 = . 35 N_{1}, N_{2} = 4368, N_{2}^{'} = 127 = . 02 N_{2},

q = 5 \Rightarrow N_{1} = 130, N_{1}^{'} = 40 = . 31 N_{1}, N_{2} = 16275, N_{2}^{'} = 295 = . 02 N_{2},

where the values of

N_{1}^{'} / N_{1}

and

N_{2}^{'} / N_{2}

are approximate.

For example, if

q = 3

, Theorem 4.1 reduces the number of terms needed for 2nd and 3rd order Edgeworth expansions by 57% and 94%. If

q = 4

or 5, Theorem 4.1 reduces the number of terms to calculate by 65% or 69% for the 2nd order Edgeworth expansions, and by 98% for the 3rd order Edgeworth expansions.

When

q = 2

, this reduction of terms was used in Section 4 of [14].

Example 3.

In Example 3.1, using the expression for

u_{3}

in Theorem 4.1,

\begin{matrix} 6 {\tilde{p}}_{1} (x) = \sum_{i_{1} = 1}^{q} κ^{i_{1}^{3}} H^{i_{1}^{3}} + 3 \sum^{q (q - 1)} κ^{i_{1}^{2} i_{2}} H^{i_{1}^{2} i_{2}} + 6 \sum_{i_{1} > i_{2} > i_{3}}^{q_{3}} κ^{i_{1} i_{2} i_{3}} H^{i_{1} i_{2} i_{3}}, \\ 6 P_{1} (x) = \sum_{i_{1} = 1}^{q} κ^{i_{1}^{3}} H_{*}^{i_{1}^{3}} + 3 \sum^{q (q - 1)} κ^{i_{1}^{2} i_{2}} H_{*}^{i_{1}^{2} i_{2}} + 6 \sum_{i_{1} > i_{2} > i_{3}}^{q_{3}} κ^{i_{1} i_{2} i_{3}} H_{*}^{i_{1} i_{2} i_{3}}, \\ 6 P_{1} (C) = \sum_{i_{1} = 1}^{q} κ^{i_{1}^{3}} H_{*}^{i_{1}^{3}} (C) + 3 \sum^{q (q - 1)} κ^{i_{1}^{2} i_{2}} H_{*}^{i_{1}^{2} i_{2}} (C) + 6 \sum_{i_{1} > i_{2} > i_{3}}^{q_{3}} κ^{i_{1} i_{2} i_{3}} H_{*}^{i_{1} i_{2} i_{3}} (C) . \end{matrix}

As

{\bar{P}}_{1}^{1} = {\bar{P}}_{2}^{12} = 0

,

L_{q 1}

in (21), and

L_{q 2}

in (22) need to be deleted.

5. Functions of a Vector Sample Mean

Let

\bar{X}

be the sample mean from a non-lattice distribution on

R^{q}

with mean

μ

, and finite cross cumulants

{\bar{κ}}^{1 - r} = κ^{j_{1} \dots j_{r}}, r \geq 1

. Let

t (.) : R^{q} \to R^{p}

be a function with

i_{k}

th component

{\bar{t}}^{k} (.) = t^{i_{k}} (.)

having finite derivatives at

μ

,

\begin{matrix} {\bar{t}}_{r - s}^{k} = \partial_{j_{r}} \dots \partial_{j_{s}} t^{k} (μ), where \partial_{j} = \partial / \partial_{μ_{j}}, s \leq r . \end{matrix}

So we expand the bar convention used earlier using

j_{1}, j_{2}, \dots

in

1, \dots, p

as well as

i_{1}, i_{2}, \dots

in

1, \dots, q

as before. Here we have a notation dilemma. We chose

t (.) : R^{q} \to R^{p}

, as this allows us to keep the notation

{\bar{k}}_{d}^{1 - r} = k_{d}^{i_{1} \dots i_{r}}

and

{\bar{P}}_{r}^{1 - k} = P_{r}^{i_{1} \dots i_{k}}

, as used earlier. However, now

\hat{w} \in R^{p}

not

R^{q}

, so that implicit summation in, say

{\bar{P}}_{r}^{1 - k} = P_{r}^{i_{1} \dots i_{k}}

, as used earlier. So now, implicit summation in

u_{k} = {\bar{P}}_{r}^{1 - k} {\bar{H}}^{1 - k}

is over

i_{1}, \dots, i_{k}

in

1, \dots, p

, not in

1, \dots, q

, and this

u_{k}

has

p^{k}

terms, reducible to

L_{p k}

using Theorem 4.1.

If I had chosen

t (.) : R^{p} \to R^{q}

, then

{\bar{k}}_{d}^{1 - r}

and

{\bar{P}}_{r}^{1 - k}

would have had to be reinterpreted as

k_{d}^{j_{1} \dots j_{r}}

and

P_{r}^{j_{1} \dots j_{k}}

, which would likely be confusing.

Let us use

\sum f^{i_{1} \dots i_{k}} \sim p^{k},

to mean

\sum f^{i_{1} \dots i_{k}}

has magnitude

p^{k}

from summing

i_{1}, \dots, i_{k}

over

1, \dots, p

.

S e t \sum^{2} {\bar{t}}_{a - b}^{1} {\bar{t}}_{c - d}^{2} = {\bar{t}}_{a - b}^{1} {\bar{t}}_{c - d}^{2} + {\bar{t}}_{a - b}^{2} {\bar{t}}_{c - d}^{1} .

More generally, for

π_{1}, \dots, π_{r}

any partition of

1, \dots, k

, let

\sum^{N} {\bar{t}}_{π_{1}}^{1} \dots {\bar{t}}_{π_{r}}^{r}

denote the sum of

t_{π_{1}}^{b_{1}} \dots t_{π_{r}}^{b_{r}}

over all N permutations

b_{1}, \dots, b_{r}

of

j_{1}, \dots, j_{r}

, giving distinct values. For example,

\sum^{3} {\bar{t}}_{13}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{4}^{3} = {\bar{t}}_{13}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{4}^{3} + {\bar{t}}_{13}^{2} {\bar{t}}_{2}^{1} {\bar{t}}_{4}^{3} + {\bar{t}}_{13}^{3} {\bar{t}}_{2}^{2} {\bar{t}}_{4}^{1} .

I now give the cumulant coefficients,

{\bar{k}}_{d}^{1 - r}

of (3), for

\hat{w} = t (\bar{X})

, and track their magnitude in p from summing

i_{1}, \dots, i_{k}

over

1, \dots, p

.

Theorem 3.

For

\hat{w} = t (\bar{X}) : R^{q} \to R^{p},

(3) holds, where the cumulant coefficients,k the

{\bar{k}}_{d}^{1 - r} = k_{d}^{i_{1} \dots i_{r}}

, needed for the 3rd order Edgeworth expansions of order

s = 1, 2, 3, 4

, are given by

\begin{matrix} F o r s = 1 : {\bar{k}}_{1}^{12} = V_{i_{1} i_{2}} = {\bar{t}}_{1}^{1} {\bar{t}}_{2}^{2} {\bar{κ}}^{12} = \sum_{j_{1}, j_{2} = 1}^{q} t_{j_{1}}^{i_{1}} {\bar{t}}_{j_{2}}^{i_{2}} κ^{j_{1} j_{2}} \sim q^{2}, \\ F o r s = 2 : {\bar{k}}_{1}^{1} = {\bar{t}}_{12}^{1} {\bar{κ}}^{12} / 2 \sim q^{2}, \\ {\bar{k}}_{2}^{1 - 3} = {\bar{t}}_{1}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{3}^{3} {\bar{κ}}^{1 - 3} + T_{1 - 4}^{1 - 3} {\bar{κ}}^{12} {\bar{κ}}^{34} \sim q^{2} + q^{4} \sim q^{4}, w h e r e T_{1 - 4}^{1 - 3} = \sum^{3} {\bar{t}}_{13}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{4}^{3} . \\ F o r s = 3 : {\bar{k}}_{2}^{12} = T_{1 - 3}^{12} {\bar{κ}}^{1 - 3} / 2 + T_{1 - 4}^{12} {\bar{κ}}^{12} {\bar{κ}}^{34} / 2 \sim q^{3} + q^{4} \sim q^{4}, w h e r e \\ T_{1 - 3}^{12} = \sum^{2} {\bar{t}}_{12}^{1} {\bar{t}}_{3}^{2}, T_{1 - 4}^{12} = \sum^{2} {\bar{t}}_{1 - 3}^{1} {\bar{t}}_{4}^{2} + {\bar{t}}_{13}^{1} {\bar{t}}_{24}^{2}, \end{matrix}

\begin{matrix} {\bar{k}}_{3}^{1 - 4} = {\bar{t}}_{1}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{3}^{3} {\bar{t}}_{4}^{4} {\bar{κ}}^{1 - 4} + T_{1 - 5}^{1 - 4} {\bar{κ}}^{1 - 3} {\bar{κ}}^{45} + T_{1 - 6}^{1 - 4} {\bar{κ}}^{12} {\bar{κ}}^{34} {\bar{κ}}^{56} \sim q^{4} + q^{5} + q^{6} \sim q^{6}, \\ w h e r e T_{1 - 5}^{1 - 4} = \sum^{12} {\bar{t}}_{14}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{3}^{3} {\bar{t}}_{5}^{4}, T_{1 - 6}^{1 - 4} = \sum^{4} {\bar{t}}_{135}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{4}^{3} {\bar{t}}_{6}^{4} + \sum^{12} {\bar{t}}_{13}^{1} {\bar{t}}_{25}^{2} {\bar{t}}_{4}^{3} {\bar{t}}_{6}^{4} . \\ F o r s = 4 : {\bar{k}}_{2}^{1} = {\bar{t}}_{1 - 3}^{1} {\bar{κ}}^{1 - 3} / 6 + {\bar{t}}_{1 - 4}^{1} {\bar{κ}}^{12} {\bar{κ}}^{34} / 8 \sim q^{3} + q^{4} \sim q^{4}, \\ {\bar{k}}_{3}^{1 - 3} = T_{1 - 4}^{1 - 3} {\bar{κ}}^{1 - 4} / 2 + T_{1 - 5}^{1 - 3} {\bar{κ}}^{1 - 3} {\bar{κ}}^{45} + T_{1 - 6}^{1 - 3} {\bar{κ}}^{12} {\bar{κ}}^{34} {\bar{κ}}^{56} \sim q^{4} + q^{5} + q^{6} \sim q^{6}, \\ w h e r e T_{1 - 4}^{1 - 3} = \sum^{3} {\bar{t}}_{13}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{4}^{3}, \\ T_{1 - 5}^{1 - 3} = \sum^{6} {\bar{t}}_{124}^{1} {\bar{t}}_{3}^{2} {\bar{t}}_{5}^{3} / 2 + \sum^{3} {\bar{t}}_{145}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{3}^{3} / 2 + \sum^{6} {\bar{t}}_{12}^{1} {\bar{t}}_{34}^{2} {\bar{t}}_{5}^{3} / 2 + \sum^{3} {\bar{t}}_{14}^{1} {\bar{t}}_{25}^{2} {\bar{t}}_{3}^{3}, \\ T_{1 - 6}^{1 - 3} = \sum^{3} {\bar{t}}_{1235}^{1} {\bar{t}}_{4}^{2} {\bar{t}}_{6}^{3} / 2 + \sum^{6} {\bar{t}}_{1 - 3}^{1} {\bar{t}}_{45}^{2} {\bar{t}}_{6}^{3} + \sum^{6} {\bar{t}}_{135}^{1} {\bar{t}}_{24}^{2} {\bar{t}}_{6}^{3} / 2 + {\bar{t}}_{13}^{1} {\bar{t}}_{25}^{2} {\bar{t}}_{46}^{3} . \\ {\bar{k}}_{4}^{1 - 5} = {\bar{t}}_{1}^{1} \dots {\bar{t}}_{5}^{5} {\bar{κ}}^{1 - 5} + T_{1 - 6}^{1 - 5} {\bar{κ}}^{1 - 4} {\bar{κ}}^{56} + U_{1 - 6}^{1 - 5} {\bar{κ}}^{1 - 3} {\bar{κ}}^{4 - 6} \\ + T_{1 - 7}^{1 - 5} {\bar{κ}}^{1 - 3} {\bar{κ}}^{45} {\bar{κ}}^{67} + T_{1 - 8}^{1 - 5} {\bar{κ}}^{12} {\bar{κ}}^{34} {\bar{κ}}^{56} {\bar{κ}}^{78} \sim p^{5} + \dots + p^{8} \sim p^{8}, \\ w h e r e T_{1 - 6}^{1 - 5} = \sum^{20} {\bar{t}}_{15}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{3}^{3} {\bar{t}}_{4}^{4} {\bar{t}}_{6}^{5}, U_{1 - 6}^{1 - 5} = \sum^{15} {\bar{t}}_{14}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{3}^{3} {\bar{t}}_{5}^{4} {\bar{t}}_{6}^{5}, \\ T_{1 - 7}^{1 - 5} = \sum^{30} {\bar{t}}_{146}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{3}^{3} {\bar{t}}_{5}^{4} {\bar{t}}_{7}^{5} + \sum^{60} {\bar{t}}_{14}^{1} {\bar{t}}_{26}^{2} {\bar{t}}_{3}^{3} {\bar{t}}_{5}^{4} {\bar{t}}_{7}^{5} + \sum^{60} {\bar{t}}_{14}^{1} {\bar{t}}_{56}^{2} {\bar{t}}_{2}^{3} {\bar{t}}_{3}^{4} {\bar{t}}_{7}^{5}, \\ T_{1 - 8}^{1 - 5} = \sum^{5} {\bar{t}}_{1357}^{1} {\bar{t}}_{2}^{2} {\bar{t}}_{4}^{3} {\bar{t}}_{6}^{4} {\bar{t}}_{8}^{5} / 5 + \sum^{60} {\bar{t}}_{135}^{1} {\bar{t}}_{27}^{2} {\bar{t}}_{4}^{3} {\bar{t}}_{6}^{4} {\bar{t}}_{8}^{5} + \sum^{60} {\bar{t}}_{13}^{1} {\bar{t}}_{25}^{2} {\bar{t}}_{47}^{3} {\bar{t}}_{6}^{4} {\bar{t}}_{8}^{5}, \end{matrix}

PROOF This is a special case of Theorem 2 of [13] with

{\bar{k}}_{d}^{1 - r}

replaced by

{\bar{κ}}^{1 - r} I (d = r - 1) .

□

Lemma 1.

For

{\bar{f}}^{1 - k} = f^{i_{1} \dots i_{k}} \in R

, and

r = 1, 2, 3,

\begin{matrix} {\bar{P}}_{r}^{1 - k} \sim q^{r + k} . S o, {\bar{P}}_{r}^{1 - k} {\bar{f}}^{1 - k} \sim q^{r + k} p^{k} . \end{matrix}

\begin{matrix} S e t P_{r} = \sum_{k = 1}^{3 r} [{\bar{P}}_{r}^{1 - k} {\bar{f}}^{1 - k} : k - r e v e n] . T h e n P_{r} \sim q^{4 r} p^{3 r} . \end{matrix}

(23)

PROOF Use Theorem 5.1 to check that for each of the Edgeworth coefficients given by (4)–(5),

{\bar{P}}_{r}^{1 - k} \sim q^{r + k} .

The dominant term in

P_{r}

is for

k = 3 r

. So, (23) holds. □

Theorem 4.

Set

\hat{w} = t (\bar{X}) : R^{q} \to R^{p} .

Suppose that

\hat{w}

is non-lattice, that

E \hat{w} \to w

, and that (3) holds.. Suppose that as

n \to \infty,

p_{n} q_{n} = p q \to \infty,

and that

\begin{matrix} ν_{n} = n^{- 1 / 2} q_{n}^{4} p_{n}^{3} \to 0 . T h a t i s, q_{n}^{8} p_{n}^{6} = o (n) . \end{matrix}

(24)

Then (15)–(17) hold for

s = 1, 2, 3, 4

with

ν_{n}

of (24).

PROOF For

P_{r}

of (23),

n^{- r / 2} P_{r} \sim ν_{n}^{r}

. Now take

{\bar{f}}^{1 - k}

of (20). □ For example, (24) holds for fixed

p = p_{n}

if

q_{n} = o (n^{1 / 8})

, and for fixed

q = q_{n}

if

p_{n} = o (n^{1 / 6})

.

We now apply Theorem 4.1 to the components of Theorem 5.1.

Theorem 5.

\begin{matrix} F o r s = 1 : {\bar{k}}_{1}^{12} = V_{i_{1} i_{2}} \sim L_{q 2} . \\ F o r s = 2 : {\bar{k}}_{1}^{1} \sim L_{q 2}, r e d u c e d f r o m q^{2}, \\ {\bar{k}}_{2}^{1 - 3} \sim L_{q 3} + 3 L_{q 4}, r e d u c e d f r o m q^{3} + 3 q^{4} . \\ F o r s = 3 : {\bar{k}}_{2}^{12} \sim 2 L_{q 3} + 3 L_{q 4}, r e d u c e d f r o m 2 q^{3} + 3 q^{4}, \\ {\bar{k}}_{3}^{1 - 4} \sim L_{q 4} + 12 L_{q 5} + 16 L_{q 6}, r e d u c e d f r o m q^{4} + 12 q^{5} + 16 q^{6} . \\ F o r s = 4 : {\bar{k}}_{2}^{1} \sim L_{q 3} + L_{q 4}, r e d u c e d f r o m q^{3} + q^{4}, \\ {\bar{k}}_{3}^{1 - 3} \sim 3 L_{q 4} + 18 L_{q 5} + 16 L_{q 6}, r e d u c e d f r o m 3 q^{4} + 18 q^{5} + 16 q^{6}, \\ {\bar{k}}_{4}^{1 - 5} \sim L_{q 5} + (20 + 15) L_{q 6} + 150 L_{q 7} + 125 L_{q 8}, \\ r e d u c e d f r o m q^{5} + (20 + 15) q^{6} + 150 q^{7} + 125 q^{8} . \end{matrix}

Let us write these as

{\bar{k}}_{d}^{1 - r} \sim N_{r d} .

For example,

L_{q 2} / q^{2} = (q + 1) / 2 q \to 1 / 2

as

q \to \infty

, and for

q = 4

, the number of calculations needed for

{\bar{k}}_{2}^{1 - 3}

is reduced by the factor

N_{32} / (q^{3} + 3 q^{4}) = (L_{q 3} + 3 L_{q 4}) / (q^{3} + 3 q^{4}) = 3 / 26 .

One can now work out similar results for

T^{i_{1} \dots i_{k}}

of (20, and so for the terms of the Edgeworth expansions,

{\tilde{p}}_{r} (x), P_{r} (x)

and

P_{r} (C)

. For example,

\begin{matrix} {\tilde{p}}_{11} = {\bar{P}}_{1}^{1} {\bar{H}}^{1} = {\bar{k}}_{1}^{1} {\bar{H}}^{1} \sim p N_{11} = p L_{q 2}, \\ {\tilde{p}}_{13} = {\bar{P}}_{1}^{1 - 3} {\bar{H}}^{1 - 3} \sim k_{2}^{1 - 3} {\bar{H}}^{1 - 3} \sim L_{p 3} N_{32}, \\ \Rightarrow {\tilde{p}}_{1} (x) \sim p N_{11} + L_{p 3} N_{32} = p L_{q 2} + L_{p 3} (L_{q 3} + 3 L_{q 4}), \end{matrix}

and

T^{i_{1} i_{2}} = P_{2}^{i_{1} i_{2}} H^{i_{1} i_{2}}

without implicit summation,

{\tilde{p}}_{22} = {\bar{P}}_{2}^{12} {\bar{H}}^{12} = \sum_{i_{1} = 1}^{p} T^{i_{1}^{2}} + 2 \sum_{i_{1} > i_{2}}^{p_{2}} T^{i_{1} i_{2}} \sim p L_{q 2} + p_{2} L_{q 0} .

6. Conclusions

Let

\hat{w}

be a standard estimate of an unknown

w \in R^{q}

. That is,

\hat{w}

is a consistent estimate, and for

r \geq 1

, its rth order cumulants have magnitude

n^{1 - r}

, and can be expanded in powers of

n^{- 1}

. This is a very large class of estimates. It includes functions of sample moments and empirical distributions, samples of independent but not identically distributed random vectors, and samples from stationary series. Then by §2, for fixed q, and non-lattice estimates, Edgeworth-type expansions hold for the density and distribution of

X_{n} = n^{1 / 2} (\hat{w} - w)

, in terms of the Edgeworth coefficients given in [14] for

s \leq 4

. For

s \geq 1,

their first s terms of the three Edgeworth expansions, give the density and distribution of

X_{n}

, and

P (X_{n} \in C)

, to

O (n^{- s / 2})

as

n \to \infty .

Theorem 3.1 shows that this remains true when

q_{n} = q \to \infty

, if the remainder,

O (n^{- s / 2})

, is replaced by

O (ν_{n}^{s})

, where

ν_{n} = n^{- 1 / 2} q_{n}^{3}

. So the three Edgeworth expansions hold when

ν_{n} \to \infty

, that is, when

q_{n} = o (n^{1 / 6})

.

Theorem 4.1 gives formulas that dramatically reduce the number of terms needed by 2nd and 3rd order Edgeworth-type expansions, that is, for 1st and 2nd order corrections to the CLT.

When

\hat{w} = t (\bar{X}) : R^{q} \to R^{p}

, and

p_{n} q_{n} = p q \to \infty

, Theorem 5.2 shows that the three 4th order Edgeworth expansions hold if

q_{n}^{8} p_{n}^{6} = o (n)

. For example, this holds for fixed

p = p_{n}

if

q_{n} = o (n^{1 / 8})

.

7. Discussion

Theorem 3.1, showed that the three Edgeworth expansions considered here, hold for standard estimates when

q_{n} = o (n^{1 / 6})

. Is this result optimal? It is certainly not optimal for a sample mean. For, as noted in Section 1, Theorem 2.1 of the recent paper [7] gives conditions for a 2nd order Edgeworth expansion for the distribution of a sample mean, when

q_{n} = O (n^{c})

for any c, or even when

q_{n} = O (exp (b n^{c}))

and

c < 1 / 3 .

It will be interesting to see if such weak conditions on

q_{n}

can be extended to a wide class of estimates, such as standard estimates.

Theorem 5.2 showed that the three 4th order Edgeworth expansions hold for

\hat{w} = t (\bar{X}) : R^{q} \to R^{p},

when

q_{n}^{8} p_{n}^{6} = o (n)

. It will be interesting to see how much this condition can be weakened. It should not be hard to extend Theorem 5.2 to

\hat{w} = T (F_{n}) \in R^{q},

where

F_{n} (x)

is the empirical distribution of arandom sample of size n from a distribution

F (x)

on

R^{p}

.

One can also extend this to

\hat{w}

a function of K independent sample means,

\hat{w} = t ({\bar{X}}_{1}, \dots, {\bar{X}}_{K}) : R^{q_{1}} \times \dots \times R^{q_{K}} \to R^{p} .

The method employed here should also be able to extend many of the results in the references from a sample mean to a standard estimate.

References

Chernozhukov, V., Chetverikov, D., and Kato, K. (2013) Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors. Ann. Stat., 41 (6), 2786–2819.
Chernozhukov, V., Chetverikov, D., and Kato, K. (2017). Central limit theorems and bootstrap in high dimensions. Ann. Probab., 45 (4), 2309–2352. arXiv:1412.3661.
Donoho, D. and Montanari, A. (2016) High dimensional robust m-estimation: Asymptotic variance via approximate message passing. Probability Theory and Related Fields, Springer.
Fang, X. and Koike, Y. (2021). High-dimensional central limit theorems by Stein’s method. Ann. Appl. Probab. 31, 1660–1686.
Fujikoshi, Y. and Sakurai, T. (2009) High-dimensional asymptotic expansions for the distributions of canonical correlations. Journal of Multivariate Analysis, 100 (1), 231–242.
Fujikoshi, Y., Ulyanov, V. V., and Shimizu, R. (2011). Multivariate statistics: High-dimensional and large-sample approximations. John Wiley and Sons.
Koike, Yuta (2025). High-dimensional bootstrap and asymptotic expansion. arXiv preprint arXiv:2404.05006.
Kosorok, M. and Ma, S. (2007). Marginal asymptotics for the large p, small n paradigm: With application to microarray data. Ann. Statist., 35, 1456–1486. MR2351093.
Kuelbs,J. and Vidyashankar, A.N. (2010). Asymptotic inference for high-dimensional data. Annals of Statistics, 38 (2) 836–869.
Portnoy, S. (1984). Asymptotic behavior of M-estimators of p regression parameters when p²/n is large. I. Consistency. Ann. Statist., 12, 1298–1309. MR0760690.
Portnoy, S. (1985). Asymptotic behavior of M-estimators of p regression parameters when p²/n is large. II: normal approximation, Ann. Statist., 13, 1403–1417.
Withers, C.S. (2000) A simple expression for the multivariate Hermite polynomials. Stat. Prob. Lett., 47, 165–169.
Withers, C.S. (2024) 5th order multivariate Edgeworth expansions for parametric estimates. Mathematics, 2024, 12 (6), 905, Advances in Applied Probability and Statistical Inference. https://www.mdpi.com/2227-7390/12/6/905/pdf. [CrossRef]
Withers, C.S. (2025) Edgeworth coefficients for standard multivariate estimates. New Perspectives in Mathematical Statistics, 2nd Edition. Axioms. Page 5 has 2 typos. $k_{1}^{1}$ in (19) should be ${\bar{k}}_{1}^{1}$ . Delete /12 is S₃, 2 lines before (21).
Withers, C.S. and Nadarajah, S. (2010) Tilted Edgeworth expansions for asymptotically normal vectors. Annals of the Institute of Statistical Mathematics, 62 (6), 1113–1142. [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Edgeworth Expansions When the Parameter Dimension Increases with Sample Size

Abstract

Keywords:

Subject:

1. Introduction and Summary

2. Multivariate Edgeworth Expansions

3. The Case $q = q_{n} \to \infty$ as $n \to \infty$

4. Further Reduction of Terms

5. Functions of a Vector Sample Mean

6. Conclusions

7. Discussion

References

MDPI Initiatives

Important Links

Subscribe

Edgeworth Expansions When the Parameter Dimension Increases with Sample Size

Abstract

Keywords:

Subject:

1. Introduction and Summary

2. Multivariate Edgeworth Expansions

3. The Case q = q n → ∞ as n → ∞

4. Further Reduction of Terms

5. Functions of a Vector Sample Mean

6. Conclusions

7. Discussion

References

MDPI Initiatives

Important Links

Subscribe

3. The Case $q = q_{n} \to \infty$ as $n \to \infty$