Preprint
Article

This version is not peer-reviewed.

Edgeworth Expansions When the Parameter Dimension Increases with Sample Size

Submitted:

20 November 2025

Posted:

20 November 2025

You are already at the latest version

Abstract
Suppose that we have a statistical model with $q=q_n$ unknown parameters, $w=(1_1,\dots,w_q)'$, estimated by $\hat{w}$, based on a sample of size $n$. Suppose also, that we have Edgeworth expansions for the density and distribution of $X_n=n^{1/2} (\hat{w}-w)$. %We ask the question: How fast can $q=q_n$ increase with $n$ for the three main Edgeworth expansions to remain valid? We show that it is sufficient that $q_n=o(n^{1/6})$, if the estimate $\hat{w}$ is a standard estimate. That is, $E\ \hat{w}\rightarrow w$ as $n\rightarrow w$, and for $r\geq 1$, its $r$th order cumulants have magnitude $n^{1-r}$ and can be expanded in powers of $n^{-1}$. This very large class of estimates has a huge range of potential applications. When $\hat{w}=t(\bar{X})$ for $t:R^q\rightarrow R^p$ a smooth function of a sample mean $\bar{X}$ from a distribution on $R^q$, and $p_nq_n=pq\rightarrow\infty$ as $n\rightarrow\infty$, I show that the Edgeworth expansions for $\hat{w}$ remain valid if $q_n^8 p_n^6=o(n)$. For example, this holds for fixed $p=p_n$ if $q_n=o(n^{1/8})$. I also give a method that greatly reduces the number of terms needed for the 2nd and 3rd order terms in the Edgeworth expansions, that is, for the 1st and 2nd order corrections to the Central Limit Theorems (CLTs).
Keywords: 
;  ;  

1. Introduction and Summary

Suppose that we have a estimate w ^ of an unknown parameter w R q of a statistical model, and that
as   n , X n = n 1 / 2 ( w ^ w ) L X N q ( 0 , V ) ,
the multivariate normal on R q , with density and distribution
ϕ V ( x ) = ( 2 π ) q / 2 ( d e t V ) 1 / 2 exp ( x V 1 x / 2 ) , Φ V ( x ) = x ϕ V ( x ) d x .
Here we assume that w ^ is a standard estimate. That is, E w ^ w as n , and for r 1 , its rth order cumulants have magnitude n 1 r and can be expanded in powers of n 1 . The class of standard estimates includes smooth functions of sample means or empirical distributions, based on one or more random samples, or on samples from a stationary time series. So it has a huge range of potential applications.
For w ^ non-lattice, the density and distribution of X n of (1) can be expanded in powers of n 1 / 2 about those of X of (1). These are the Edgeworth expansions. To be self-contained, Section 2 summarises these expansions.
How fast can q = q n increase with n, for these expansions to hold? Section 3 shows that they hold if q n = o ( n 1 / 6 ) .
Section 4 gives a theorem that reduces the number of terms needed for 2nd and 3rd order Edgeworth expansions. For example, if q = 3 , it reduces the number of terms needed for 2nd and 3rd order Edgeworth expansions by 57% and 94%. This % reduction increases with q. If q = 4 or 5, it reduces the number of terms by 65% or 69% for the 2nd order Edgeworth expansions, and by 98% for the 3rd order Edgeworth expansions.
Section 5 considers the case when w ^ = t ( X ¯ ) for t : R q R p a smooth function of a sample mean X ¯ from a distribution on R q with finite moments. When p n q n = p q as n , it shows the Edgeworth expansions for w ^ remain valid if q n 8 p n 6 = o ( n ) . For example, this holds for fixed p = p n if q n = o ( n 1 / 8 ) . These may be the 1st CLTs or Edgeworth expansions when, not 1, but 2 parameters are allowed to increase to with n.
Earlier work done when dimension q = q n increases with sample size n, has been mainly for sample means, including some CLTs and a 2nd order Edgeworth-type expansion. [10,11] showed asymptotic normality for M-estimators with q n regression parameters when q n 2 / n is large. [3] gave a CLT for M-estimates in a linear regression model of dimension q n when q n / n a ( 0 , 1 ) . Remarkably, (8) of [2] gave a CLT for w ^ the sample mean of bounded random vectors that holds for q n = O ( exp ( a n c ) ) if c < 1 / 7 . (Substitute into their condition to confirm.) (1.4) of [7], appears to allow for q n = O ( n c ) if c < 1 / 3 to suffice for a CLT for the sample mean, when X 1 , , X n are log-concave, quoting [4]. It will be interesting to see if this bound can be extended to a broader class of estimates than sample means, and if the log-concave condition can be removed.
Section 2.1 of the very recent paper [7] considers the 2nd order Edgeworth expansion for the distribution of a standardized sample mean. His Theorem 2.1 gives conditions for this which hold when q n = O ( n c ) for any c, or even when q n = O ( exp ( b n c ) ) and c < 1 / 3 . The bounds (1.4) and (2.4) of [7] give a 2nd order Edgeworth expansion for a sample mean, that allows for q n of magnitude n c if c < 1 / 3 , a remarkable result.
[8] considered sampling from vectors, and investigated the simultaneous estimation of the marginal distributions for large q n . [5] considered the asymptotic distributions of the canonical correlations between X 1 R p and X 2 R q with p q . They derived asymptotic distributions of the canonical correlations when p is fixed, q = q n , and q n / n a [ 0 , 1 ) , as n . It assumes that X 1 , and X 2 have a joint normal distribution. [9] gave a CLT for the sample mean for large q n . [6] gave a number of results for large dimensions. [1] and [7] considered the validity and accuracy of Edgeworth expansions of max i = 1 q X n for large q when X n is a standardized sample mean.

2. Multivariate Edgeworth Expansions

Suppose that w ^ is a standard estimate of w R q with respect to n. (n is typically the sample size.) That is, E w ^ w as n , where we use E for expected value, and for r 1 and 1 i 1 , , i r q , the rth order cumulants of w ^ = ( w ^ 1 , , w ^ q ) can be expanded as
k ¯ 1 r = k i 1 i r = κ ( w ^ i 1 , , w ^ i r ) d = r 1 n d k ¯ d 1 r ,   where   k ¯ d 1 r = k d i 1 i r ,
≈ indicates an asymptotic expansion, and the cumulant coefficients  k ¯ d 1 r may depend on n but are bounded as n . So the bar replaces each i k by k. For example, k ¯ 0 1 = w ¯ 1 = w i 1 and k ¯ 1 12 = k 1 i 1 i 2 . I reserve i k for this bar notation to avoid double subscripts. So, (1) holds with V = ( k ¯ 1 12 ) , q × q . V may depend on n, but I assume that d e t V is bounded away from 0.
Let   P ¯ r 1 k = P r i 1 i k   be   the   kth   Edgeworth   coefficient   of   order   r   for   w ^ .
These are Bell polynomials in the cumulant coefficients, k ¯ d 1 r of (3), defined and given in [14] for 1 r 3 . Their importance lies in their central role in the Edgeworth expansions of X n of (1): see (8) and (14) below.
The P ¯ r 1 k needed for r = 1 , 2 , 3 are given in (19)–(21) of [14]:
P ¯ 1 1 = k ¯ 1 1 , P ¯ 1 1 3 = k ¯ 2 1 3 / 6 , P ¯ 2 12 = k ¯ 2 12 / 2 + k ¯ 1 1 k ¯ 1 2 / 2 , P ¯ 2 1 4 = k ¯ 3 1 4 / 4 ! + S k ¯ 1 4 k ¯ 2 1 3 / 6 , P ¯ 2 1 6 = S k ¯ 2 1 3 k ¯ 2 4 6 / 72 , P ¯ 3 1 = k ¯ 2 1 , P ¯ 3 1 3 = k ¯ 3 1 3 / 6 + S k ¯ 2 12 k ¯ 1 3 / 2 + k ¯ 1 1 k ¯ 1 2 k ¯ 1 3 / 6 , P ¯ 3 1 5 = k ¯ 4 1 5 / 5 ! + S 1 / 24 + S 2 / 12 + S 3 / 12 where   S 1 = S k ¯ 3 1 4 k ¯ 1 5 , S 2 = S k ¯ 2 12 k ¯ 2 345 , S 3 = S k ¯ 1 1 k ¯ 1 2 k ¯ 2 345 , P ¯ 3 1 7 = S k ¯ 2 123 k ¯ 3 4 7 / 144 + S k ¯ 2 123 k ¯ 2 4 6 k ¯ 1 7 / 72 ,
P ¯ 3 1 9 = S k ¯ 2 1 3 k ¯ 2 4 6 k ¯ 2 7 9 / 6 4 ,
where S is the operator S that symmetrizes f ¯ 1 k over i 1 , , i k .
Set P ( A ) = Probability that A is true. By [15], or [14], for w ^ non-lattice, the density and distribution of X n can be expanded as
p X n ( x ) r = 0 n r / 2 p r ( x ) , P ( X n x ) r = 0 n r / 2 P r ( x ) , x R q , where   p 0 ( x ) = ϕ V ( x ) , P 0 ( x ) = Φ V ( x ) , of   ( 2 ) ,   and   for   r 1 ,
p r ( x ) / ϕ V ( x ) = k = 1 3 r [ p ˜ r k : k r   even ] = p ˜ r ( x ) s a y , p ˜ r k = P ¯ r 1 k H ¯ 1 k ,
P r ( x ) = k = 1 3 r [ P r k ( x ) : k r   even ] , P r k ( x ) = P ¯ r 1 k H ¯ * 1 k ,
and   H ¯ 1 k = H ¯ 1 k ( x , V ) = H i 1 i k = ϕ V ( x ) 1 O ¯ 1 k ϕ V ( x ) ,
H ¯ * 1 k = H ¯ * 1 k ( x , V ) = O ¯ 1 k Φ V ( x ) = x H ¯ 1 k ϕ V ( x ) d x ,
O ¯ 1 k = ( ¯ 1 ) ( ¯ k ) , ¯ k = i k , i = / x i .
H ¯ 1 k ( x , V ) = H ¯ 1 k and H ¯ * 1 k ( x , V ) = H ¯ * 1 k are the multivariate Hermite polynomial, and the integrated multivariate Hermite polynomial. By [12],
H ¯ 1 k = E ( y ¯ 1 + 1 Y ¯ 1 ) ( y ¯ k + 1 Y ¯ k ) where   y = V 1 x , Y = V 1 X N q ( 0 , V 1 ) , y ¯ k = y i k ,   and   Y ¯ k = Y i k .
I use the tensor summation convention: repetition of i 1 , , i k in p ˜ r k of (7) and P r k ( x ) of (8) implies their implicit summation over their range, 1 , , q . [14] gave H ¯ 1 k explicitly for k 6 , and for k 9 when q = 2 .
Set   μ ¯ 1 2 k = E Y ¯ 1 Y ¯ 2 k = 1.3 ( 2 k 1 ) V ¯ 12 V ¯ 2 k 1 , 2 k ,
where N f ¯ 1 k sums f ¯ 1 k over all N permutations of i 1 , , i k giving distinct values. For example,
μ ¯ 1 4 = V 12 V 34 + V 13 V 24 + V 14 V 23 , H ¯ 1 = y ¯ 1 , H ¯ 12 = y ¯ 1 y ¯ 2 V ¯ 12 , H ¯ 1 3 = y ¯ 1 y ¯ 2 y ¯ 3 3 y ¯ 1 V ¯ 23 = y ¯ 1 y ¯ 2 y ¯ 3 y ¯ 1 V ¯ 23 y ¯ 2 V ¯ 13 y ¯ 3 V ¯ 12 , H ¯ * 1 = J ¯ 1 , H ¯ * 12 = J ¯ 12 V ¯ 12 Φ V ( x ) , H ¯ * 1 3 = J ¯ 123 3 J ¯ 1 V ¯ 23 ,   where
J ¯ 1 k = J ¯ 1 k ( x , V ) = E Y ¯ 1 Y ¯ k I ( X x ) = V ¯ 1 , k + 1 V ¯ k , 2 k M ¯ k + 1 2 k ,
and   M ¯ a b = M ¯ a b ( x , V ) = E X ¯ 1 X ¯ k I ( X x ) = x x ¯ a x ¯ b ϕ V ( x ) d x ,
for x ¯ a = x i a . So the repeated i k + 1 , , i 2 k in (10) implies their repeated summation over 1 , , q . As x lies in R q , x d x in (9) and (11), stands for x 1 d x 1 x q d x q . (6) with the P ¯ r 1 k of [14], give the Edgeworth expansions for the density and distribution of X n of (1) to O ( n 2 ) . p ˜ r k and P r k each have q k terms, but many are duplicates as P ¯ r 1 k is symmetric in i 1 , , i k . This is exploited in Section 4 to greatly reduce the number of terms in (8) and (14) below.
By (7), the density of X n relative to its asymptotic value is
p X n ( x ) / ϕ V ( x ) 1 + r = 1 n r / 2 p ˜ r ( x ) = 1 + n 1 / 2 p ˜ 1 ( x ) + O ( n 1 ) ,
for x R q . For measurable C R q ,
P ( X n C ) r = 0 n r / 2 P r ( C ) ,   where   P 0 ( C ) = Φ V ( C ) ,   and   for   r 1 ,
P r ( C ) = E p r ( X ) I ( X C ) = C p r ( x ) ϕ V ( x ) d x = k = 1 3 r [ P r k ( C ) : k r   even ] ,
P r k ( C ) = E p ˜ r k ( X ) I ( X C ) = C p ˜ r k ( x ) ϕ V ( x ) d x = P ¯ r 1 k H ¯ * 1 k ( C ) ,
and   H ¯ * 1 k ( C ) = E H ¯ 1 k ( X , V ) I ( X C ) = C H ¯ 1 k ϕ V ( x ) d x .
This paper focuses on the three Edgeworth expansions, (6) and (12), when q as n .
If C = C , then for r odd, P r k ( C ) = P r ( C ) = 0 , so that
P ( X n C ) r = 0 n r P 2 r ( C ) = Φ V ( C ) + n 1 P 2 ( C ) + O ( n 2 ) .
Examples 3 and 4 of [14] gave P 2 ( C ) for
C = { x : x V 1 x u } , and C = { x : | ( V 1 / 2 x ) j | u j , j = 1 , , q } .
The main take-away here is that for x R q , C R q ,   p ˜ r ( x ) , P r ( x ) , P r ( C ) of (7), (8) and (12), and s = 1 , 2 , ,
p X n ( x ) / ϕ V ( x ) = r = 0 s 1 n r / 2 p ˜ r ( x ) + O ( n s / 2 ) , P ( X n x ) = r = 0 s 1 n r / 2 P r ( x ) + O ( n s / 2 ) , P ( X n C ) = r = 0 s 1 n r / 2 P r ( C ) + O ( n s / 2 ) ,
where, for example, by (4),
p ˜ 1 ( x ) = p 1 ( x ) / ϕ V ( x ) = k = 1 , 3 p ˜ 1 k , p ˜ 11 = k ¯ 1 1 H ¯ 1 , p ˜ 13 = k ¯ 2 1 3 H ¯ 1 3 / 6 , P 1 ( x ) = k = 1 , 3 P 1 k ( x ) , P 11 ( x ) = k ¯ 1 1 H ¯ * 1 , P 13 ( x ) = k ¯ 2 1 3 H ¯ * 1 3 / 6 , P 1 ( C ) = k = 1 , 3 P 1 k ( C ) , P 11 ( C ) = k ¯ 1 1 H ¯ * 1 ( C ) , P 13 ( C ) = k ¯ 2 1 3 H ¯ * 1 3 ( C ) / 6 .
These asymptotic expansions generally diverge, as normal moments and Hermite polynomials increase very rapidly with their degree.

3. The Case q = q n as n

Theorem 1.
Let w ^ be a non-lattice estimate of w R q , satisfying E w ^ w , and (3). Set X n = n 1 / 2 ( w ^ w ) . Take s 1 . Suppose that as n ,
q = q n ,   and   ν n = n 1 / 2 q n 3 0 ,   that   is , q n = o ( n 1 / 6 ) .
Then, for p ˜ r ( x ) of (7), P r ( x ) of (8), and P r ( C ) of (13),
p X n ( x ) / ϕ V ( x ) = r = 0 s 1 n r / 2 p ˜ r ( x ) + O ( ν n s ) ,
P ( X n x ) = r = 0 s 1 n r / 2 P r ( x ) + O ( ν n s ) ,
P ( X n C ) = r = 0 s 1 n r / 2 P r ( C ) + O ( ν n s ) .
PROOF Set Q = q 2 . p ˜ r k ( x ) and P r k ( x ) of (8), and p ˜ r k ( C ) of (14), each have q k terms for q = q n . So p r ( x ) of (7), P r ( x ) of (8), and P r ( C ) of (12), each have N r terms, where
for   r   odd , N r = q + q 3 + q 5 + + q 3 r = q ( Q ( 3 r + 1 ) / 2 1 ) / ( Q 1 ) ,
for   r   even , N r = q 2 + q 4 + + q 3 r = Q ( Q 3 r / 2 1 ) / ( Q 1 ) .
So , N r = O ( q 3 r ) as   n .
So p r ( x ) , P r ( x ) and P r ( C ) each have magnitude q n 3 r as n .
So n s / 2 ( p s ( x ) , P s ( x ) , P s ( C ) ) has magnitude n s / 2 q n 3 s = ν n s . The theorem follows. □
Example 1.
Let w ^ be the sample mean from a distribution on R q with finite cross cumulants κ ¯ 1 r , r 1 . Then E w ^ = w , and only the leading coefficient in (3) is non-zero. As in Example 2 of [14], by (4)–(5), the non-zero Edgeworth coefficients P r 1 k needed for the three 4th order Edgeworth expansions of w ^ with V = ( κ ¯ 12 ) , are
P ¯ 1 1 3 = κ ¯ 1 3 / 3 ! , P ¯ 2 1 4 = κ ¯ 1 4 / 4 ! , P ¯ 2 1 6 = S κ ¯ 1 3 κ ¯ 4 6 / 72 , , P ¯ 3 1 5 = κ ¯ 1 5 / 5 ! , P ¯ 3 1 7 = S κ ¯ 123 κ ¯ 4 7 / 144 , P ¯ 3 1 9 = S κ ¯ 1 3 κ ¯ 4 6 κ ¯ 7 9 / 6 4 .
Substitution gives ( p s ( x ) , P s ( x ) , P s ( C ) ) for s = 1 , 2 , 3 . For example, as q n = q , the coefficients of n 1 / 2 in the 2nd terms in the Edgeworth expansions for p X n ( x ) / ϕ V ( x ) , P ( X n x ) , and P ( X n C ) , are
p ˜ 1 ( x ) = p ˜ 13 = κ ¯ 1 3 H ¯ 1 3 / 6 = O ( q n 3 ) , P 1 ( x ) = P 13 ( x ) = κ ¯ 1 3 H ¯ * 1 3 / 6 = O ( q n 3 ) , P 1 ( C ) = P 13 ( C ) = κ ¯ 1 3 H ¯ * 1 3 ( C ) / 6 = O ( q n 3 ) .
Example 2.
Suppose that X 1 , X 2 , , X n are independent random vectors in R q , with mean w ^ = X ¯ , and that X j has finite cross-cumulants
κ ¯ j 1 r = κ ( X j i 1 , , X j i r ) f o r r 1 a n d i 1 , , i r i n 1 , , q .
Then for r 1 ,
κ ( w ^ i 1 , , w ^ i r ) = n 1 r κ ¯ 1 r w h e r e κ ¯ 1 r = n 1 j = 1 n κ ¯ j 1 r .
Suppose also, that { κ ¯ 1 r } are bounded in n, and that for V = ( κ ¯ 12 ) , d e t V is bounded away from 0, as n increases. Then the Edgeworth coefficients needed for the three 4th order Edgeworth expansions for w ^ , are given by Example 3.1.

4. Further Reduction of Terms

Our next theorem gives a way to reduce the number of terms in p ˜ r ( x ) of (7), P r ( x ) of (8), and P r ( C ) of (14), from q k , to L k = q k [ 1 + O ( q 1 ) ] as q , where
q k = q k = q ( q 1 ) ( q k + 1 ) / k ! .
As we do not use q n for q in this section, there is no ambiguity with this different use of q k . k
p ˜ r k ( x ) , P r k ( x ) and P r k ( C ) have k summations over 1 , , q , and so have q k terms. But many are duplicates as P ¯ r 1 k is symmetric in i 1 , , i k . Set k a b = k ! / a ! b ! , the multinomial coefficient. For example 3 111 = 6 .
Theorem 2.
Let T i 1 i k R be symmetric in i 1 , , i k .
S e t T i 1 2 i 2 = T i 1 i 1 i 2 a n d   s o   o n ,   a n d   u k = i 1 , , i k = 1 q T i 1 i k . T h e n , u 1 = i 1 = 1 q T i 1 , u 2 = i 1 = 1 q T i 1 2 + 2 ! i 1 > i 2 q 2 T i 1 i 2 , u 3 = i 1 = 1 q T i 1 3 + 3 1 q ( q 1 ) T i 1 2 i 2 + 3 ! i 1 > i 2 > i 3 q 3 T i 1 i 2 i 3 , u 4 = i 1 = 1 q T i 1 4 + 4 1 q ( q 1 ) T i 1 3 i 2 + 4 2 i 1 > i 2 q 2 T i 1 2 i 2 2 + 4 211 i 1 > i 2 3 q 3 T i 1 2 i 2 i 3 + 4 ! i 1 > i 2 > i 3 > i 4 q 4 T i 1 i 2 i 3 i 4 , u 5 = i 1 = 1 q T i 1 5 + 5 1 q ( q 1 ) T i 1 4 i 2 + 5 2 q ( q 1 ) T i 1 3 i 2 2 + 5 221 i 1 > i 2 5 q 3 T i 1 2 i 2 2 i 3 + 5 2111 i 2 > i 3 > i 4 4 q 4 T i 1 2 i 2 i 3 i 4 + 5 ! i 1 > i 2 > i 3 > i 4 > i 5 q 5 T i 1 i 2 i 3 i 4 i 5 , u 6 = i 1 = 1 q T i 1 6 + 6 1 q ( q 1 ) T i 1 5 i 2 + 6 2 q ( q 1 ) T i 1 4 i 2 2 + 6 3 i 1 > i 2 q 2 T i 1 3 i 2 3 + 6 411 i 2 > i 3 3 q 3 T i 1 4 i 2 i 3 + 6 321 6 q 3 T i 1 3 i 2 2 i 3 + 6 222 i 1 > i 2 > i 3 q 3 T i 1 2 i 2 2 i 3 2 + 6 3111 i 2 > i 3 > i 4 4 q 4 T i 1 3 i 2 i 3 i 4 + 6 2211 i 1 > i 2 , i 3 > i 4 6 q 4 T i 1 2 i 2 2 i 3 i 4 + 6 2 1111 i 2 > i 3 > i 4 > i 5 5 q 5 T i 1 2 i 2 i 3 i 4 i 5 + 6 ! i 1 > > i 6 q 6 T i 1 i 6 ,
where the sums are for distinct i j . This reduces the number of terms in u k from q k to L q k where
L q 1 = q , L q 2 = q + q 2 , L q 3 = q + 2 q 2 + q 3 , L q 4 = q + 3 q 2 + 3 q 3 + q 4 , L q 5 = q + 4 q 2 + 5 q 3 + 4 q 4 + q 5 , L q 6 = q + 5 q 2 + 10 q 3 + 14 q 4 + 5 q 5 + q 6 .
For example, ( q , k ) = ( 5 , 3 ) L q k / q k = 0.28 ; that is, Theorem 4.1 reduces the number of terms to 28%.
As a check, q k = j = 1 k S ( k , j ) ( q ) j where ( q ) j = j ! q j = q ( q 1 ) ( q j + 1 ) , and S ( k , j ) is the Stirling number of the 2nd kind tabled on p310 of Comtet (1974). For example, counting the number of terms to calculate in u 4 above gives q 4 = q + ( 4 + 3 ) ( q ) 2 + 6 ( q ) 3 + ( q ) 4 .
Taking
T i 1 i k = P ¯ r 1 k f ¯ 1 k , w h e r e f ¯ 1 k = H ¯ 1 k , o r H ¯ * 1 k , o r H ¯ * 1 k ( C ) ,
and tensor summation is not used, gives u k = p ˜ r k of (7), or u k = P r k ( x ) of (8), or u k = P r k ( C ) of (14). For example,
p ˜ r 1 = i 1 = 1 q P r i 1 H i 1 , P r 1 ( x ) = i 1 = 1 q P r i 1 H * i 1 , P r 1 ( C ) = i 1 = 1 q P r i 1 H * i 1 ( C ) , p ˜ r 2 = i 1 = 1 q P r i 1 i 1 H i 1 i 1 + 2 i 1 > i 2 q 2 P r i 1 i 2 H i 1 i 2 , P r 2 ( x ) = i 1 = 1 q P r i 1 i 1 H * i 1 i 1 + 2 i 1 > i 2 q 2 P r i 1 i 2 H * i 1 i 2 , P r 2 ( C ) = i 1 = 1 q P r i 1 i 1 H * i 1 i 1 ( C ) + 2 i 1 > i 2 q 2 P r i 1 i 2 H * i 1 i 2 ( C ) , p ˜ r 3 = i 1 = 1 q P r i 1 i 1 i 1 H i 1 i 1 i 1 + 3 i 1 i 2 q ( q 1 ) P r i 1 i 1 i 2 H i 1 i 1 i 2 + 6 i 1 > i 2 > i 3 q 3 P r i 1 i 2 i 3 H i 1 i 2 i 3 , P r 3 ( x ) = i 1 = 1 q P r i 1 i 1 i 1 H * i 1 i 1 i 1 + 3 i 1 i 2 q ( q 1 ) P r i 1 i 1 i 2 H * i 1 i 1 i 2 + 6 i 1 > i 2 > i 3 q 3 P r i 1 i 2 i 3 H * i 1 i 2 i 3 , P r 3 ( C ) = i 1 = 1 q P r i 1 i 1 i 1 H * i 1 i 1 i 1 ( C ) + 3 i 1 i 2 q ( q 1 ) P r i 1 i 1 i 2 H * i 1 i 1 i 2 ( C ) + 6 i 1 > i 2 > i 3 q 3 P r i 1 i 2 i 3 H * i 1 i 2 i 3 ( C ) .
This shows that we can replace q k in the derivation of Theorem 3.1 by L q k . So for r = 1 , 2 , the number of terms, N r of (18) and (19), in p r ( x ) of (7), P r ( x ) of (8), and P r ( C ) of (12), can be reduced to N r where
N 1 = L q 1 + L q 3 = 2 q + 2 q 2 + q 3 ,
N 2 = L q 2 + L q 4 + L q 6 = 3 q + 10 q 2 + 10 q 3 + 15 q 4 + 5 q 5 + q 6 .
So, q = 1 N 1 = 2 , N 1 = 2 , N 2 = 3 , N 2 = 3 ,
q = 2 N 1 = 10 , N 1 = 6 = . 6 N 1 , N 2 = 84 , N 2 = 16 = . 19 N 2 ,
q = 3 N 1 = 30 , N 1 = 13 = . 43 N 1 , N 2 = 819 , N 2 = 49 = . 06 N 2 ,
q = 4 N 1 = 68 , N 1 = 24 = . 35 N 1 , N 2 = 4368 , N 2 = 127 = . 02 N 2 ,
q = 5 N 1 = 130 , N 1 = 40 = . 31 N 1 , N 2 = 16275 , N 2 = 295 = . 02 N 2 ,
where the values of N 1 / N 1 and N 2 / N 2 are approximate.
For example, if q = 3 , Theorem 4.1 reduces the number of terms needed for 2nd and 3rd order Edgeworth expansions by 57% and 94%. If q = 4 or 5, Theorem 4.1 reduces the number of terms to calculate by 65% or 69% for the 2nd order Edgeworth expansions, and by 98% for the 3rd order Edgeworth expansions.
When q = 2 , this reduction of terms was used in Section 4 of [14].
Example 3.
In Example 3.1, using the expression for u 3 in Theorem 4.1,
6 p ˜ 1 ( x ) = i 1 = 1 q κ i 1 3 H i 1 3 + 3 q ( q 1 ) κ i 1 2 i 2 H i 1 2 i 2 + 6 i 1 > i 2 > i 3 q 3 κ i 1 i 2 i 3 H i 1 i 2 i 3 , 6 P 1 ( x ) = i 1 = 1 q κ i 1 3 H * i 1 3 + 3 q ( q 1 ) κ i 1 2 i 2 H * i 1 2 i 2 + 6 i 1 > i 2 > i 3 q 3 κ i 1 i 2 i 3 H * i 1 i 2 i 3 , 6 P 1 ( C ) = i 1 = 1 q κ i 1 3 H * i 1 3 ( C ) + 3 q ( q 1 ) κ i 1 2 i 2 H * i 1 2 i 2 ( C ) + 6 i 1 > i 2 > i 3 q 3 κ i 1 i 2 i 3 H * i 1 i 2 i 3 ( C ) .
As P ¯ 1 1 = P ¯ 2 12 = 0 , L q 1 in (21), and L q 2 in (22) need to be deleted.

5. Functions of a Vector Sample Mean

Let X ¯ be the sample mean from a non-lattice distribution on R q with mean μ , and finite cross cumulants κ ¯ 1 r = κ j 1 j r , r 1 . Let t ( . ) : R q R p be a function with i k th component t ¯ k ( . ) = t i k ( . ) having finite derivatives at μ ,
t ¯ r s k = j r j s t k ( μ ) ,   where   j = / μ j , s r .
So we expand the bar convention used earlier using j 1 , j 2 , in 1 , , p as well as i 1 , i 2 , in 1 , , q as before. Here we have a notation dilemma. We chose t ( . ) : R q R p , as this allows us to keep the notation k ¯ d 1 r = k d i 1 i r and P ¯ r 1 k = P r i 1 i k , as used earlier. However, now w ^ R p not R q , so that implicit summation in, say P ¯ r 1 k = P r i 1 i k , as used earlier. So now, implicit summation in u k = P ¯ r 1 k H ¯ 1 k is over i 1 , , i k in 1 , , p , not in 1 , , q , and this u k has p k terms, reducible to L p k using Theorem 4.1.
If I had chosen t ( . ) : R p R q , then k ¯ d 1 r and P ¯ r 1 k would have had to be reinterpreted as k d j 1 j r and P r j 1 j k , which would likely be confusing.
Let us use f i 1 i k p k , to mean f i 1 i k has magnitude p k from summing i 1 , , i k over 1 , , p .
S e t 2 t ¯ a b 1 t ¯ c d 2 = t ¯ a b 1 t ¯ c d 2 + t ¯ a b 2 t ¯ c d 1 .
More generally, for π 1 , , π r any partition of 1 , , k , let N t ¯ π 1 1 t ¯ π r r denote the sum of t π 1 b 1 t π r b r over all N permutations b 1 , , b r of j 1 , , j r , giving distinct values. For example,
3 t ¯ 13 1 t ¯ 2 2 t ¯ 4 3 = t ¯ 13 1 t ¯ 2 2 t ¯ 4 3 + t ¯ 13 2 t ¯ 2 1 t ¯ 4 3 + t ¯ 13 3 t ¯ 2 2 t ¯ 4 1 .
I now give the cumulant coefficients, k ¯ d 1 r of (3), for w ^ = t ( X ¯ ) , and track their magnitude in p from summing i 1 , , i k over 1 , , p .
Theorem 3.
For w ^ = t ( X ¯ ) : R q R p , (3) holds, where the cumulant coefficients,k the k ¯ d 1 r = k d i 1 i r , needed for the 3rd order Edgeworth expansions of order s = 1 , 2 , 3 , 4 , are given by
F o r   s = 1 : k ¯ 1 12 = V i 1 i 2 = t ¯ 1 1 t ¯ 2 2 κ ¯ 12 = j 1 , j 2 = 1 q t j 1 i 1 t ¯ j 2 i 2 κ j 1 j 2 q 2 , F o r   s = 2 : k ¯ 1 1 = t ¯ 12 1 κ ¯ 12 / 2 q 2 , k ¯ 2 1 3 = t ¯ 1 1 t ¯ 2 2 t ¯ 3 3 κ ¯ 1 3 + T 1 4 1 3 κ ¯ 12 κ ¯ 34 q 2 + q 4 q 4 , w h e r e   T 1 4 1 3 = 3 t ¯ 13 1 t ¯ 2 2 t ¯ 4 3 . F o r   s = 3 : k ¯ 2 12 = T 1 3 12 κ ¯ 1 3 / 2 + T 1 4 12 κ ¯ 12 κ ¯ 34 / 2 q 3 + q 4 q 4 , w h e r e T 1 3 12 = 2 t ¯ 12 1 t ¯ 3 2 , T 1 4 12 = 2 t ¯ 1 3 1 t ¯ 4 2 + t ¯ 13 1 t ¯ 24 2 ,
k ¯ 3 1 4 = t ¯ 1 1 t ¯ 2 2 t ¯ 3 3 t ¯ 4 4 κ ¯ 1 4 + T 1 5 1 4 κ ¯ 1 3 κ ¯ 45 + T 1 6 1 4 κ ¯ 12 κ ¯ 34 κ ¯ 56 q 4 + q 5 + q 6 q 6 , w h e r e   T 1 5 1 4 = 12 t ¯ 14 1 t ¯ 2 2 t ¯ 3 3 t ¯ 5 4 , T 1 6 1 4 = 4 t ¯ 135 1 t ¯ 2 2 t ¯ 4 3 t ¯ 6 4 + 12 t ¯ 13 1 t ¯ 25 2 t ¯ 4 3 t ¯ 6 4 . F o r   s = 4 : k ¯ 2 1 = t ¯ 1 3 1 κ ¯ 1 3 / 6 + t ¯ 1 4 1 κ ¯ 12 κ ¯ 34 / 8 q 3 + q 4 q 4 , k ¯ 3 1 3 = T 1 4 1 3 κ ¯ 1 4 / 2 + T 1 5 1 3 κ ¯ 1 3 κ ¯ 45 + T 1 6 1 3 κ ¯ 12 κ ¯ 34 κ ¯ 56 q 4 + q 5 + q 6 q 6 , w h e r e   T 1 4 1 3 = 3 t ¯ 13 1 t ¯ 2 2 t ¯ 4 3 , T 1 5 1 3 = 6 t ¯ 124 1 t ¯ 3 2 t ¯ 5 3 / 2 + 3 t ¯ 145 1 t ¯ 2 2 t ¯ 3 3 / 2 + 6 t ¯ 12 1 t ¯ 34 2 t ¯ 5 3 / 2 + 3 t ¯ 14 1 t ¯ 25 2 t ¯ 3 3 , T 1 6 1 3 = 3 t ¯ 123 5 1 t ¯ 4 2 t ¯ 6 3 / 2 + 6 t ¯ 1 3 1 t ¯ 45 2 t ¯ 6 3 + 6 t ¯ 135 1 t ¯ 24 2 t ¯ 6 3 / 2 + t ¯ 13 1 t ¯ 25 2 t ¯ 46 3 . k ¯ 4 1 5 = t ¯ 1 1 t ¯ 5 5 κ ¯ 1 5 + T 1 6 1 5 κ ¯ 1 4 κ ¯ 56 + U 1 6 1 5 κ ¯ 1 3 κ ¯ 4 6 + T 1 7 1 5 κ ¯ 1 3 κ ¯ 45 κ ¯ 67 + T 1 8 1 5 κ ¯ 12 κ ¯ 34 κ ¯ 56 κ ¯ 78 p 5 + + p 8 p 8 , w h e r e   T 1 6 1 5 = 20 t ¯ 15 1 t ¯ 2 2 t ¯ 3 3 t ¯ 4 4 t ¯ 6 5 , U 1 6 1 5 = 15 t ¯ 14 1 t ¯ 2 2 t ¯ 3 3 t ¯ 5 4 t ¯ 6 5 , T 1 7 1 5 = 30 t ¯ 146 1 t ¯ 2 2 t ¯ 3 3 t ¯ 5 4 t ¯ 7 5 + 60 t ¯ 14 1 t ¯ 26 2 t ¯ 3 3 t ¯ 5 4 t ¯ 7 5 + 60 t ¯ 14 1 t ¯ 56 2 t ¯ 2 3 t ¯ 3 4 t ¯ 7 5 , T 1 8 1 5 = 5 t ¯ 1357 1 t ¯ 2 2 t ¯ 4 3 t ¯ 6 4 t ¯ 8 5 / 5 + 60 t ¯ 135 1 t ¯ 27 2 t ¯ 4 3 t ¯ 6 4 t ¯ 8 5 + 60 t ¯ 13 1 t ¯ 25 2 t ¯ 47 3 t ¯ 6 4 t ¯ 8 5 ,
PROOF This is a special case of Theorem 2 of [13] with k ¯ d 1 r replaced by κ ¯ 1 r I ( d = r 1 ) .
Lemma 1.
For f ¯ 1 k = f i 1 i k R , and r = 1 , 2 , 3 ,
P ¯ r 1 k q r + k . S o , P ¯ r 1 k f ¯ 1 k q r + k p k .
S e t   P r = k = 1 3 r [ P ¯ r 1 k f ¯ 1 k : k r   e v e n ] .   T h e n   P r q 4 r p 3 r .
PROOF Use Theorem 5.1 to check that for each of the Edgeworth coefficients given by (4)–(5), P ¯ r 1 k q r + k . The dominant term in P r is for k = 3 r . So, (23) holds. □
Theorem 4.
Set w ^ = t ( X ¯ ) : R q R p . Suppose that w ^ is non-lattice, that E w ^ w , and that (3) holds.. Suppose that as n ,   p n q n = p q , and that
ν n = n 1 / 2 q n 4 p n 3 0 .   T h a t   i s ,   q n 8 p n 6 = o ( n ) .
Then (15)–(17) hold for s = 1 , 2 , 3 , 4 with ν n of (24).
PROOF For P r of (23), n r / 2 P r ν n r . Now take f ¯ 1 k of (20). □ For example, (24) holds for fixed p = p n if q n = o ( n 1 / 8 ) , and for fixed q = q n if p n = o ( n 1 / 6 ) .
We now apply Theorem 4.1 to the components of Theorem 5.1.
Theorem 5.
F o r   s = 1 : k ¯ 1 12 = V i 1 i 2 L q 2 . F o r   s = 2 : k ¯ 1 1 L q 2 ,   r e d u c e d   f r o m   q 2 , k ¯ 2 1 3 L q 3 + 3 L q 4 ,   r e d u c e d   f r o m   q 3 + 3 q 4 . F o r   s = 3 : k ¯ 2 12 2 L q 3 + 3 L q 4 , r e d u c e d   f r o m   2 q 3 + 3 q 4 , k ¯ 3 1 4 L q 4 + 12 L q 5 + 16 L q 6 ,   r e d u c e d   f r o m   q 4 + 12 q 5 + 16 q 6 . F o r   s = 4 : k ¯ 2 1 L q 3 + L q 4 ,   r e d u c e d   f r o m   q 3 + q 4 , k ¯ 3 1 3 3 L q 4 + 18 L q 5 + 16 L q 6 ,   r e d u c e d   f r o m   3 q 4 + 18 q 5 + 16 q 6 , k ¯ 4 1 5 L q 5 + ( 20 + 15 ) L q 6 + 150 L q 7 + 125 L q 8 , r e d u c e d   f r o m   q 5 + ( 20 + 15 ) q 6 + 150 q 7 + 125 q 8 .
Let us write these as
k ¯ d 1 r N r d .
For example, L q 2 / q 2 = ( q + 1 ) / 2 q 1 / 2 as q , and for q = 4 , the number of calculations needed for k ¯ 2 1 3 is reduced by the factor N 32 / ( q 3 + 3 q 4 ) = ( L q 3 + 3 L q 4 ) / ( q 3 + 3 q 4 ) = 3 / 26 .
One can now work out similar results for T i 1 i k of (20, and so for the terms of the Edgeworth expansions, p ˜ r ( x ) , P r ( x ) and P r ( C ) . For example,
p ˜ 11 = P ¯ 1 1 H ¯ 1 = k ¯ 1 1 H ¯ 1 p N 11 = p L q 2 , p ˜ 13 = P ¯ 1 1 3 H ¯ 1 3 k 2 1 3 H ¯ 1 3 L p 3 N 32 , p ˜ 1 ( x ) p N 11 + L p 3 N 32 = p L q 2 + L p 3 ( L q 3 + 3 L q 4 ) ,
and T i 1 i 2 = P 2 i 1 i 2 H i 1 i 2 without implicit summation,
p ˜ 22 = P ¯ 2 12 H ¯ 12 = i 1 = 1 p T i 1 2 + 2 i 1 > i 2 p 2 T i 1 i 2 p L q 2 + p 2 L q 0 .

6. Conclusions

Let w ^ be a standard estimate of an unknown w R q . That is, w ^ is a consistent estimate, and for r 1 , its rth order cumulants have magnitude n 1 r , and can be expanded in powers of n 1 . This is a very large class of estimates. It includes functions of sample moments and empirical distributions, samples of independent but not identically distributed random vectors, and samples from stationary series. Then by §2, for fixed q, and non-lattice estimates, Edgeworth-type expansions hold for the density and distribution of X n = n 1 / 2 ( w ^ w ) , in terms of the Edgeworth coefficients given in [14] for s 4 . For s 1 , their first s terms of the three Edgeworth expansions, give the density and distribution of X n , and P ( X n C ) , to O ( n s / 2 ) as n . Theorem 3.1 shows that this remains true when q n = q , if the remainder, O ( n s / 2 ) , is replaced by O ( ν n s ) , where ν n = n 1 / 2 q n 3 . So the three Edgeworth expansions hold when ν n , that is, when q n = o ( n 1 / 6 ) .
Theorem 4.1 gives formulas that dramatically reduce the number of terms needed by 2nd and 3rd order Edgeworth-type expansions, that is, for 1st and 2nd order corrections to the CLT.
When w ^ = t ( X ¯ ) : R q R p , and p n q n = p q , Theorem 5.2 shows that the three 4th order Edgeworth expansions hold if q n 8 p n 6 = o ( n ) . For example, this holds for fixed p = p n if q n = o ( n 1 / 8 ) .

7. Discussion

Theorem 3.1, showed that the three Edgeworth expansions considered here, hold for standard estimates when q n = o ( n 1 / 6 ) . Is this result optimal? It is certainly not optimal for a sample mean. For, as noted in Section 1, Theorem 2.1 of the recent paper [7] gives conditions for a 2nd order Edgeworth expansion for the distribution of a sample mean, when q n = O ( n c ) for any c, or even when q n = O ( exp ( b n c ) ) and c < 1 / 3 .
It will be interesting to see if such weak conditions on q n can be extended to a wide class of estimates, such as standard estimates.
Theorem 5.2 showed that the three 4th order Edgeworth expansions hold for w ^ = t ( X ¯ ) : R q R p , when q n 8 p n 6 = o ( n ) . It will be interesting to see how much this condition can be weakened. It should not be hard to extend Theorem 5.2 to w ^ = T ( F n ) R q , where F n ( x ) is the empirical distribution of arandom sample of size n from a distribution F ( x ) on R p .
One can also extend this to w ^ a function of K independent sample means, w ^ = t ( X ¯ 1 , , X ¯ K ) : R q 1 × × R q K R p .
The method employed here should also be able to extend many of the results in the references from a sample mean to a standard estimate.

References

  1. Chernozhukov, V., Chetverikov, D., and Kato, K. (2013) Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors. Ann. Stat., 41 (6), 2786–2819.
  2. Chernozhukov, V., Chetverikov, D., and Kato, K. (2017). Central limit theorems and bootstrap in high dimensions. Ann. Probab., 45 (4), 2309–2352. arXiv:1412.3661.
  3. Donoho, D. and Montanari, A. (2016) High dimensional robust m-estimation: Asymptotic variance via approximate message passing. Probability Theory and Related Fields, Springer.
  4. Fang, X. and Koike, Y. (2021). High-dimensional central limit theorems by Stein’s method. Ann. Appl. Probab. 31, 1660–1686.
  5. Fujikoshi, Y. and Sakurai, T. (2009) High-dimensional asymptotic expansions for the distributions of canonical correlations. Journal of Multivariate Analysis, 100 (1), 231–242.
  6. Fujikoshi, Y., Ulyanov, V. V., and Shimizu, R. (2011). Multivariate statistics: High-dimensional and large-sample approximations. John Wiley and Sons.
  7. Koike, Yuta (2025). High-dimensional bootstrap and asymptotic expansion. arXiv preprint arXiv:2404.05006.
  8. Kosorok, M. and Ma, S. (2007). Marginal asymptotics for the large p, small n paradigm: With application to microarray data. Ann. Statist., 35, 1456–1486. MR2351093.
  9. Kuelbs,J. and Vidyashankar, A.N. (2010). Asymptotic inference for high-dimensional data. Annals of Statistics, 38 (2) 836–869.
  10. Portnoy, S. (1984). Asymptotic behavior of M-estimators of p regression parameters when p2/n is large. I. Consistency. Ann. Statist., 12, 1298–1309. MR0760690.
  11. Portnoy, S. (1985). Asymptotic behavior of M-estimators of p regression parameters when p2/n is large. II: normal approximation, Ann. Statist., 13, 1403–1417.
  12. Withers, C.S. (2000) A simple expression for the multivariate Hermite polynomials. Stat. Prob. Lett., 47, 165–169.
  13. Withers, C.S. (2024) 5th order multivariate Edgeworth expansions for parametric estimates. Mathematics, 2024, 12 (6), 905, Advances in Applied Probability and Statistical Inference. https://www.mdpi.com/2227-7390/12/6/905/pdf. [CrossRef]
  14. Withers, C.S. (2025) Edgeworth coefficients for standard multivariate estimates. New Perspectives in Mathematical Statistics, 2nd Edition. Axioms. Page 5 has 2 typos. k 1 1 in (19) should be k ¯ 1 1 . Delete /12 is S3, 2 lines before (21).
  15. Withers, C.S. and Nadarajah, S. (2010) Tilted Edgeworth expansions for asymptotically normal vectors. Annals of the Institute of Statistical Mathematics, 62 (6), 1113–1142. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated