Preprint
Article

This version is not peer-reviewed.

Edgeworth-Cornish-Fisher Expansions for the Mean When Sampling from a Stationary Process

A peer-reviewed article of this preprint also exists.

Submitted:

09 April 2025

Posted:

11 April 2025

You are already at the latest version

Abstract
We give the Edgeworth-Cornish-Fisher expansions for the distribution, density and quantiles of the sample mean of a stationary process.
Keywords: 
;  ;  

1. Introduction and Summary

The behaviour of a standard estimate, as described by its Edgeworth-Cornish-Fisher expansions, is governed by the coefficients obtained by expanding its cumulants. For the simplest case, the mean of independent identically distributed random variables, the cumulant expansion has only one term. In Section 2 we summarise Edgeworth-Cornish-Fisher expansions for a standard estimate.
In Section 3 and Section 4 we apply this to the mean of a sample from a stationary process for univariate and multivariate series. Remarkably, we show that for the sample mean of a stationary process, its cumulant expansion has exactly 2 terms.
Suppose that X ¯ is the mean of a sample X 1 , , X n from a stationary process { X i } in R p . So X ¯ is an unbiased estimate of μ = E X 0 . In Section 2 we show that when p = 1 , for r > 1 , its rth cumulant has the form
κ r ( X ¯ ) = a n r , r 1 n 1 r + a n r r n r .
where a n r i are bounded as n increases, and a n 21 is bounded away from 0. This makes it a special case of a standard estimate, so that Section 2 applies with a r i = a n r i for i = r 1 , r and a r i = 0 for i > r .
If a n r i = a r i + O ( e n λ r ) where λ r i > 0 , then a n r i can be replaced by a r i . Here x n = O ( y n ) means that x n / y n is bounded.
We also consider the case where observations are not sequential, as for missing data. And we consider unbiased weighted means.

2. Edgeworth-Cornish-Fisher Theory

Here we summarise the expansions of Withers (1984) for the distribution and quantiles of a standard estimate. In Section 3 we shall show that (1) holds, so that the sample mean is a special case of a standard estimate.
Univariate estimates. An estimate w ^ of an unknown w R is said to be a standard estimate with respect to n, if E w ^ w as n , and for r 1 , its rth cumulant can be expanded as
κ r ( w ^ ) i = r 1 a r i n i .
The cumulant coefficients  a r i may depend on n but are bounded as n , and a 21 is bounded away from 0. Here and below ≈ indicates an asymptotic expansion that need not converge. That is, (2) holds in the sense that
f o r I r , κ r ( w ^ ) = i = r 1 I 1 a r i n i + O ( n I ) .
For non-lattice estimates, the distribution and quantiles of
Y n = ( n / a 21 ) 1 / 2 ( w ^ w )
have asymptotic expansions in powers of n 1 / 2 :
P n ( x ) = P ( Y n x ) Φ ( x ) ϕ ( x ) r = 1 h r ( x ) n r / 2 ,
p n ( x ) = d P n ( x ) / d x ϕ ( x ) [ 1 + r = 1 h ¯ r ( x ) n r / 2 ] ,
Φ 1 ( P n ( x ) ) x r = 1 f r ( x ) n r / 2 , P n 1 ( Φ ( x ) ) x + r = 1 g r ( x ) n r / 2 ,
where Φ and φ are the unit normal distribution and density of N N ( 0 , 1 ) , and h r ( x ) , h ¯ r ( x ) , f r ( x ) , g r ( x ) are polynomials in x and the standardized cumulant coefficients { A r i } ,
A r i = a r i / a 21 r / 2 .
The expansions (2), (4), are given in Withers (1984):
h 1 ( x ) = f 1 ( x ) = g 1 ( x ) = A 11 + A 32 H 2 / 6 , h ¯ 1 ( x ) = A 11 H 1 + A 32 H 3 / 6 , h 2 ( x ) = ( A 11 2 + A 22 ) H 1 + ( A 11 A 32 + A 43 ) H 3 / 6 + A 32 2 H 5 / 72 , f 2 ( x ) = ( A 22 / 2 A 11 A 32 / 3 ) H 1 + A 43 H 3 / 24 A 32 2 ( 4 x 3 7 x ) / 36 , g 2 ( x ) = A 22 H 1 / 2 + A 43 H 3 / 24 A 32 2 ( 4 x 3 7 x ) / 36 , h ¯ 2 ( x ) = ( A 11 2 + A 22 ) H 2 + ( A 11 A 32 + A 43 ) H 4 / 6 + A 32 2 H 6 / 72 ,
where H k is the kth Hermite polynomial,
H k = H k ( x ) = ϕ ( x ) 1 ( d / d x ) k ϕ ( x ) = E ( x + i N ) k f o r k 0 , i = 1 : H 0 = 1 , H 1 = x , H 2 = x 2 1 , H 3 = x 3 3 x , H 4 = x 4 6 x 2 + 3 , H 5 = x 5 10 x 3 + 15 x , H 6 = x 6 15 x 4 + 45 x 2 15 ,
See Withers (2000) for (7). The log density has a simpler form than the density:
ln [ p n ( x ) / ϕ ( x ) ] = r = 1 b r ( x ) n r / 2 , b 1 ( x ) = h 1 ( x ) , b 2 ( x ) = A 11 2 / 2 + ( A 22 A 32 A 11 ) H 2 / 2 A 32 2 ( 3 x 4 12 x 2 + 5 ) / 24 + A 43 H 4 / 24 ,
and for r > 1 , b r ( x ) is a polynomial of order only r + 2 , while h ¯ r ( x ) is of order 3 r . See Withers and Nadarajah (2010a) for h ¯ r ( x ) and b r ( x ) .
Notation 1. 
The original Edgeworth expansion was for w ^ the mean of n independent identically distributed random variables from a distribution with rth cumulant κ r . So (1) holds with a r i = κ r I ( i = r 1 ) , and A i i 0 . An explicit formula for its general term was given in Withers and Nadarajah (2009).
Ordinary Bell polynomials. For a sequence e = ( e 1 , e 2 , ) ,  the partial ordinary Bell polynomial  B ˜ r s = B ˜ r s ( e ) , is defined by the identity
S s = r = s z r B ˜ r s ( e ) w h e r e S = r = 1 z r e r , z R . S o , B ˜ r 0 = δ r 0 , B ˜ r 1 = e r , B ˜ r r = e 1 r , B ˜ 21 = 2 e 1 e 2 ,
where δ 00 = 1 , δ r 0 = 0 for r 0 . They are tabled on p309 of Comtet (1974). The complete ordinary Bell polynomial, B ˜ r ( e ) is defined in terms of S by
e S = r = 0 z r B ˜ r s ( e ) . S o B ˜ r ( e ) = s = 0 r B ˜ r s ( e ) / s ! :
B ˜ 0 ( e ) = 1 , B ˜ 1 ( e ) = e 1 , B ˜ 2 ( e ) = e 2 + e 1 2 / 2 , B ˜ 3 ( e ) = e 3 + e 1 e 2 + e 1 3 / 6
Multivariate estimates. Suppose that w ^ is a standard estimate of w R p with respect to n. That is, E w ^ w as n , and for r 1 , 1 i 1 , , i r p , the rth order cumulants of w ^ can be expanded as
k ¯ 1 r = κ ( w ^ i 1 , , w ^ i r ) = e = r 1 k ¯ e 1 r n e , k ¯ e 1 r = k e i 1 i r ,
where the cumulant coefficients  k ¯ e 1 r = k e j 1 j r may depend on n but are bounded as n . So k ¯ 0 1 = w i 1 . Set V = ( k 2 i 1 i 2 ) , p × p . Y n converges in law to the multivariate normal N p ( 0 , V ) with p × p covariance V and distribution and density Φ V ( x ) and ϕ V ( x ) . So V may depend on n, but we assume that d e t ( V ) is bounded away from 0. By Withers and Nadarajah (2010b) or Withers (2024), Y n = n 1 / 2 ( w ^ w ) has distribution and density
P r o b . ( Y n x ) r = 0 n r / 2 P r ( x ) , p Y n ( x ) r = 0 n r / 2 p r ( x ) , x R p ,
where ( b 1 ) i = b 1 i , P 0 ( x ) = Φ V ( x ) , p 0 ( x ) = ϕ V ( x ) ,
P r ( x ) = B ˜ r ( e ( / x ) ) Φ V ( x ) , p r ( x ) = B ˜ r ( e ( / x ) ) ϕ V ( x ) , r 1 ,
e j ( s ) = r = 1 j + 2 b ¯ r + j 1 r s i 1 s i r / r ! , b ¯ r + j 1 r = b r + j i 1 i r , b 2 d + 1 i 1 i r = 0 , b 2 d i 1 i r = k d i 1 i r : e 1 = k ¯ 1 1 t ¯ 1 + k ¯ 2 1 3 t ¯ 1 t ¯ 2 t ¯ 3 / 6 , e 2 = k ¯ 2 12 t ¯ 1 t ¯ 2 / 2 + k ¯ 3 1 4 t ¯ 1 t ¯ 4 / 24 ,
This gives the Edgeworth expansion for the distribution of Y n to O ( n 3 / 2 ) . See Withers (2024) for more terms. (15) uses the tensor summation convention of implicitly summing i 1 , , i r over their range 1 , , p . For example,
f o r i = / x i a n d ¯ k = i k , P 1 ( x ) = e 1 ( / x ) ) Φ V ( x ) = r = 1 3 b ¯ r + 1 1 r ( ¯ 1 ) ( ¯ r ) Φ V ( x ) / r ! = k ¯ 1 1 ( ¯ 1 ) Φ V ( x ) + k ¯ 2 1 3 ( ¯ 1 ) ( ¯ 2 ) ( ¯ 3 ) Φ V ( x ) / 6 , p 1 ( x ) = k ¯ 1 1 ( ¯ 1 ) ϕ V ( x ) + k ¯ 2 1 3 ( ¯ 1 ) ( ¯ 2 ) ( ¯ 3 ) ϕ V ( x ) / 6 . ( ¯ 1 ) ( ¯ k ) ϕ V ( x ) = H ¯ 1 k ( x , V ) ϕ V ( x ) ,
for H ¯ 1 k = H ¯ 1 k ( x , V ) the multivariate Hermite polynomial. For their dual form see Withers and Nadarajah (2014). By Withers (2020), for i = 1 ,
H ¯ 1 k ( x , V ) = E Π j = 1 k ( y ¯ j + i Y ¯ j ) w h e r e y ¯ j = y i j , Y ¯ j = Y i j , y = V 1 x , Y N p ( 0 , V 1 ) . S o , H 1 = y 1 , H ¯ 1 = y ¯ 1 , H 12 = y 1 y 2 V 12 , H ¯ 12 = y ¯ 1 y ¯ 2 V ¯ 12 , H 1 3 = y 1 y 2 y 3 3 V 12 y 3 , H 1 4 = y 1 y 4 6 V 12 y 3 y 4 + 3 V 12 V 34 , H 1 5 = y 1 y 5 10 V 12 y 3 y 5 + 15 V 12 V 34 y 5 , H 1 6 = y 1 y 6 15 V 12 y 3 y 6 + 45 V 12 V 34 y 5 y 6 45 V 12 V 34 V 56 ,
where V i 1 i 2 is the ( i 1 , i 2 ) element of V 1 ,   V ¯ 2 j 1 j is the ( i j 1 , i j 2 ) element of V 1 , and for example,
3 V 12 y 3 = V 12 y 3 + V 13 y 2 + V 23 y 1 .
For r 1 , we can write
B ˜ r ( e ( s ) ) = k = 1 3 r [ B ¯ r 1 k s ¯ 1 s ¯ k : k r e v e n ] , w h e r e B ¯ 1 1 = k ¯ 1 1 , B ¯ 1 1 3 = k ¯ 2 1 3 / 6 , B ¯ 2 12 = k ¯ 1 1 k ¯ 1 2 + k ¯ 2 12 / 2 , B ¯ 2 1 4 = k ¯ 3 1 4 / 24 + k ¯ 1 1 k ¯ 2 2 4 / 6 + k ¯ 1 4 k ¯ 2 1 3 / 6 , B ¯ 2 1 6 = k ¯ 2 1 3 k ¯ 2 4 6 / 36 . S o , P r ( x ) = k = 1 3 r [ B ¯ r 1 k ( ¯ 1 ) ( ¯ k ) Φ V ( x ) : k r e v e n ] , p r ( x ) / ϕ V ( x ) = k = 1 3 r [ B ¯ r 1 k H ¯ 1 k ( x , V ) : k r e v e n ] = p ˜ r ( x ) s a y . S o , p ˜ 1 ( x ) = k ¯ 1 1 H ¯ 1 ( x , V ) + k ¯ 2 1 3 H ¯ 1 3 ( x , V ) / 6 , P 2 ( x ) = k = 2 , 4 , 6 B ¯ 2 1 k ( ¯ 1 ) ( ¯ k ) Φ V ( x ) , p ˜ 2 ( x ) = k = 2 , 4 , 6 B ¯ 2 1 k H ¯ 1 k ( x , V ) .
The log density can be expanded as
ln [ p n ( x ) / ϕ V ( x ) ] r = 1 n r / 2 b r ( x ) .
S o p n ( x ) / ϕ V ( x ) ] r = 0 n r / 2 B ˜ r ( b ( x ) ) w h e r e b = ( b 2 , b 2 , ) .
See Withers and Nadarajah (2016). Cornish-Fisher expansions for parametric and nonparametric standard estimates were first given in Withers (1984) and Withers (1983), and extended to functions of them in Withers (1982). In Withers and Nadarajah (2012) we gave Cornish-Fisher expansions for smooth functions of the sample cross-moments of a linear process. In Section 3 we show that this extends easily to a stationary process.

3. The Cumulants of the Sample Mean

Consider the general real stationary process , X 1 , X 0 , X 1 , with finite mean and cross-cumulants,
μ = E X 0 , k ( i 1 , , i r ) = κ ( X i 1 , , X i r ) .
Given a sequence of integers i 1 , , i r , set
i 0 = min k = 1 r i k , I k = i k i 0 0 , n ( i 1 i r ) = I 0 = max k = 1 r I k = max k = 1 r i k i 0 .
Since { X i } is stationary,
k ( i 1 i r ) = k ( I 1 I r ) .
These are not changed by permuting subscripts. Also at least one I k is zero. E X ¯ = μ . For r 2 , transforming from i k to T k = i k i 1 for k = 2 , , r ,
n r κ r ( X ¯ ) = i 1 , , i r = 1 n k ( i 1 , , i r ) = | T k | < n , k = 2 , , r ( n δ r ( T ) ) k ( 0 , T 2 , , T r )
where δ r ( T ) = max ( 0 , T 2 , , T r ) min ( 0 , T 2 , , T r ) .
For example, , δ 2 ( T ) = | T 2 | , δ 3 ( T ) = T 3 I ( 0 T 2 < T 3 ) + ( T 3 T 2 ) I ( T 2 < 0 < T 3 ) T 2 I ( T 2 < T 3 0 ) . So for r 2 , κ r ( X ¯ ) = i = r 1 r a n r i n i where a n r , r 1 = | T i | < n , i = 2 , , r k ( 0 , T 2 , , T r ) , a n r r = | T i | < n , i = 2 , , r δ r ( T ) k ( 0 , T 2 , , T r ) .
This proves that (1) holds. So the Edgeworth-Cornish-Fisher expansions of Section 2 apply to ( w ^ , w ) = ( X ¯ , μ ) with a r i in (5) replaced by these a n r i .
If the cross-cumulants k ( 0 , T 2 , , T r ) decrease exponentially in T 2 , as is true for a stationary ARMA process by Withers and Nadarajah (2012), then for r 2 ,
a n r , r 1 = a r , r 1 + O ( e n λ r ) , a n r r = a r r + O ( e n λ r ) w h e r e λ r > 0 , a r , r 1 = | T i | < , i = 2 , , r k ( 0 , T 2 , , T r ) , a r r = | T i | < , i = 2 , , r δ r ( T ) k ( 0 , T 2 , , T r ) ,
so that the Edgeworth-Cornish-Fisher expansions of Section 2 apply to
( w ^ , w ) = ( X ¯ , μ ) with a r i in (5) replaced by these a r i .
For convergence in law of n 1 / 2 ( X ¯ μ ) ) to N ( 0 , a 21 ) with
a 21 = T = k ( 0 , T ) < , under mixing conditions on a stationary process, see Section 18.4, 18.5 of Ibragimov and Linnik (1971). They also show how to express a 21 n and a 21 in terms of the spectral distribution and density.
Missing values. Now suppose that we only have observations at times t 1 , , t n . Our estimate of μ = E X 0 is then
μ ^ = μ ^ ( t 1 , , t n ) = n 1 i = 1 n X t i . S o E μ ^ = μ , a n d f o r r 2 , S k = t i k t i 1 , n r κ r ( μ ^ ) = i 1 , , i r = 1 n k ( t i 1 , , t i r ) = i 1 , , i r = 1 n k ( 0 , S 2 , , S r ) = n a r , r 1 s a y .
So if a 21 is bounded away from 0 and a r , r 1 is bounded in n, we can apply Section 2 with a r i = 0 for i r .
Weighted means. Let w n 1 , , w n n be given real numbers adding to n. For example the standardized form of the Chernoff weight i / n is w n i = 2 i / ( n + 1 ) . See Chernoff and Zacks(1964). An unbiased estimate of μ is the weighted sample mean, μ ^ w = i = 1 n w n i X i .
F o r r 2 , n r κ r ( μ ^ w ) = i 1 , , i r = 1 n w n i 1 w n i r k ( t i 1 , , t i r ) = n a r , r 1
say. So if a 21 is bounded away from 0 and a r , r 1 is bounded in n, we can apply Section 2 with a r i = 0 for i r .

4. Multivariate Edgeworth Expansions for X ¯

Suppose that , X 1 , X 0 , X 1 , lie in R p and are stationary with finite moments. For j = 1 , , p , denote the jth component of X i by X i j and the and the cross-cumulants, by
μ = E X 0 , μ j = E X 0 j , and for 1 j 1 , , j r p , % M i 1 i r j 1 j r = E X i 1 j 1 X i r , k j 1 j r i 1 , , i r = κ ( X i 1 j 1 , , X i r j r ) .
Given a sequence of integers i 1 , , i r , define i 0 , I k as in (2), and again transform from i k to T k = i k i 1 for k = 2 , , r . (3) becomes
k j 1 j r i 1 i r = k j 1 j r I 1 I r .
In general k j 1 j 2 0 , I k j 1 j 2 0 , I . By (4),
n r κ ( X ¯ j 1 , , X ¯ j r ) = i 1 , , i r = 1 n k j 1 j r i 1 , , i r = | T k | < n , k = 2 , , r ( n δ r ( T ) ) k j 1 j r 0 , T 2 , , T r . So for r 2 , κ ( X ¯ j 1 , , X ¯ j r ) = e = r 1 r k n e j 1 j r n e where k n , r 1 j 1 j r = | T i | < n , i = 2 , , r k j 1 j r 0 , T 2 , , T r , and k n r j 1 j r = | T i | < n , i = 2 , , r δ r ( T ) k j 1 j r 0 , T 2 , , T r .
This proves that a 2 term version of (11) holds. So the expansions (12)–(14) and (16) hold for the distribution and density of n 1 / 2 ( X ¯ μ ) .
If the cross-cumulants k ( j 1 j r 0 , T 2 , , T r ) decrease exponentially in T 2 , then for r 2 ,
k n , r 1 j 1 j r = k r 1 j 1 j r + O ( e n λ r ) , and k n r j 1 j r = k r j 1 j r + O ( e n λ r ) where λ r > 0 , k r 1 j 1 j r = | T i | < , i = 2 , , r k j 1 j r 0 , T 2 , , T r , and k r j 1 j r = | T i | < , i = 2 , , r δ r ( T ) k j 1 j r 0 , T 2 , , T r .

References

  1. Anderson, T. W. (1958) An introduction to multivariate analysis. John Wiley, New York.
  2. Chernoff, H. & Zacks, S. (1964) Estimating the current mean of a normal distribution which is subjected to changes in time. Ann. Math. Statist., 35, 999-1018.
  3. Comtet, L. Advanced Combinatorics; Reidel: Dordrecht, The Netherlands, 1974.
  4. Ibragimov, I.A. and Linnik, Y.V. (1971) Independent and stationary sequences of random variables. Wolters-Noordhoff, The Netherlands.
  5. Withers, C.S. (1982) The distribution and quantiles of a function of parameter estimates. Ann. Inst. Statist. Math. A, 34, 55–68. A correction to a22 is given in Appendix A of Withers (2024).
  6. Withers, C.S. (1983) Expansions for the distribution and quantiles of a regular functional of the empirical distribution with applications to nonparametric confidence intervals. Annals Statist., 11 (2), 577–587.
  7. Withers, C.S. (1984) Asymptotic expansions for distributions and quantiles with power series cumulants. Journal Royal Statist. Soc. B, 46, 389–396. Corrigendum (1986) 48, 256. For typos, see p23–24 of Withers (2024).
  8. Withers, C.S. (2020) A simple expression for the multivariate Hermite polynomials. Statistics and Prob. Letters, 47, 165-169.
  9. Withers, C.S. (2024) 5th-Order multivariate Edgeworth expansions for parametric estimates. Mathematics, 12,905, Advances in Applied Probability and Statistical Inference. https://www.mdpi.com/2227-7390/12/6/905/pdf.
  10. Withers, C.S. and Nadarajah, S.N. (2009) Charlier and Edgeworth expansions via Bell polynomials. Probability and Mathematical Statistics, 29, 271–280. For typos, see p24–25 of Withers (2024).
  11. Withers, C.S. and Nadarajah, S.N. (2010a) Expansions for log densities of asymptotically normal estimates, Statistical Papers, 51, 247–257, Springer, DOI 10.1007/s00362-008-0135-2.
  12. Withers, C.S. and Nadarajah, S. (2010b) Tilted Edgeworth expansions for asymptotically normal vectors. Annals of the Institute of Statistical Mathematics, 62 (6), 1113–1142. DOI: 10.1007/s10463-008-0206-0. For typos, see p25 of Withers (2024).
  13. Withers, C.S. and Nadarajah, S. (2012) Cornish-Fisher expansions for sample autocovariances and other functions of sample moments of linear processes. Brazilian Journal of Probability and Statistics, 26 (2), 149–166. DOI:10.1214/10-BJPS126.
  14. Withers, C.S.; Nadarajah, S. (2014) The dual multivariate Charlier and Edgeworth expansions. Stat. Probab. Lett. 2014, 87, 76–85. https://doi.org/10.1016/j.spl.2014.01.003 For typos, see p25 of Withers (2024).
  15. Withers, C.S. and Nadarajah, S. (2016) Expansions for log densities of multivariate estimates. Methodology and Computing in Applied Probability, 18, 911–920. DOI 10.1007/s11009-016-9488-5 https://rdcu.be/6n1k http://link.springer.com/article/10.1007/s11009-016-9488-5.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated