Submitted:
29 September 2025
Posted:
30 September 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Contribution and Novelty
2. Proposed Energy-based Goodness of Fit Test
3. Simulations and Results
3.1. The Univariate Energy Test Statistic
- (1)
- A term involving the integral of up to each data point.
- (2)
- A quantile-based approximation for the inter-distributional expectation .
- (3)
- A linear-order statistic approximation for the within-sample expectation .
3.2. Critical Value Simulation Under Varying and
3.3. Type I Error Control
- (1)
- Generate samples of size from the Skew-t distribution with parameters .
- (2)
- Estimate the parameters using the maximum penalized likelihood method available in the sn package.
- (3)
- Standardize the data to standard Skew-t random variables and compute the energy goodness-of-fit statistic using the formula in equation (8).
- (4)
- Determine the empirical percentile critical value under the null hypothesis.
3.4. Power Analysis under Various Alternatives
- Chi-square: asymmetric, heavy-tailed.
- Standard Gusset’s t: symmetric, heavy-tailed.
- Exponential: lighter tail, positive skew.
- SHASH: The SHASH (Sinh-Arcsinh) distribution that was discussed in [37] is a highly flexible statistical distribution defined by four parameters that separately control the location, scale, skewness, and kurtosis of a variable.
- Generalized t (GT): to assess sensitivity to misspecification.
- Log-normal: asymmetric, heavy-tailed.
3.5. Superior Performance of the Test
- For the Log-Normal () distribution, the test achieved a power of 0.9701 at a sample size of just 50, far exceeding the next best test, Watson test, which had a power of 0.3988.
- Against the GT() distribution, the test reached a power of 1.0000 for sample sizes of 150 and 200, indicating perfect detection in the simulation.
- For the Exponential (Exp(1)) distribution, the power of ranged from 0.8413 to 0.9848, consistently outperforming all other competitors.
3.6. Effect of Sample Size and Parameters
3.7. Summary
- Robust control of Type I error
- Superior power in a wide range of alternatives
- Flexibility for skewed, heavy-tailed, and multi-modal data
4. Real Data Application
- (1)
- Fit the real data with an Azzalini’s Skew-t distribution in Eq. (Section 1), and obtain the maximum likelihood estimates (MLEs) of and from the Azzalini’s sn package available in R.
- (2)
- Use the formula in Eq. (8) to calculate the energy goodness-of-fit statistic of the real data and denote it .
- (3)
- Simulate , a random sample of size from the Azzalini Skew-t distribution with parameters specified as and that were obtained in step 1.
- (4)
- Standardize the data to standard Skew-t distribution and compute the energy goodness-of-fit statistic for the simulated data using the formula in Eq. (8) and denote this value as .
- (5)
- Repeat this process for B times and obtain B energy goodness-of-fit statistics and denote them by .
- (6)
- The bootstrap p-value is therefore approximated aswhere is an indicator function that takes the value of one when and zero otherwise.
4.1. Model Fitting
4.2. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
| 22.46 | 23.88 | 23.68 | 23.15 | 22.32 | 24.02 | 23.29 | 25.11 | 22.81 | 26.25 |
|---|---|---|---|---|---|---|---|---|---|
| 21.38 | 22.52 | 26.73 | 23.57 | 25.84 | 24.06 | 23.85 | 25.09 | 23.84 | 25.31 |
| 19.69 | 26.07 | 25.50 | 23.69 | 26.79 | 25.61 | 25.06 | 24.93 | 22.96 | 20.69 |
| 23.97 | 24.64 | 25.93 | 23.69 | 25.38 | 22.68 | 23.36 | 22.44 | 22.57 | 19.81 |
| 21.19 | 20.39 | 21.12 | 21.89 | 29.97 | 27.39 | 23.11 | 21.75 | 20.89 | 22.83 |
| 22.02 | 20.07 | 20.15 | 21.24 | 19.63 | 23.58 | 21.65 | 25.17 | 23.25 | 32.52 |
| 22.59 | 30.18 | 34.42 | 21.86 | 23.99 | 24.81 | 21.68 | 21.04 | 23.12 | 20.76 |
| 23.13 | 22.35 | 22.28 | 23.55 | 19.85 | 26.51 | 24.78 | 33.73 | 30.18 | 23.31 |
| 24.51 | 25.37 | 23.67 | 24.28 | 25.82 | 21.93 | 23.38 | 23.07 | 25.21 | 23.25 |
| 22.93 | 26.86 | 21.26 | 25.43 | 24.54 | 27.79 | 23.58 | 27.56 | 23.76 | 22.01 |
| 22.34 | 21.07 |
| 0.72012 | 0.04807 | 2.84050 | 1.71094 | 0.03445 | -1.89394 | -0.31005 | -0.57509 | 0.86172 | -0.41548 |
|---|---|---|---|---|---|---|---|---|---|
| 0.36433 | -0.22248 | 0.29926 | 0.64354 | -0.56386 | 1.11657 | 0.72845 | 0.05740 | 0.30976 | -0.98359 |
| 1.05112 | 0.50295 | -0.18198 | -0.57540 | -1.15746 | 0.23768 | 3.46422 | 1.12354 | 0.44221 | 1.11165 |
| 1.30082 | 1.02085 | -0.79247 | 0.87386 | 3.68303 | -2.07084 | -1.20911 | -1.03317 | 0.89941 | 0.13265 |
| 0.65176 | 0.67913 | 1.45367 | 0.68550 | 0.17404 | 0.66942 | 1.57361 | -0.29985 | 0.56140 | -0.83242 |
| -0.04021 | 0.18102 | 0.01004 | 1.94730 | 0.35937 | 0.32866 | 1.44233 | -0.15423 | -0.90751 | -0.47739 |
| 0.15174 | -1.92561 | 4.90856 | -0.65078 | 1.24313 | -1.54428 | -2.69570 | 0.01971 | -1.07442 | -1.76365 |
| -5.81194 | 1.58303 | 1.19792 | -0.43997 | -0.56818 | -3.12699 | 1.91710 | -2.04716 | -1.70697 | -0.38406 |
| -0.41348 | -0.47691 | 0.51866 | -1.81155 | -1.01103 | 3.65839 | 1.61434 | 1.46818 | 2.66170 | 1.27794 |
| 1.15796 | -0.31825 | -0.02060 | -0.72624 | 0.59666 | 2.35185 | -0.29227 | 0.80356 | -0.34092 | -0.10061 |
| -1.51576 | 2.16291 | -0.03003 | -0.91119 | 1.83408 | 0.58546 | 0.82869 | -0.08806 | -2.06140 | 0.60994 |
| 0.98887 | -0.72824 | 0.76828 | 0.93950 | -0.34599 | -0.56234 | 1.13597 | -1.49276 | 2.28541 | 0.78178 |
| -0.08140 | -0.79072 | 0.34779 | 0.93385 | -0.42922 | 2.04042 | -2.16391 | -2.11581 | -5.23478 | 1.89304 |
| 1.03553 | 2.20559 | -1.19942 | -0.25375 | 4.23484 | -2.97650 | -0.49815 | 2.35947 | 1.86441 | 0.00475 |
| 1.08386 | -0.08465 | -4.62205 | 1.89992 | -1.12838 | 0.67104 | 1.69318 | -0.12918 | -1.45636 | 1.69665 |
| 1.95516 | -0.00938 | 0.42671 | 1.18130 | 3.17951 | -0.22362 | -1.94540 | 0.52571 | 0.36380 | 0.93793 |
| -0.81250 | -1.46181 | 0.45469 | -0.47550 | 1.53896 | -0.51577 | -0.48660 | 2.35353 | 0.27682 | -2.50678 |
| 0.84947 | 2.80318 | 0.02203 | -1.17150 | 1.17202 | 1.34784 | 2.65983 | -0.14394 | -0.23317 | -0.40371 |
| 0.38828 | 0.48028 | 1.73427 | -0.22868 | 1.34188 | 0.16449 | 1.23163 | 1.00170 | -2.31279 | -0.01233 |
| 2.26096 | 2.83808 | 0.65671 | -0.14369 | 0.04278 | 0.85135 | 0.27369 | 0.79188 | -0.09154 | 0.95816 |
| -0.69194 | 1.18793 | 0.50421 | -0.30326 | -1.16415 | -0.44834 | -0.08779 | 1.75338 | -0.78086 | 1.34322 |
| -0.22028 | -1.15622 | -1.78301 | 0.88263 | 1.46710 | 1.93162 | -1.40001 | 0.58444 | 0.85294 | 0.25483 |
| 1.35932 | 1.71179 | 0.19653 | -0.23894 | 0.10009 | -0.20712 | 1.63183 | 0.09507 | 1.98403 | -0.03795 |
| 0.59351 | 0.73065 | 2.28163 | -0.97220 | 0.79682 | -0.47030 | 1.60863 | 2.12408 | 0.22607 | 2.13644 |
| -1.35033 | -0.42855 | 1.25265 | 1.10710 | -0.67769 | 0.35695 | 0.48159 | -0.28820 |
References
- Ibragimov, M.; Ibragimov, R.; Walden, J. Heavy-tailed distributions and robustness in economics and finance; Vol. 214, Springer, 2015.
- Guo, Z.Y. Heavy-tailed distributions and risk management of equity market tail events. Journal of Risk & Control 2017, 4, 31–41. [Google Scholar]
- Cortés, I.; Reyes, J.; Iriarte, Y.A. A Weighted Skew-Logistic Distribution with Applications to Environmental Data. Mathematics 2024, 12, 1287. [Google Scholar] [CrossRef]
- Ahmad, Z.; Mahmoudi, E.; Dey, S. A new family of heavy tailed distributions with an application to the heavy tailed insurance loss data. Communications in Statistics-Simulation and Computation 2022, 51, 4372–4395. [Google Scholar] [CrossRef]
- Azzalini, A. A class of distributions which includes the normal ones. Scandinavian journal of statistics 1985, pp. 171–178.
- Azzalini, A.; Capitanio, A. The Skew-Normal and Related Families; Cambridge University Press: New York, 2014. [Google Scholar]
- Henze, N. On a Skew-t Distribution. Scandivanian Journal of Statistics 1986, 13, 271–275. [Google Scholar]
- Hasan, A.; Ning, W.; Gupta, A. A New Approach for the Skew t Distribution with Applications to Environmental Data. Advances and Applications in Statistics 2016, 49, 117–136. [Google Scholar] [CrossRef]
- Arellano-Valle, R.B.; Azzalini, A. The centred parameterization and related quantities of the skew-t distribution. Journal of Multivariate Analysis 2013, 113, 73–90. [Google Scholar] [CrossRef]
- Tagle, F.; Castruccio, S.; Genton, M.G. A hierarchical bi-resolution spatial skew-t model. Spatial Statistics 2020, 35, 100398. [Google Scholar] [CrossRef]
- Galarza, C.E.; Matos, L.A.; Castro, L.M.; Lachos, V.H. Moments of the doubly truncated selection elliptical distributions with emphasis on the unified multivariate skew-t distribution. Journal of Multivariate Analysis 2022, 189, 104944. [Google Scholar] [CrossRef]
- Hasan, A. A study of Non-central Skew t Distributions and their Applications in Data Analysis and Change Point Detection. PhD thesis, Bowling Green State University, 2013.
- Azzalini, A.; Capitanio, A. Distributions generated by perturbation of symmetry with emphasis on a multivariate Skew t-distribution. Journal of the Royal Statistical Society Series B: Statistical Methodology 2003, 65, 367–389. [Google Scholar] [CrossRef]
- Stephens, M.A. Edf statistics for goodness of fit and some comparisons. Journal of the American Statistical Association 1974, 69, 730–737. [Google Scholar] [CrossRef]
- Stephens, M.A. Tests Based on EDF Statistics. In Goodness-of-Fit Techniques; D’Agostino, R.B., Stephens, M.A., Eds.; Marcel Dekker: New York, 1986; pp. 97–193. [Google Scholar]
- Maghami, M.; Bahrami, M. Goodness of Fit Test for the Skew-T Distribution. Journal of Mathematics and Computer Science 2015, 14, 274–283. [Google Scholar] [CrossRef]
- Hasan, A.; Ning, W.; Gupta, A.K. An information-based Approach to the Change-point Problem of the Non-central Skew t Distribution with Applications to Stock Market Data. Sequential Analysis 2014, 33, 458–474. [Google Scholar] [CrossRef]
- Kim, H.J. On a Skew-t Distribution. Journal of the Korean Communications in Statistics 2001, 8, 867–873. [Google Scholar]
- Hasan, A.; Chen, Y. On The Modified Information-Based Approach to The Change Point Detection (CPD) Problem under The Non-Central Skew t Distribution. J Stat Theory Pract 2025, 19. [Google Scholar] [CrossRef]
- Rizzo, M.L. A new rotation invariant goodness-of-fit test. Ph.D Thesis: Bowling Green State University 2002.
- Sźekely, G.J. E-statistics: Energy of statistical samples. Technical Report 03-05, BGSU, Department of Mathematics and Statistics 2000.
- Opperman, L.; Ning, W. Goodness-of-fit test for skew normality based on energy statistics. Random Operators and Stochastic Equations 2020, 28, 227–236. [Google Scholar] [CrossRef]
- Ofosuhene, P. The energy goodness-of-fit Test for the Inverse gaussian distribution. Ph.D Thesis: Bowling Green State University 2020.
- Njuki, J.; Avallone, R. Energy Statistic-Based Goodness-of-Fit Test for the Lindley Distribution with Application to Lifetime Data. Stats 2025, 8. [Google Scholar] [CrossRef]
- Móri, F.T.; Sźekely, G.J.; Rizzo, M.L. On energy tests of normality. Journal of Statistical Planning and Inference 2021, 213, 1–15. [Google Scholar] [CrossRef]
- Sźekely, G.J.; Rizzo, M. A new test for multivariate normality. Journal of Multivariate Analysis 2005, 93, 58–80. [Google Scholar] [CrossRef]
- Rizzo, M.L. A test of homogeneity for two multivariate populations, Physical and Engineering Sciences section.; In: 2002 Proceedings of American Statistical Association: American Statistical Association, Alexandria, VA, 2003. [Google Scholar]
- Sźekely, G.J.; Rizzo, M.L. Testing for Equal Distributions in high Dimension. InterStat 2004, 11. [Google Scholar]
- Sźekely, G.J.; Rizzo, M.L. The Energy of Data and Distance Correlation, 1st ed.; Chapman and Hall: London, UK, 2023. [Google Scholar]
- Rizzo, M.L. New goodness-of-fit tests for Pareto distributions. ASTIN Bulletin: The Journal of the IAA 2009, 39, 691–715. [Google Scholar] [CrossRef]
- Njuki, J.; Ning, W. Energy statistic-based modified information criterion for detecting the change in distribution. Journal of Applied Statistics 2025, pp. 1–23.
- Njuki, J.M. Nonparametric Sequential tests for Change Point Analysis Using Energy Statistics. Ph.D Thesis: Bowling Green State University 2022.
- Matterson, D.S.; James, N.A. A nonparametric Approach for Multiple Change Point Analysis of Multivariate Data. Journal of the American Statistical Association 2014, 109, 334–345. [Google Scholar] [CrossRef]
- Kim, A.Y.; Marzban, C.; Percival, D.B. ; Stuetzle, W. Using labeled data to evaluate change detectors in a multivariate streaming environment. Signal Processing 2009, 89(12), 2529–2536. [Google Scholar] [CrossRef]
- Sźekely, G.J.; Rizzo, M. A Class of Statistical Based on Distances. Journal of Statistical Planning and Inference 2013, 143, 1249–1272. [Google Scholar] [CrossRef]
- Sźekely, G.J.; Rizzo, M.L. The Energy of Data. Annual Review of Statistics and Its Application 2017, 4, 447–479. [Google Scholar] [CrossRef]
- Jones, M.C. A family of distributions on the real line with four parameters. In Statistical Models and Methods for Financial Markets; Bali, T.G.; Lim, E., Eds.; Springer, 2006; pp. 75–93.
- Cook, R.D.; Weisberg, S. Bayesian density estimation using skew Student-t-normal mixtures. An Introduction to Regression Graphics; John Wiley and Sons, New York, 1994.
- Macrotrends. Apple - Stock Price History, 2025. Accessed: March 10, 2025.




| n | (1,5) | (0,5) | (-1,5) | (1,10) | (0,10) | (-1,10) | (1,30) | (0,30) | (-1,30) | |
| 50 | 3.7856 | 3.8663 | 4.0781 | 2.0441 | 1.9596 | 1.9143 | 1.3624 | 1.3332 | 1.3421 | |
| 100 | 4.5742 | 4.6635 | 4.6401 | 2.3416 | 2.3314 | 2.3443 | 1.6584 | 1.5815 | 1.6448 | |
| 150 | 5.3913 | 5.6910 | 5.3485 | 2.8291 | 2.8642 | 2.8279 | 2.0239 | 2.0291 | 2.0270 | |
| 200 | 6.3893 | 6.6986 | 6.3946 | 3.3414 | 3.4878 | 3.3710 | 2.4003 | 2.4172 | 2.3981 | |
| n | (1,5) | (0,5) | (-1,5) | (1,10) | (0,10) | (-1,10) | (1,30) | (0,30) | (-1,30) | |
| 50 | 0.0508 | 0.0532 | 0.0513 | 0.0476 | 0.0508 | 0.0513 | 0.0510 | 0.0484 | 0.0540 | |
| 100 | 0.0484 | 0.0476 | 0.0492 | 0.0511 | 0.0487 | 0.0503 | 0.0501 | 0.0558 | 0.0488 | |
| 150 | 0.0496 | 0.0506 | 0.0524 | 0.0513 | 0.0509 | 0.0491 | 0.0478 | 0.0492 | 0.0508 | |
| 200 | 0.0503 | 0.0494 | 0.0506 | 0.0505 | 0.0498 | 0.0501 | 0.0497 | 0.0496 | 0.0499 | |
| Distribution | Sample size n | ||||||
|---|---|---|---|---|---|---|---|
| 50 | 0.8369 | 0.2248 | 0.2134 | 0.2809 | 0.2460 | 0.3878 | |
| 100 | 0.8933 | 0.2924 | 0.2832 | 0.3850 | 0.3298 | 0.5061 | |
| 150 | 0.9559 | 0.4018 | 0.3638 | 0.5103 | 0.4288 | 0.6223 | |
| 200 | 0.9715 | 0.4973 | 0.4418 | 0.6184 | 0.5069 | 0.7270 | |
| 50 | 0.1066 | 0.0342 | 0.0396 | 0.0332 | 0.0382 | 0.0372 | |
| 100 | 0.1736 | 0.0356 | 0.0418 | 0.0392 | 0.0380 | 0.0384 | |
| 150 | 0.2268 | 0.0360 | 0.0398 | 0.0334 | 0.0382 | 0.0378 | |
| 200 | 0.2790 | 0.0387 | 0.0410 | 0.0413 | 0.0393 | 0.0397 | |
| 50 | 0.8413 | 0.2640 | 0.2491 | 0.3069 | 0.2692 | 0.4092 | |
| 100 | 0.9152 | 0.5090 | 0.4924 | 0.5538 | 0.5140 | 0.6128 | |
| 150 | 0.9655 | 0.6496 | 0.6224 | 0.7012 | 0.6486 | 0.7702 | |
| 200 | 0.9848 | 0.7439 | 0.7169 | 0.8012 | 0.7421 | 0.8601 | |
| 50 | 0.5734 | 0.1145 | 0.1292 | 0.1345 | 0.1302 | 0.1545 | |
| 100 | 0.6045 | 0.2131 | 0.2154 | 0.2546 | 0.2328 | 0.261 | |
| 150 | 0.6401 | 0.3032 | 0.3072 | 0.3844 | 0.3498 | 0.3958 | |
| 200 | 0.6749 | 0.4166 | 0.4196 | 0.5184 | 0.4742 | 0.5261 | |
| 50 | 0.9596 | 0.1128 | 0.1166 | 0.1306 | 0.1188 | 0.3350 | |
| 100 | 0.9916 | 0.1562 | 0.1860 | 0.2004 | 0.1966 | 0.4480 | |
| 150 | 1.0000 | 0.2306 | 0.2756 | 0.3228 | 0.3232 | 0.5816 | |
| 200 | 1.0000 | 0.3030 | 0.3798 | 0.4278 | 0.4292 | 0.6720 | |
| 50 | 0.9701 | 0.3331 | 0.1950 | 0.3988 | 0.2304 | 0.4611 | |
| 100 | 0.9931 | 0.33691 | 0.2092 | 0.3974 | 0.2441 | 0.4644 | |
| 150 | 0.9995 | 0.3449 | 0.2179 | 0.4102 | 0.2586 | 0.4775 | |
| 200 | 0.9998 | 0.3639 | 0.2409 | 0.4429 | 0.2818 | 0.5155 |
| Skew-t | Normal | Skew-cauchy | Skew-normal | |
|---|---|---|---|---|
| 21.6490 | 23.9036 | 22.7889 | 20.8765 | |
| 2.6570 | 2.7403 | 1.3762 | 4.0610 | |
| 1.6421 | - | 0.5126 | 3.2992 | |
| 4.5503 | - | - | - | |
| LogL | -236.0511 | -248.0613 | -247.5519 | -237.9670 |
| AIC | 480.1022 | 500.1227 | 501.1037 | 481.9341 |
| SIC | 490.6020 | 505.3726 | 508.9787 | 489.8090 |
| Skew-t | Normal | Skew-cauchy | Skew-normal | |
|---|---|---|---|---|
| 0.5254 | 0.2749 | 0.1798 | 1.3519 | |
| 1.1514 | 1.4259 | 0.7655 | 1.7858 | |
| -0.2285 | - | 0.1191 | -1.1688 | |
| 5.3556 | - | - | - | |
| LogL | -431.3161 | -440.3975 | -461.9316 | -438.1367 |
| AIC | 870.6321 | 884.795 | 929.8633 | 882.2733 |
| SIC | 884.6858 | 891.8219 | 940.4036 | 892.8136 |
| Test | ||||||
|---|---|---|---|---|---|---|
| Statistic value | 1.7109 | 0.0454 | 0.8731 | 0.0295 | 0.0290 | 0.2321 |
| P-value | 0.5840 | 0.9847 | 0.9847 | 0.9827 | 0.9840 | 0.9807 |
| Test | ||||||
|---|---|---|---|---|---|---|
| Statistic value | 3.9635 | 0.0357 | 1.0576 | 0.0291 | 0.0290 | 0.1737 |
| P-value | 0.5473 | 0.8260 | 0.8167 | 0.9381 | 0.9380 | 0.9913 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).