Submitted:
04 February 2026
Posted:
04 February 2026
You are already at the latest version
Abstract
Keywords:
MSC: 62P05; 62F03; 68T05
1. Introduction
1.1. Literature Review
1.1.1. Investment Risk Assessment and Financial Volatility in Emerging Markets
1.1.2. Hypothesis Testing in Financial Risk Management
1.1.3. Machine Learning Methods in Financial Risk Prediction
1.1.4. Empirical Evidence from Romania
1.1.5. Purchasing power, real economic conditions, and investment risk in emerging markets
2. Materials and Methods
2.1. Data Description
- Date: Trading day;
- Price: Closing price of the index;
- Open, High, Low: Daily opening, highest, and lowest prices, respectively;
- Vol.: Trading volume (note: volume data was unavailable for the selected period);
- Change %: Daily percentage change of the closing price.
2.2. Hypothesis Testing Framework
2.2.1. Risk Indicators
- Intraday Range (): A proxy for daily volatility, calculated as the normalized difference between the daily high and low prices relative to the opening price:
- Scaled High Price (): Daily high prices scaled by the mean high over the sample period to account for level effects:
- Purchasing Power Index (), a macroeconomic indicator reflecting real income and consumption capacity. The index is observed at monthly frequency and is lagged and carried forward to daily frequency to ensure information availability and to avoid look-ahead bias;
- Return Direction: A categorical variable indicating whether daily returns are positive (“Up”) or negative (“Down”).
2.2.2. Statistical Tests
- Two-sample t-test: To examine whether mean returns differ between two time periods (pre-2020 vs post-2020), under the null hypothesis ;
- One-way ANOVA: To test whether mean daily returns differ across months (), assessing potential seasonal effects;
- Chi-square test of independence: To evaluate the association between return direction (Up / Down) and volatility level (High / Low, defined by a median split of intraday range), under the null hypothesis of independence;
- Regression-based hypothesis testing: Ordinary least squares (OLS) regression models are estimated to test whether the constructed risk factors statistically explain daily returns:
2.3. Machine Learning Models
2.3.1. Problem Definition
- Price-based features:
- 2.
- Volatility indicators: rolling standard deviations of returns over 7 and 14 days:
- 3.
- Date-based features:
- 4.
- Sentiment features:
- Regression: daily returns () and rolling volatility ();
- Classification: return direction:
- 5.
- Macroeconomic feature:
2.3.2. Random Forest Models
- Regression prediction is the average of individual tree predictions:
- Classification prediction is based on majority voting across trees:
2.3.3. XGBoost Models
2.3.4. Model Training and Evaluation
- Regression: Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), R²;
- Classification: Accuracy, Precision, Recall, F1-score, ROC-AUC.
3. Results
3.1. Hypothesis Testing Results
| Predictor | Coefficient | Standard Error | t | P>|t| |
| Intercept | 0.2894 | 0.063 | 4.607 | 0.000 |
| Intraday Range | -25.384 | 2.216 | -11.455 | 0.000 |
| High (scaled) | 3.878e-06 | 4.64e-06 | 0.836 | 0.403 |
| Predictor | Coefficient | Standard Error | t | P>|t| |
| Intercept | 0.0248 | 0.118 | 0.211 | 0.833 |
| Intraday Range | -10.143 | 2.468 | -4.110 | 0.000 |
| High (scaled) | 0.0036 | 0.002 | 1.675 | 0.094 |
| Predictor | Coefficient | Standard Error | t | P>|t| |
| Intercept | 0.1134 | 0.041 | 2.777 | 0.006 |
| Intraday Range | 6.482e-06 | 3.87e-06 | 1.673 | 0.094 |
| High (scaled) | -8.124e-07 | 6.26e-07 | -1.298 | 0.194 |
3.2. Machine Learning Results
|
Random Forest |
Predictor | RMSE | MAE | R² |
| Results | 0.013795 | 0.00776 | 0.025803 | |
| Volatility (7d) | 0.010017 | 0.005017 | -0.051318 | |
| Volatility (14d) | 0.009518 | 0.004855 | -0.066428 | |
|
XGBoost |
Predictor | RMSE | MAE | R² |
| Results | 0.012552 | 0.00716 | 0.193472 | |
| Volatility (7d) | 0.009636 | 0.004911 | 0.02723 | |
| Volatility (14d) | 0.008973 | 0.004603 | 0.052114 |
|
Random Forest |
Predictor | RMSE | MAE | R² |
| Results | 0.013788 | 0.007783 | 0.026741 | |
| Volatility (7d) | 0.010074 | 0.00502 | -0.06327 | |
| Volatility (14d) | 0.00958 | 0.004873 | -0.08051 | |
|
XGBoost |
Predictor | RMSE | MAE | R² |
| Results | 0.012682 | 0.007087 | 0.176557 | |
| Volatility (7d) | 0.00976 | 0.004966 | 0.001935 | |
| Volatility (14d) | 0.00912 | 0.004681 | 0.020757 |
| Model |
Direction |
Accuracy | Precision | Recall | F1 | ROC-AUC |
| Random Forest | 0.5128 | 0.4577 | 0.7987 | 0.5819 | 0.5891 | |
| XGBoost | 0.6125 | 0.5461 | 0.5168 | 0.531 | 0.6906 |
| Model |
Direction |
Accuracy | Precision | Recall | F1 | ROC-AUC |
| Random Forest | 0.49 | 0.45 | 0.906 | 0.6013 | 0.621 | |
| XGBoost | 0.6154 | 0.5522 | 0.4966 | 0.523 | 0.6765 |
|
Random Forest |
Predictor | RMSE | MAE | R² |
| Results | 0.01469 | 0.009435 | -0.150164 | |
| Volatility (7d) | 0.009 | 0.006276 | -0.248638 | |
| Volatility (14d) | 0.008312 | 0.005672 | -0.194686 | |
|
XGBoost |
Predictor | RMSE | MAE | R² |
| Results | 0.012888 | 0.00808 | 0.114712 | |
| Volatility (7d) | 0.007217 | 0.004959 | 0.197051 | |
| Volatility (14d) | 0.007937 | 0.005193 | -0.089393 |
|
Random Forest |
Predictor | RMSE | MAE | R² |
| Results | 0.014214 | 0.008934 | -0.07682 | |
| Volatility (7d) | 0.0085 | 0.005844 | -0.11369 | |
| Volatility (14d) | 0.00804 | 0.005301 | -0.11771 | |
|
XGBoost |
Predictor | RMSE | MAE | R² |
| Results | 0.012846 | 0.008254 | 0.120467 | |
| Volatility (7d) | 0.007206 | 0.004767 | 0.199597 | |
| Volatility (14d) | 0.007838 | 0.005386 | -0.06244 |
| Model |
Direction |
Accuracy | Precision | Recall | F1 | ROC-AUC |
| Random Forest | 0.5556 | 0.5862 | 0.2048 | 0.3036 | 0.6009 | |
| XGBoost | 0.6325 | 0.6667 | 0.4458 | 0.5343 | 0.652 |
| Model |
Direction |
Accuracy | Precision | Recall | F1 | ROC-AUC |
| Random Forest | 0.5755 | 0.562 | 0.4639 | 0.5083 | 0.6087 | |
| XGBoost | 0.604 | 0.6195 | 0.4217 | 0.5018 | 0.65 |
|
Random Forest |
Predictor | RMSE | MAE | R² |
| Results | 0.008559 | 0.006261 | -0.200652 | |
| Volatility (7d) | 0.012927 | 0.011263 | -8.406296 | |
| Volatility (14d) | 0.012163 | 0.01051 | -10.823135 | |
|
XGBoost |
Predictor | RMSE | MAE | R² |
| Results | 0.008719 | 0.006209 | -0.245992 | |
| Volatility (7d) | 0.010733 | 0.00882 | -5.48462 | |
| Volatility (14d) | 0.010858 | 0.008608 | -8.422544 |
|
Random Forest |
Predictor | RMSE | MAE | R² |
| Results | 0.00861 | 0.006306 | -0.21497 | |
| Volatility (7d) | 0.012861 | 0.011151 | -8.31139 | |
| Volatility (14d) | 0.012276 | 0.010539 | -11.0431 | |
|
XGBoost |
Predictor | RMSE | MAE | R² |
| Results | 0.008553 | 0.006086 | -0.19888 | |
| Volatility (7d) | 0.010254 | 0.008496 | -4.91844 | |
| Volatility (14d) | 0.010976 | 0.008768 | -8.62794 |
| Model |
Direction |
Accuracy | Precision | Recall | F1 | ROC-AUC |
| Random Forest | 0.5331 | 0.5091 | 0.7029 | 0.5905 | 0.5266 | |
| XGBoost | 0.6072 | 0.6503 | 0.3891 | 0.4869 | 0.6403 |
| Model |
Direction |
Accuracy | Precision | Recall | F1 | ROC-AUC |
| Random Forest | 0.5251 | 0.5027 | 0.7824 | 0.6121 | 0.536 | |
| XGBoost | 0.5972 | 0.6319 | 0.3808 | 0.4752 | 0.637 |
4. Discussion
5. Conclusions
References
- Bekaert, G.; Harvey, C.R.; Mondino, T. Emerging equity markets in a globalized world. Emerg. Mark. Rev. 2023, 56. [Google Scholar] [CrossRef]
- Kose, M.A.; Prasad, E.; Rogoff, K.; Wei, S.-J. Financial globalization: A reappraisal. IMF Staff Pap. 2006, 6. Available online: https://ssrn.com/abstract=934448.
- Sharpe, W.F. Capital asset prices: A theory of market equilibrium under conditions of risk. J. Finance 1964, 19, 425–442. [Google Scholar]
- Fama, E.F.; French, K.R. Common risk factors in the returns on stocks and bonds. J. Financ. Econ. 1993, 33, 3–56. [Google Scholar] [CrossRef]
- Chen, N.-F.; Roll, R.; Ross, S.A. Economic forces and the stock market. J. Bus. 1986, 59, 383–403. [Google Scholar] [CrossRef]
- Ang, A.; Piazzesi, M. A no-arbitrage vector autoregression of term structure dynamics with macroeconomic and latent variables. J. Monet. Econ. 2003, 50, 745–787. [Google Scholar] [CrossRef]
- Hamilton, J.D. Regime switching models. In Macroeconometrics and Time Series Analysis. The New Palgrave Economics Collection; Durlauf, S.N., Blume, L.E., Eds.; Palgrave Macmillan, UK, 2010; pp. 202–209. [Google Scholar] [CrossRef]
- Cont, R. Empirical properties of asset returns: Stylized facts and statistical issues. Quant. Finance 2001, 1, 223–236. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2000, 29, 1189–1232. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; ACM: New York, NY, USA, 2016; pp. 785–794. [Google Scholar] [CrossRef]
- Gu, S.; Kelly, B.; Xiu, D. Empirical asset pricing via machine learning. Rev. Financ. Stud. 2020, 33, 2223–2273. [Google Scholar] [CrossRef]
- Sirignano, J.; Cont, R. Universal features of price formation in financial markets: Perspectives from deep learning. SSRN Elec. Jour 2018. [Google Scholar] [CrossRef]
- Mullainathan, S.; Spiess, J. Machine learning: An applied econometric approach. J. Econ. Perspect. 2017, 31, 87–106. [Google Scholar] [CrossRef]
- Athey, S.; Imbens, G.W.; Ramachandra, V. Machine learning methods for estimating heterogeneous causal effects. Stat. Sci. 2015, 5, 1–2. [Google Scholar]
- Chernozhukov, V.; Hansen, C.; Spindler, M. Valid post-selection and post-regularization inference: an elementary, general approach. Annu. Rev. Econ. 2015, 7, 649–688. [Google Scholar] [CrossRef]
- Bekaert, G.; Harvey, C.R.; Lundblad, C. Emerging equity markets and economic development. 2000. [Google Scholar] [CrossRef]
- Claessens, S.; Kose, M.A. Financial crises: Explanations, types, and implications. IMF Work. Pap. 2013, 28. [Google Scholar] [CrossRef]
- Forbes, K.J.; Rigobon, R. No contagion, only interdependence: Measuring stock market comovements. J. Finance 2002, 57, 2223–2261. [Google Scholar] [CrossRef]
- Calvo, G.A.; Reinhart, C.M. Fear of floating. Quart. J. of Econ. 2002, 117(2), 379–408. [Google Scholar] [CrossRef]
- Diebold, F.X.; Yilmaz, K. Measuring financial asset return and volatility spillovers, with application to global equity markets. Economic Journal 2009, 119(534), 158–171. [Google Scholar] [CrossRef]
- Ang, A.; Chen, J. Asymmetric correlations of equity portfolios. Journal of Financial Economics 2001, 63(3), 443–494. [Google Scholar] [CrossRef]
- Rey, H. Dilemma not trilemma: The global financial cycle and monetary policy independence. In National Bureau of Economic Research; 2015. [Google Scholar] [CrossRef]
- Bekaert, G.; Engstrom, E.; Xing, Y. Risk, uncertainty, and asset prices. Journal of Financial Economics 2009, 91(1), 59–82. [Google Scholar] [CrossRef]
- Brooks, C. Introductory Econometrics for Finance, 3rd ed.; Cambridge University Press: Cambridge, UK, 2014. [Google Scholar] [CrossRef]
- Cochrane, J.H. Asset Pricing (Revised Edition). Journal of Economic Behavior & Organization 2006, 60(4), 603–608. [Google Scholar] [CrossRef]
- Perron, P. The great crash, the oil price shock, and the unit root hypothesis. Econometrica 1989, 57(6), 1361–1401. [Google Scholar] [CrossRef]
- Hansen, L.P. Large sample properties of generalized method of moments estimators. Econometrica 1982, 50(4), 1029–1054. [Google Scholar] [CrossRef]
- Pesaran, M.H.; Timmermann, A. Predictability of stock returns: Robustness and economic significance. J. Finance 1995, 50, 1201–1228. [Google Scholar] [CrossRef]
- Diebold, F.X. Elements of Forecasting. In International Journal of Forecasting, 4th ed.; 2008; Volume 24, 3, pp. 552–553. [Google Scholar] [CrossRef]
- Lessmann, S.; Baesens, B.; Seow, H.-V.; Thomas, L.C. Benchmarking state-of-the-art classification algorithms for credit scoring: an update of research. Eur. Jour. Op. Res. 2015, 247, 124–136. [Google Scholar] [CrossRef]
- Heaton, J.B.; Polson, N.G.; Witte, J.H. Deep learning for finance: Deep portfolios. Appl. Stoch. Models Bus. Ind. 2017, 33, 3–12. [Google Scholar] [CrossRef]
- Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed.; Springer, 2009. [Google Scholar] [CrossRef]
- Atanasova, C.; Schwartz, E. Stranded fossil fuel reserves and firm value. In National Bureau of Economic Research; 2019. [Google Scholar] [CrossRef]
- Bailey, D.H.; Borwein, J.M.; López de Prado, M.; Zhu, Q.J. The probability of backtest overfitting. Journal of Computational Finance 2017, 20(4), 39–69. [Google Scholar] [CrossRef]
- Guidotti, R.; Monreale, A.; Ruggieri, S.; Turini, F.; Giannotti, F.; Pedreschi, D. A survey of methods for explaining black box models. ACM Comput. Surv. 2018, 51, 1–42. [Google Scholar] [CrossRef]
- Varian, H.R. Big data: New tricks for econometrics. Journal of Economic Perspectives 2014, 28(2), 3–28. [Google Scholar] [CrossRef]
- Dedu, V.; Stoica, T. The impact of monetary policy on the Romanian economy. J. Econ. Forecast. 2014, 0(2), 71–86. [Google Scholar]
- Moldovan (Gavril), I.A. Financial market’s contribution to economic growth in Romania. Management Dynamics in the Knowledge Economy 2015, 3, 447. [Google Scholar]
- Babecký, J.; Havránek, T.; Matějů, J.; Rusnák, M.; Šmídková, K.; Vašíček, B. Leading indicators of crisis incidence: Evidence from developed and emerging economies. Journal of International Money and Finance 2013, 35(C), 1–19. [Google Scholar] [CrossRef]
- Égert, B.; Kočenda, E. Time-varying synchronization of European stock markets. Empirical Economics 2011, 40(2), 393–407. [Google Scholar] [CrossRef]
- Égert, B.; Kočenda, E. Interdependence between Eastern and Western European stock markets: Evidence from intraday data. Economic Systems 2007, 31(2), 184–203. [Google Scholar] [CrossRef]
- Lettau, M.; Ludvigson, S.C. Consumption, aggregate wealth, and expected stock returns. The Journal of Finance 2001, 56(3), 815–849. [Google Scholar] [CrossRef]
- Alesina, A.; Perotti, R. Fiscal adjustments in OECD countries: Composition and macroeconomic effects. In IMF Staff Papers; 1996. [Google Scholar] [CrossRef]
- Mendoza, E.G. Sudden stops, financial crises, and leverage. American Economic Review 2010, 100(5), 1941–1966. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
