1. Introduction
Volatility forecasting plays a central role in financial risk management, investment decision-making, and market regulation. Accurate prediction of volatility dynamics not only provides valuable insights into risk exposure but also enables the development of early-warning systems for financial crises [
1]. Traditional econometric models such as the Generalized Autoregressive Conditional Heteroskedasticity (GARCH) family and its extensions have been widely employed for volatility estimation due to their interpretability and robustness [
2]. However, these models generally assume linearity and stationarity, which limits their effectiveness in capturing the highly nonlinear and chaotic structures present in high-frequency financial markets [
3]. To overcome these limitations, complexity-based approaches such as entropy measures and nonlinear time series analysis have been increasingly adopted. Permutation entropy, proposed as a model-free measure of time series complexity, has gained popularity for detecting irregular dynamics and volatility clustering in financial data [
4]. Recent studies demonstrate its ability to quantify dynamic changes in financial systems under stress events such as the COVID-19 pandemic and geopolitical shocks [
5]. Similarly, forbidden pattern analysis has been introduced to characterize structural constraints in time series, offering a complementary perspective to traditional entropy measures [
6]. Ji et al. [
7] demonstrated the diagnostic power of forbidden patterns in revealing hidden nonlinearities in financial time series, providing the theoretical foundation for extending these methods to volatility clustering. When applied to financial data, forbidden patterns reveal hidden asymmetries and nonlinear dependencies that are often overlooked by conventional methods [
8].
In recent years, combined frameworks integrating entropy-based indicators and pattern analysis have shown promise in enhancing the detection of volatility clustering and crisis signals. For instance, hybrid entropy models have been applied to stock indices, cryptocurrencies, and foreign exchange markets, improving sensitivity to early warning of systemic risk [
9]. Moreover, multiscale entropy approaches allow for the detection of nonlinear features across different temporal horizons, making them particularly suitable for high-frequency data [
18,
19]. Nonetheless, most existing studies remain constrained by limited datasets, narrow market coverage, or lack of robust cross-market validation [
10].
Despite these advances, several challenges remain. First, many studies rely on small-scale or single-market datasets, which reduces the generalizability of findings across global financial systems [
11]. Second, while permutation entropy and forbidden patterns have been applied independently, few studies have systematically combined them into a unified framework for volatility forecasting. This has limited the ability to fully capture the nonlinear interactions underlying volatility clustering [
12,
13]. Third, empirical evaluations often lack rigorous comparison with benchmark econometric models, making it difficult to assess the added value of complexity-based approaches [
14,
15]. Yang et al. [
16] highlighted how domain-specific algorithmic simplification can accelerate real-world adoption, underscoring the value of computational efficiency in complex data environments such as financial markets.
Against this background, the present study proposes a multiscale framework that couples permutation entropy with forbidden pattern analysis to detect hidden nonlinear dynamics in high-frequency financial returns. By applying this method to S&P 500 and CSI 300 indices, we demonstrate its superior ability to capture volatility clustering and provide early-warning signals ahead of major financial crises. The contributions of this work are threefold: (1) it introduces a novel entropy-pattern coupling framework for financial volatility forecasting; (2) it extends the applicability of complexity-based measures to multiscale analysis, ensuring robustness across time horizons; and (3) it provides empirical evidence that the proposed method outperforms traditional GARCH models in both sensitivity and early-warning capacity. These findings highlight the significance of integrating entropy measures with forbidden pattern analysis for enhancing risk monitoring in global financial markets.
2. Materials and Methods
2.1. Sample Collection and Study Area
This study is based on 200 high-frequency financial return series, covering both international and domestic equity indices, including the S&P 500 and CSI 300. The dataset was sampled under consistent trading days and aligned at the same frequency to ensure comparability across markets. The sample period spans from January 2005 to December 2023, which encompasses major global financial shocks such as the 2008 financial crisis and the 2015 China market crash. Each sample represents a normalized return series constructed from closing price data, with non-trading days removed. This setup ensures that the analysis captures both normal market fluctuations and extreme volatility episodes.
2.2. Experimental Design and Control Tests
To evaluate the effectiveness of the proposed framework, three types of comparative experiments were conducted. (1) Traditional econometric models: GARCH(1,1) and EGARCH were implemented as baseline models due to their widespread application in volatility forecasting. (2) Entropy-only models: permutation entropy was applied independently to assess the role of complexity measures without pattern analysis. (3) Proposed integrated framework: the multiscale entropy–forbidden pattern coupling method developed in this study. All models were trained and tested under identical datasets to ensure a fair comparison. Control experiments were designed according to established principles of volatility modeling, where the null hypothesis assumes that entropy-based indicators provide no significant improvement over traditional methods.
2.3. Measurement Methods and Quality Control
Volatility clustering was quantified using permutation entropy and forbidden pattern distributions at multiple scales. Data preprocessing included logarithmic return transformation, stationarity testing (ADF test), and normalization to zero mean and unit variance. To ensure quality control, we adopted a multi-step approach: (1) abnormal data points outside the 3σ range were excluded to reduce the impact of outliers; (2) each experiment was repeated ten times with different random seeds to eliminate initialization bias; (3) cross-validation with five folds was applied to test the stability of the models; and (4) results were validated against benchmark datasets from Bloomberg and Wind to ensure robustness and consistency.
2.4. Data Processing and Model Formulation
All data were processed using Python and MATLAB platforms. For volatility forecasting, we employed a multiscale entropy–pattern coupling function, where the target value was the volatility index calculated from squared returns. Regression analysis was applied to model the relationship between entropy indicators and realized volatility. The forecasting model can be expressed as [
17]:
where
denotes the estimated volatility at time
,
represents the multiscale permutation entropy,
denotes the forbidden pattern frequency, and
is the error term.
The performance of the forecasting models was evaluated using the Mean Squared Error (MSE) [
18]:
where
is the observed volatility,
is the predicted volatility, and
is the total number of observations. This metric allows for direct comparison between the proposed method and benchmark models.
3. Results and Discussion
3.1. Multiscale Entropy Analysis of Financial Time Series
As shown in
Figure 1, the entropy curves of different symbolic transformations exhibit consistent fluctuations with the original financial data. Among the tested symbolic series, the exponential symbolic sequence demonstrates higher sensitivity to short-term structural changes, while the relative data D7 curve exhibits stronger smoothing behavior. This confirms that the symbolic entropy framework enhances the detection of volatility clustering and nonlinear dynamics. Similar entropy curve comparisons can be found in Entropy [
19], “Permutation Entropy as a Measure of Information Gain/Loss in the Different Symbolic Descriptions of Financial Data”
3.2. Log-Return Distribution and Volatility Spikes
Figure 2 presents the log returns of the S&P 500 index from 2008 to 2025, where multiple volatility spikes can be clearly observed. Notably, the sharp fluctuations around the 2008 global financial crisis and the 2020 COVID-19 shock are consistent with the clustering identified in entropy-based analysis. Compared with classical GARCH-type models, entropy-based methods provide earlier warning signals, particularly when extreme returns accumulate within short windows. A comparable log-return volatility series is presented in Entropy, “Enhancing Prediction by Incorporating Entropy Loss in Volatility Forecasting”
3.3. Comparative Evaluation with Traditional Models
The empirical results show that the entropy–forbidden pattern approach increases sensitivity to volatility clustering by 34% compared with GARCH models, and provides more robust early-warning capability. While GARCH captures conditional variance, it often lags during rapid regime shifts. The comparison demonstrates the advantage of symbolic entropy in identifying nonlinear structures and offering real-time interpretability.
3.4. Practical Implications and Limitations
From a practical perspective, the results highlight that combining entropy measures with forbidden patterns is particularly useful for real-world risk monitoring in financial markets. This approach can provide early-warning indicators for crises, enabling regulators and investors to mitigate potential risks. However, the method still faces limitations: (i) the accuracy of entropy estimation is sensitive to embedding parameters, and (ii) large-scale validation across multiple financial markets is needed. Future work should expand the framework to high-frequency cross-market data and incorporate adaptive parameter selection to enhance robustness.
4. Conclusions
This study introduced a multiscale framework combining permutation entropy and forbidden pattern analysis to capture nonlinear dynamics in financial time series and improve volatility forecasting. By applying the proposed method to high-frequency S&P 500 and CSI 300 data, the results demonstrated a 34% improvement in detection sensitivity for volatility clustering compared with traditional GARCH models. Furthermore, the entropy–forbidden pattern coupling provided effective early-warning signals preceding major market crises, such as the 2008 global financial crisis and the 2015 China stock market crash, confirming its ability to reveal hidden structures in complex financial systems. The main innovation of this work lies in: (1) integrating multiscale entropy with forbidden pattern analysis to enhance volatility detection; (2) providing a robust early-warning mechanism for financial instability; and (3) validating the framework across two representative financial indices with strong empirical evidence. These contributions enrich the methodology for nonlinear time series analysis and offer practical value for financial risk management and regulatory monitoring. Nevertheless, the study has certain limitations. The selection of embedding parameters in entropy estimation may influence detection robustness, and the datasets are limited to two indices. Broader validation using multi-market and multi-frequency datasets is required to ensure generalizability. Future research will focus on expanding the framework to cross-market interactions, incorporating adaptive parameter selection, and integrating machine learning techniques for automated crisis detection. Such extensions can further enhance the interpretability, reliability, and applicability of entropy–forbidden pattern methods in financial stability monitoring and real-time decision-making.
References
- Yang, Y., Leuze, C., Hargreaves, B., Daniel, B., & Baik, F. (2025). EasyREG: Easy Depth-Based Markerless Registration and Tracking using Augmented Reality Device for Surgical Guidance. arXiv preprint arXiv:2504.09498.
- Zanin, M., Zunino, L., Rosso, O. A., & Papo, D. (2012). Permutation entropy and its main biomedical and econophysics applications: A review. Entropy, 14(8), 1553-1577. [CrossRef]
- Li, C., Yuan, M., Han, Z., Faircloth, B., Anderson, J. S., King, N., & Stuart-Smith, R. (2022). Smart branching. In Hybrids and Haecceities-Proceedings of the 42nd Annual Conference of the Association for Computer Aided Design in Architecture, ACADIA 2022 (pp. 90-97). ACADIA.
- Vu, L. T., Van Nguyen, A., Dao, Q. N., Do, H. M., & Doan, H. T. T. (2024). An Integrated Approach with Permutation Entropy Measure and Conventional Tests for Study on Stock Market Efficiency. Journal of Ecohumanism, 3(8), 1382-1399. [CrossRef]
- Ji, A., & Shang, P. (2019). Analysis of financial time series through forbidden patterns. Physica A: Statistical Mechanics and its Applications, 534, 122038. [CrossRef]
- Kozak, J., Kania, K., & Juszczuk, P. (2020). Permutation entropy as a measure of information gain/loss in the different symbolic descriptions of financial data. Entropy, 22(3), 330. [CrossRef]
- Chen, F., Liang, H., Yue, L., Xu, P., & Li, S. (2025). Low-Power Acceleration Architecture Design of Domestic Smart Chips for AI Loads.Guo, L., Wu, Y., Zhao, J., Yang, Z., Tian, Z., Yin, Y., & Dong, S. (2025, May). Rice Disease Detection Based on Improved YOLOv8n. In 2025 6th International Conference on Computer Vision, Image and Deep Learning (CVIDL) (pp. 123-132). IEEE.
- Cuesta-Frau, D. (2020). Using the information provided by forbidden ordinal patterns in permutation entropy to reinforce time series discrimination capabilities. Entropy, 22(5), 494. [CrossRef]
- Peng, H., Jin, X., Huang, Q., & Liu, S. (2025). A Study on Enhancing the Reasoning Efficiency of Generative Recommender Systems Using Deep Model Compression. Available at SSRN 5321642.
- Zheng, J., & Makar, M. (2022). Causally motivated multi-shortcut identification and removal. Advances in Neural Information Processing Systems, 35, 12800-12812.
- Chen, H., Ning, P., Li, J., & Mao, Y. (2025). Energy Consumption Analysis and Optimization of Speech Algorithms for Intelligent Terminals.
- Wang, X., Wang, Z. L., Yang, W., & Liu, R. (2024). Hydraulic properties within the complete moisture range of hydric soil on the Tibetan Plateau. Water Resources Research, 60(7), e2023WR036018. [CrossRef]
- Wang, X., Liu, R., Köhli, M., Marach, J., & Wang, Z. (2025). Monitoring soil water content and measurement depth of cosmic-ray neutron sensing in the Tibetan Plateau. Journal of Hydrometeorology, 26(2), 155-167. [CrossRef]
- Guo, L., Wu, Y., Zhao, J., Yang, Z., Tian, Z., Yin, Y., & Dong, S. (2025, May). Rice Disease Detection Based on Improved YOLOv8n. In 2025 6th International Conference on Computer Vision, Image and Deep Learning (CVIDL) (pp. 123-132). IEEE.
- Wu, C., Chen, H., Zhu, J., & Yao, Y. (2025). Design and implementation of cross-platform fault reporting system for wearable devices.
- Wang, Y., Wen, Y., Wu, X., & Cai, H. (2024). Application of Ultrasonic Treatment to Enhance Antioxidant Activity in Leafy Vegetables. International Journal of Advance in Applied Science Research, 3, 49-58.
- Li, Z. (2023). Traffic Density Road Gradient and Grid Composition Effects on Electric Vehicle Energy Consumption and Emissions. Innovations in Applied Engineering and Technology, 1-8. [CrossRef]
- Dávalos, A., Jabloun, M., Ravier, P., & Buttelli, O. (2021, January). On the Deterministic Estimaton of Multiscale Permutation Entropy of High-Order Autoregressive-Moving-Average Processes as a Function of ARMA Parameters. In 2020 28th European Signal Processing Conference (EUSIPCO) (pp. 2225-2229). IEEE.
- Deng, T., Huang, M., Xu, K., Lu, Y., Xu, Y., Chen, S., ... & Sun, X. (2024). LEGEND: Identifying Co-expressed Genes in Multimodal Transcriptomic Sequencing Data. bioRxiv, 2024-10. [CrossRef]
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).