Submitted:
21 October 2025
Posted:
22 October 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Literature Review
3. Methodology
Data Collection.
Data Acquisition and Preprocessing.
Crash Event Definition.
Feature Engineering and Target Label Creation.

Model Architectures.
Training and Hyperparameter Tuning.
Evaluation Metrics.
- 1.
- Classification Metrics: For binary crash prediction (C=1 vs. C=0), the following metrics were used: Precision, Recall, F1-Score, Accuracy, and Area Under the Receiver Operating Characteristic Curve (AUC-ROC). These metrics offered insights into the models’ ability to identify crash events while managing false positives (incorrectly predicting a crash) and false negatives (failing to predict an actual crash), which is crucial in financial risk management.
- 2.
- Regression Metrics: For models that predict future crash prices, the evaluation included Mean Squared Error (MSE), Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Mean Squared Log Error (MSLE), and R² Score. These metrics were selected to measure the accuracy of price predictions and evaluate the models’ capability to reflect the scale of market movements.
- 3.
-
Portfolio Performance Metrics: To evaluate the real-world usefulness and impact of the crash prediction signals, a simulated tactical allocation strategy was used. Performance was measured using financial metrics such as:
- (a)
- Drawdown: Maximum observed loss from a peak to a trough, indicating capital preservation capabilities.
- (b)
- Sharpe Ratio: Risk-adjusted return, measuring return per unit of risk.
- (c)
- Volatility Reduction: A measure of reduced-price fluctuations compared to a baseline, indicating stability.
Assumptions and Limitations.
4. Results
Overall Model Performance.
Reinforcement Learning Regression Model Comparisons.
Traditional Classification Model Comparisons.
Deep Learning Regression Model Comparisons.
Portfolio Simulation Outcomes.
Model Complexity and Interpretability.
5. Conclusions
References
- Atsalakis, G.; Valavanis, K. Surveying stock market forecasting techniques – Part I: Conventional methods. In Computation Optimization in Economics and Finance Research Compendium; Zopounidis, C., Ed.; Nova Science Publishers: New York, 2013; pp. 49–104.
- Kim, K.J. Financial time series forecasting using support vector machines. Neurocomputing 2003, 55, 307–319. [CrossRef]
- Sezer, O.B.; Gudelek, M.U.; Ozbayoglu, A.M. Financial time series forecasting with deep learning: A systematic literature review 2005–2019. Applied Soft Computing 2020, 90, 106181. [CrossRef]
- Selvin, S.; Vinayakumar, R.; Gopalakrishnan, E.A.; Menon, V.K.; Soman, K.P. Stock price prediction using LSTM, RNN and CNN-sliding window model. In Proceedings of the Proceedings of the International Conference on Advances in Computing, Communications and Informatics (ICACCI), 2017, pp. 1643–1647. [CrossRef]
- Fischer, T.; Krauss, C. Deep learning with long short-term memory networks for financial market predictions. European Journal of Operational Research 2018, 270, 654–669. [CrossRef]
- Kara, Y.; Boyacioglu, M.A.; Baykan, Ö. Predicting direction of stock price index movement using artificial neural networks and support vector machines: The sample of the Istanbul Stock Exchange. Expert Systems with Applications 2011, 38, 5311–5319. [CrossRef]
- Chong, E.; Han, C.; Park, F.C. Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies. Expert Systems with Applications 2017, 83, 187–205. [CrossRef]
- Bao, W.; Yue, J.; Rao, Y. A deep learning framework for financial time series using stacked autoencoders and long-short term memory. PLoS One 2017, 12, e0180944. [CrossRef]
- Anastasiou, D.; Katsafados, A.; Tzomakas, C. Banks’ stock price crash risk prediction with textual analysis: a machine learning approach. Annals of Operations Research 2025. [CrossRef]
- Zhang, Y.; Zohren, S.; Roberts, S. Deep reinforcement learning for trading. Journal of Finance and Data Science 2020, 2, 25–40. [CrossRef]
- Deng, Y.; Bao, F.; Kong, Y.; Ren, Z.; Dai, Q. Deep direct reinforcement learning for financial signal representation and trading. IEEE Transactions on Neural Networks and Learning Systems 2016, 28, 653–664. [CrossRef]
- Jiang, Z.; Xu, D.; Liang, J. A deep reinforcement learning framework for the financial portfolio management problem. arXiv preprint arXiv:1706.10059, 2017. [CrossRef]
- Moody, J.; Saffell, M. Learning to trade via direct reinforcement. IEEE Transactions on Neural Networks 2001, 12, 875–889. [CrossRef]
- Chen, Y.; Li, X.; Wang, H. Hybrid CNN-BiGRU-AM Model with Anomaly Detection for Stock Price Crash Prediction. Electronics 2025, 14, 1275. [CrossRef]
- Henrique, B.M.; Sobreiro, V.A.; Kimura, H. Stock price prediction using support vector regression on daily and up to the minute prices. Journal of Finance and Data Science 2018, 4, 183–201. [CrossRef]
- Dichtl, H.; Drobetz, W.; Otto, T. Forecasting Stock Market Crashes via Machine Learning. SSRN Electronic Journal, 2021. [CrossRef]




| References | Key Contribution | How This Paper Approaches |
|---|---|---|
| Conventional & Classical ML (Atsalakis & Valavanis, 2010[1]; Kim, 2003[2]; Henrique et al., 2018[15]) | Surveys and applications of conventional econometric models and ML methods (SVM, SVR) for price/direction forecasting | Moves beyond price or direction prediction to define crash-like events as rare outcomes, framed as temporal sequence classification |
| Deep Supervised Models (Bao et al., 2017[8]; Fischer & Krauss, 2018[5]; Chong et al., 2017[7]) | Use of LSTM, stacked autoencoders, CNNs, and feature learning for financial time series forecasting | Extends from one-step regression/classification to crash detection using sliding windows; integrates DL outputs with RL policies |
| Reinforcement Learning for Trading/Portfolio (Deng et al., 2016[11]; Jiang et al., 2017[12]) | DRL applied to optimize trading strategies, learn signal representations, or manage portfolio weights | Shifts RL focus from profit maximization to crash resilience by introducing misclassification costs and portfolio risk metrics (drawdown, Sharpe) |
| Crash-specific ML Approaches (Dichtl et al., 2021[16]) | Supervised ML applied directly for crash prediction | Advances from pure supervised learning to a hybrid DL + DRL design, explicitly targeting rare crash events |
| Multimodal / Representation Focus(Chong et al., 2017[7]) | Emphasis on feature extraction and representation (e.g., PCA, RBM, autoencoders) for predictive tasks | Uses representations but goes further: aligns them with RL-driven crash-aware decision policies |
| Model | MSE | MAE | R2-Score |
|---|---|---|---|
| Deep Reinforcement Learning | 6.922 | 1.971 | 0.999 |
| Deep RL (with Moving Averages) | 6.922 | 1.971 | 0.999 |
| Transformer | 982.18 | 130.60 | -11.47 |
| Deep Deterministic Policy Gradiant (DDPG) | 0.367 | 0.486 | -3.39 |
| Graph Neural Network (GNN) | 62.87 | 6.427 | 0.910 |
| Twin Delayed DDPG (TD3) | 6.922 | 1.971 | 0.999 |
| Soft Actor-Critic (SAC) | 6.922 | 1.971 | 0.999 |
| Neural Attention Mechanism in PPO | 62.87 | 6.427 | 0.910 |
| Evolution Strategies (ES) + PPO | 6.922 | 1.971 | 0.999 |
| Meta-Learning (MAML) | 6.922 | 1.971 | 0.999 |
| Model | Precision | Recall | F1-Score | Accuracy | AUC |
|---|---|---|---|---|---|
| KNN | 0.731 | 0.787 | 0.758 | 0.773 | 0.940 |
| LR | 0.714 | 0.600 | 0.653 | 0.680 | 0.993 |
| DT | 0.711 | 0.640 | 0.673 | 0.707 | 0.843 |
| SVM | 0.613 | 0.640 | 0.626 | 0.667 | 0.828 |
| Model | MSE | R2-Score |
|---|---|---|
| LSTM | 0.093 | 0.007 |
| Autoencoder | 0.189 | 0.148 |
| CNN | 0.105 | 0.045 |
| GAN | 0.420 | -7739.66 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).