Submitted:
04 May 2026
Posted:
06 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Data and Feature Engineering
2.1. Data Sources
2.2. Predictor Variables
2.3. Stationarity and Transformations
3. Methodological Framework
3.1. Imbalanced Classification
3.2. Training, Validation, and Testing
4. Machine Learning Models
4.1. Linear Benchmark Models
4.2. Kernel-Based Model
4.3. Ensemble Tree Methods
4.4. Shallow Neural Network
4.5. Deep Sequence Models
5. Classification Performance Metrics
6. Empirical Results
6.1. Training Sample Performance
6.2. Test Sample Performance
6.3. Predicted Recession Probabilities: Full Sample
6.4. Comparison with the Original Paper
7. SHAP-Based Interpretability
8. Structural Break Analysis
- 1.
- Pre-1984 (High Inflation Era): Characterized by the oil shocks of the 1970s, the Volcker disinflation of 1979–1983, and multiple recessions driven by supply-side and monetary shocks. Macroeconomic volatility was high.
- 2.
- 1984–2007 (Great Moderation): A period of low macroeconomic volatility, declining interest rates, and relatively mild recessions (1990–91 and 2001).
- 3.
- 2008–2019 (Post-GFC): Near-zero interest rates, unconventional monetary policy, and a historically long expansion following the GFC.
- 4.
- 2020–2024 (Post-Pandemic): The exogenous COVID-19 recession, rapid recovery, high inflation, and aggressive monetary tightening.
9. Economic Value Analysis
| Model | Ann. Return | S&P 500 | Volatility | Sharpe | Max DD | Switches |
|---|---|---|---|---|---|---|
| PROBIT | 6.38% | 8.20% | 12.62% | 0.51 | −30.50% | 32 |
| GLMNET | 5.88% | 8.20% | 12.44% | 0.47 | −35.68% | 44 |
| SVM | 6.64% | 8.20% | 11.92% | 0.56 | −24.77% | 46 |
| RF | 8.53% | 8.20% | 12.94% | 0.66 | −31.87% | 20 |
| XGBoost | 8.17% | 8.20% | 12.37% | 0.66 | −27.18% | 30 |
| LightGBM | 8.66% | 8.20% | 12.46% | 0.69 | −30.26% | 32 |
| NNET | 8.57% | 8.20% | 12.01% | 0.71 | −32.36% | 40 |
| LSTM | 10.21% | 8.20% | 13.26% | 0.77 | −24.77% | 4 |
| Transformer | 11.90% | 8.20% | 14.20% | 0.84 | −24.77% | 3 |
| S&P 500 | 8.20% | – | 15.65% | 0.52 | −52.56% | – |

10. Post-Pandemic Subperiod Analysis (2020–2024)
11. Conclusion
Data Availability Statement
Conflicts of Interest
Use of Artificial Intelligence
References
- Breiman, Leo. 2001. Random forests. Machine Learning 45, 1: 5–32. [Google Scholar] [CrossRef]
- Camacho, Maximo, Gabriel Perez-Quiros, and Pilar Poncela. 2012. Markov-switching dynamic factor models in real time. CEPR Working Paper (8866). [Google Scholar]
- Chen, Tianqi, and Carlos Guestrin. 2016. XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; pp. 785–794. [Google Scholar]
- Davis, Jesse, and Mark Goadrich. 2006. The relationship between precision-recall and ROC curves. Proceedings of the 23rd International Conference on Machine Learning; pp. 233–240. [Google Scholar]
- Estrella, Arturo, and Gikas A. Hardouvelis. 1991. The term structure as a predictor of real economic activity. The Journal of Finance 46, 2: 555–576. [Google Scholar] [CrossRef]
- Estrella, Arturo, and Frederic S. Mishkin. 1996. The yield curve as a predictor of U.S. recessions. Current Issues in Economics and Finance 2, 7: 1–6. [Google Scholar] [CrossRef]
- Estrella, Arturo, and Frederic S. Mishkin. 1998. Predicting U.S. recessions: Financial variables as leading indicators. The Review of Economics and Statistics 80, 1: 45–61. [Google Scholar] [CrossRef]
- Friedman, Jerome, Trevor Hastie, and Robert Tibshirani. 2010. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software 33, 1: 1–22. [Google Scholar] [CrossRef]
- Hochreiter, Sepp, and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computation 9, 8: 1735–1780. [Google Scholar] [CrossRef]
- Hyndman, Rob J., and George Athanasopoulos. 2018. Forecasting: Principles and Practice, 2nd ed. Melbourne, Australia: OTexts. [Google Scholar]
- Ke, Guolin, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. 2017. LightGBM: A highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems 30: 3146–3154. [Google Scholar]
- Levanon, Gad, Jean-Claude Manini, Ataman Ozyildirim, Brian Schaitkin, and Jingyi Tanchua. 2011. Using a leading credit index to predict turning points in the U.S. business cycle. The Conference Board, Economics Program; pp. Working Paper 11–05. [Google Scholar]
- Lundberg, Scott M., and Su-In Lee. 2017. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems. vol. 30. [Google Scholar]
- Ng, Serena. 2014. Viewpoint: Boosting recessions. Canadian Journal of Economics 47, 1: 1–34. [Google Scholar] [CrossRef]
- Sephton, Peter. 2001. Forecasting recessions: Can we do better on MARS? Federal Reserve Bank of St. Louis Review 83, 2: 39–50. [Google Scholar] [CrossRef]
- Vapnik, Vladimir. 1996. The Nature of Statistical Learning Theory. New York: Springer. [Google Scholar]
- Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. vol. 30. [Google Scholar]
- Wright, Jonathan H. 2006. The yield curve and predicting recessions. In Finance and Economics Discussion Series. Board of Governors of the Federal Reserve System. [Google Scholar]
- Yazdani, Alireza. 2020. Machine learning prediction of recessions: An imbalanced classification approach. The Journal of Financial Data Science 2, 4: 21–32. [Google Scholar] [CrossRef]






| Series | FRED Code | Transformation (Lag) | Source |
|---|---|---|---|
| Original Predictors (Yazdani 2020) | |||
| Federal Funds Rate | FEDFUNDS | DIFF (1M) | Sephton (2001); Ng (2014) |
| Industrial Production | INDPRO | DIFF_LOG (1M) | Sephton (2001) |
| Nonfarm Payrolls | PAYEMS | DIFF_LOG (1M) | Camacho et al. (2012) |
| S&P 500 Index | – | DIFF_LOG (1M) | Estrella and Mishkin (1998) |
| 10Y Treasury Rate | GS10 | DIFF (1M) | Estrella and Mishkin (1998) |
| Unemployment Rate | UNRATE | DIFF_LOG (1M) | Ng (2014) |
| Yield Curve Slope | GS10 − FEDFUNDS | LEVEL (1M) | Estrella and Hardouvelis (1991); Wright (2006) |
| New Predictors (this paper) | |||
| Housing Starts | HOUST | DIFF_LOG (1M) | Levanon et al. (2011) |
| CPI Inflation (YoY) | CPIAUCSL | LOG DIFF (12M, 1M) | this paper |
| Labor Force Participation Rate | CIVPART | DIFF (1M) | this paper |
| M2 Money Stock (YoY) | M2SL | LOG DIFF (12M, 1M) | this paper |
| PPI (YoY) | PPIACO | LOG DIFF (12M, 1M) | this paper |
| Real Federal Funds Rate | FEDFUNDS − CPI | LEVEL (1M) | this paper |
| Yield Curve Inversion | YC | Binary Dummy (1M) | Estrella and Mishkin (1996) |
| Model | AUC | PR-AUC | F-Score | KS | Prec. | Sens. | Spec. | Brier |
|---|---|---|---|---|---|---|---|---|
| (%) | (%) | (%) | (%) | (%) | (%) | (%) | ||
| PROBIT | 96.3 | 81.1 | 71.8 | 83.7 | 58.3 | 93.6 | 90.0 | 0.074 |
| GLMNET | 96.6 | 81.0 | 71.5 | 84.2 | 57.9 | 93.6 | 89.8 | 0.073 |
| SVM | 98.8 | 86.8 | 73.4 | 98.2 | 58.0 | 100.0 | 89.2 | 0.069 |
| RF | 99.3 | 95.3 | 77.6 | 93.7 | 64.3 | 97.9 | 91.9 | 0.063 |
| XGBoost | 99.2 | 92.5 | 78.0 | 96.0 | 63.9 | 100.0 | 91.6 | 0.054 |
| LightGBM | 99.0 | 91.5 | 78.3 | 94.1 | 64.4 | 100.0 | 91.7 | 0.057 |
| NNET | 98.8 | 90.0 | 79.3 | 95.7 | 65.7 | 100.0 | 92.2 | 0.055 |
| LSTM | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 0.000 |
| Transformer | 100.0 | 100.0 | 99.4 | 100.0 | 100.0 | 98.8 | 100.0 | 0.001 |
| Model | AUC | PR-AUC | F-Score | KS | Prec. | Sens. | Spec. | Brier |
|---|---|---|---|---|---|---|---|---|
| (%) | (%) | (%) | (%) | (%) | (%) | (%) | ||
| PROBIT | 95.1 | 67.0 | 52.1 | 79.2 | 35.8 | 95.0 | 82.7 | 0.112 |
| GLMNET | 95.2 | 69.6 | 52.1 | 81.3 | 35.8 | 95.0 | 82.7 | 0.108 |
| SVM | 98.0 | 82.8 | 54.8 | 92.4 | 37.7 | 100.0 | 83.2 | 0.114 |
| RF | 99.5 | 97.0 | 73.1 | 94.0 | 59.4 | 95.0 | 93.4 | 0.067 |
| XGBoost | 99.6 | 96.7 | 65.6 | 95.9 | 48.8 | 100.0 | 89.3 | 0.063 |
| LightGBM | 99.6 | 96.4 | 66.7 | 95.9 | 50.0 | 100.0 | 89.8 | 0.063 |
| NNET | 98.0 | 75.5 | 62.5 | 93.9 | 45.5 | 100.0 | 87.8 | 0.089 |
| LSTM | 99.1 | 95.5 | 70.2 | 91.3 | 54.1 | 100.0 | 90.8 | 0.080 |
| Transformer | 99.2 | 87.6 | 92.3 | 94.6 | 94.7 | 90.0 | 99.5 | 0.015 |
| Yazdani (2020): Test 2007–2019 | This Paper: Test 2007–2024 | |||||
|---|---|---|---|---|---|---|
| Model | AUC | Sens. | F-Score | AUC | Sens. | F-Score |
| PROBIT | 90% | 89% | 71% | 95.1% | 95.0% | 52.1% |
| GLMNET | 95% | 100% | 73% | 95.2% | 95.0% | 52.1% |
| SVM | 91% | 89% | 76% | 98.0% | 100.0% | 54.8% |
| RF | 94% | 95% | 80% | 99.5% | 95.0% | 73.1% |
| XGBoost | 94% | 95% | 78% | 99.6% | 100.0% | 65.6% |
| NNET | 84% | 74% | 68% | 98.0% | 100.0% | 62.5% |
| Original paper metrics sourced from Exhibit 6 of Yazdani (2020). | ||||||
| Pre-1984 | 1984–2007 | 2008–2019 | 2020–2024 | |||||
|---|---|---|---|---|---|---|---|---|
| Model | AUC | Sens. | AUC | Sens. | AUC | Sens. | AUC | Sens. |
| PROBIT | 95.7% | 93.2% | 96.0% | 93.8% | 99.4% | 100.0% | 73.3% | 50.0% |
| GLMNET | 95.6% | 93.2% | 96.8% | 93.8% | 99.5% | 100.0% | 69.0% | 50.0% |
| SVM | 98.4% | 100.0% | 99.0% | 100.0% | 99.6% | 100.0% | 84.5% | 100.0% |
| RF | 98.5% | 98.3% | 99.8% | 100.0% | 100.0% | 100.0% | 87.9% | 50.0% |
| XGBoost | 98.5% | 100.0% | 99.8% | 100.0% | 100.0% | 100.0% | 95.7% | 100.0% |
| LightGBM | 98.1% | 100.0% | 99.5% | 100.0% | 100.0% | 100.0% | 96.6% | 100.0% |
| NNET | 98.7% | 100.0% | 98.3% | 100.0% | 99.6% | 100.0% | 91.4% | 100.0% |
| LSTM and Transformer excluded from regime table as their rolling-window predictions cover a shorter date range. | ||||||||
| Model | Months >50% | Max Prob. | Mean Prob. | AUC | Sens. |
|---|---|---|---|---|---|
| PROBIT | 24 | 100.0% | 37.5% | 73.3% | 50.0% |
| GLMNET | 23 | 100.0% | 37.1% | 69.0% | 50.0% |
| SVM | 20 | 93.3% | 42.8% | 84.5% | 100.0% |
| RF | 12 | 81.8% | 36.3% | 87.9% | 50.0% |
| XGBoost | 18 | 99.3% | 30.0% | 95.7% | 100.0% |
| LightGBM | 16 | 98.5% | 31.0% | 96.6% | 100.0% |
| NNET | 15 | 100.0% | 29.4% | 91.4% | 100.0% |
| LSTM | 19 | 99.5% | 31.5% | 72.4% | 100.0% |
| Transformer | 1 | 100.0% | 2.9% | 89.7% | 0.0% |
| 60 total observations (2020-01 to 2024-12); 2 NBER recession months (March–April 2020). | |||||
| “Months >50%” counts months with predicted recession probability above 0.50. | |||||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.