Submitted:
15 January 2026
Posted:
16 January 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Theoretical Background
2.1. Corporate Financial Performance: Measures and Classification
2.2. Evolution of AI and ML in Financial Forecasting
2.3. Typology of Hybrid AI Architectures for CFP Forecasting
| Hybrid architecture type | Core description | Main advantages | Representative studies |
| Econometric and machine learning hybrids | Models such as autoregressive integrated moving average, autoregressive distributed lag, and vector autoregression are enhanced by machine learning or deep learning methods to account for nonlinear factors. | Integrates baseline trends with nonlinear dynamics and yields more stable forecasts | Emrouznejad and Yang (2022); Li et al. (2022); Martyushev et al. (2025) |
| Machine learning ensembles | Bagging, boosting, and stacking approaches improve the prediction of corporate financial performance and stock-related indicators. | Reduces heterogeneity and unpredictability and improves generalization | Kliestik et al. (2022); Dong et al. (2022); Xu et al. (2024) |
| Machine learning and data envelopment analysis hybrids | Data envelopment analysis efficiency scores are combined with machine learning to integrate productivity frontiers with nonlinear learning | Provides more detailed modelling of efficiency gaps and productivity differences | Emrouznejad and Yang (2022); Zhang et al. (2022); Zhu et al. (2021) |
| Deep learning and signal processing hybrids | Wavelet transforms, empirical mode decomposition, and Fourier-based decomposition are integrated with long short-term memory or convolutional neural networks. | Enhances the extraction of cyclical patterns and reduces noise | Zhang, Chen and Li (2020); Zhang and Yu (2020) |
| Multimodal hybrids | Integrates tabular financial variables with macroeconomic indicators, textual information, and market data | Expands data representation and captures broader behavioral and structural signals | Che et al. (2020); Gupta et al. (2023); Le et al. (2021) |
2.4. Explainable AI: Concepts and Applications in Finance
2.5. Hybrid AI and Explainable AI Paradigm in Financial Performance Forecasting
3. Methods
3.1. Review Scope and Research Questions
- RQ1. What are the main categories and architectural logics of machine learning, deep learning, and hybrid artificial intelligence models used to forecast corporate financial performance?
- RQ2. What forms of global or local explainability do the studies employ, and for what analytical purposes are explainable artificial intelligence methods used in forecasting models?
- RQ3. How do the structures of the datasets used in these studies differ, including panel data, time series, market data, financial statement data, textual disclosures, and environmental, social, and governance indicators?
- RQ4. What contribution does the integration of hybrid artificial intelligence and explainable artificial intelligence make to the predictive performance, robustness, and interpretability of corporate financial performance forecasting models?
3.2. Search Strategy and Data Sources
- Corporate financial performance terms: “corporate financial performance”, “firm performance”, return on assets, return on equity, “Tobin’s Q”, profitability, “firm value”, “stock return”.
- Machine learning and hybrid model terms: “machine learning”, “deep learning”, “hybrid model”, ensemble, long short-term memory, XGBoost.
- Explainable artificial intelligence terms: “explainable AI”, SHAP, LIME, partial dependence, individual conditional effects, attention, counterfactual.
3.3. Inclusion and Exclusion Criteria
- Corporate financial performance outcomes: The study must forecast indicators such as return on assets, return on equity, Tobin’s Q, profitability, stock returns, firm value, or efficiency measures.
- Use of machine learning, deep learning, or hybrid artificial intelligence: Eligible methods include Random Forest, support vector machines, XGBoost, multilayer perceptrons, long short-term memory, convolutional neural networks, ensemble models, and hybrid approaches.
- Use of explainable artificial intelligence: The study must incorporate at least one interpretability technique such as SHAP, LIME, partial dependence or individual conditional effects, attention mechanisms, counterfactual explanations, or surrogate modelling.
- Empirical evidence: The study must use real data and report measurable forecasting outcomes.
- Publication period and quality: Only peer-reviewed articles published between 2000 and 2025 were included.
3.4. PRISMA-Based Study Selection Logic
3.5. Data Extraction and Coding Scheme
3.6. Construction of Hybrid AI Taxonomy
| Hybrid family | Description | Representative studies |
| Econometrics combined with machine learning. | Enhances autoregressive integrated moving average or autoregressive distributed lag residual structures using machine learning | Lam (2004); Zhang et al. (2020); Li et al. (2022) |
| Machine learning ensembles | Uses bagging, boosting, and stacking to improve predictive accuracy | Cavalcante et al. (2010); Kliestik et al. (2022) |
| Machine learning combined with data envelopment analysis | Employs data envelopment analysis efficiency scores as inputs to machine learning forecasting models | Emrouznejad and Yang (2022); Zhang et al. (2022) |
| Deep learning combined with signal processing | Integrates long short-term memory or convolutional neural networks with wavelet transforms or empirical mode decomposition | Ma et al. (2023); Zhang et al. (2020) |
| Multimodal architectures | Merges financial, market, textual, and macroeconomic data sources | Che et al. (2020); Du and Kim (2023); Xu et al. (2024) |
3.7. XAI Method Integration Framework
| XAI category | Description | Representative studies |
| SHAP | Provides local and global decompositions of feature contributions | Jabeur and Lachuer (2023); Delen and Kuzey (2023) |
| LIME | Generates local explanations based on perturbation of input features | Silva et al. (2021); Mousa et al. (2022) |
| Partial dependence and individual conditional effects | Shows aggregated and instance-specific marginal effects of predictors | Bae and Kim (2022) |
| Attention mechanisms | Learns feature weighting structures within deep learning architectures | Xu et al. (2024); Lee et al. (2017) |
| Counterfactual explanations | Provides “what if” reasoning by showing how outputs would change under altered inputs | Zhang et al. (2022) |
| Surrogate models | Approximates black box models with interpretable structures such as decision trees or generalized linear models | Malakar and Chakraborty (2024) |
4. Results
4.1. Descriptive Overview of The Included Studies
4.2. Distribution by Corporate Financial Performance Outcomes
4.3. Hybrid AI Model Families and Their Usage
| Hybrid category | Key mechanism | Representative studies |
| Econometrics combined with machine learning. | Autoregressive integrated moving average or autoregressive distributed lag structures refined by machine learning to capture nonlinear dynamics. | Emrouznejad and Yang (2022); Li et al. (2022); Martyushev et al. (2025) |
| Machine learning ensembles | Bagging, boosting, and stacking techniques that reduce overfitting and enhance generalization | Kliestik et al. (2022); Dong et al. (2022); Xu et al. (2024) |
| Machine learning combined with data envelopment analysis | Efficiency frontier estimation is used as an input to machine learning prediction | Zhang et al. (2022); Zhu et al. (2021) |
| Deep learning combined with signal processing | Wavelet transforms or empirical mode decomposition paired with deep sequence learning. | Zhang et al. (2020). Yuan and Zhang (2020) |
| Multimodal hybrids | Joint processing of textual, numerical, and macroeconomic features | Che et al. (2020); Gupta et al. (2023) |
4.4. Explainable AI Methods in CFP Forecasting
| XAI method | Explanation type | Usage | Representative studies |
| SHAP | Global and local | Feature attribution for return on assets, return on equity, and firm value. | Silva et al. (2021); Jabeur and Lachuer (2023) |
| LIME | Local | Instance-level explanations | Salleh et al. (2023) |
| Partial dependence, individual conditional effects, accumulated local effects | Global | Visualization of nonlinear effects | Papadimitriou et al. (2023) |
| Attention mechanisms | Model specific | Highlighting key features in textual and sequential data | Le et al. (2021); Che et al. (2020) |
| Counterfactual explanations | Scenario based | Supporting managerial what-if analysis | Delen and Kuzey (2023); Zhang et al. (2022) |
4.5. Dataset Structures and Methodological Patterns
4.6. Predictive Performance: Hybrid AI vs Non-Hybrid Baselines
5. Discussion
6. Future Research Directions
6.1. Multimodal and Heterogeneous Data Integration
6.2. Causal ML and Structural Explainability
6.3. Real-Time and Adaptive CFP Forecasting
6.4. Robustness, Generalizability, and Model Risk Management
6.5. Explainability for Decision-Making and Policy Integration
6.6. Towards Deployable, Scalable, and Auditable Hybrid AI Systems
7. Conclusions
Funding
Institutional Review Board Statement
Conflicts of Interest
References
- Arrieta, A. B., Díaz-Rodríguez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., García, S., Gil-López, S., Molina, D., Benjamins, R., Chatila, R., & Herrera, F. (2020). Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion, 58, 82–115. [CrossRef]
- Attanasio, G., Cagliero, L., & Baralis, E. (2020). Leveraging the explainability of associative classifiers to support quantitative stock trading. In Proceedings of the International Workshop on Data Science for Macro-Modeling (pp. 1–6). [CrossRef]
- Bae, S. C., & Kim, D. (2022). Predicting corporate profitability using machine learning with feature engineering. Decision Support Systems, 154, 113719. [CrossRef]
- Bahrami, M., Boz, H. A., Suhara, Y., Balcısoy, S., Bozkaya, B., & Pentland, A. (2023). Predicting merchant future performance using privacy-safe network-based features. Scientific Reports, 13, 10073. [CrossRef]
- Cavalcante, R. C., Brasileiro, R. C., Souza, V. L. F., Nóbrega, J. P., & Oliveira, A. L. I. (2010). Using boosting for financial analysis and performance prediction. Computational Economics, 36(2), 133–151. [CrossRef]
- Che, S., Zhu, W., & Li, X. (2020). Anticipating corporate financial performance from CEO letters utilizing sentiment analysis. Mathematical Problems in Engineering, 2020, 5609272. [CrossRef]
- Delen, D., & Kuzey, C. (2023). Corporate value prediction with explainable machine learning. Decision Support Systems, 162, 113518. [CrossRef]
- Dong, B., Wang, X., & Cao, Q. (2022). Performance prediction of listed companies in smart healthcare industry based on machine learning algorithms. Journal of Healthcare Engineering, 2022, 8091383. [CrossRef]
- Du, J., & Kim, S. (2023). Machine learning–based corporate performance forecasting using multi-source firm information. Omega, 119, 102903. [CrossRef]
- Emerson, R., Liu, S., Krauss, C., & Weinhardt, C. (2019). Trends and applications of machine learning in quantitative finance. ICEFR 2019. https://ssrn.com/abstract=3397005.
- Emrouznejad, A., & Yang, G. (2022). Hybrid DEA–machine learning approach to forecast efficiency and financial performance. Omega, 112, 102676. [CrossRef]
- Erdal, H., & Karahanoğlu, İ. (2016). Bagging ensemble models for bank profitability. Applied Soft Computing, 49, 861–867. [CrossRef]
- Giudici, P., & Raffinetti, E. (2021). Shapley–Lorenz explainable artificial intelligence. Expert Systems with Applications, 167, 114104. [CrossRef]
- Giudici, P., & Raffinetti, E. (2022). Explainable AI methods in cyber risk management. Quality and Reliability Engineering International, 38(3), 1318–1326. [CrossRef]
- Gogas, P., & Papadimitriou, T. (2021). Machine learning in economics and finance. Computational Economics, 57, 1–4. [CrossRef]
- Gupta, A., Rawte, V., & Zaki, M. J. (2023). Predicting firm financial performance from SEC filing changes using automatically generated dictionary. Computational Economics, 64(1), 307–334. [CrossRef]
- Hajek, P., Olej, V., & Myskova, R. (2014). Forecasting corporate financial performance using sentiment in annual reports. Technological and Economic Development of Economy, 20(4), 721–738. [CrossRef]
- Jabeur, S. B., & Lachuer, H. (2023). Corporate financial performance modeling using explainable AI. Journal of Business Research, 159, 113760. [CrossRef]
- Jing, W., & Anhang, L. (2023). Enterprise financial performance prediction model based on AI algorithm. Soft Computing, 28(S2), 607–621. [CrossRef]
- Kayakuş, E., Özdemir, A., & Konya, S. (2023). ROA and ROE forecasting using machine learning. Sustainability, 15(9), 7389. [CrossRef]
- Kliestik, T., Vrbka, J., & Rowland, Z. (2022). Forecasting corporate performance using machine learning ensembles. Journal of Business Economics and Management, 23(1), 72–89. [CrossRef]
- Krauss, C., Hansen, M., & Do, X. A. (2023). Forecasting firm fundamentals using deep learning. European Journal of Operational Research, 310(1), 350–366. [CrossRef]
- Lam, M. (2004). Neural network techniques for financial performance prediction. Decision Support Systems, 37(4), 567–581. [CrossRef]
- Le, T. D. B., Ngo, M. M., Tran, L. K., & Duong, V. N. (2021). Applying LSTM to predict firm performance based on annual reports. In Data Science for Financial Econometrics (pp. 273–289). [CrossRef]
- Lee, J., Jang, D., & Park, S. (2017). Deep learning-based corporate performance prediction. Sustainability, 9(6), 899. [CrossRef]
- Lin, Y., Luo, H., Wang, Z., & Li, Q. (2023). ESG factors and corporate financial performance prediction using interpretable ML. Journal of Cleaner Production, 413, 137574. [CrossRef]
- Liu, H. (2021). Performance prediction using deep learning. Wireless Communications and Mobile Computing, 2021, 1682163. [CrossRef]
- Ma, Y., Han, R., & Wang, W. (2023). Portfolio optimization with explainable DL. Expert Systems with Applications, 213, 118987. [CrossRef]
- Malakar, S., & Chakraborty, S. (2024). Interpretable ML for predicting corporate performance. Expert Systems with Applications, 244, 122891. [CrossRef]
- Martyushev, N. V., Pantiukhina, O. V., & Suvorova, A. V. (2025). Predicting firm performance using hybrid methods. Mathematics, 13(8), 1247. [CrossRef]
- Mousa, G. A., Elamir, E. A., & Hussainey, K. (2022). Using ML to predict financial performance: Does disclosure tone matter? International Journal of Disclosure and Governance, 19(1), 93–112. [CrossRef]
- Papadimitriou, T., Gogas, P., & Papathanasiou, S. (2023). Machine learning for firm value forecasting: An explainable framework. Finance Research Letters, 56, 103659. [CrossRef]
- Rallis, I., Markoulidakis, Y., Georgoulas, I., & colleagues. (2022). Interpretation of NPS attributes. Pervasive Technologies Related to Assistive Environments, 113–117. [CrossRef]
- Salleh, N., Ramasamy, S., Kamarudin, N., & Hashim, S. (2023). Corporate financial performance prediction using hybrid DL models. Applied Soft Computing, 148, 110810. [CrossRef]
- Silva, P., Alves, A., & Ribeiro, B. (2021). Interpretable ML models for corporate performance prediction. Decision Support Systems, 149, 113608. [CrossRef]
- Sodnomdavaa, T., Sodnomdavaa, T., & Tsogzol, Z. (2025). Possibility of improving credit scoring systems: Using machine learning models. International Journal of Social Science and Humanities Research-MIYR, 5(3), 43–53. [CrossRef]
- Sodnomdavaa, T., Sodnomdavaa, T., & Amgalanbat, N. (2025). Possibility of predicting inflation: Using machine learning model. International Journal of Social Science and Humanities Research-MIYR, 5(2), 49–64. [CrossRef]
- Son, P. V. H., & Duong, L. T. (2024). Predicting ROA using machine learning. Asian Journal of Civil Engineering, 25, 4269–4279. [CrossRef]
- Vuković, D. B., Spitsina, L., Gribanova, E., Spitsin, V., & Lyzin, I. (2023). Predicting the performance of retail market firms. Mathematics, 11(8), 1916. [CrossRef]
- Weng, F., Zhu, J., Yang, C., Gao, W., & Zhang, H. (2022). Financial pressure impacts with explainable ML. Expert Systems with Applications, 210, 118482. [CrossRef]
- Xie, C., Rajan, D., & Quek, C. (2021). Interpretable neural fuzzy network for stock prediction. Information Sciences, 577, 324–335. [CrossRef]
- Xu, Y., Li, S., & Zhang, H. (2024). Predicting firm performance using heterogeneous information and ML. Knowledge-Based Systems, 297, 111352. [CrossRef]
- Yuan, M., & Zhang, Y. (2020). Interpretable volatility forecasting using ML. International Journal of Forecasting, 36(4), 1478–1490. [CrossRef]
- Zahariev, A., Angelov, P., & Zarkova, S. (2022). Estimation of bank profitability using ML. Economic Alternatives, 2, 157–170. [CrossRef]
- Zhang, C., & Li, X. (2022). ML for forecasting corporate sales and FP. Expert Systems with Applications, 201, 117109. [CrossRef]
- Zhang, D., Chen, Y., & Li, J. (2020). A regime-aware hybrid ML model for asset allocation. Neurocomputing, 415, 295–309. [CrossRef]
- Zhang, R., Tian, Z., McCarthy, K. J., Wang, X., & Zhang, K. (2023). Predicting entrepreneurial firm valuation using ML. Journal of Forecasting, 42(2), 402–417. [CrossRef]
- Zhang, Z., Xiao, Y., & Niu, H. (2022). DEA and machine learning for performance prediction. Mathematics, 10(10), 1776. [CrossRef]
- Zhu, N., Zhu, C., & Emrouznejad, A. (2021). ML + DEA for efficiency prediction. Journal of Management Science and Engineering, 6(4), 435–448. [CrossRef]
| Category | Subcategory or coded variable | Description |
| Corporate financial performance | Return on assets, return on equity, Tobin’s Q, profitability measures, stock return, firm value, efficiency indicators such as data envelopment analysis. | Target variables used for forecasting |
| Model type | Machine learning, deep learning, hybrid approaches | Includes RF, XGBoost, LSTM, CNN, and related techniques |
| Hybrid architecture | Econometrics combined with machine learning, machine learning ensembles, machine learning combined with data envelopment analysis, deep learning combined with signal processing, multimodal frameworks. | Mechanisms that integrate multiple modelling approaches |
| Explainable AI method | SHAP, LIME, partial dependence or individual conditional effects, attention mechanisms, counterfactual explanations, surrogate models | Techniques used to interpret model outputs |
| Data structure | Cross-sectional, panel, time series, financial statement data, market data, textual data, and environmental, social, and governance indicators | Types of datasets employed in the study |
| Industry context | Manufacturing, finance, technology, services, energy, and other sectors | The industry or domain represented in the empirical analysis |
| Performance metric | Root mean squared error, mean absolute error, mean absolute percentage error, coefficient of determination, accuracy, F1 score. | Measures used to evaluate forecasting performance |
| Key findings | Performance improvement, interpretability enhancement, analytical insight | Summary of the empirical contribution |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).