Submitted:
25 February 2025
Posted:
25 February 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- To identify the most effective interpretability methods for credit risk assessment in finance, striking an optimal balance between interpretability and predictive accuracy.
- To explore how trade-offs between interpretability and accuracy vary across different credit risk assessment tasks in finance.
- To examine the role of domain knowledge in enhancing the interpretability of credit risk assessment models in financial contexts.
- To develop a robust framework for integrating human-in-the-loop decisionmaking into credit risk assessment workflows using interpretable models.
- What are the most effective interpretability methods for credit risk assessment in finance, in terms of balancing interpretability and accuracy?
- How do the interpretability and accuracy trade-offs differ across different credit risk assessment tasks in finance?
- How does domain knowledge impact the interpretability of credit risk assessment models in finance?
- How can human-in-the-loop decision making be effectively incorporated into credit risk assessment using interpretable models in finance?
2. Literature Review
2.1. An Overview of Interpretability Methods
- Interpretability helps ensure impartiality in decision making, enabling detection and consequently correction from bias in the training dataset. • Interpretability facilitates the provision of robustness by highlighting potential adversarial perturbations that could change the prediction. • Interpretability can act as an insurance validating meaningful variables for inferring output, i.e., guaranteeing that an underlying truthful causality exists in model reasoning.
2.2. Interpretability in Context of Finance
2.3. Deep Learning and Decision Trees Model for Credit Risk Assessment
3. Methodology
3.1. Data Pre-Processing
- receive wages, fin32
- receive transfers, fin37
- receive pension, fin38
- receive agriculture, fin42
- pay utilities, fin30
- • anydigpayment, merchantpay dig, fin14 1, fin14a, fin14a1, fin14b
- 0 = It was a null cell in the raw dataset
- 1 = Yes
- 2 = No
- 3 = I do not know
- 4 = Refuse to answer
- 1 = Respondent’s main source of emergency funds is savings
- 2 = Family, relatives, or friends
- 3 = Money from working
- 4 = Borrowing from a bank, employer, or private lender
- 5 = Sale of assets
- 6 = Some other source
- 7 = Could not come up with the money
- 8 = Don’t know
- 9 = Refused to answer
Results
Machine Learning Models
4. Interpretability and Explainability of ML Models




5. Discussion
- Partial Dependence Plot (PDP) Goldstein et al. (2014)
- Individual Conditional Expectation (ICE) Goldstein et al. (2014)
- Feature Importance Fang et al. (2020)
- Global Surrogate Lualdi et al. (2022)
- Local Surrogate (LIME) Ribeiro et al. (2016)
- Shapley Value (SHAP) Lundberg and Lee (2017)
6. Conclusion
References
- Adadi, A., Berrada, M., 2018. Peeking inside the black-box: a survey on explainable artificial intelligence (xai). IEEE access 6, 52138–52160. [CrossRef]
- Addo, P.M., Guegan, D., Hassani, B., 2018. Credit Risk Analysis using Machine and Deep Learning Models. Technical Report 08/WP/2018. Universit´e Paris 1 Panth´eon-Sorbonne. [CrossRef]
- Aremu, O.O., Hyland-Wood, D., McAree, P.R., 2020. A machine learning approach to circumventing the curse of dimensionality in discontinuous time series machine data. Reliability Engineering & System Safety 195, 106706. [CrossRef]
- Arrieta, A.B., Rodriguez, N.D., Ser, J.D., Bennetot, A., Tabik, S., Gonz´alez, A.B., Garcia, S., Gil-Lopez, S., Molina, D., Benjamins, V.R., Chatila, R., Herrera, F., 2019. Explainable artificial intelligence (xai): Concepts, taxonomies, opportunities and challenges toward responsible ai. Information Fusion 58. [CrossRef]
- Attigeri, G., M, M.P.M., Pai, R.M., 2019. Framework to predict npa/willful defaults in corporate loans: A big data approach. International Journal of Electrical and Computer Engineering (IJECE) 9, 3786–3797. [CrossRef]
- Bawa, J., Goyal, V., Mitra, S., Basu, S., 2018. An analysis of npas of indian banks: Using a comprehensive framework of 31 financial ratios. IIMB Management Review 31. [CrossRef]
- Bello, O.A., 2023. Machine learning algorithms for credit risk assessment: an economic and financial analysis. International Journal of Management 10, 109–133.
- Buckley, R.P., Zetzsche, D.A., Arner, D.W., Tang, B.W., 2021. Regulating artificial intelligence in finance: Putting the human in the loop. Sydney Law Review, The 43, 43–81.
- Bussmann, N., Giudici, P., Marinelli, D., Papenbrock, J., 2020. Explainable ai in fintech risk management. Frontiers in Artificial Intelligence 3, 26. [CrossRef]
- Choudhary, R., Gianey, H.K., 2017. Comprehensive review on supervised machine learning algorithms, in: 2017 International Conference on Machine Learning and Data Science (MLDS), IEEE. pp. 37–43.
- Chow, J.C.K., 2017. Analysis of financial credit risk using machine learning URL: https://arxiv.org/ftp/arxiv/papers/1802/1802.05326.pdf.
- Crespo Ma´rquez, A., 2022. The curse of dimensionality, in: Digital Maintenance Management: Guiding Digital Transformation in Maintenance. Springer, pp. 67–86.
- Das, K., Behera, R.N., 2017. A survey on machine learning: concept, algorithms and applications. International Journal of Innovative Research in Computer and Communication Engineering 5, 1301–1309.
- Demajo, L., Vella, V., Dingli, A., 2020. Explainable ai for interpretable credit scoring. CSIT. [CrossRef]
- Demirgu¨¸c-Kunt, A., Klapper, L.F., 2012. Measuring financial inclusion: The global findex database. World bank policy research working paper .
- Emmert-Streib, F., Yli-Harja, O., Dehmer, M., 2020. Explainable artificial intelligence and machine learning: A reality rooted perspective . [CrossRef]
- Fang, F., Ventre, C., Li, L., Kanthan, L., Wu, F., Basios, M., 2020. Better model selection with a new definition of feature importance. URL: https: //arxiv.org/abs/2009.07708, arXiv:2009.07708.
- Goldstein, A., Kapelner, A., Bleich, J., Pitkin, E., 2014. Peeking inside the black box: Visualizing statistical learning with plots of individual conditional expectation. URL: https://arxiv.org/abs/1309.6392, arXiv:1309.6392. [CrossRef]
- Kadanda, D., Raj, K., 2018. Non-performing assets (npas) and its determinants: A study of indian public sector banks. Journal of Social and Economic Development 20, 193–212. [CrossRef]
- Lualdi, P., Sturm, R., Siefkes, T., 2022. Exploration-oriented sampling strategies for global surrogate modeling: A comparison between one-stage and adaptive methods. Journal of Computational Science 60, 101603. [CrossRef]
- URL:. [CrossRef]
- Lundberg, S., Lee, S.I., 2017. A unified approach to interpreting model predictions. URL: https://arxiv.org/abs/1705.07874, arXiv:1705.07874.
- Mahesh, B., 2020. Machine learning algorithms-a review. International Journal of Science and Research (IJSR).[Internet] 9, 381–386.
- Misheva, B.H., Hisra, A., Osterrieder, J., Kulkarni, O., Lin, S.F., 2021. Explainable ai in credit risk management URL: https://arxiv.org/pdf/ 2103.00949.pdf.
- Molnar, C., 2022. Interpretable Machine Learning: A Guide for Making Black Box Models Explainable. URL: https://christophm.github.io/ interpretable-ml-book/.
- Nemenyi, P.B., 1963. Distribution-free multiple comparisons. Princeton University.
- Ribeiro, M.T., Singh, S., Guestrin, C., 2016. ”why should i trust you?”: Explaining the predictions of any classifier. URL: https://arxiv.org/ abs/1602.04938, arXiv:1602.04938.
- Yao, Y., Li, J., 2022. Operational risk assessment of third-party payment platforms: a case study of china. Financial Innovation 8, 19. [CrossRef]
- Zhou, L., Pan, S., Wang, J., Vasilakos, A.V., 2017. Machine learning on big data: Opportunities and challenges. Neurocomputing 237, 350–361. [CrossRef]






| Models | AUC | RMSE |
| M1 | 0.876280 | 0.245231 |
| M2 | 0.993066 | 0.096683 |
| M3 | 0.994803 | 0.044277 |
| D1 | 0.904914 | 0.114487 |
| D2 | 0.841172 | 0.116625 |
| D3 | 0.975266 | 0.323504 |
| D4 | 0.897737 | 0.113269 |
| Variable Name | Description | Data Type |
| economycode | Name of the economy | object |
| pop adult | Adult (15+) population using 2020 World Development Indicators (WDI) |
float64 |
| wpid random | Individual-level identifier to merge with Gallup World Poll data | int64 |
| wgt | Weight assigned to each observation | float64 |
| female | Respondent is female or male: 1 = female, 2 = male |
int64 |
| age | Respondent’s age (in years) | float64 |
| educ | Respondent’s education level: = primary or less, = secondary, = tertiary or more |
int64 |
| inc q | Respondent’s within-economy household income quintile (1 to 5) | int64 |
| emp in | Respondent is in workforce: 1 = in workforce, 2 = out |
float64 |
| urbancity f2f | Respondent lives in rural or urban area: 1 = rural, 2 = urban |
float64 |
| account | Has an account: 1 = yes, 0 = no |
int64 |
| saved | Personally saved money in the past year: 1 = yes, 0 = no |
int64 |
| borrowed | Personally borrowed money in the past year: 1 = yes, 0 = no |
int64 |
| mobileowner | Owns a mobile phone | int64 |
| internetaccess | Respondent has internet access | int64 |
| anydigpayment | Made/received a digital payment: 1 = yes, 0 = no |
int64 |
| remittances | Made/received a domestic remittance payment | int64 |
| merchantpay dig | Made a digital merchant payment: 1 = yes, 0 = no |
int64 |
| fin1 1a | Opened first account to receive a wage payment | float64 |
| fin1 1b | Opened first account to receive government money | float64 |
| fin2 | Has a debit card | int64 |
| fin4 | Used a debit card | float64 |
| fin4a | Used a debit card in store | float64 |
| fin5 | Used mobile phone or internet to access account | float64 |
| fin6 | Used mobile phone or internet to check balance | float64 |
| fin7 | Has a credit card | float64 |
| fin8 | Used a credit card | float64 |
| fin8a | Used a credit card in store | float64 |
| fin8b | Paid credit card balances in full | float64 |
| fin9 | Made a deposit into the account | float64 |
| fin9a | Made a deposit two or more times per month | float64 |
| fin10 | Withdrew from the account | float64 |
| fin10a | Withdrew two or more times per month | float64 |
| fin10b | Used account to store money | float64 |
| fin11 1 | Unbanked: use account without help | float64 |
| fin11a | Reason for no account: too far | float64 |
| fin11b | Reason for no account: too expensive | float64 |
| fin11c | Reason for no account: lack documentation | float64 |
| fin11d | Reason for no account: lack trust12 | float64 |
| fin11e | Reason for no account: religious reasons | float64 |
| fin11f | Reason for no account: lack money | float64 |
| fin11g | Reason for no account: family member already has one | float64 |
| fin11h | Reason for no account: no need for services | float64 |
| Variable Name | Description | Data Type |
| fin13 1b | Reason for no mobile money account: too expensive | float64 |
| fin13 1c | Reason for no mobile money account: lack documentation | float64 |
| fin13 1d | Reason for no mobile money account: lack money | float64 |
| fin13 1e | Reason for no mobile money account: use agent | float64 |
| fin13 1f | Reason for no mobile money account: no mobile phone | float64 |
| fin13a | Use mobile money account two or more times per month | float64 |
| fin13b | Use mobile money account to store money | float64 |
| fin13c | Use mobile money account to borrow money | float64 |
| fin13d | Use mobile money account without help | float64 |
| fin14 2 | Paid digitally for in-store purchase after COVID-19 | float64 |
| fin14c | Paid online or in cash at delivery | float64 |
| fin14c2 | Paid online for the first time after COVID-19 | float64 |
| fin24 | Main source of emergency funds in 30 days | int64 |
| fin24a | Difficulty of emergency funds in 30 days | float64 |
| fin24b | Difficulty of emergency funds in 7 days | float64 |
| fin26 | Sent domestic remittances | float64 |
| fin28 | Received domestic remittances | float64 |
| ML Model | Confusion Matrix | TP | FP | TN | FN | F1-Score | Precision | Recall |
| Support Vector Machine |
![]() |
25,752 | 8 | 12,576 | 21 | 1.00 | 1.00 | 1.00 |
| Logistic Regression | ![]() |
25,740 | 6 | 12,589 | 22 | 1.00 | 1.00 | 1.00 |
| Decision Tree Classifier |
![]() |
25,391 | 355 | 12,281 | 330 | 0.99 | 0.99 | 0.99 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).


