Submitted:
09 September 2024
Posted:
10 September 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Literature Review
3. Materials and Methods
3.1. Linear Discriminant Analysis
3.2. Logistic Regression
3.3. Support Vector Machine
3.4. XGBoost
3.5. Random Forest
3.6. Deep Neural Network
Deep Neural Network Architecture
3.7. Methodology
Data Preprocessing
4. Results
4.1. Descriptives
4.2. Heatmap
4.3. Confusion Matrix Analysis
4.4. Receiver Operating Characteristics (ROC)
4.5. Features Importance
5. Discussions and Implications
6. Conclusions
Funding
Data Availability Statement
Conflicts of Interest
References
- Bali, T. G., Beckmeyer, H., Mörke, M., & Weigert, F. (2023). Option Return Predictability with Machine Learning and Big Data. The Review of Financial Studies, 36(9), 3548–3602. [CrossRef]
- Bastos, J. A. (2010). Forecasting bank loans loss-given-default. Journal of Banking & Finance, 34(10), 2510–2517. [CrossRef]
- Bazarbash, M. (2019). Fintech in financial inclusion: machine learning applications in assessing credit risk. INTERNATIONAL MONETARY FUND.
- Bekkar, M., Kheliouane Djemaa, H., & Akrouf Alitouche, T. (2013). Evaluation Measures for Models Assessment over Imbalanced Data Sets . Journal of Information Engineering and Applications, 3(10), 27–38.
- Bellotti, A., Brigo, D., Gambetti, P., & Vrins, F. (2021). Forecasting recovery rates on non-performing loans with machine learning. International Journal of Forecasting, 37(1), 428–444. [CrossRef]
- Bewick, V., Cheek, L., & Ball, J. (2004). Statistics review 13: receiver operating characteristic curves. Critical Care, 8, 1–5.
- Bhandary, R., Shenoy, S. S., Shetty, A., & Shetty, A. D. (2023a). Attitudes Toward Educational Loan Repayment Among College Students: A Qualitative Enquiry. Journal of Financial Counseling and Planning, 34(2), 281–292. [CrossRef]
- Bhandary, R., Shenoy, S. S., Shetty, A., & Shetty, A. D. (2023b). Education loan repayment: a systematic literature review. Journal of Financial Services Marketing. [CrossRef]
- Boudri, I., & El Bouhadi, A. (2024). Modeling and Forecasting Historical Volatility Using Econometric and Deep Learning Approaches: Evidence from the Moroccan and Bahraini Stock Markets. Journal of Risk and Financial Management, 17(7), 300. [CrossRef]
- Breiman, L. (2001a). Random forests. Machine Learning, 45, 5–32.
- Breiman, L. (2001b). Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author). Statistical Science, 16(3). [CrossRef]
- Chahuán-Jiménez, K. (2024). Neural Network-Based Predictive Models for Stock Market Index Forecasting. Journal of Risk and Financial Management, 17(6), 242. [CrossRef]
- Chang, C.-H. (2022). Information Asymmetry and Card Debt Crisis in Taiwan. Bulletin of Applied Economics, 123–145. [CrossRef]
- Chen, T., & Guestrin, C. (2016). XGBoost. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. [CrossRef]
- Chiang, W. K., Zhang, D., & Zhou, L. (2006). Predicting and explaining patronage behavior toward web and traditional stores using neural networks: a comparative analysis with logistic regression. Decision Support Systems, 41(2), 514–531.
- Chin, W., Cheah, J.-H., Liu, Y., Ting, H., Lim, X.-J., & Cham, T. H. (2020). Demystifying the role of causal-predictive modeling using partial least squares structural equation modeling in information systems research. Industrial Management & Data Systems, 120(12), 2161–2209.
- DeVaney, S. A. (1999). Determinants of consumer’s debt repayment patterns. Consumer Interests Annual, 45.
- Dicon, H. (2024, May 8). Why Are Americans Cutting Back on Credit Card Debt? Investopedia. https://www.investopedia.com/why-americans-are-cutting-back-on-credit-card-debt-8645241.
- Drobetz, W., & Otto, T. (2021). Empirical asset pricing via machine learning: evidence from the European stock market. Journal of Asset Management, 22(7), 507–538. [CrossRef]
- Fisher, R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7(2), 179–188.
- Gauthier, C., Lehar, A., & Souissi, M. (2012). Macroprudential capital requirements and systemic risk. Journal of Financial Intermediation, 21(4), 594–618. [CrossRef]
- Géron, A. (2022). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow. “ O’Reilly Media, Inc.”.
- Globaldata (2023). Taiwan Cards and Payments Market Report Overview. Globaldata report store. GDFS0736CI-ST.
- Gu, S., Kelly, B., & Xiu, D. (2020). Empirical Asset Pricing via Machine Learning. The Review of Financial Studies, 33(5), 2223–2273. [CrossRef]
- Gu, Z., Eils, R., & Schlesner, M. (2016). Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics, 32(18), 2847–2849. [CrossRef]
- Hamdi, M., Mestiri, S., & Arbi, A. (2024). Artificial Intelligence Techniques for Bankruptcy Prediction of Tunisian Companies: An Application of Machine Learning and Deep Learning-Based Models. Journal of Risk and Financial Management, 17(4), 132. [CrossRef]
- Hearst, M. A., Dumais, S. T., Osuna, E., Platt, J., & Scholkopf, B. (1998). Support vector machines. IEEE Intelligent Systems and Their Applications, 13(4), 18–28.
- Henseler, J., Ringle, C. M., & Sinkovics, R. R. (2009). The use of partial least squares path modeling in international marketing. In New challenges to international marketing (Vol. 20, pp. 277–319). Emerald Group Publishing Limited.
- Hochreiter, S., Younger, A. S., & Conwell, P. R. (2001). Learning to learn using gradient descent. Artificial Neural Networks—ICANN 2001: International Conference Vienna, Austria, August 21–25, 2001 Proceedings 11, 87–94.
- Hubbard, D. W. (2020). The failure of risk management: Why it’s broken and how to fix it. John Wiley & Sons.
- Jayadev, M., Shah, N., & Vadlamani, R. (2019). Predicting educational loan defaults: Application of machine learning and deep learning models. IIM Bangalore Research Paper, 601.
- Krenker, A., Bešter, J., & Kos, A. (2011). Introduction to the artificial neural networks. Artificial Neural Networks: Methodological Advances and Biomedical Applications. InTech, 1–18.
- Lahboub, K., & Benali, M. (2024). Assessing the Predictive Power of Transformers, ARIMA, and LSTM in Forecasting Stock Prices of Moroccan Credit Companies. Journal of Risk and Financial Management, 17(7), 293. [CrossRef]
- Leong, L.-Y., Hew, T.-S., Tan, G. W.-H., & Ooi, K.-B. (2013). Predicting the determinants of the NFC-enabled mobile credit card acceptance: A neural networks approach. Expert Systems with Applications, 40(14), 5604–5620.
- Liang, Y., Jin, X., & Wang, Z. (2019). Loanliness: Predicting loan repayment ability by using machine learning methods.
- Maehara, R., Benites, L., Talavera, A., Aybar-Flores, A., & Muñoz, M. (2024). Predicting Financial Inclusion in Peru: Application of Machine Learning Algorithms. Journal of Risk and Financial Management, 17(1), 34. [CrossRef]
- McCulloch, W. S., & Pitts, W. (1943). A logical calculus of the ideas immanent in nervous activity. The Bulletin of Mathematical Biophysics, 5, 115–133.
- Musolf, A. M., Holzinger, E. R., Malley, J. D., & Bailey-Wilson, J. E. (2022). What makes a good prediction? Feature importance and beginning to open the black box of machine learning in genetics. Human Genetics, 141(9), 1515–1528.
- Netzer, O., Lemaire, A., & Herzenstein, M. (2019). When words sweat: Identifying signals for loan default in the text of loan applications. Journal of Marketing Research, 56(6), 960–980.
- Nti, I. K., Nyarko-Boateng, O., & Aning, J. (2021). Performance of Machine Learning Algorithms with Different K Values in K-fold CrossValidation. International Journal of Information Technology and Computer Science, 13(6), 61–71. [CrossRef]
- Ofori, K. S., Fianu, E., Omoregie, O. K., Odai, N. A., & Oduro-Gyimah, F. (2014). Predicting credit default among micro borrowers in Ghana. Research Journal of Finance and Accounting ISSN, 1697–2222.
- Puli, S., Thota, N., & Subrahmanyam, A. C. V. (2024). Assessing Machine Learning Techniques for Predicting Banking Crises in India. Journal of Risk and Financial Management, 17(4), 141. [CrossRef]
- Saraiva, A. (2024, August 8). US Consumer Borrowing Rises Less Than Forecast on Credit Cards. Bloomberg. https://www.bloomberg.com/news/articles/2024-08-07/us-consumer-credit-misses-forecast-on-lower-card-balances.
- Schreiner, M. (2000). Credit Scoring for Microfinance: Can It Work. Journal of Microfinance / ESR Review, 2(2).
- Sperandei, S. (2014). Understanding logistic regression analysis. Biochemia Medica, 12–18. [CrossRef]
- Suhadolnik, N., Ueyama, J., & Da Silva, S. (2023). Machine Learning for Enhanced Credit Risk Assessment: An Empirical Approach. Journal of Risk and Financial Management, 16(12), 496. [CrossRef]
- Thomas, L. C., Matuszyk, A., So, M. C., Mues, C., & Moore, A. (2016). Modelling repayment patterns in the collections process for unsecured consumer debt: A case study. European Journal of Operational Research, 249(2), 476–486. [CrossRef]
- Vapnik, V. (1998). The support vector method of function estimation. In Nonlinear modeling: Advanced black-box techniques (pp. 55–85). Springer.
- Varian, H. R. (2014). Big Data: New Tricks for Econometrics. Journal of Economic Perspectives, 28(2), 3–28. [CrossRef]
- Viswanathan, P. K., & Shanthi, S. K. (2017). Modelling Credit Default in Microfinance—An Indian Case Study. Journal of Emerging Market Finance, 16(3), 246–258. [CrossRef]
- Wang, Y., Tong, L., & Zhao, Y. (2024). Revolutionizing Hedge Fund Risk Management: The Power of Deep Learning and LSTM in Hedging Illiquid Assets. Journal of Risk and Financial Management, 17(6), 224. [CrossRef]
- Yeh, I-Cheng. (2016). Default of credit card clients. UCI Machine Learning Repository 10: C55S3H.
- Zheng, A., & Casari, A. (2018). Feature engineering for machine learning: principles and techniques for data scientists. “ O’Reilly Media, Inc.”.







| Feature ID | Feature Code | Description |
|---|---|---|
| X1 | Limit_bal | The amount of credit that the card holder is entitled to avail. It includes individual and family credit. |
| X2 | Sex | (Gender) 1=male, 2=female |
| X3 | Education | 1= graduate, 2 = university, 3 = high school, 4=others. |
| X4 | Marital status | 1=Married, 2=Single, 3=Others |
| X5 | Age | 21 years to 79 years |
| X6 to X11 | History of past payment month wise | Repayment status codes-1 = paid duly 1 = payment delay for one month 2 = payment delay for two months … 9 = payment delay for 9 months and above |
| X12 to X17 | Amount of bill statement | X12 = amount of bill statement for September 2005 X13 = amount of bill statement for August 2005 … X17 = amount of bill statement for April 2005 |
| X18 to X23 | Amount of previous payment | X18 = amount paid in September 2005 X19 = amount paid in August 2005 … X23 = amount paid in April 2005 |
| Actual | Prediction | |
| 0 (Negative) | 1 (Positive) | |
| 0 (Negative) | True Negative (TN) | False Postive (FP) |
| 1 (Positive) | False Negative (FN) | True Positive (TP) |
| Metric | Formula |
|---|---|
| Accuracy | |
| Error rate = 1 - Accuracy | |
| Sensitivity (or Recall, Accuracy of positive examples) | |
| Specificity (Accuracy of Negative examples) | |
| Prescision | |
| F1-Score | 2* |
| G-mean |
| LDA | LR | SVM | ||||||
|---|---|---|---|---|---|---|---|---|
| Actual | Prediction | Actual | Prediction | Actual | Prediction | |||
| 0 | 1 | 0 | 1 | 0 | 1 | |||
| 0 | 4529 | 158 | 0 | 4549 | 138 | 0 | 4560 | 127 |
| 1 | 988 | 325 | 1 | 1002 | 311 | 1 | 1010 | 303 |
| XGBoost | RF | DNN | ||||||
| Actual | Prediction | Actual | Prediction | Actual | Prediction | |||
| 0 | 1 | 0 | 1 | 0 | 1 | |||
| 0 | 4406 | 281 | 0 | 4417 | 270 | 0 | 4400 | 287 |
| 1 | 819 | 494 | 1 | 832 | 481 | 1 | 805 | 508 |
| Mertric | LDA | LR | SVM | XGBoost | RF | DNN |
|---|---|---|---|---|---|---|
| Accuracy | 0.8090 | 0.8100 | 0.8105 | 0.8167 | 0.8163 | 0.8180 |
| Sensitivity or Recall | 0.2475 | 0.2369 | 0.2308 | 0.3762 | 0.3663 | 0.3869 |
| Specifivity | 0.9663 | 0.9706 | 0.9729 | 0.9400 | 0.9424 | 0.9388 |
| Precision | 0.6729 | 0.6927 | 0.7047 | 0.6374 | 0.6405 | 0.6390 |
| F1 Score | 0.3619 | 0.3530 | 0.3477 | 0.4732 | 0.4661 | 0.4820 |
| G—mean | 0.4891 | 0.4794 | 0.4738 | 0.5947 | 0.5876 | 0.6027 |
| AUC | 0.72 | 0.73 | 0.71 | 0.77 | 0.76 | 0.77 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).