Submitted:
04 October 2024
Posted:
08 October 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Dataset Description
- Annual Income (AI): This typically refers to the total amount of money an individual earns in a year before taxes and other deductions. This figure is crucial in evaluating a person’s creditworthiness because it provides an indication of their ability to repay borrowed funds.
-
Debt-to-Income Ratio (DTI): This ratio as indicated by [47,48] is a measure used by lenders to evaluate a borrower’s ability to manage monthly payments and repay debts. It is the percentage of a person’s Gross Monthly Income (GMI) that goes towards paying their Total Monthly Debt Payments (TMDP). DTI may be computed by the following formula:Taking into consideration the fact that , it can be easily derived that DTI and AI are connecting according to:Furthermore, the TMDP (Total Monthly Debt Payments) can be decomposed into the sum of all minimum monthly payments on revolving balances (P) such as credit cards and lines of credit and other debts (Q) including loans and mortgages [49]. Formally, this can be expressed as:where:
- –
- represents the minimum monthly payment on the i-th revolving credit account.
- –
- represents the monthly payment on the j-th non-revolving debt account.
The total number of revolving credit accounts is , encompassing all types of credit that allow the borrower to access a maximum credit limit on a recurring basis as long as the account remains in good standing. The total number of non-revolving debt accounts is , which includes all types of credit with a fixed payment schedule and a predetermined number of payments. -
Revolving Balance (RB): Refers to the amount of credit that remains unpaid at the conclusion of a billing cycle [50]. It can be calculated as the sum of the outstanding balances on all revolving credit accounts as:where identifies the outstanding balance on the i-th revolving credit account. A connection between RB and TMDP may be established by considering the minimum monthly payments on revolving credit accounts. Assuming that is typically a fraction of the revolving balance which is determined by the minimum payment rate r (a common rate might be around 1- of the revolving balance), we could write that:Therefore, the total amount of payments on revolving balances could be expressed as:which finally yields that
- Revolving Utilization (RU): It is also known as credit utilization ratio, is a key metric in credit scoring that measures the percentage of a borrower’s available revolving credit that is currently being used [51]. It indicates how much of the available credit limits are being utilized by the borrower. Lenders and credit scoring models use this ratio to assess credit risk, with a lower utilization rate generally being favorable as it suggests responsible credit usage. The revolving utilization ratio may be calculated by the following equation:where TRCL is the acronym for Total Revolving Credit Limits referring to the sum of all credit limits on the available revolving credit accounts. TRCL can, in turn, be computed as:where is the credit limit on the i-th revolving credit account. In other words, provides the upper bound for the outstanding balance on the i-th revolving credit account such that .
- Inquiries Last 6 Months (ILSM): This represents the count of credit inquiries made by lenders into an individual’s credit report over the past six months. These inquiries occur when a consumer applies for new credit, such as credit cards, mortgages, or auto loans. Each time a lender requests a copy of a credit report to evaluate an application, it registers as an inquiry. According to [52], credit inquiries are an important factor in credit scoring models because they can indicate a consumer’s credit-seeking behavior. Multiple inquiries in a short period might suggest that a consumer is experiencing financial stress or taking on more debt than they can manage, which can be a red flag for lenders. However, the impact of inquiries on credit scores is generally small compared to other factors such as payment history and debt levels.
-
Delinquencies in the Last 2 Years (DLTY): This is the total number of instances where a borrower has failed to make timely payments on their credit obligations within the past two years. A delinquency typically occurs when a payment is overdue by a specified period (e.g., 30, 60, or 90 days past due). This metric is crucial in assessing a borrower’s creditworthiness and financial reliability, as frequent delinquencies can indicate financial distress or poor financial management. Delinquencies are a critical factor in credit risk assessment [53] for several reasons:
- Months Since Last Delinquency (MSLD): This measurement corresponds to the number of months that have elapsed since a borrower last missed a payment on any credit account. This metric is important in credit risk assessment as it provides insight into the recency of a borrower’s financial difficulties [53]. The longer the period since the last delinquency, the better it reflects on the borrower’s current financial stability and reliability.
-
Public Records (PR): This is the total count of derogatory public records that appear on a borrower’s credit report. These records are legal documents that are accessible to the public and typically include serious credit events such as bankruptcies, tax liens, and civil judgments. Each of these records can significantly impact a borrower’s credit score and creditworthiness due to the severity of the financial issues they indicate [58,59]. Three main categories of public record filings may be discerned including:
- Bankruptcies: Legal proceedings involving a person or business that is unable to repay outstanding debts. Bankruptcies can remain on a credit report for up to 10 years.
- Tax Liens: Claims made by the government when taxes are not paid on time. Tax liens can severely affect credit scores and remain on credit reports for several years, even after being paid.
- Civil Judgments: Court rulings against a person in a lawsuit, usually involving the repayment of debt. Civil judgments can remain on a credit report for up to seven years.
Public record filings constitute extremely important determinants in credit risk assessment [60] since they can by conceived as indicators of severe financial distress. They reflect significant issues in managing finances, which are critical for estimating the credit risk of a borrower. Verily, the presence of public records on a credit report can decidedly reduce the credit score of a given individual. Credit scoring models like FICO and Vantage Score heavily penalize public records due to their serious nature. Moreover, the count and type of public records are utilized by lender in order to assess the risk associated with extending new credit. An increased number of derogatory public records may result in higher interest rates, lower credit limits, or denial of credit applications. - Public Record Bankruptcies (PRB): The number of bankruptcy filings appearing in the credit report of a applicant.
-
Total Current Balance to High Credit Ratio (BHCR): This ratio compares the Total Current Balance (TCB) on all installment accounts to the Highest Credit Limit (HCL) granted on these accounts. It is a metric used to assess how much of the available credit a borrower is currently using relative to their highest credit limit, providing insight into their credit utilization and financial behavior [61]. It is easy to deduce that BHCR can be calculate as:This measure can provide useful insight concerning the percentage of the highest available credit a borrower is currently utilizing. Apparently, higher credit utilization rates are associated with with higher credit risk. BHCR may be thought of as an additional indicator of the financial behavior of an individual where an increased credit utilization ratio may suggest an over-reliance on credit. Once again, higher BHCR values can lead to higher interest rates, lower credit limits, or even denial of credit. Unlike RU, BHCR pertains to installment accounts such as mortgages and auto loans where there exists a fixed payment schedule and a predetermined loan amount. Furthermore, BHCR affects the long-term assessment of debt management, while RU focuses on assessing the the short-term debt management reflecting the borrower’s dependence on credit.
-
Balance to Credit Limit on All Trades (BCLA): This metric can be defined as:stands for Total Current Balances on All Trades representing the sum of all outstanding balances on the borrower’s credit accounts, including both revolving and installment accounts. is the acronym used for Total Credit Limits on All Trades corresponding to the sum of all credit limits on the borrower’s credit accounts. and can be expressed based on the previously defined quantities as:In this context, BCLA may be re-expressed as:The aforementioned ratio provides insight into how much of the available credit a borrower is using across all credit accounts, not just revolving credit. BCLA is an important indicator of credit utilization and financial behavior, and is used in credit risk assessment to evaluate a borrower’s ability to manage debt. Taking into consideration Eqs. 8 and 11, it is straightforward to understand that an alternative formula for BCLA can be obtained as:Eq.16 suggests that BCLA is actually a weighted average of the quantities RU and BHCR where the weighting coefficients are given by TRCL and HCL respectively.
-
Total Revolving High Credit/Credit Limit (TRHC): This measure quantifies the highest amount of credit ever utilized on revolving credit accounts relative to the total credit limits available on those accounts. It provides insight into the maximum credit exposure a borrower has reached in their revolving accounts, offering a perspective on their peak credit utilization [62]. TRHC is defined according to the equation below:TRHC corresponds to the Total High Credit on revolving accounts, which is the highest amount of credit ever utilized on revolving credit accounts. TRCL is the Total Revolving Credit Limits as mentioned previously in this section. TRHC provides a different perspective on a borrower’s credit utilization and risk profile by reflecting the highest debt levels of an individual relative to available credit. This measurement can help lenders to evaluate a borrower’s efficiency in managing credit limits and how frequently higher levels of credit utilization are approached or exceeded.
3. Symbolic Regression
4. Layered Regression Models
5. Experimental Results
5.1. GP Regression
5.2. Gaussian Support Vector Machines—GSVM Regression
5.3. Multilayer Perceptrons—MLP Regression
5.4. Radial Basis Function Networks—RBFN Regression
5.5. Regression Trees
5.6. Challenges in Regression Accuracy Across Higher-Index Layers
6. Interpretable FICO Score Models
6.1. Insights from Comparative Statics
6.1.1. Dynamic Sensitivity Analysis of Variables at Layer 1
6.1.2. Dynamic Sensitivity Analysis of Variables at Layer 1
7. Conclusions & Future Work
Appendix A
| Training | Testing | ||||
| Measure | t-Statistic | p-Value | t-Statistic | p-Value | |
| RMSE () | 16.5310 | 1.2729e-09 | 16.4654 | 1.3326e-09 | |
| MAE () | 9.5911 | 5.6120e-07 | 9.4685 | 6.4402e-07 | |
| () | -7.3849 | 8.4449e-06 | -7.4042 | 8.2279e-06 | |
| RMSE () | 14.3256 | 6.5691e-09 | 14.2521 | 6.9654e-09 | |
| MAE () | 6.4930 | 2.9669e-05 | 5.9779 | 6.4317e-05 | |
| () | -5.4275 | 1.5313e-04 | -5.1304 | 2.4891e-04 | |
| Training | Testing | ||||
| Measure | t-Statistic | p-Value | t-Statistic | p-Value | |
| RMSE () | 16.5259 | 1.2775e-09 | 16.4617 | 1.3361e-09 | |
| MAE () | 9.4390 | 6.6583e-07 | 9.3757 | 7.1538e-07 | |
| () | -7.4702 | 7.5285e-06 | -7.4262 | 7.9874e-06 | |
| RMSE () | 14.3218 | 6.5892e-09 | 14.2591 | 6.9265e-09 | |
| MAE () | 6.1405 | 5.0178e-05 | 5.8225 | 8.1810e-05 | |
| () | -5.5358 | 1.2866e-04 | -5.2638 | 1.9982e-04 | |
| Training | Testing | ||||
| Measure | t-Statistic | p-Value | t-Statistic | p-Value | |
| RMSE () | 16.5286 | 1.2750e-09 | 16.4527 | 1.3446e-09 | |
| MAE () | 7.2390 | 1.0301e-05 | 7.2260 | 1.0485e-05 | |
| () | -5.9443 | 6.7732e-05 | -5.9822 | 6.3887e-05 | |
| RMSE () | 14.3524 | 6.4306e-09 | 14.3422 | 6.4832e-09 | |
| MAE () | 5.8596 | 7.7214e-05 | 5.9594 | 6.6174e-05 | |
| () | -4.9460 | 3.3864e-04 | -4.8186 | 4.2003e-04 | |
| Training | Testing | ||||
| Measure | t-Statistic | p-Value | t-Statistic | p-Value | |
| RMSE () | 16.5270 | 1.2765e-09 | 16.4670 | 1.3312e-09 | |
| MAE () | 8.1019 | 3.3021e-06 | 7.7605 | 5.1253e-06 | |
| () | -6.0252 | 5.9811e-05 | -5.7756 | 8.8031e-05 | |
| RMSE () | 14.3516 | 6.4351e-09 | 14.3213 | 6.5917e-09 | |
| MAE () | 5.8078 | 8.3710e-05 | 5.3632 | 1.6992e-04 | |
| () | -5.4000 | 1.6009e-04 | -5.0101 | 3.0413e-04 | |
References
- Van Gestel, T.; Baesens, B. Credit Risk Management: Basic concepts: Financial risk components, Rating analysis, models, economic and regulatory capital; OUP Oxford, 2008.
- He, H.; Zhang, W.; Zhang, S. A novel ensemble method for credit scoring: Adaption of different imbalance ratios. Expert Systems with Applications 2018, 98, 105–117. [Google Scholar] [CrossRef]
- Hanić, A.; Žunić, E.; Dželihodžić, A. Scoring Models of Bank Credit Policy Management. Economic analysis 2013, 46, 12–27. [Google Scholar]
- Agarwal, S.; Rosen, R.J.; Yao, V. Why do borrowers make mortgage refinancing mistakes? Management Science 2016, 62, 3494–3509. [Google Scholar] [CrossRef]
- https://www.fico.com. Accessed: 2024-05-01.
- https://vantagescore.com. Accessed: 2024-05-01.
- https://www.myfico.com/credit-education/whats-in-your-credit-score. Accessed: 2024-05-01.
- http://www.vantagescore.com/machinelearningWP. Accessed: 2024-05-01.
- https://www.myfico.com/credit-education/credit-scores/whats-not-in-your-credit-score. Accessed: 2024-05-01.
- Albanesi, S.; Vamossy, D.F. Predicting consumer default: A deep learning approach. Technical report, National Bureau of Economic Research, 2019.
- https://www.lendingclub.com. Accessed: 2021-05-01.
- Zhao, H.; Ge, Y.; Liu, Q.; Wang, G.; Chen, E.; Zhang, H. P2P lending survey: platforms, recent advances and prospects. ACM Transactions on Intelligent Systems and Technology (TIST) 2017, 8, 1–28. [Google Scholar] [CrossRef]
- Chi, G.; Ding, S.; Peng, X. Data-driven robust credit portfolio optimization for investment decisions in P2P lending. Mathematical Problems in Engineering 2019, 2019. [Google Scholar] [CrossRef]
- https://www.prosper.com. Accessed: 2021-05-01.
- Munkhdalai, L.; Munkhdalai, T.; Namsrai, O.E.; Lee, J.Y.; Ryu, K.H. An empirical comparison of machine-learning methods on bank client credit assessments. Sustainability 2019, 11, 699. [Google Scholar] [CrossRef]
- Leong, C.K. Credit risk scoring with bayesian network models. Computational Economics 2016, 47, 423–446. [Google Scholar]
- Amaro, M.M. Credit scoring: comparison of non-parametric techniques against logistic regression. PhD thesis, 2020.
- Feng, X.; Xiao, Z.; Zhong, B.; Qiu, J.; Dong, Y. Dynamic ensemble classification for credit scoring using soft probability. Applied Soft Computing 2018, 65, 139–151. [Google Scholar]
- Dumitrescu, E.I.; Hué, S.; Hurlin, C. ; others. Machine Learning or Econometrics for Credit Scoring: Let’s Get the Best of Both Worlds 2021.
- Dumitrescu, E.; Hué, S.; Hurlin, C.; Tokpavi, S. Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects. European Journal of Operational Research 2022, 297, 1178–1192. [Google Scholar] [CrossRef]
- Wang, H.; Kou, G.; Peng, Y. Multi-class misclassification cost matrix for credit ratings in peer-to-peer lending. Journal of the Operational Research Society 2020, pp.1–12.
- Dzik-Walczak, A.; Heba, M. An implementation of ensemble methods, logistic regression, and neural network for default prediction in Peer-to-Peer lending. Zbornik radova Ekonomskog fakulteta u Rijeci: časopis za ekonomsku teoriju i praksu 2021, 39, 163–197. [Google Scholar] [CrossRef]
- Fernandez, C.; Provost, F.; Han, X. Counterfactual explanations for data-driven decisions 2019.
- Moscato, V.; Picariello, A.; Sperlí, G. A benchmark of machine learning approaches for credit score prediction. Expert Systems with Applications 2021, 165, 113986. [Google Scholar] [CrossRef]
- Namvar, A.; Naderpour, M. Handling uncertainty in social lending credit risk prediction with a Choquet fuzzy integral model. 2018 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). IEEE, 2018, pp. 1–8.
- Serrano-Cinca, C.; Gutiérrez-Nieto, B. The use of profit scoring as an alternative to credit scoring systems in peer-to-peer (P2P) lending. Decision Support Systems 2016, 89, 113–122. [Google Scholar]
- Ye, X.; Dong, L.a.; Ma, D. Loan evaluation in P2P lending based on random forest optimized by genetic algorithm with profit score. Electronic Commerce Research and Applications 2018, 32, 23–36. [Google Scholar] [CrossRef]
- Tuoremaa, H. A multi-gene symbolic regression approach for predicting LGD: A benchmark comparative study, 2023.
- Horn, D.M. Credit scoring using genetic programming. PhD thesis, 2017.
- Ong, C.S.; Huang, J.J.; Tzeng, G.H. Building credit scoring models using genetic programming. Expert systems with applications 2005, 29, 41–47. [Google Scholar] [CrossRef]
- Huang, J.J.; Tzeng, G.H.; Ong, C.S. Two-stage genetic programming (2SGP) for the credit scoring model. Applied Mathematics and Computation 2006, 174, 1039–1053. [Google Scholar] [CrossRef]
- Pławiak, P.; Abdar, M.; Acharya, U.R. Application of new deep genetic cascade ensemble of SVM classifiers to predict the Australian credit scoring. Applied Soft Computing 2019, 84, 105740. [Google Scholar] [CrossRef]
- Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature machine intelligence 2019, 1, 206–215. [Google Scholar] [CrossRef]
- Barocas, S.; Hardt, M.; Narayanan, A. Fairness and Machine Learning. fairmlbook. org, 2019.
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 2016, pp. 785–794.
- Colombani, J. The Fair Credit Reporting Act. Suffolk UL Rev. 1979, 13, 63. [Google Scholar]
- Doshi-Velez, F.; Kim, B. Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608, arXiv:1702.08608 2017.
- Boyd, W.E. Federal Consumer Credit Protection Act–A Consumer Perspective. Notre Dame Law. 1969, 45, 171. [Google Scholar]
- Lipton, Z.C. The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery. Queue 2018, 16, 31–57. [Google Scholar] [CrossRef]
- Ricardo, B.Y.; Berthier, R.N. Modern information retrieval: the concepts and technology behind search. New Jersey, USA: Addi-son-Wesley Professional 2011.
- Schmidt, M.; Lipson, H. Distilling free-form natural laws from experimental data. science 2009, 324, 81–85. [Google Scholar] [CrossRef] [PubMed]
- Christoph, M. Interpretable machine learning: A guide for making black box models explainable; Leanpub, 2020.
- https://www.kaggle.com/wordsforthewise/lending-club. Accessed: 2021-05-01.
- Emekter, R.; Tu, Y.; Jirasakuldech, B.; Lu, M. Evaluating credit risk and loan performance in online Peer-to-Peer (P2P) lending. Applied Economics 2015, 47, 54–70. [Google Scholar] [CrossRef]
- Serrano-Cinca, C.; Gutiérrez-Nieto, B.; López-Palacios, L. Determinants of default in P2P lending. PloS one 2015, 10, e0139427. [Google Scholar] [CrossRef]
- Polena, M.; Regner, T. Determinants of borrowers’ default in P2P lending under consideration of the loan risk class. Games 2018, 9, 82. [Google Scholar] [CrossRef]
- Szwabe, A.; Misiorek, P. Decision trees as interpretable bank credit scoring models. Beyond Databases, Architectures and Structures. Facing the Challenges of Data Proliferation and Growing Variety: 14th International Conference, BDAS 2018, Held at the 24th IFIP World Computer Congress, WCC 2018, Poznan, Poland, September 18-20, 2018, Proceedings 14. Springer, 2018, pp. 207–219. [CrossRef]
- https://www.investopedia.com/terms/d/dti.asp. Accessed: 2024-07-01.
- https://www.rocketmortgage.com/learn/debt-to-income-ratio. Accessed: 2024-07-01.
- Thomas, L.; Crook, J.; Edelman, D. Credit scoring and its applications; SIAM, 2017.
- https://www.investopedia.com/terms/c/credit-utilization-rate.asp. Accessed: 2024-07-01.
- https://www.myfico.com/credit-education/credit-reports/credit-checks-and-inquiries. Accessed: 2024-07-01.
- Kim, H.; Cho, H.; Ryu, D. An empirical study on credit card loan delinquency. Economic Systems 2018, 42, 437–449. [Google Scholar] [CrossRef]
- Guan, C.; Suryanto, H.; Mahidadia, A.; Bain, M.; Compton, P. Responsible credit risk assessment with machine learning and knowledge acquisition. Human-Centric Intelligent Systems 2023, 3, 232–243. [Google Scholar] [CrossRef]
- Bhattacharya, A.; Biswas, S.K.; Mandal, A. Credit risk evaluation: a comprehensive study. Multimedia Tools and Applications 2023, 82, 18217–18267. [Google Scholar] [CrossRef]
- Abdou, H.A.; Pointon, J. Credit scoring, statistical techniques and evaluation criteria: a review of the literature. Intelligent systems in accounting, finance and management 2011, 18, 59–88. [Google Scholar] [CrossRef]
- Kamimura, E.S.; Pinto, A.R.F.; Nagano, M.S. A recent review on optimisation methods applied to credit scoring models. Journal of Economics, Finance and Administrative Science 2023. [CrossRef]
- https://www.experian.com/blogs/ask-experian/public-records-that-appear-on-your-report/. Accessed: 2024-07-01.
- https://fastercapital.com/content/The-Influence-of-Public-Records-on-Credit-Scoring-Analysis.html#Introduction-to-Public-Records-and-Credit-Scoring-Analysis. Accessed: 2024-07-01.
- Nagypal, E.; Fulford, S. The Equilibrium Effect of Information in Consumer Credit Markets: Public Records and Credit. SSRN Electronic Journal 2023. [Google Scholar] [CrossRef]
- https://www.investopedia.com/terms/b/balancetolimit-ratio.asp. Accessed: 2024-05-01.
- https://www.experian.com/blogs/ask-experian/credit-education/score-basics/credit-utilization-rate/. Accessed: 2024-05-01.
- Searson, D. GPTIPS genetic programming & symbolic regression for MATLAB user guide 2009.
- Searson, D.P.; Leahy, D.E.; Willis, M.J. GPTIPS: an open source genetic programming toolbox for multigene symbolic regression. Proceedings of the International multiconference of engineers and computer scientists. Citeseer, 2010, Vol. 1, pp. 77–80.
- Brevoort, K.P.; Grimm, P.; Kambara, M. Credit invisibles and the unscored. Cityscape 2016, 18, 9–34. [Google Scholar] [CrossRef]
- Avery, R.B.; Calem, P.S.; Canner, G.B. Credit report accuracy and access to credit. Fed. Res. Bull. 2004, 90, 297. [Google Scholar] [CrossRef]
- Sengupta, R.; Bhardwaj, G. Credit scoring and loan default. International Review of Finance 2015, 15, 139–167. [Google Scholar] [CrossRef]
- Keys, B.J.; Mukherjee, T.; Seru, A.; Vig, V. Did securitization lead to lax screening? Evidence from subprime loans. The Quarterly journal of economics 2010, 125, 307–362. [Google Scholar] [CrossRef]
- Giesecke, K.; Longstaff, F.A.; Schaefer, S.; Strebulaev, I. Corporate bond default risk: A 150-year perspective. Journal of financial Economics 2011, 102, 233–250. [Google Scholar] [CrossRef]
| 1 | Actually, since all credit-related features are normalized in the interval. |
| 2 | Apparently, the M-th FICO bin will be given as:
|
| 3 |
Figure 5 verifies that this is not exactly the case for the considered classes of FICO, at least, as far as the utilized dataset is concerned. |
| 4 | Initial experiments have shown that the regression accuracy of a model trained on the complete dataset is significantly low. |
| 5 |
represents the set of m-dimensional credit-related feature vectors associated with each candidate borrower in the dataset. |
| 6 | The first and last anchor points correspond to the minimum and maximum distances, defined as and , respectively. |
| 7 | The L-th range of distance values is specifically defined as:
|
| 8 | Additional features to experiment: Car, Credit Card, Debt Consolidation, Home Improvement, House Purchase, Major Purchase, Medical, Moving, Renewable Energy, Small Business, Vacation, Wedding |
| 9 | This threshold value corresponds to the minimum achieved by the top of the evolved population of models for the first five layers across all folds. |
| 10 | |
| 11 | In this setting, we assume that the primary variable driving changes in the FICO score is , such that . |
| 12 | In this setting, we assume that the primary variable driving changes in the FICO score is , such that . |
| 13 | Since is normalized in the interval, we have that , which, in turn, suggests that . |







| Min | Max | Normalized Mean | Normalized STD | |
|---|---|---|---|---|
| Annual Income (AI) | 10008 | 699587 | ||
| Debt-to-Income Ratio (DTI) | 0 | |||
| Revolving Balance (RB) | 0 | 1696796 | ||
| Revolving Utilization (RU) | 0 | 100 | ||
| Inquiries Last 6 Months (ILSM) | 0 | 5 | ||
| Delinquencies in the last 2 years (DLTY) | 0 | 29 | ||
| Months since the last Delinquency (MSLD) | 0 | 226 | ||
| Public Records (PR) | 0 | 61 | ||
| Total Current Balance-to-High Credit Ratio (BHCR) | 0 | 558 | ||
| Balance-to-Credit Limit on all trades (BCLA) | 0 | 204 | ||
| Total Revolving High Credit/Credit Limit (TRHC) | 100 | 1652700 | ||
| Public Record Bankruptcies (PRB) | 0 | 9 |
| FICO Class | Layer 1 | Layer 2 | Layer 3 | Layer 4 | Layer 5 | Layer 6 | Layer 7 | Layer 8 | Layer 9 | Layer 10 | Layer 11 | Layer 12 | Layer 13 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 20.6428 | 20.6291 | 20.6291 | 20.6291 | 20.6291 | 20.6291 | 20.6291 | 20.6291 | 20.6291 | 20.6291 | 20.6291 | 20.6291 | 20.5769 |
| 2 | 18.2905 | 18.2789 | 18.2789 | 18.2789 | 18.2789 | 18.2789 | 18.2789 | 18.2789 | 18.2789 | 18.2789 | 18.2789 | 18.2789 | 18.2080 |
| 3 | 15.6701 | 15.6607 | 15.6607 | 15.6607 | 15.6607 | 15.6607 | 15.6607 | 15.6607 | 15.6607 | 15.6607 | 15.6607 | 15.6607 | 15.5857 |
| 4 | 12.8605 | 12.8536 | 12.8536 | 12.8536 | 12.8536 | 12.8536 | 12.8536 | 12.8536 | 12.8536 | 12.8536 | 12.8536 | 12.8536 | 12.8191 |
| 5 | 9.9807 | 9.9763 | 9.9763 | 9.9763 | 9.9763 | 9.9763 | 9.9763 | 9.9763 | 9.9763 | 9.9763 | 9.9763 | 9.9763 | 9.9257 |
| 6 | 7.3778 | 7.3757 | 7.3757 | 7.3757 | 7.3757 | 7.3757 | 7.3757 | 7.3757 | 7.3757 | 7.3757 | 7.3757 | 7.3757 | 7.3689 |
| 7 | 2.6776 | 2.6797 | 2.6797 | 2.6797 | 2.6797 | 2.6797 | 2.6797 | 2.6797 | 2.6797 | 2.6797 | 2.6797 | 2.6797 | 2.7010 |
| 8 | 3.7636 | 3.7647 | 3.7647 | 3.7647 | 3.7647 | 3.7647 | 3.7647 | 3.7647 | 3.7647 | 3.7647 | 3.7647 | 3.7647 | 3.7587 |
| 9 | 2.4886 | 2.4908 | 2.4908 | 2.4908 | 2.4908 | 2.4908 | 2.4908 | 2.4908 | 2.4908 | 2.4908 | 2.4908 | 2.4908 | 2.5262 |
| 10 | 1.7367 | 1.7396 | 1.7396 | 1.7396 | 1.7396 | 1.7396 | 1.7396 | 1.7396 | 1.7396 | 1.7396 | 1.7396 | 1.7396 | 1.7832 |
| 11 | 1.3366 | 1.3398 | 1.3398 | 1.3398 | 1.3398 | 1.3398 | 1.3398 | 1.3398 | 1.3398 | 1.3398 | 1.3398 | 1.3398 | 1.3505 |
| 12 | 0.9981 | 1.0016 | 1.0016 | 1.0016 | 1.0016 | 1.0016 | 1.0016 | 1.0016 | 1.0016 | 1.0016 | 1.0016 | 1.0016 | 1.0140 |
| 13 | 0.7299 | 0.7336 | 0.7336 | 0.7336 | 0.7336 | 0.7336 | 0.7336 | 0.7336 | 0.7336 | 0.7336 | 0.7336 | 0.7336 | 0.7343 |
| 14 | 0.2770 | 0.2811 | 0.2811 | 0.2811 | 0.2811 | 0.2811 | 0.2811 | 0.2811 | 0.2811 | 0.2811 | 0.2811 | 0.2811 | 0.3016 |
| 15 | 0.4529 | 0.4569 | 0.4569 | 0.4569 | 0.4569 | 0.4569 | 0.4569 | 0.4569 | 0.4569 | 0.4569 | 0.4569 | 0.4569 | 0.4589 |
| 16 | 0.3342 | 0.3383 | 0.3383 | 0.3383 | 0.3383 | 0.3383 | 0.3383 | 0.3383 | 0.3383 | 0.3383 | 0.3383 | 0.3383 | 0.3802 |
| 17 | 0.2023 | 0.2065 | 0.2065 | 0.2065 | 0.2065 | 0.2065 | 0.2065 | 0.2065 | 0.2065 | 0.2065 | 0.2065 | 0.2065 | 0.2054 |
| 18 | 0.1143 | 0.1186 | 0.1186 | 0.1186 | 0.1186 | 0.1186 | 0.1186 | 0.1186 | 0.1186 | 0.1186 | 0.1186 | 0.1186 | 0.1617 |
| 19 | 0.0484 | 0.0527 | 0.0527 | 0.0527 | 0.0527 | 0.0527 | 0.0527 | 0.0527 | 0.0527 | 0.0527 | 0.0527 | 0.0527 | 0.0962 |
| 20 | 0.0176 | 0.0220 | 0.0220 | 0.0220 | 0.0220 | 0.0220 | 0.0220 | 0.0220 | 0.0220 | 0.0220 | 0.0220 | 0.0220 | 0.0437 |
| Run Parameter | Value |
|---|---|
| Population Size | 100 |
| Maximum Generations | 50 |
| Input Variables | 4 |
| Training Instances | 20472 |
| Tournament Size | 10 |
| Elite Fraction | |
| Maximum Genes | 1 |
| Maximum Depth | 5 |
| Maximum Total Nodes | |
| Ephemeral Random Constants Probability | |
| Crossover Probability | |
| Mutation Probability |
| - 4 Features | - 12 Features | |||||
|---|---|---|---|---|---|---|
| Layer | RMSE | MAE | RMSE | MAE | ||
| 1 | 0.0611 | 0.050 | 0.84 | 0.0597 | 0.050 | 0.85 |
| 2 | 0.0791 | 0.064 | 0.74 | 0.0681 | 0.056 | 0.81 |
| 3 | 0.0896 | 0.071 | 0.67 | 0.0736 | 0.060 | 0.77 |
| 4 | 0.0954 | 0.075 | 0.62 | 0.0814 | 0.065 | 0.72 |
| 5 | 0.1018 | 0.080 | 0.57 | 0.0834 | 0.066 | 0.71 |
| 6 | 0.1067 | 0.083 | 0.53 | 0.0928 | 0.073 | 0.64 |
| 7 | 0.1100 | 0.085 | 0.50 | 0.1072 | 0.087 | 0.52 |
| 8 | 0.1147 | 0.088 | 0.45 | 0.1204 | 0.098 | 0.40 |
| 9 | 0.1187 | 0.091 | 0.41 | 0.1275 | 0.099 | 0.32 |
| 10 | 0.1231 | 0.094 | 0.37 | 0.1236 | 0.093 | 0.36 |
| 11 | 0.1286 | 0.098 | 0.31 | 0.1232 | 0.091 | 0.37 |
| 12 | 0.1342 | 0.102 | 0.25 | 0.1266 | 0.091 | 0.33 |
| 13 | 0.1485 | 0.112 | 0.12 | 0.1265 | 0.092 | 0.36 |
| - 4 Features | - 12 Features | ||||||
| Layer | RMSE | MAE | RMSE | MAE | |||
| 1 | 0.0611 | 0.050 | 0.84 | 0.0598 | 0.050 | 0.85 | |
| 2 | 0.0791 | 0.064 | 0.74 | 0.0681 | 0.056 | 0.81 | |
| 3 | 0.0895 | 0.071 | 0.67 | 0.0737 | 0.060 | 0.77 | |
| 4 | 0.0954 | 0.075 | 0.62 | 0.0814 | 0.065 | 0.73 | |
| 5 | 0.1019 | 0.080 | 0.57 | 0.0835 | 0.066 | 0.71 | |
| 6 | 0.1067 | 0.083 | 0.53 | 0.0930 | 0.073 | 0.64 | |
| 7 | 0.1100 | 0.085 | 0.50 | 0.1071 | 0.087 | 0.52 | |
| 8 | 0.1149 | 0.088 | 0.45 | 0.1202 | 0.098 | 0.40 | |
| 9 | 0.1187 | 0.091 | 0.41 | 0.1279 | 0.099 | 0.32 | |
| 10 | 0.1229 | 0.094 | 0.37 | 0.1236 | 0.093 | 0.36 | |
| 11 | 0.1286 | 0.098 | 0.31 | 0.1229 | 0.090 | 0.37 | |
| 12 | 0.1344 | 0.102 | 0.25 | 0.1271 | 0.092 | 0.33 | |
| 13 | 0.1487 | 0.112 | 0.11 | 0.1264 | 0.092 | 0.36 | |
| - 4 Features | - 12 Features | ||||||
| Layer | RMSE | MAE | RMSE | MAE | |||
| 1 | 0.0004 | 0.046 | 0.86 | 0.0004 | 0.041 | 0.89 | |
| 2 | 0.0004 | 0.043 | 0.84 | 0.0004 | 0.041 | 0.87 | |
| 3 | 0.0005 | 0.042 | 0.82 | 0.0004 | 0.042 | 0.86 | |
| 4 | 0.0005 | 0.040 | 0.82 | 0.0004 | 0.044 | 0.83 | |
| 5 | 0.0005 | 0.040 | 0.80 | 0.0005 | 0.046 | 0.82 | |
| 6 | 0.0005 | 0.039 | 0.79 | 0.0005 | 0.047 | 0.80 | |
| 7 | 0.0005 | 0.038 | 0.79 | 0.0005 | 0.043 | 0.80 | |
| 8 | 0.0005 | 0.039 | 0.79 | 0.0005 | 0.037 | 0.81 | |
| 9 | 0.0005 | 0.039 | 0.78 | 0.0005 | 0.032 | 0.82 | |
| 10 | 0.0005 | 0.041 | 0.77 | 0.0005 | 0.033 | 0.81 | |
| 11 | 0.0005 | 0.042 | 0.76 | 0.0005 | 0.038 | 0.81 | |
| 12 | 0.0006 | 0.049 | 0.72 | 0.0005 | 0.047 | 0.77 | |
| 13 | 0.0008 | 0.076 | 0.49 | 0.0007 | 0.065 | 0.63 | |
| - 4 Features | - 12 Features | ||||||
| Layer | RMSE | MAE | RMSE | MAE | |||
| 1 | 0.0012 | 0.047 | 0.86 | 0.0011 | 0.044 | 0.88 | |
| 2 | 0.0013 | 0.043 | 0.84 | 0.0012 | 0.043 | 0.86 | |
| 3 | 0.0014 | 0.042 | 0.82 | 0.0013 | 0.045 | 0.85 | |
| 4 | 0.0014 | 0.040 | 0.81 | 0.0014 | 0.046 | 0.82 | |
| 5 | 0.0015 | 0.040 | 0.80 | 0.0015 | 0.048 | 0.80 | |
| 6 | 0.0015 | 0.039 | 0.79 | 0.0015 | 0.050 | 0.79 | |
| 7 | 0.0015 | 0.039 | 0.79 | 0.0015 | 0.046 | 0.79 | |
| 8 | 0.0015 | 0.039 | 0.79 | 0.0015 | 0.040 | 0.79 | |
| 9 | 0.0016 | 0.040 | 0.77 | 0.0014 | 0.034 | 0.81 | |
| 10 | 0.0016 | 0.041 | 0.76 | 0.0015 | 0.036 | 0.79 | |
| 11 | 0.0016 | 0.043 | 0.76 | 0.0015 | 0.040 | 0.79 | |
| 12 | 0.0018 | 0.050 | 0.71 | 0.0016 | 0.050 | 0.75 | |
| 13 | 0.0024 | 0.078 | 0.48 | 0.0021 | 0.070 | 0.59 | |
| - 4 Features | - 12 Features | ||||||
| Layer | RMSE | MAE | RMSE | MAE | |||
| 1 | 0.0004 | 0.047 | 0.86 | 0.0004 | 0.043 | 0.88 | |
| 2 | 0.0004 | 0.045 | 0.84 | 0.0004 | 0.044 | 0.87 | |
| 3 | 0.0005 | 0.045 | 0.83 | 0.0004 | 0.045 | 0.86 | |
| 4 | 0.0005 | 0.043 | 0.82 | 0.0004 | 0.046 | 0.84 | |
| 5 | 0.0005 | 0.043 | 0.81 | 0.0005 | 0.049 | 0.83 | |
| 6 | 0.0005 | 0.044 | 0.80 | 0.0005 | 0.048 | 0.82 | |
| 7 | 0.0005 | 0.043 | 0.80 | 0.0005 | 0.045 | 0.82 | |
| 8 | 0.0005 | 0.044 | 0.80 | 0.0005 | 0.041 | 0.82 | |
| 9 | 0.0005 | 0.044 | 0.78 | 0.0004 | 0.036 | 0.83 | |
| 10 | 0.0005 | 0.045 | 0.78 | 0.0005 | 0.038 | 0.81 | |
| 11 | 0.0005 | 0.047 | 0.77 | 0.0005 | 0.042 | 0.81 | |
| 12 | 0.0005 | 0.054 | 0.72 | 0.0005 | 0.051 | 0.77 | |
| 13 | 0.0008 | 0.080 | 0.49 | 0.0007 | 0.069 | 0.65 | |
| - 4 Features | - 12 Features | ||||||
| Layer | RMSE | MAE | RMSE | MAE | |||
| 1 | 0.0012 | 0.047 | 0.86 | 0.0011 | 0.044 | 0.88 | |
| 2 | 0.0013 | 0.046 | 0.84 | 0.0012 | 0.046 | 0.86 | |
| 3 | 0.0014 | 0.045 | 0.82 | 0.0013 | 0.047 | 0.84 | |
| 4 | 0.0014 | 0.043 | 0.82 | 0.0014 | 0.048 | 0.83 | |
| 5 | 0.0014 | 0.044 | 0.81 | 0.0014 | 0.051 | 0.81 | |
| 6 | 0.0015 | 0.044 | 0.80 | 0.0014 | 0.050 | 0.81 | |
| 7 | 0.0015 | 0.043 | 0.80 | 0.0015 | 0.047 | 0.80 | |
| 8 | 0.0015 | 0.044 | 0.79 | 0.0015 | 0.043 | 0.80 | |
| 9 | 0.0015 | 0.045 | 0.78 | 0.0014 | 0.038 | 0.81 | |
| 10 | 0.0016 | 0.046 | 0.77 | 0.0015 | 0.040 | 0.79 | |
| 11 | 0.0016 | 0.047 | 0.77 | 0.0015 | 0.044 | 0.79 | |
| 12 | 0.0017 | 0.055 | 0.72 | 0.0016 | 0.053 | 0.75 | |
| 13 | 0.0024 | 0.081 | 0.48 | 0.0021 | 0.071 | 0.61 | |
| - 4 Features | - 12 Features | ||||||
| Layer | RMSE | MAE | RMSE | MAE | |||
| 1 | 0.0004 | 0.048 | 0.85 | 0.0004 | 0.046 | 0.87 | |
| 2 | 0.0005 | 0.057 | 0.78 | 0.0005 | 0.053 | 0.82 | |
| 3 | 0.0006 | 0.060 | 0.74 | 0.0005 | 0.058 | 0.78 | |
| 4 | 0.0006 | 0.059 | 0.73 | 0.0005 | 0.061 | 0.75 | |
| 5 | 0.0006 | 0.061 | 0.70 | 0.0006 | 0.065 | 0.72 | |
| 6 | 0.0006 | 0.061 | 0.69 | 0.0006 | 0.070 | 0.68 | |
| 7 | 0.0006 | 0.060 | 0.69 | 0.0007 | 0.079 | 0.60 | |
| 8 | 0.0006 | 0.060 | 0.69 | 0.0008 | 0.089 | 0.49 | |
| 9 | 0.0006 | 0.061 | 0.67 | 0.0008 | 0.087 | 0.45 | |
| 10 | 0.0006 | 0.062 | 0.66 | 0.0008 | 0.080 | 0.50 | |
| 11 | 0.0006 | 0.061 | 0.66 | 0.0008 | 0.081 | 0.48 | |
| 12 | 0.0007 | 0.065 | 0.62 | 0.0008 | 0.085 | 0.44 | |
| 13 | 0.0009 | 0.091 | 0.36 | 0.0008 | 0.085 | 0.44 | |
| - 4 Features | - 12 Features | ||||||
| Layer | RMSE | MAE | RMSE | MAE | |||
| 1 | 0.0012 | 0.048 | 0.85 | 0.0012 | 0.046 | 0.87 | |
| 2 | 0.0015 | 0.057 | 0.78 | 0.0014 | 0.053 | 0.82 | |
| 3 | 0.0017 | 0.060 | 0.74 | 0.0015 | 0.058 | 0.78 | |
| 4 | 0.0017 | 0.059 | 0.73 | 0.0016 | 0.061 | 0.75 | |
| 5 | 0.0018 | 0.061 | 0.70 | 0.0017 | 0.065 | 0.72 | |
| 6 | 0.0018 | 0.061 | 0.69 | 0.0018 | 0.070 | 0.68 | |
| 7 | 0.0018 | 0.061 | 0.69 | 0.0021 | 0.079 | 0.60 | |
| 8 | 0.0018 | 0.060 | 0.69 | 0.0023 | 0.089 | 0.49 | |
| 9 | 0.0019 | 0.061 | 0.67 | 0.0024 | 0.087 | 0.45 | |
| 10 | 0.0019 | 0.062 | 0.66 | 0.0023 | 0.080 | 0.50 | |
| 11 | 0.0019 | 0.061 | 0.65 | 0.0023 | 0.081 | 0.48 | |
| 12 | 0.0020 | 0.065 | 0.62 | 0.0024 | 0.085 | 0.44 | |
| 13 | 0.0026 | 0.091 | 0.36 | 0.0025 | 0.085 | 0.44 | |
| - 4 Features | - 12 Features | ||||||
| Layer | RMSE | MAE | RMSE | MAE | |||
| 1 | 0.0004 | 0.048 | 0.86 | 0.0004 | 0.045 | 0.87 | |
| 2 | 0.0005 | 0.056 | 0.79 | 0.0004 | 0.052 | 0.83 | |
| 3 | 0.0006 | 0.061 | 0.74 | 0.0005 | 0.055 | 0.81 | |
| 4 | 0.0006 | 0.063 | 0.71 | 0.0005 | 0.057 | 0.78 | |
| 5 | 0.0006 | 0.066 | 0.67 | 0.0005 | 0.059 | 0.76 | |
| 6 | 0.0006 | 0.066 | 0.66 | 0.0006 | 0.062 | 0.73 | |
| 7 | 0.0007 | 0.068 | 0.63 | 0.0006 | 0.063 | 0.70 | |
| 8 | 0.0007 | 0.070 | 0.62 | 0.0007 | 0.073 | 0.58 | |
| 9 | 0.0007 | 0.071 | 0.59 | 0.0007 | 0.068 | 0.61 | |
| 10 | 0.0007 | 0.072 | 0.57 | 0.0007 | 0.071 | 0.56 | |
| 11 | 0.0007 | 0.073 | 0.57 | 0.0007 | 0.073 | 0.55 | |
| 12 | 0.0007 | 0.075 | 0.53 | 0.0008 | 0.078 | 0.51 | |
| 13 | 0.0008 | 0.086 | 0.44 | 0.0008 | 0.079 | 0.53 | |
| - 4 Features | - 12 Features | ||||||
| Layer | RMSE | MAE | RMSE | MAE | |||
| 1 | 0.0013 | 0.049 | 0.85 | 0.0012 | 0.046 | 0.87 | |
| 2 | 0.0015 | 0.056 | 0.78 | 0.0014 | 0.053 | 0.82 | |
| 3 | 0.0017 | 0.062 | 0.73 | 0.0015 | 0.056 | 0.80 | |
| 4 | 0.0018 | 0.064 | 0.70 | 0.0016 | 0.058 | 0.77 | |
| 5 | 0.0019 | 0.067 | 0.66 | 0.0016 | 0.061 | 0.75 | |
| 6 | 0.0019 | 0.068 | 0.64 | 0.0017 | 0.063 | 0.72 | |
| 7 | 0.0020 | 0.070 | 0.62 | 0.0018 | 0.064 | 0.68 | |
| 8 | 0.0021 | 0.071 | 0.60 | 0.0022 | 0.075 | 0.56 | |
| 9 | 0.0021 | 0.073 | 0.57 | 0.0021 | 0.070 | 0.59 | |
| 10 | 0.0022 | 0.074 | 0.55 | 0.0022 | 0.073 | 0.54 | |
| 11 | 0.0022 | 0.074 | 0.54 | 0.0022 | 0.075 | 0.53 | |
| 12 | 0.0023 | 0.076 | 0.51 | 0.0023 | 0.080 | 0.48 | |
| 13 | 0.0026 | 0.088 | 0.40 | 0.0024 | 0.081 | 0.49 | |
| Layer | AI | DTI | RB | RU | |
|---|---|---|---|---|---|
| 1 | 0.1278 | 0.1051 | 0.3345 | 0.4326 | 0.8401 |
| 2 | 0.1234 | 0.0588 | 0.2762 | 0.5416 | 0.7309 |
| 3 | 0.1340 | 0.1127 | 0.1706 | 0.5827 | 0.6476 |
| 4 | 0.1689 | 0.0611 | 0.1917 | 0.5783 | 0.6091 |
| 5 | 0.1819 | 0.0389 | 0.2827 | 0.4964 | 0.5522 |
| Layer | AI | DTI | RB | RU | ILSM | DLTY | MSLD | PR | BHCR | BCLA | TRHC | PRB | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 0.0530 | 0.0236 | 0.0170 | 0.3423 | 0.0072 | 0.0137 | 0.2480 | 0.0340 | 0.0295 | 0.1348 | 0.0262 | 0.0707 | 0.8339 |
| 2 | 0.0551 | 0.0101 | 0.0178 | 0.4339 | 0.0302 | 0.0119 | 0.2134 | 0.0356 | 0.0403 | 0.0634 | 0.0095 | 0.0788 | 0.7881 |
| 3 | 0.0255 | 0.0154 | 0.0255 | 0.2916 | 0.0303 | 0.0178 | 0.2357 | 0.0285 | 0.0932 | 0.0499 | 0.0386 | 0.1479 | 0.7439 |
| 4 | 0.0287 | 0.0143 | 0.0208 | 0.2237 | 0.0523 | 0.0186 | 0.2502 | 0.0380 | 0.0222 | 0.1333 | 0.0545 | 0.1434 | 0.6755 |
| 5 | 0.0904 | 0.0149 | 0.0194 | 0.2676 | 0.0284 | 0.0306 | 0.2220 | 0.0172 | 0.0845 | 0.0635 | 0.0411 | 0.1203 | 0.6949 |
| Feature Set | Expression |
|---|---|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).