Submitted:
31 January 2024
Posted:
01 February 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Materials and Methods
3.1. Datasets
3.2. Baseline
3.3. Experimental Design
3.3.1. Data Preprocessing
3.3.2. Interpretability Metrics
3.3.3. Model Evaluation and Hyperparameter Optimization
4. Results
4.1. Classification Results
4.2. Interpretability Results
4.3. Summary of Comparative Experiments
4.4. Two Examples
4.4.1. bank-marketing
4.4.2. german
5. Discussion
5.1. Why Should We Avoid a Mixture of Categorical and Numerical Attributes?
5.2. Optimal Selection of the Pareto Solutions
5.3. Decision Lists v.s. Decision Trees
5.4. as a Metric of Interpretability
5.5. Limitation
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A. Implementation details and hyperparameters
Appendix A.1. XGBoost
| Parameter | Space |
|---|---|
| UniformInt(1, 10) | |
| LogUniform(1e-4,1.0) | |
| # Iterations | 50 |
Appendix A.2. FBTs
| Parameter | Space |
|---|---|
| UniformInt(1, 10) | |
| {auc, None} | |
| # Iterations | 50 |
Appendix A.3. RuleCOSI+
| Parameter | Space |
|---|---|
| Uniform(0.0, 0.95) | |
| Uniform(0.0, 0.5) | |
| c | Uniform(0.1, 0.5) |
| # Iterations | 50 |
Appendix A.4. Re-RX with J48graft
- [51]
| Parameter | Space |
|---|---|
| UniformInt(1, 5) | |
| LogUniform(5e-3, 0.1) | |
| LogUniform(1e-6, 1e-2) | |
| # Iterations | 50 |
| Parameter | Space |
|---|---|
| {2, 4, 8, …, 128} | |
| Uniform(0.1, 0.5) | |
| LogUniform(0.001, 0.25) | |
| Uniform(0.05, 0.4) | |
| Uniform(0.05, 0.4) | |
| # Iterations | 50 |
Appendix A.5. DT
| Parameter | Space |
|---|---|
| UniformInt(1, 10) | |
| Uniform(0.0, 0.5) | |
| Uniform(0.0, 0.5) | |
| # Iterations | 100 |
Appendix A.6. J48graft
| Parameter | Space |
|---|---|
| {2, 4, 8, …, 128} | |
| Uniform(0.1, 0.5) | |
| # Iterations | 100 |
Appendix B. Results for other metrics
| dataset | FBTs | RuleCOSI+ | Re-RX with J48graft | J48graft | DT |
|---|---|---|---|---|---|
| heart | |||||
| australian | |||||
| mammographic | |||||
| tictactoe | |||||
| german | |||||
| biodeg | |||||
| banknote | |||||
| bank-marketing | |||||
| spambase | |||||
| occupancy | |||||
| ranking | 4.0 | 2.1 | 2.5 | 4.1 | 2.3 |
| dataset | FBTs | RuleCOSI+ | Re-RX with J48graft | J48graft | DT |
|---|---|---|---|---|---|
| heart | |||||
| australian | |||||
| mammographic | |||||
| tictactoe | |||||
| german | |||||
| biodeg | |||||
| banknote | |||||
| bank-marketing | |||||
| spambase | |||||
| occupancy | |||||
| ranking | 2.8 | 3.3 | 3.3 | 2.5 | 3.1 |
| dataset | FBTs | RuleCOSI+ | Re-RX with J48graft | J48graft | DT |
|---|---|---|---|---|---|
| heart | |||||
| australian | |||||
| mammographic | |||||
| tictactoe | |||||
| german | |||||
| biodeg | |||||
| banknote | |||||
| bank-marketing | |||||
| spambase | |||||
| occupancy | |||||
| ranking | 2.9 | 1.1 | 3.9 | 3.8 | 3.0 |
| dataset | FBTs | RuleCOSI+ | Re-RX with J48graft | J48graft | DT |
|---|---|---|---|---|---|
| heart | |||||
| australian | |||||
| mammographic | |||||
| tictactoe | |||||
| german | |||||
| biodeg | |||||
| banknote | |||||
| bank-marketing | |||||
| spambase | |||||
| occupancy | |||||
| ranking | 3.0 | 2.1 | 4.0 | 3.0 | 2.9 |
References
- Saeed, W.; Omlin, C. Explainable AI (XAI): A systematic meta-survey of current challenges and future opportunities. Knowl. Based Syst. 2023, 263, 110273. [CrossRef]
- Arrieta, A.; Díaz-Rodríguez, N.; Del Ser, J.; Bennetot, A.; Tabik, S.; Barbado, A.; Garcia, S.; Gil-Lopez, S.; Molina, D.; Benjamins, R.; Chatila, R.; Herrera, F. Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Inf. Fusion 2020, 58, 82–115. [CrossRef]
- Adadi, A.; Berrada, M. Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI). IEEE Access 2018, 6, 52138–52160. [CrossRef]
- Zhang, Y.; Tiňo, P.; Leonardis, A.; Tang, K. A Survey on Neural Network Interpretability. IEEE Trans. Emerg. Top Comput. Intell. 2021, 5, 726–742. [CrossRef]
- Demajo, L.M.; Vella, V.; Dingli, A. Explainable AI for Interpretable Credit Scoring. Computer Science & Information Technology (CS & IT). AIRCC Publishing Corporation, 2020. [CrossRef]
- Petch, J.; Di, S.; Nelson, W. Opening the Black Box: The Promise and Limitations of Explainable Machine Learning in Cardiology. Can. J. Cardiol. 2022, 38, 204–213. [CrossRef]
- Weber, L.; Lapuschkin, S.; Binder, A.; Samek, W. Beyond explaining: Opportunities and challenges of XAI-based model improvement. Inf. Fusion 2023, 92, 154–176. [CrossRef]
- Vilone, G.; Longo, L. Classification of Explainable Artificial Intelligence Methods through Their Output Formats. Mach. Learn. Knowl. Extr. 2021, 3, 615–661. [CrossRef]
- Cabitza, F.; Campagner, A.; Malgieri, G.; Natali, C.; Schneeberger, D.; Stoeger, K.; Holzinger, A. Quod erat demonstrandum? - Towards a typology of the concept of explanation for the design of explainable AI. Expert Syst. Appl. 2023, 213, 118888. [CrossRef]
- Deck, L.; Schoeffer, J.; De-Arteaga, M.; Kühl, N. A Critical Survey on Fairness Benefits of XAI, 2023, [arXiv:cs.AI/2310.13007]. [CrossRef]
- Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 2019, 1, 206–215. [CrossRef]
- Zihni, E.; Madai, V.I.; Livne, M.; Galinovic, I.; Khalil, A.A.; Fiebach, J.B.; Frey, D. Opening the black box of artificial intelligence for clinical decision support: A study predicting stroke outcome. PLOS ONE 2020, 15. [CrossRef]
- Yang, C.C. Explainable Artificial Intelligence for Predictive Modeling in Healthcare. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2022, 6, 228–239. [CrossRef]
- Lipton, Z.C. The Mythos of Model Interpretability, 2017, [arXiv:cs.LG/1606.03490]. [CrossRef]
- Breiman, L. Bagging predictors. MACHINE LEARNING 1996, 24, 123–140. [CrossRef]
- Breiman, L. Random forests. MACHINE LEARNING 2001, 45, 5–32. [CrossRef]
- Mason, L.; Baxter, J.; Bartlett, P.; Frean, M. Boosting Algorithms as Gradient Descent. Advances in Neural Information Processing Systems; Solla, S.; Leen, T.; Müller, K., Eds. MIT Press, 1999, Vol. 12.
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Association for Computing Machinery: New York, NY, USA, 2016; KDD ’16, pp. 785–794. [CrossRef]
- Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.Y. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. Proceedings of the 31st International Conference on Neural Information Processing Systems; Curran Associates Inc.: Red Hook, NY, USA, 2017; NIPS’17, pp. 3149–3157.
- Prokhorenkova, L.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. CatBoost: unbiased boosting with categorical features, 2019, [arXiv:cs.LG/1706.09516]. [CrossRef]
- Sagi, O.; Rokach, L. Ensemble learning: A survey. WIREs Data Mining and Knowledge Discovery 2018, 8, e1249, [https://wires.onlinelibrary.wiley.com/doi/pdf/10.1002/widm.1249]. [CrossRef]
- Mahbooba, B.; Timilsina, M.; Sahal, R.; Serrano, M. Explainable artificial intelligence (XAI) to enhance trust management in intrusion detection systems using decision tree model. Complexity 2021, 2021, 1–11. [CrossRef]
- Shulman, E.; Wolf, L. Meta Decision Trees for Explainable Recommendation Systems. Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society; Association for Computing Machinery: New York, NY, USA, 2020; AIES ’20, pp. 365–371. [CrossRef]
- Blanco-Justicia, A.; Domingo-Ferrer, J.; Martínez, S.; Sánchez, D. Machine learning explainability via microaggregation and shallow decision trees. Knowl. Based Syst. 2020, 194, 105532. [CrossRef]
- Sachan, S.; Yang, J.B.; Xu, D.L.; Benavides, D.E.; Li, Y. An explainable AI decision-support-system to automate loan underwriting. Expert Syst. Appl. 2020, 144. [CrossRef]
- Yang, L.H.; Liu, J.; Ye, F.F.; Wang, Y.M.; Nugent, C.; Wang, H.; Martinez, L. Highly explainable cumulative belief rule-based system with effective rule-base modeling and inference scheme. Knowl. Based Syst. 2022, 240. [CrossRef]
- Li, H.; Wang, Y.; Zhang, S.; Song, Y.; Qu, H. KG4Vis: A Knowledge Graph-Based Approach for Visualization Recommendation. IEEE Transactions on Visualization and Computer Graphics 2022, 28, 195–205. [CrossRef]
- Setiono, R.; Baesens, B.; Mues, C. Recursive Neural Network Rule Extraction for Data With Mixed Attributes. IEEE Transactions on Neural Networks 2008, 19, 299–307. [CrossRef]
- Hayashi, Y.; Nakano, S. Use of a Recursive-Rule eXtraction algorithm with J48graft to achieve highly accurate and concise rule extraction from a large breast cancer dataset. Informatics in Medicine Unlocked 2015, 1, 9–16. [CrossRef]
- Friedman, J.H.; Popescu, B.E. Predictive learning via rule ensembles. The Annals of Applied Statistics 2008, 2, 916 – 954. [CrossRef]
- Deng, H. Interpreting tree ensembles with inTrees. International Journal of Data Science and Analytics 2019, 7, 277–287. [CrossRef]
- Hara, S.; Hayashi, K. Making Tree Ensembles Interpretable: A Bayesian Model Selection Approach. Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics; Storkey, A.; Perez-Cruz, F., Eds. PMLR, 2018, Vol. 84, Proceedings of Machine Learning Research, pp. 77–85.
- Sagi, O.; Rokach, L. Explainable decision forest: Transforming a decision forest into an interpretable tree. Inf. Fusion 2020, 61, 124–138. [CrossRef]
- Sagi, O.; Rokach, L. Approximating XGBoost with an interpretable decision tree. Information Sciences 2021, 572, 522–542. [CrossRef]
- Obregon, J.; Kim, A.; Jung, J.Y. RuleCOSI: Combination and simplification of production rules from boosted decision trees for imbalanced classification. Expert Syst. Appl. 2019, 126, 64–82. [CrossRef]
- Obregon, J.; Jung, J.Y. RuleCOSI+: Rule extraction for interpreting classification tree ensembles. Inf. Fusion 2023, 89, 355–381. [CrossRef]
- Hayashi, Y. Synergy effects between grafting and subdivision in Re-RX with J48graft for the diagnosis of thyroid disease. Knowl. Based Syst. 2017, 131, 170–182. [CrossRef]
- Hayashi, Y.; Oishi, T. High accuracy-priority rule extraction for reconciling accuracy and interpretability in credit scoring. New Generation Computing 2018, 36, 393–418. [CrossRef]
- Chakraborty, M.; Biswas, S.K.; Purkayastha, B. Recursive Rule Extraction from NN using Reverse Engineering Technique. New Generation Computing 2018, 36, 119–142. [CrossRef]
- Hayashi, Y. NEURAL NETWORK RULE EXTRACTION BY A NEW ENSEMBLE CONCEPT AND ITS THEORETICAL AND HISTORICAL BACKGROUND: A REVIEW. International Journal of Computational Intelligence and Applications 2013, 12, 1340006. [CrossRef]
- Hayashi, Y. Application of a rule extraction algorithm family based on the Re-RX algorithm to financial credit risk assessment from a Pareto optimal perspective. Operations Research Perspectives 2016, 3, 32–42. [CrossRef]
- Hayashi, Y.; Takano, N. One-Dimensional Convolutional Neural Networks with Feature Selection for Highly Concise Rule Extraction from Credit Scoring Datasets with Heterogeneous Attributes. Electronics 2020, 9. [CrossRef]
- Hayashi, Y. Does Deep Learning Work Well for Categorical Datasets with Mainly Nominal Attributes? Electronics 2020, 9. [CrossRef]
- Kelly, M.; Longjohn, R.; Nottingham, K. UCI Machine Learning Repository. https://archive.ics.uci.edu.
- Webb, G.I. Decision Tree Grafting from the All-Tests-but-One Partition. Proceedings of the 16th International Joint Conference on Artificial Intelligence; Morgan Kaufmann Publishers Inc.: San Francisco, CA, USA, 1999; Vol. 2, IJCAI’99, pp. 702–707.
- Quinlan, J.R. C4.5: Programs for Machine Learning; Morgan Kaufmann, 1993.
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; Vanderplas, J.; Passos, A.; Cournapeau, D.; Brucher, M.; Perrot, M.; Édouard Duchesnay. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 2011, 12, 2825–2830.
- Akiba, T.; Sano, S.; Yanase, T.; Ohta, T.; Koyama, M. Optuna: A Next-generation Hyperparameter Optimization Framework. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2019. [CrossRef]
- Bradley, A.P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 1997, 30, 1145–1159. [CrossRef]
- WELCH, B. THE GENERALIZATION OF STUDENTS PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARIANCES ARE INVOLVED. BIOMETRIKA 1947, 34, 28–35. [CrossRef]
- Loshchilov, I.; Hutter, F. Decoupled Weight Decay Regularization. International Conference on Learning Representations, 2019. [CrossRef]
| 1 | Kaggle is a platform for predictive modeling and analytics competitions in which statisticians and data miners compete to produce the best models for predicting and describing the datasets uploaded by companies and users. |
| 2 | |
| 3 | |
| 4 | We used OneHotEncoder from scikit-learn [47] |
| 5 | |
| 6 | |
| 7 | |
| 8 | |
| 9 | |
| 10 | |
| 11 | |
| 12 | |
| 13 | |
| 14 | |
| 15 |





| dataset | #instances | #features | #cate | #cont | major class ratio |
|---|---|---|---|---|---|
| heart | 270 | 13 | 7 | 6 | 0.55 |
| australian | 690 | 14 | 6 | 8 | 0.555 |
| mammographic | 831 | 4 | 2 | 2 | 0.52 |
| tictactoe | 958 | 9 | 9 | 0 | 0.65 |
| german | 1000 | 20 | 7 | 13 | 0.70 |
| biodeg | 1055 | 41 | 0 | 41 | 0.66 |
| banknote | 1372 | 4 | 0 | 4 | 0.55 |
| bank-marketing | 4521 | 16 | 9 | 7 | 0.89 |
| spambase | 4601 | 57 | 0 | 57 | 0.60 |
| occupancy | 8143 | 5 | 0 | 5 | 0.79 |
| dataset | FBTs | RuleCOSI+ | Re-RX with J48graft | J48graft | DT |
|---|---|---|---|---|---|
| heart | |||||
| australian | |||||
| mammographic | |||||
| tictactoe | |||||
| german | |||||
| biodeg | |||||
| banknote | |||||
| bank-marketing | |||||
| spambase | |||||
| occupancy | |||||
| ranking | 2.9 | 2.1 | 4.0 | 3.2 | 2.8 |
| dataset | FBTs | RuleCOSI+ | Re-RX with J48graft | J48graft | DT |
|---|---|---|---|---|---|
| heart | |||||
| australian | |||||
| mammographic | |||||
| tictactoe | |||||
| german | |||||
| biodeg | |||||
| banknote | |||||
| bank-marketing | |||||
| spambase | |||||
| occupancy | |||||
| ranking | 4.5 | 1.6 | 3.3 | 3.8 | 1.8 |
| dataset | FBTs | RuleCOSI+ | Re-RX with J48graft | J48graft | DT |
|---|---|---|---|---|---|
| heart | |||||
| australian | |||||
| mammographic | |||||
| tictactoe | |||||
| german | |||||
| biodeg | |||||
| banknote | |||||
| bank-marketing | |||||
| spambase | |||||
| occupancy | |||||
| ranking | 3.5 | 4.7 | 1.5 | 3.1 | 2.2 |
| method | |||
|---|---|---|---|
| FBTs | |||
| RuleCOSI+ | |||
| Re-RX with J48graft | |||
| J48graft | |||
| DT |
| RuleCOSI+ | coverage | |
|---|---|---|
| 0.744 | ||
| 0.148 | ||
| 0.109 | ||
| Re-RX with J48graft | coverage | |
| 0.820 | ||
| 0.108 | ||
| 0.044 | ||
| 0.0 | ||
| 0.029 | ||
| DT | coverage | |
| 0.891 | ||
| 0.025 | ||
| 0.084 |
| Re-RX with J48graft | coverage | |
|---|---|---|
| 0.971 | ||
| 0.0 | ||
| 0.029 |
| RuleCOSI+ | coverage | |
|---|---|---|
| 0.329 | ||
| 0.283 | ||
| 0.388 | ||
| Re-RX with J48graft | coverage | |
| 0.394 | ||
| 0.063 | ||
| 0.16 | ||
| 0.067 | ||
| 0.013 | ||
| 0.022 | ||
| 0.012 | ||
| 0.19 | ||
| 0.079 | ||
| DT | coverage | |
| 0.394 | ||
| 0.325 | ||
| 0.281 |
| Re-RX with J48graft | coverage | |
|---|---|---|
| 0.394 | ||
| 0.063 | ||
| 0.067 | ||
| 0.207 | ||
| 0.19 | ||
| 0.079 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).