Submitted:
11 December 2024
Posted:
12 December 2024
You are already at the latest version
Abstract

Keywords:
1. Introduction
2. Material and Methods
2.1. State-of-the Art of Breast Cancer Classification
2.2. Overview of Major Supervised Algorithms Used for Breast Cancer Classification
2.3. The Patient Rule Induction Method
2.3.1. Overview
2.3.2. Related Work
3. PRIM Based Framework for Breast Cancer Classification and Explanation
3.1. Presentation of the Framework
3.1.1. Step 1: Data Preparation and Defining Learning Objectives
3.1.2. Step 2: Building the Boxes with PRIM on Random Feature Selection
3.1.3. Step 3: Handling Rule Conflict
3.1.4. Step 4: Organizing and Pruning Rules Using Metarules
3.1.5. Step 5: Validation of a Classifier
3.2. Illustrative Example of the PRIM Based Classification Framework
4. Experimental Setup for the Empirical Evaluation of the PRIM Based Framework for Breast Cancer Classification
5. Results and Limitations
5.1. Empirical Results
5.2. Limitations
6. Discussion and Conclusion
Acknowledgments
References
- Hou, C.; Zhong, X.; He, P.; Xu, B.; Diao, S.; Yi, F. . Li, J. Predicting breast cancer in Chinese women using machine learning techniques: algorithm development. JMIR medical informatics 2020, 8, e17364. [Google Scholar] [CrossRef]
- Naji, M.A.; El Filali, S.; Aarika, K.; Benlahmar, E.H.; Abdelouhahid, R.A.; Debauche, O. Machine learning algorithms for breast cancer prediction and diagnosis. Procedia Computer Science 2021, 191, 487–492. [Google Scholar] [CrossRef]
- Prastyo, P.H.; Paramartha, I.G.Y.; Pakpahan, M.S.M.; Ardiyanto, I. Predicting Breast Cancer: A Comparative Analysis of Machine Learning Algorithms. In Proceeding International Conference on Science and Engineering; April 2020; Vol. 3, pp. 455–459.
- Ahmad, L.G.; Eshlaghy, A.T.; Poorebrahimi, A.; Ebrahimi, M.; Razavi, A.R. Using three machine learning techniques for predicting breast cancer recurrence. J Health Med Inform 2013, 4, 3. [Google Scholar]
- Tseng, Y.J.; Huang, C.E.; Wen, C.N.; Lai, P.Y.; Wu, M.H.; Sun, Y.C. . Lu, J.J. Predicting breast cancer metastasis by using serum biomarkers and clinicopathological data with machine learning technologies. International journal of medical informatics 2019, 128, 79–86. [Google Scholar] [CrossRef] [PubMed]
- Gupta, S.; Kumar, D.; Sharma, A. Data mining classification techniques applied for breast cancer diagnosis and prognosis. Indian Journal of Computer Science and Engineering (IJCSE) 2011, 2, 188–195. [Google Scholar]
- Li, J.; Zhou, Z.; Dong, J.; Fu, Y.; Li, Y.; Luan, Z.; Peng, X. Predicting breast cancer 5-year survival using machine learning: A systematic review. PloS one 2021, 16, e0250370. [Google Scholar] [CrossRef] [PubMed]
- Nassif, A.B.; Talib, M.A.; Nasir, Q.; Afadar, Y.; Elgendy, O. Breast cancer detection using artificial intelligence techniques: A systematic literature review. Artificial Intelligence in Medicine 2022, 102276. [Google Scholar] [CrossRef] [PubMed]
- Abreu, P.H.; Santos, M.S.; Abreu, M.H.; Andrade, B.; Silva, D.C. Predicting breast cancer recurrence using machine learning techniques: a systematic review. ACM Computing Surveys (CSUR) 2016, 49, 1–40. [Google Scholar] [CrossRef]
- Houfani, D.; Slatnia, S.; Kazar, O.; Zerhouni, N.; Merizig, A.; Saouli, H. Machine learning techniques for breast cancer diagnosis: literature review. In International Conference on Advanced Intelligent Systems for Sustainable Development; Springer, Cham, 2020; pp. 247–254.
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining; August 2016; pp. 785–794. August 2016, 785–794.
- Breiman, L. Random Forests. Machine Learning 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Friedman, J.; Hastie, T.; Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. Journal of statistical software 2010, 33, 1. [Google Scholar] [CrossRef] [PubMed]
- FRIEDMAN, Jerome H. et FISHER, Nicholas I. Bump hunting in high-dimensional data. Statistics and Computing, 1999; 9, 123–143. [CrossRef]
- R. Reed, "Pruning algorithms-a survey," in IEEE Transactions on Neural Networks, vol. 4, no. 5, pp. 740–747, Sept. 1993. [CrossRef]
- Fürnkranz, J. Pruning Algorithms for Rule Learning. Machine Learning 1997, 27, 139–172. [Google Scholar] [CrossRef]
- Berrado, Abdelaziz, and George, C. Runger. "Using metarules to organize and group discovered association rules." Data mining and knowledge discovery 2007, 14, 409–431.
- Sáez, J.A.; Galar, M.; Krawczyk, B. Addressing the overlapping data problem in classification using the one-vs-one decomposition strategy. IEEE Access 2019, 7, 83396–83411. [Google Scholar] [CrossRef]
- Das, B.; Krishnan, N.C.; Cook, D.J. Handling class overlap and imbalance to detect prompt situations in smart homes. In 2013 IEEE 13th international conference on data mining workshops; IEEE: December 2013; pp. 266–273.
- NASSIH, Rym, and Abdelaziz BERRADO. "Towards a patient rule induction methodbased classifier." 2019 1st International Conference on Smart Systems and Data Science (ICSSD). IEEE, 2019.
- Breiman, L.; Friedman, J.; Olshen, R.; Stone, C. Cart. Classification and Regression Trees. 1984.
- Asri, H.; Mousannif, H.; Al Moatassime, H.; Noel, T. Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Computer Science 2016, 83, 1064–1069. [Google Scholar] [CrossRef]
- Lindgren, T. On handling conflicts between rules with numerical features. In Proceedings of the 2006 ACM symposium on Applied computing; April 2006; pp. 37–41.
- Lindgren, T. Methods for rule conflict resolution. In European Conference on Machine Learning; Springer: Berlin, Heidelberg, September 2004; pp. 262–273. [Google Scholar]
- Nassih, Rym, and Abdelaziz Berrado. "State of the art of Fairness, Interpretability and Explainability in Machine Learning: Case of PRIM." Proceedings of the 13th International Conference on Intelligent Systems: Theories and Applications. 2020.
- Oviedo, F.; Ferres, J.L.; Buonassisi, T.; Butler, K.T. Interpretable and explainable machine learning for materials science and chemistry. Accounts of Materials Research 2022, 3, 597–607. [Google Scholar] [CrossRef]
- Linardatos, P.; Papastefanopoulos, V.; Kotsiantis, S. Explainable ai: A review of machine learning interpretability methods. Entropy 2020, 23, 18. [Google Scholar] [CrossRef]
- Demsar J, Curk T, Erjavec A, Gorup C, Hocevar T, Milutinovic M, Mozina M, Polajnar M, Toplak M, Staric A, Stajdohar M, Umek L, Zagar L, Zbontar, J. ; Zitnik M, Zupan, B. Orange: Data Mining Toolbox in Python, Journal of Machine Learning Research 2013, 14, 2349−2353.
- Powers, D.M. Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv 2020, arXiv:2010.16061. [Google Scholar]
- Shokri, A.; Walker, J.P.; van Dijk, A.I.; Wright, A.J.; Pauwels, V.R. Application of the patient rule induction method to detect hydrologic model behavioral parameters and quantify uncertainty. Hydrological Processes 2018, 32, 1005–1025. [Google Scholar] [CrossRef]
- Kwakkel, J.H. A generalized many-objective optimization approach for scenario discovery. Futures Foresight Science 2019, 1, e8. [Google Scholar] [CrossRef]
- Su, H.C.; Sakata, T.; Herman, C.; Dolins, S. U.S. U.S. Patent No. 6,643,646. U.S. Patent and Trademark Office: Washington, DC, 2003.
- Dyson, G. An Application of the Patient Rule-Induction Method to Detect Clinically Meaningful Subgroups from Failed Phase III Clinical Trials. International journal of clinical biostatistics and biometrics 2021, 7. [Google Scholar]
- Nassih, R.; Berrado, A. Potential for PRIM based classification: A literature review. In Proceedings of the International Conference on Industrial Engineering and Operations Management Pilsen, Czech Republic, 2019, July 23-26.
- Worldwide cancer data | World Cancer Research Fund International (wcrf.
- Fatima,.N.; Liu,.L.; Hong,.S. and Ahmed.H, "Prediction of Breast Cancer, Comparative Review of Machine Learning Techniques, and Their Analysis," in IEEE Access, vol. 8, pp. 150360–150376, 2020. [CrossRef]
- Jacob, D.S.; Viswan, R.; Manju, V.; PadmaSuresh, L.; Raj, S. A survey on breast cancer prediction using data miningtechniques. In 2018 Conference on Emerging Devices and Smart Systems (ICEDSS), March 2018; pp. 256–258. IEEE.
- Zand, H.K.K. A comparative survey on data mining techniques for breast cancer diagnosis and prediction. Indian Journal of Fundamental and Applied Life Sciences 2015, 5, 4330–4339. [Google Scholar]






| Rules for class = 1 | Coverage | Density | Dimension | Support |
|---|---|---|---|---|
|
R1: 128.0 < Glucose < 199.0 AND 17.0 < SkinThickness < 99.0 AND 0.0 < Insulin < 520.0 AND 0.257 < DiabetesPedigreeFunction < 2.42 AND 25.0 < Age < 57.0 R2: 111.0 < Glucose < 199.0 AND 56.0 < BloodPressure < 122.0 AND 0.0 < SkinThickness < 43.0 AND 0.078 < DiabetesPedigreeFunction < 1.37 AND 32.0 < Age < 54.0 R3: 100.0 < Glucose < 199.0 AND 0.253 < DiabetesPedigreeFunction < 1.16 AND 29.0 < Age < 62.0 R4: 90.0 < Glucose < 199.0 AND 0.1495 < DiabetesPedigreeFunction < 2.42 AND 22.0 < Age < 81.0 R5: 0.0 < BloodPressure < 82.0 AND 12.0 < SkinThickness < 99.0 AND 0.0 < Insulin < 99.0 AND 0.1265 < DiabetesPedigreeFunction < 2.42 AND 24.0 < Age < 81.0 R6: 89.0 < Glucose < 199.0 AND 0.1265 < DiabetesPedigreeFunction < 2.42 R7: 128.0 < Glucose < 199.0 AND 17.0 < SkinThickness < 99.0 AND 25.0 < Age < 56.0 R8: 101.0 < Glucose < 199.0 AND 60.0 < BloodPressure < 85.0 AND 0.0 < SkinThickness < 26.0 AND 33.0 < Age < 52.0 R9: 109.0 < Glucose < 199.0 AND 12.0 < SkinThickness < 99.0 AND 31.0 < Age < 59.0 R10: 124.0 < Glucose < 199.0 AND 0.0 < SkinThickness < 0.0 AND 25.0 < Age < 53.0 R11: 95.0 < Glucose < 199.0 AND 22.0 < Age < 62.0 R12: 0.0 < BloodPressure < 85.0 AND 7.0 < SkinThickness < 99.0 AND 26.0 < Age < 56.0 R13: 93.0 < Glucose < 199.0 AND 60.0 < BloodPressure < 92.0 R14: 130.0 < Glucose < 199.0 AND 30.05 < BMI < 67.1 R15: 109.0 < Glucose < 199.0 AND 27.85 < BMI < 67.1 R16: 95.0 < Glucose < 199.0 AND 22.79 < BMI < 67.1 R17: 7.0 < Pregnancies < 9.0 AND 145.0 < Glucose < 199.0 AND 0.0 < Insulin < 495.0 R18: 128.0 < Glucose < 199.0 AND 18.0 < SkinThickness < 99.0 AND 74.0 < Insulin < 478.0 R19: 109.0 < Glucose < 199.0 R20: 7.0 < Pregnancies < 17.0 AND 84.0 < Glucose < 199.0 R21: 0.0 < Pregnancies < 3.0 AND 77.0 < Glucose < 199.0 AND 13.0 < SkinThickness < 45.0 AND 36.0 < Insulin < 846.0 R22: 4.0 < Pregnancies < 6.0 AND 0.0 < Glucose < 105.0 AND 0.0 < SkinThickness < 42.0 AND 0.0 < Insulin < 156.0 R23: 0.0 < Pregnancies < 2.0 AND 90.0 < Glucose < 199.0 AND 0.0 < SkinThickness < 42.0 AND 0.0 < Insulin < 15.0 R24: 8.0 < Pregnancies < 17.0 AND 24.0 < SkinThickness < 43.0 AND 31.0 < BMI < 45.90 R25: 30.05 < BMI < 67.1 R26: 7.0 < Pregnancies < 17.0 AND 0.0 < BloodPressure < 94.0 AND 23.15 < BMI < 67.1 R27: 4.0 < Pregnancies < 17.0 AND 0.0 < BloodPressure < 80.0 AND 0.0 < SkinThickness < 24.0 R28: 3.0 < Pregnancies < 17.0 AND 0.0 < SkinThickness < 33.0 R29: 0.0 < BloodPressure < 86.0 AND 23.15 < BMI < 29.5 R30: 28.1 < BMI < 67.1 AND 0.20 < DiabetesPedigreeFunction < 2.42 AND 31.0 < Age < 60.0 R31: 29.0 < Insulin < 846.0 AND 26.1 < BMI < 67.1 AND 0.1275 < DiabetesPedigreeFunction < 2.42 AND 28.0 < Age < 53.0 R32: 26.9 < BMI < 67.1 AND 0.1265 < DiabetesPedigreeFunction < 2.42 AND 25.0 < Age < 62.0 R33: 0.0 < Insulin < 194.0 AND 22.79 < BMI < 67.1 AND 0.1195 < DiabetesPedigreeFunction < 0.817 AND 23.0 < Age < 81.0 R34: 22.0 < Age < 54.0 R35: 24.75 < BMI < 67.1 R36: 30.85 < BMI < 67.1 R37: 23.25 < BMI < 67.1 R38: 0.0 < BMI < 23.05 R39: 7.0 < Pregnancies < 12.0 AND 110.0 < Insulin < 846.0 AND 0.188 < DiabetesPedigreeFunction < 2.42 R40: 0.3235 < DiabetesPedigreeFunction < 2.42 R41: 7.0 < Pregnancies < 12.0 AND 64.0 < BloodPressure < 122.0 AND 0.1215 < DiabetesPedigreeFunction < 0.2825 R42: 0.11 < DiabetesPedigreeFunction < 0.2825 R43: 0.0 < Insulin < 140.0 AND 0.086 < DiabetesPedigreeFunction < 2.42 R44: 7.0 < Pregnancies < 9.0 AND 145.0 < Glucose < 199.0 AND 0.0 < Insulin < 495.0 R45: 134.0 < Glucose < 199.0 AND 0.0 < Insulin < 478.0 R46: 109.0 < Glucose < 199.0 R47: 7.0 < Pregnancies < 17.0 AND 84.0 < Glucose < 199.0 R48: 0.0 < Pregnancies < 3.0 AND 78.0 < Glucose < 199.0 AND 36.0 < Insulin < 846.0 R49: 4.0 < Pregnancies < 6.0 AND 0.0 < Glucose < 104.0 AND 0.0 < Insulin < 156.0 R50: 0.0 < Pregnancies < 2.0 AND 90.0 < Glucose < 199.0 AND 0.0 < Insulin < 15.0 R51: 28.1 < BMI < 67.1 AND 0.20 < DiabetesPedigreeFunction < 2.42 AND 31.0 < Age < 60.0 R52: 26.70 < BMI < 35.45 AND 0.1275 < DiabetesPedigreeFunction < 2.42 AND 30.0 < Age < 53.0 R53: 29.95 < BMI < 67.1 AND 0.1265 < DiabetesPedigreeFunction < 2.42 AND 25.0 < Age < 81.0 R54: 23.35 < BMI < 67.1 AND 0.1275 < DiabetesPedigreeFunction < 0.6535 AND 28.0 < Age < 61.0 R55:0.1195 < DiabetesPedigreeFunction < 2.42 AND 22.0 < Age < 60.0 R56: 27.85 < BMI < 67.1 AND 21.0 < Age < 62.0 |
0.29 0.24 0.16 0.24 0.02 0.03 0.34 0.14 0.08 0.11 0.23 0.03 0.03 0.51 0.26 0.18 0.14 0.22 0.48 0.06 0.04 0.02 0.01 0.11 0.69 0.08 0.06 0.03 0.03 0.5 0.1 0.21 0.11 0.059 0.018 0.74 0.24 0.01 0.12 0.56 0.067 0.20 0.03 0.14 0.40 0.31 0.05 0.04 0.02 0.01 0.5 0.09 0.21 0.05 0.12 0.02 |
0.77 0.63 0.52 0.22 0.15 0.13 0.75 0.63 0.53 0.71 0.22 0.25 0.16 0.73 0.39 0.22 0.88 0.64 0.39 0.4 0.12 0.14 0.09 0.75 0.43 0.45 0.27 0.14 0.09 0.62 0.49 0.36 0.22 0.11 0.11 0.46 0.24 0.04 0.82 0.37 0.43 0.25 0.16 0.88 0.59 0.34 0.4 0.11 0.14 0.09 0.62 0.53 0.38 0.32 0.14 0.17 |
5 5 3 3 5 2 3 4 3 3 2 3 2 2 2 2 3 3 1 2 4 4 4 3 1 3 3 2 2 3 4 3 4 1 1 1 1 1 3 1 3 1 2 3 2 1 2 3 3 3 3 3 3 3 2 2 |
0.13 0.13 0.10 0.38 0.05 0.08 0.16 0.08 0.05 0.05 0.37 0.05 0.08 0.24 0.22 0.28 0.05 0.12 0.42 0.05 0.12 0.06 0.05 0.05 0.55 0.05 0.07 0.07 0.12 0.27 0.07 0.20 0.17 0.18 0.05 0.56 0.34 0.08 0.05 0.52 0.05 0.27 0.07 0.06 0.23 0.31 0.05 0.13 0.06 0.05 0.27 0.06 0.18 0.05 0.29 0.05 |
| Rules for class = 0 | ||||
|
R1: 94.0 < Glucose < 157.0 AND 0.0 < BloodPressure < 88.0 AND 60.0 < Insulin < 228.0 AND 0.078 < DiabetesPedigreeFunction < 0.899 AND 21.0 < Age < 49.0 R2: 89.0 < Glucose < 183.0 AND 0.0 < BloodPressure < 90.0 AND 0.0 < SkinThickness < 41.0 AND 0.0 < Insulin < 190.0 AND 0.078 < DiabetesPedigreeFunction < 1.1855 AND 21.0 < Age < 59.0 R3: 80.0 < Glucose < 189.0 AND 52.0 < BloodPressure < 82.0 AND 12.0 < SkinThickness < 39.0 AND 49.0 < Insulin < 394.0 AND 0.259 < DiabetesPedigreeFunction < 2.42 R4: 70.0 < BloodPressure < 106.0 AND 16.0 < SkinThickness < 50.0 AND 0.0 < Insulin < 145.0 AND 0.1535 < DiabetesPedigreeFunction < 0.712 R5: 52.0 < BloodPressure < 122.0 AND 0.0 < Insulin < 485.0 AND 0.239 < DiabetesPedigreeFunction < 2.42 R6: 0.0 < Glucose < 189.0 AND 0.11 < DiabetesPedigreeFunction < 1.143 R7: 93.0 < Glucose < 137.0 AND 54.0 < BloodPressure < 88.0 AND 7.0 < SkinThickness < 40.0 AND 21.0 < Age < 52.0 R8: 90.0 < Glucose < 157.0 AND 23.25 < BMI < 41.65 R9: 19.20 < BMI < 47.34 R10: 0.0 < Pregnancies < 0.0 AND 13.0 < SkinThickness < 45.0 AND 63.0 < Insulin < 291.0 R11: 2.0 < Pregnancies < 7.0 AND 92.0 < Glucose < 133.0 AND 0.0 < SkinThickness < 39.0 AND 73.0 < Insulin < 267.0 R12: 1.0 < Pregnancies < 8.0 AND 105.0 < Glucose < 169.0 AND 0.0 < SkinThickness < 47.0 AND 74.0 < Insulin < 846.0 R13: 1.0 < Pregnancies < 17.0 AND 80.0 < Glucose < 199.0 AND 0.0 < SkinThickness < 41.0 AND 0.0 < Insulin < 220.0 R14: 56.0 < Glucose < 199.0 AND 0.0 < SkinThickness < 51.0 AND 0.0 < Insulin < 474.0 R15: 0.0 < BloodPressure < 88.0 AND 21.45 < BMI < 43.55 R16: 0.0 < Pregnancies < 10.0 AND 17.0 < SkinThickness < 46.0 AND 20.6 < BMI < 46.15 R17: 0.0 < BloodPressure < 94.0 AND 0.0 < SkinThickness < 47.0 AND 0.0 < BMI < 51.15 R18: 40.0 < Insulin < 215.0 AND 25.1 < BMI < 41.65 AND 0.078 < DiabetesPedigreeFunction < 1.18 AND 21.0 < Age < 46.0 R19: 20.6 < BMI < 43.34 AND 0.078 < DiabetesPedigreeFunction < 0.9155 R20: 15.0 < Insulin < 846.0 AND 0.0 < BMI < 46.6 AND 0.247 < DiabetesPedigreeFunction < 2.2125000000000004 R21: 0.0 < Insulin < 14.0 R22: 0.0 < BloodPressure < 88.0 AND 21.45 < BMI < 43.55 R23: 0.0 < BloodPressure < 106.0 AND 19.20 < BMI < 46.150 R24: 0.0 < BloodPressure < 108.0 R25: 0.0 < Pregnancies < 1.0 AND 62.0 < BloodPressure < 84.0 AND 60.0 < Insulin < 265.0 R26: 2.0 < Pregnancies < 7.0 AND 70.0 < BloodPressure < 88.0 AND 56.0 < Insulin < 160.0 AND 0.078 < DiabetesPedigreeFunction < 0.69 R27: 0.0 < BloodPressure < 90.0 AND 0.0 < Insulin < 220.0 AND 0.1405 < DiabetesPedigreeFunction < 1.1855 R28: 0.0 < Pregnancies < 3.0 AND 52.0 < BloodPressure < 106.0 AND 14.0 < Insulin < 540.0 AND 0.094 < DiabetesPedigreeFunction < 2.42 R29: 1.0 < Pregnancies < 17.0 AND 64.0 < BloodPressure < 108.0 AND 0.098 < DiabetesPedigreeFunction < 2.42 R30: 0.0 < Pregnancies < 2.0 AND 65.0 < Insulin < 291.0 R31: 0.0 < Pregnancies < 10.0 AND 89.0 < Glucose < 169.0 AND 56.0 < Insulin < 846.0 R32: 1.0 < Pregnancies < 17.0 AND 75.0 < Glucose < 199.0 AND 0.0 < Insulin < 0.0 R33: 0.0 < Pregnancies < 6.0 AND 56.0 < Glucose < 187.0 R34: 0.0 < Glucose < 195.0 R35: 23.25 < BMI < 42.5 AND 0.078 < DiabetesPedigreeFunction < 1.09AND 21.0 < Age < 58.0 R36: 19.45 < BMI < 49.65AND 0.2355 < DiabetesPedigreeFunction < 1.31 R37: 0.10 < DiabetesPedigreeFunction < 2.42 AND 22.0 < Age < 81.0 |
0.25 0.42 0.07 0.06 0.14 0.05 0.32 0.61 0.36 0.07 0.10 0.14 0.51 0.15 0.84 0.06 0.08 0.31 0.55 0.06 0.07 0.84 0.11 0.04 0.12 0.07 0.64 0.06 0.08 0.24 0.18 0.38 0.16 0.04 0.77 0.16 0.07 |
0.76 0.66 0.84 0.71 0.54 0.43 0.71 0.68 0.62 0.86 0.79 0.70 0.63 0.57 0.66 0.78 0.57 0.75 0.63 0.67 0.56 0.66 0.66 0.54 0.85 0.81 0.64 0.78 0.57 0.78 0.66 0.63 0.61 0.48 0.67 0.67 0.49 |
5 6 5 4 3 2 4 2 1 3 4 4 4 3 2 3 3 4 2 3 1 2 2 1 3 4 3 4 3 2 3 3 2 1 3 2 2 |
0.22 0.42 0.06 0.06 0.16 0.07 0.30 0.58 0.37 0.05 0.09 0.13 0.53 0.17 0.83 0.05 0.09 0.27 0.56 0.06 0.08 0.83 0.11 0.05 0.09 0.05 0.65 0.05 0.10 0.20 0.18 0.40 0.17 0.06 0.74 0.16 0.09 |
| Datasets | Nb of instances | Nb of attributes | Class labels | Class distribution |
| Wisconsin SEER ISPY1-clinica Mammographic-masses NKI dataset |
569 4024 168 961 272 |
32 12 18 6 1570 |
Malignant : 1 Benign: 0 Alive: 0 Dead: 1 No: not dead :0 Yes: dead: 1 1: malignant 0: benign 1: dead 0: alive |
359 210 3408 616 32 136 445 516 195 77 |
| Number of rules per class | ||||
|---|---|---|---|---|
| Before the Metarules | After the Metarules | |||
| 0 | 1 | |||
| Wisconsin SEER IYSP1-clinica Mammographic-masses NKI dataset |
36 28 5 14 38 |
45 72 12 12 54 |
19 15 5 9 12 |
34 44 11 9 33 |
| ACCURACY | PRECISION | RECALL | F1-SCORE | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RF | XG | LR | PRIM based classifier | RF | XGB | LR | PRIM based classifier | RF | XGB | LR | PRIM based classifier | RF | XGB | LR | PRIM based classifier | |
| Wisconsin SEER IYSP1-clinica Mammographic-masses NKI dataset |
97.6 94.4 98.7 95.8 98 |
95.4 97.5 96.3 98.6 97.5 |
94.1 88.7 89.8 92.4 85.6 |
96.8 98.4 95.3 97.2 95.6 |
98.2 95.6 98.9 97.7 97.9 |
98 97.6 95.2 96.3 94.5 |
94.8 88.1 86.5 92.4 93.3 |
95.6 97.1 94.6 96.3 96.7 |
93.9 98 97.2 97.5 97.8 |
96.7 97.2 95.4 98.7 96.7 |
85.4 86.7 89.9 93.2 94.1 |
94.2 95.6 94.8 95.8 97.1 |
96 | 97.3 | 89.9 | 94.9 96.3 94.7 96.1 96.9 |
| 96. 7 | 97.3 | 87.4 | ||||||||||||||
| 98 | 95.3 | 88.2 | ||||||||||||||
| 97. 6 | 97.4 | 92.8 | ||||||||||||||
| 97. 8 | 95.6 | 93.7 | ||||||||||||||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).