Submitted:
05 February 2026
Posted:
06 February 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Methods and Materials
3.1. Data Description
3.2. Preprocessing Steps
3.3. Fuzzy Feature Engineering
3.4. Proposed Hybrid Ensemble Model Architecture
3.5. Model Training, Validation, and Evaluation Metrics
3.6. Experimental Setup
3.6.1. Dataset Overview
3.7. Role of Base Learners in the Ensemble Framework
3.8. Performance Evaluation of the Proposed Model
3.9. Effect of Threshold Optimization
4. Discussion
| Model | Accuracy | F1-score | AUC | Key Strength | Clinical Implication |
|---|---|---|---|---|---|
| Random Forest | 0.816 | 0.812 | 0.873 | Strong discrimination | Reliable baseline model |
| XGBoost | 0.800 | 0.800 | 0.856 | Nonlinear boosting | Sensitive to hyperparameters |
| LightGBM | 0.784 | 0.783 | 0.835 | Computational efficiency | Moderate discrimination |
| Hard Voting | 0.816 | 0.814 | – | Stability | Threshold-independent |
| Soft Voting | 0.800 | 0.799 | 0.862 | Probabilistic fusion | Improved calibration |
| Stacking | 0.800 | 0.800 | 0.866 | Robust ranking | Better threshold robustness |
| Intelligent Weighted Ensemble | 0.816 | 0.814 | 0.862 | Balanced aggregation | Clinically robust decisions |
5. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
References
- Tanha, J.; Salarabadi, H.; Aznab, M.; Farahi, A.; Zoberi, M. Relationship among prognostic indices of breast cancer using classification techniques. Informatics in Medicine Unlocked 2020, 18, 100265.
- Schreurs, M.A.C.; Schmidt, M.K.; Hollestelle, A.; Schaapveld, M.; van Asperen, C.J.; Ausems, M.G.E.M.; van de Beek, I.; Broekema, M.F.; Collée, J.M.; van der Hout, A.H.; et al. Cancer risks for other sites in addition to breast in CHEK2 c.1100delC families. Genetics in Medicine 2024, 26, 101171.
- Di Grazia, G.; Conti, C.; Nucera, S.; Stella, S.; Massimino, M.; Giuliano, M.; Schettini, F.; Vigneri, P.; Martorana, F. Bridging the Gap: the role of MDM2 inhibition in overcoming treatment resistance in breast cancer. Critical Reviews in Oncology/Hematology 2025, p. 104834.
- Hlauschek, D.; Fesl, C.; Gnant, M. The impact of analysis methodology details on invasive breast cancer-free survival in randomized clinical trials. ESMO Open 2025, 10, 105324.
- Muñoz, J.; Nucera, S.; Garcia, N.R.; Cebrecos, I.; Oses, G.; Ganau, S.; Sanfeliu, E.; Jares, P.; Marín-Aguilera, M.; Galván, P.; et al. Paclitaxel-related type I Kounis Syndrome in a very young patient with HER2-positive breast cancer and the role of genomics to disentangle a complex therapeutic scenario: a case report and narrative review. The Breast 2025, p. 104465.
- Jin, Y.; Xue, J.; Li, X.; Zhong, X. Downregulation of PIP4K2C inhibits the breast cancer cell proliferation, migration and invasion. Translational Oncology 2025, 57, 102420.
- Borugadda, P.; Kalluri, H.K. A comprehensive analysis of artificial intelligence, machine learning, deep learning and computer vision in food science. Journal of Future Foods 2025.
- Salarabadi, H.; Iraji, M.S.; Salimi, M.; Zoberi, M. Improved COVID-19 diagnosis using a hybrid transfer learning model with fuzzy edge detection on CT scan images. Advances in Fuzzy Systems 2024, 2024, 3249929.
- Rahman, M.A.; Bhuiyan, T.; Ali, M.A. Enhancing aviation safety: Machine learning for real-time ADS-B injection detection through advanced data analysis. Alexandria Engineering Journal 2025, 126, 262–276.
- Zhao, Y.; Zhang, W.; Liu, X. Grid search with a weighted error function: Hyper-parameter optimization for financial time series forecasting. Applied Soft Computing 2024, 154, 111362.
- Wang, H.; Zhang, J.; Li, Y.; Wang, D.; Zhang, T.; Yang, F.; Zhang, Y.; Yang, L.; Li, P. Deep-learning features based on F18 fluorodeoxyglucose positron emission tomography/computed tomography (18F-FDG PET/CT) to predict preoperative colorectal cancer lymph node metastasis. Clinical Radiology 2024, 79, e1152–e1158.
- Chen, L.; Su, Y.; Wang, Y.; Yu, Y.; Huang, D.; Zhang, M.; Chen, X.; Ye, X.; He, Y.; Xue, E.; et al. Ultrasound radiomics-based machine learning and SHapley Additive exPlanations method predicting pathological prognostic stage in breast cancer: a bicentric and validation study. Clinical Breast Cancer 2025.
- Song, Y.; Wang, C.; Zhou, Y.; Sun, Q.; Lin, Y. A multi-omics-based prognostic model for elderly breast cancer by machine learning: insights from hypoxia and immunity of tumor microenvironment. Clinical Breast Cancer 2025.
- Ahmadian, M.; Bodalal, Z.; Adib, M.; Mohammadi Ziabari, S.S.; Bos, P.; Martens, R.M.; Agrotis, G.; Vens, C.; Karssemakers, L.; Al-Mamgani, A.; et al. Explainable feature selection combining particle swarm optimisation with adaptive LASSO for MRI radiogenomics: Predicting HPV status in oropharyngeal cancer. Computer Methods and Programs in Biomedicine 2026, 275, 109204. [CrossRef]
- Lin, N.; Abbas-Aghababazadeh, F.; Su, J.; Wu, A.J.; Lin, C.; Shi, W.; Xu, W.; Haibe-Kains, B.; Liu, F.F.; Kwan, J.Y.Y. Development of machine learning models for predicting radiation dermatitis in breast cancer patients using clinical risk factors, patient-reported outcomes, and serum cytokine biomarkers. Clinical Breast Cancer 2025.
- Miglietta, F.; Collesei, A.; Vernieri, C.; Giarratano, T.; Giorgi, C.A.; Girardi, F.; Griguolo, G.; Cacciatore, M.; Botticelli, A.; Vingiani, A.; et al. Development of two machine learning models to predict conversion from primary HER2-0 breast cancer to HER2-low metastases: a proof-of-concept study. ESMO Open 2025, 10, 104087.
- Zhang, X.; Li, P.; Ji, L.; Zhang, Y.; Zhang, Z.; Guo, Y.; Zhang, L.; Jing, S.; Dong, Z.; Tian, J.; et al. A machine learning-based prognostic signature utilizing MSC proteomics for predicting bladder cancer prognosis and treatment response. Translational Oncology 2025, 54, 102349.
- Gao, F.; Huang, Y.; Yang, M.; He, L.; Yu, Q.; Cai, Y.; Shen, J.; Lu, B. Machine learning-based cell death marker for predicting prognosis and identifying tumor immune microenvironment in prostate cancer. Heliyon 2024, 10.
- Maimaitiyiming, A.; An, H.; Xing, C.; Li, X.; Li, Z.; Bai, J.; Luo, C.; Zhuo, T.; Huang, X.; Maimaiti, A.; et al. Machine learning-driven mast cell gene signatures for prognostic and therapeutic prediction in prostate cancer. Heliyon 2024, 10.
- Liu, L.; Zhang, Y.; Zhao, X.h.; Zhu, M.; Liang, J.w.; Jiang, Z.y.; Zhang, J.q.; Ma, J.l.; Wang, Z.Z.; Gan, M.f.; et al. Predicting clinical prognosis in gastric cancer using deep learning-based analysis of tissue pathomics images. Computer Methods and Programs in Biomedicine 2025, p. 108895.
- Wu, X.; Chen, M.; Liu, K.; Wu, Y.; Feng, Y.; Fu, S.; Xu, H.; Zhao, Y.; Lin, F.; Lin, L.; et al. Molecular classification of geriatric breast cancer displays distinct senescent subgroups of prognostic significance. Molecular Therapy Nucleic Acids 2024, 35.
- Emily, M.; Meidioktaviana, F.; Nabiilah, G.Z.; Moniaga, J.V. Comparative analysis of machine learning and survival analysis for breast cancer prediction. Procedia Computer Science 2024, 245, 759–767.
- Lin, B.Y.; Mohammadi Ziabari, S.S.; Nasser Al Husaini, Y.; Mansoor Alsahag, A.M. SG-MuRCL: Smoothed Graph-Enhanced Multi-Instance Contrastive Learning for Robust Whole-Slide Image Classification. Information 2026, 17, 37. [CrossRef]
- Göransson, S.; Olofsson, H.; Johansson, H.J.; Yan, F.; Vogiatzakis, C.; Liang, S.; Bellato, H.M.; Masvidal, L.; Aksoylu, I.; Hartman, J.; et al. Mechanical control of breast cancer malignancy by promotion of mevalonate pathway enzyme synthesis. Matrix Biology 2025.
- Zhang, J.; Liu, R.; Gao, S.; Chen, W.; Han, X.; Wang, Z.; Zhou, H.; Wang, Y.; Chen, J.; Ma, Y.; et al. A phase I dose-escalation and dose-expansion study of tibremciclib, a novel CDK4/6 inhibitor, monotherapy and in combination with fulvestrant in HR-positive/HER2-negative advanced breast cancer. ESMO Open 2025, 10, 105121.
- Tsutsumi, C.; Ohuchida, K.; Tsutsumi, H.; Shimada, Y.; Yamada, Y.; Son, K.; Hayashida, S.; Katayama, N.; Mochida, Y.; Iwamoto, C.; et al. TIM3 on natural killer cells regulates antibody-dependent cellular cytotoxicity in HER2-positive gastric cancer. Cancer Letters 2025, 611, 217412.
- Molinelli, C.; Bruzzone, M.; Blondeaux, E.; Ruelle, T.; Lanzavecchia, C.; De Laurentiis, M.; Russo, S.; Riccardi, F.; Sini, V.; Cognetti, F.; et al. The journey of patients affected by metastatic hormone receptor-positive/HER2-negative breast cancer from CDK4/6 inhibitors to second-line treatment: a real-world analysis of 701 patients enrolled in the GIM14/BIOMETA study. European Journal of Cancer 2024, 213, 115113.
- Zhang, Q.; Guo, D.; Xu, G.; Xie, R.; Deng, Y.; Fu, P.; Wan, J. Tumor-associated macrophages suppress estrogen receptor-β expression in triple-negative breast cancer through the PI3K/AKT pathway. Experimental and Therapeutic Medicine 2026, 31, 1–10.
- Ye, G.; Zhang, C.; Zhuang, Y.; Liu, H.; Song, E.; Li, K.; Liao, Y. An advanced nomogram model using deep learning radiomics and clinical data for predicting occult lymph node metastasis in lung adenocarcinoma. Translational Oncology 2024, 44, 101922.
- Bressane, A.; Toda, D.H.R.; Negri, R.G.; Formiga, J.K.S.; Bankole, A.O.; Bankole, A.R.; Sharifi, S.; Moruzzi, R. Physics-informed feature engineering with fuzzy symbolic regression for predicting settling velocity in water treatment. Journal of Water Process Engineering 2025, 78, 108749.
- Siqueira, V.S.d.M.; Cuadros, M.A.S.L.; Munaro, C.J.; de Almeida, G.M. Expert system for early sign stuck pipe detection: Feature engineering and fuzzy logic approach. Engineering Applications of Artificial Intelligence 2024, 127, 107229.
- Liu, J.; Wu, J.; Pan, J.; Zhao, P. Fuzzification-back propagation neural network-based model prediction for robotic arm positioning error reduction. Engineering Applications of Artificial Intelligence 2025, 151, 110639.
- Wang, J.; Feng, X.; Yu, Y.; Wang, X.; Werghi, N.; Han, X.; Zhou, H.; Shi, K.; Zhong, S.; Cai, J. Fuzzy actor–critic learning-based interpretable control and stability-informed guarantee with error mapping for discrete-time nonlinear system. Chaos, Solitons & Fractals 2025, 199, 116878.
- Li, P.; Cheng, G.; Zhao, Y.; Liu, L.; Chen, M.; Ma, Z.; Chen, X. An interpretable reasoning schema for fuzzy knowledge graph under interval type-2 fuzzy model. Applied Soft Computing 2025, p. 113534.
- Ni, C.; Huang, H.; Cui, P.; Ke, Q.; Tan, S.; Ooi, K.T.; Liu, Z. Light Gradient Boosting Machine (LightGBM) to forecasting data and assisting the defrosting strategy design of refrigerators. International Journal of Refrigeration 2024, 160, 182–196.
- Makwana, S.; Vaghela, D.; Chan, C.K.; Naik, N. Novel hybrid approach of random forest and stacking ensemble to improve fresh water yield prediction in mobile wick solar still. Desalination and Water Treatment 2025, p. 101490.
- Koohmishi, M.; Connolly, D.P. Railway ballast fouling detection using thermal imaging: Integration of LSTM and XGBoost. Transportation Geotechnics 2025, p. 101889.
- Gao, J.; Zhang, J.; Gong, Y.; Yang, K.; Quan, W.; Li, L.; Wu, Y.; Wang, R.; Cheng, H. A robust soil quality index for mountain ecosystems driven by XGBoost and network analysis. Geoderma 2025, 463, 117558.
- Hendiani, S.; Walther, G. Double-layer multi-criteria group decision-making approach using neutralized possibility degree-based decision matrix with fuzzy information. Engineering Applications of Artificial Intelligence 2024, 133, 108276.





| Model | Accuracy | F1-score | AUC |
|---|---|---|---|
| Random Forest | 0.816 | 0.812 | 0.873 |
| XGBoost | 0.800 | 0.800 | 0.856 |
| LightGBM | 0.784 | 0.783 | 0.835 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).