Submitted:
10 June 2025
Posted:
10 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methodology
2.1. Data Preprocessing and Feature Engineering
2.2. Feature Selection and Correlation Analysis
3. Experiments
3.1. Model Construction and Training
3.2. Model Performance Evaluation
3.3. SHAP-Based Interpretability Analysis
4. Conclusions
References
- R. B. Nozari, et al., “A novel behavior-based recommendation system for e-commerce,” arXiv preprint arXiv:2403.18536, 2024.
- V. Pleskach, et al., “An e-commerce recommendation systems based on analysis of consumer behavior models,” in Proc. IntSol, 2023.
- S. S. Li and E. Karahanna, “Online recommendation systems in a B2C e-commerce context: A review and future directions,” J. Assoc. Inf. Syst., vol. 16, no. 2, pp. 2, 2015. [CrossRef]
- S. Sameena, et al., “Personalized product recommendation system for e-commerce platforms,” in ITM Web Conf., vol. 74, EDP Sciences, 2025.
- J.-P. Cabrera-Sánchez, et al., “Online recommendation systems: Factors influencing use in e-commerce,” Sustainability, vol. 12, no. 21, pp. 8888, 2020. [CrossRef]
- Y. Wu and Y. Yusof, “Emerging trends in real-time recommendation systems: A deep dive into multi-behavior streaming processing and recommen- dation for e-commerce platforms,” J. Internet Serv. Inf. Secur., vol. 14, no. 4, pp. 45–66, 2024. [CrossRef]
- M. A. Raji, et al., “E-commerce and consumer behavior: A review of AI-powered personalization and market trends,” GSC Adv. Res. Rev., vol. 18, no. 3, pp. 66–77, 2024.
- J. B. Schafer, J. A. Konstan, and J. Riedl, “E-commerce recommendation applications,” Data Min. Knowl. Discov., vol. 5, pp. 115–153, 2001. [CrossRef]
- S. Somavarapu and V. Gupta, “Analyzing and visualizing user behavior in e-commerce: A machine learning approach,” unpublished, 2025.
- S. Iseal and M. Halli, “Customer behavior analysis and purchase prediction in e-commerce,” unpublished, 2025.
- P. Pande, A. K. Kulkarni, and V. Ramalingam, “Big data analytics in e-commerce driving business decisions through customer behavior insights,” in ITM Web Conf., vol. 76, EDP Sciences, 2025.
- Z. Liu, et al., “Development and validation of interpretable machine learning models for triage patients admitted to the intensive care unit,” PLoS One, vol. 20, no. 2, pp. e0317819, 2025. [CrossRef]
- Q. Sun, et al., “Machine learning-based assessment of diabetes risk,” Appl. Intell., vol. 55, no. 2, pp. 1–13, 2025. [CrossRef]
- H. Jiang, et al., “ReIU: An efficient preliminary framework for Alzheimer patients based on multi-model data,” Front. Public Health, vol. 12, pp. 1449798, 2025. [CrossRef]
- C. Xu, et al., “Research on the nonlinear relationship between carbon emissions from residential land and the built environment: A case study of Susong County, Anhui Province using the XGBoost-SHAP model,” Land, vol. 14, no. 3, pp. 440, 2025. [CrossRef]
- O. Folorunsho, et al., “Explainable ensemble deep learning model for predicting diabetic retinopathy based on APTOS 2019 eye pack dataset,” LAUTECH J. Eng. Technol., vol. 19, no. 1, pp. 1–14, 2025.
- C. Huang, et al., “Development and validation of an interpretable machine learning model for predicting left atrial thrombus or spontaneous echo contrast in non-valvular atrial fibrillation patients,” PLoS One, vol. 20, no. 1, pp. e0313562, 2025.
- M. M. Monowar, et al., “Advanced sleep disorder detection using multi-layered ensemble learning and advanced data balancing techniques,” Front. Artif. Intell., vol. 7, pp. 1506770, 2025. [CrossRef]
- Y. Cheng and K. Wang, “A high-precision machine learning-based prediction model for delayed graft function (DGF) in Chinese kidney transplant patients: A multicenter study,” unpublished, 2025.
- T. R. Noviandy, et al., “An explainable multi-model stacked classifier approach for predicting hepatitis C drug candidates,” Sci, vol. 6, no. 4, 2024. [CrossRef]







| Model | Accuracy | Precision | Recall | F1-Score | ROC-AUC |
| Decision Tree | 0.941 | 0.366 | 0.283 | 0.319 | 0.629 |
| Gradient Boosting | 0.951 | 0.688 | 0.207 | 0.318 | 0.916 |
| Logistic Regression | 0.956 | 0.625 | 0.189 | 0.290 | 0.926 |
| XGBoost | 0.946 | 0.615 | 0.150 | 0.242 | 0.910 |
| Random Forest | 0.955 | 0.750 | 0.113 | 0.197 | 0.895 |
| Naive Bayes | 0.393 | 0.067 | 0.906 | 0.125 | 0.685 |
| KNN | 0.952 | 0.667 | 0.038 | 0.071 | 0.689 |
| SVM | 0.952 | 0.000 | 0.000 | 0.000 | 0.881 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).