Submitted:
02 June 2026
Posted:
03 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
1. Literature Review
2.1. Machine Learning Applied to Student Stress and Mental Health
2.2. Ensemble Models for Predicting Stress and Psychological Well-Being
2.3. Metrics for Evaluation, Validation, and Probabilistic Calibration
2.4. Explainability and Interpretation of Risk Factors Using XAI
2.5. Research Gap and Contribution of this Study
3. Methodology
3.1. Analytical Design
3.2. Data Source
3.3. Definition of the Outcome and Initial Debugging
3.4. Variable Typing and Preprocessing
3.5. Data Partitioning and Imbalance Control
3.6. Algorithms and Hyperparameter Tuning
3.7. Nested Validation and Model Selection Criteria
3.8. Statistical Comparison, Final Evaluation, and Calibration
3.9. Considerations Regarding Rigor and Reproducibility
3.10. Complementary Explainability Strategy
4. Results
4.1. Comparative Performance in Nested Cross-Validation
4.2. Formal Model Selection and Statistical Comparison
4.3. Final Evaluation on the Holdout Set
4.4. Probabilistic Quality and Interpretability
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Douwes, R.; Metselaar, J.; Pijnenborg, G.H.M.; Boonstra, N. Well-Being of Students in Higher Education: The Importance of a Student Perspective. Cogent Educ 2023, 10, 2190697. [CrossRef]
- Arif, N.M.N.A.; Roslan, N.S.; Ismail, S.B.; Nayak, R.D.; Jamian, M.R.; Mohamad Ali Roshidi, A.S.; Edward, T.C.; Kamal, M.A.; Mohd Amin, M.M.; Shaari, S.; et al. Prevalence and Associated Factors of Psychological Distress and Burnout among Medical Students: Findings from Two Campuses. Int J Environ Res Public Health 2021, 18, 8446. [CrossRef]
- March-Amengual, J.-M.; Cambra Badii, I.; Casas-Baroy, J.-C.; Altarriba, C.; Comella Company, A.; Pujol-Farriols, R.; Baños, J.-E.; Galbany-Estragués, P.; Comella Cayuela, A. Psychological Distress, Burnout, and Academic Performance in First Year College Students. Int J Environ Res Public Health 2022, 19, 3356. [CrossRef]
- Pascoe, M.C.; Hetrick, S.E.; Parker, A.G. The Impact of Stress on Students in Secondary School and Higher Education. Int J Adolesc Youth 2020, 25, 104–112. [CrossRef]
- Smith, M.D.; Wesselbaum, D. Global Evidence on the Prevalence of and Risk Factors Associated with Stress. J Affect Disord 2025, 374, 179–183. [CrossRef]
- Li, H. Multicultural Data Assistance Mining Analysis for Ideological and Political Education in Smart Education Platforms Using Artificial Intelligence. Wireless Netw 2025, 31, 567–581. [CrossRef]
- Gustavson, K.; Knudsen, A.K.; Nesvåg, R.; Knudsen, G.P.; Vollset, S.E.; Reichborn-Kjennerud, T. Prevalence and Stability of Mental Disorders among Young Adults: Findings from a Longitudinal Study. BMC Psychiatry 2018, 18, 65. [CrossRef]
- Tan, W.; Chen, L.; Zhang, Y.; Xi, J.; Hao, Y.; Jia, F.; Hall, B.J.; Gu, J.; Wang, S.; Lin, H.; et al. Regional Years of Life Lost, Years Lived with Disability, and Disability-Adjusted Life-Years for Severe Mental Disorders in Guangdong Province, China: A Real-World Longitudinal Study. Glob Health Res Policy 2022, 7, 17. [CrossRef]
- Ullah, H.; Arbab, S.; Liu, C.; Du, Q.; Khan, S.A.; Khan, S.; Dad, O.; Tian, Y.; Li, K. Source of Stress-Associated Factors Among Medical and Nursing Students: A Cross-Sectional Study. J Nurs Manag 2025, 2025, 9928649. [CrossRef]
- Conley, C.S.; Durlak, J.A.; Dickson, D.A. An Evaluative Review of Outcome Research on Universal Mental Health Promotion and Prevention Programs for Higher Education Students. J Am Coll Health 2013, 61, 286–301. [CrossRef]
- Ahorsu, D.K.; Sánchez Vidaña, D.I.; Lipardo, D.; Shah, P.B.; Cruz González, P.; Shende, S.; Gurung, S.; Venkatesan, H.; Duongthipthewa, A.; Ansari, T.Q.; et al. Effect of a Peer-led Intervention Combining Mental Health Promotion with Coping-strategy-based Workshops on Mental Health Awareness, Help-seeking Behavior, and Wellbeing among University Students in Hong Kong. Int J Ment Health Syst 2021, 15, 6. [CrossRef]
- Roshan, R.; Hamid, S.; Kumar, R.; Hamdani, U.; Naqvi, S.; Zill-e-Huma; Adeel, U. Utilizing the CFIR Framework for Mapping the Facilitators and Barriers of Implementing Teachers Led School Mental Health Programs – A Scoping Review. Soc Psychiatry Psychiatr Epidemiol 2025, 60, 535–548. [CrossRef]
- Zhang, Z.; Chen, H.; Ye, Y.; Chen, H.; Guo, H.; Zhou, J. Entropy-Based Risk Network Identification in Adolescent Self-Injurious Behavior Using Machine Learning and Network Analysis. Transl Psychiatry 2025, 15, 299. [CrossRef]
- Zhang, Z. Early Warning Model of Adolescent Mental Health Based on Big Data and Machine Learning. Soft Comput 2024, 28, 811–828. [CrossRef]
- Mittal, S.; Mahendra, S.; Sanap, V.; Churi, P. How Can Machine Learning Be Used in Stress Management: A Systematic Literature Review of Applications in Workplaces and Education. Int J Inf Manag Data Insights 2022, 2, 100110. [CrossRef]
- Geronimo, S.M.; Hernandez, A.A.; Abisado, M.B. Academic Stress of Students in Higher Education Using Machine Learning: A Systematic Literature Review. In Proceedings of the 2023 IEEE 13th International Conference on System Engineering and Technology (ICSET); October 2023; pp. 141–146.
- Arias, E.M.; Parraga-Alava, J.; Montenegro, D.Z. Stress Detection among Higher Education Students: A Comprehensive Systematic Review of Machine Learning Approaches. In Proceedings of the 2024 Tenth International Conference on eDemocracy & eGovernment (ICEDEG); June 2024; pp. 1–8.
- Hasan, M.E.; Arif, M.; Rakibul Hasan, S.M.; Muwanguzi, M.; Abaatyo, J.; Kaggwa, M.M.; ALmerab, M.M.; Atroszko, P.A.; Muhit, M.; Al-Mamun, F.; et al. Prevalence, Associated Factors, and Machine Learning-Based Prediction of Depression, Anxiety, and Stress among University Students: A Cross-Sectional Study from Bangladesh. J Health Popul Nutr 2025, 44, 361. [CrossRef]
- Wang, Y.; Lu, S.; Harter, D. Towards Collaborative and Intelligent Learning Environments Based on Eye Tracking Data and Learning Analytics: A Survey. IEEE Access 2021, 9, 137991–138002. [CrossRef]
- Barreto, A.; Zhai, J.; Adjouadi, M. Non-Intrusive Physiological Monitoring for Automated Stress Detection in Human-Computer Interaction. In Proceedings of the Human–Computer Interaction; Lew, M., Sebe, N., Huang, T.S., Bakker, E.M., Eds.; Springer: Berlin, Heidelberg, 2007; pp. 29–38.
- Broglia, E.; Barkham, M. Adopting the Principles and Practices of Learning Health Systems in Universities and Colleges: Recommendations for Delivering Actionable Data to Improve Student Mental Health. Cogent Mental Health 2024, 3, 2301339. [CrossRef]
- Umematsu, T.; Sano, A.; Taylor, S.; Picard, R.W. Improving Students’ Daily Life Stress Forecasting Using LSTM Neural Networks. In Proceedings of the 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI); May 2019; pp. 1–4.
- Ding, Y.; Chen, X.; Fu, Q.; Zhong, S. A Depression Recognition Method for College Students Using Deep Integrated Support Vector Algorithm. IEEE Access 2020, 8, 75616–75629. [CrossRef]
- Skrbinjek, V.; Dermol, V. Predicting Students’ Satisfaction Using a Decision Tree. Tert Educ Manag 2019, 25, 101–113. [CrossRef]
- Leevy, J.L.; Khoshgoftaar, T.M.; Bauder, R.A.; Seliya, N. A Survey on Addressing High-Class Imbalance in Big Data. J Big Data 2018, 5, 42. [CrossRef]
- Hasanin, T.; Khoshgoftaar, T.M.; Leevy, J.L.; Bauder, R.A. Severely Imbalanced Big Data Challenges: Investigating Data Sampling Approaches. J Big Data 2019, 6, 107. [CrossRef]
- Li, Z.; Kamnitsas, K.; Glocker, B. Overfitting of Neural Nets Under Class Imbalance: Analysis and Improvements for Segmentation. In Proceedings of the Medical Image Computing and Computer Assisted Intervention – MICCAI 2019; Shen, D., Liu, T., Peters, T.M., Staib, L.H., Essert, C., Zhou, S., Yap, P.-T., Khan, A., Eds.; Springer International Publishing: Cham, 2019; pp. 402–410.
- Carrington, A.M.; Manuel, D.G.; Fieguth, P.W.; Ramsay, T.; Osmani, V.; Wernly, B.; Bennett, C.; Hawken, S.; Magwood, O.; Sheikh, Y.; et al. Deep ROC Analysis and AUC as Balanced Average Accuracy, for Improved Classifier Selection, Audit and Explanation. IEEE Trans Pattern Anal Mach Intell 2023, 45, 329–341. [CrossRef]
- Chicco, D.; Warrens, M.J.; Jurman, G. The Matthews Correlation Coefficient (MCC) Is More Informative Than Cohen’s Kappa and Brier Score in Binary Classification Assessment. IEEE Access 2021, 9, 78368–78381. [CrossRef]
- Martinović, M.; Dokic, K.; Pudić, D. Comparative Analysis of Machine Learning Models for Predicting Innovation Outcomes: An Applied AI Approach. Appl Sci 2025, 15, 3636. [CrossRef]
- Abu-Shaira, M.; Shi, W. Unveiling Statistical Significance of Online Regression Over Multiple Datasets. In Proceedings of the 2024 IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR); August 2024; pp. 274–279.
- Carvalho, D.V.; Pereira, E.M.; Cardoso, J.S. Machine Learning Interpretability: A Survey on Methods and Metrics. Electronics 2019, 8, 832. [CrossRef]
- Anand, R.V.; Md, A.Q.; Urooj, S.; Mohan, S.; Alawad, M.A.; C., A. Enhancing Diagnostic Decision-Making: Ensemble Learning Techniques for Reliable Stress Level Classification. Diagnostics 2023, 13, 3455. [CrossRef]
- Saklani, S.; Manchanda, M.; Bisht, G.; Thapa, K. Comparing Random Forest and XGBoost for Sentiment Classification of Student Social Media Posts: A Case Study on Pre-Board Exam Stress. In Proceedings of the 2025 Global Conference in Emerging Technology (GINOTECH); May 2025; pp. 1–6.
- Nnadi, L.C.; Watanobe, Y.; Rahman, M.M.; John-Otumu, A.M. Prediction of Students’ Adaptability Using Explainable AI in Educational Machine Learning Models. Appl Sci 2024, 14, 5141. [CrossRef]
- Islam, M.T.; Ashraf, K.; Hosen, Md.H.; Nawar, S.; Asgar, S. Predictive Modeling of Anxiety Levels in Bangladeshi University Students: A Voting-Based Approach with LIME and SHAP Explanations. In Proceedings of the 2024 International Conference on Advances in Computing, Communication, Electrical, and Smart Systems (iCACCESS); March 2024; pp. 01–06.
- Hooshyar, D.; Yang, Y. Problems With SHAP and LIME in Interpretable AI for Education: A Comparative Study of Post-Hoc Explanations and Neural-Symbolic Rule Extraction. IEEE Access 2024, 12, 137472–137490. [CrossRef]
- A. Marouf, A.; F. Ashrafi, A.; Ahmed, T.; Emon, T. A Machine Learning Based Approach for Mapping Personality Traits and Perceived Stress Scale of Undergraduate Students. IJMECS 2019, 11, 42–47. [CrossRef]
- A. Flesia, L.; Monaro, M.; Mazza, C.; Fietta, V.; Colicino, E.; Segatto, B.; Roma, P. Predicting Perceived Stress Related to the Covid-19 Outbreak through Stable Psychological Traits and Machine Learning Models. JCM 2020, 9, 3350. [CrossRef]
- Naghavi, A.; Teismann, T.; Asgari, Z.; Mohebbian, M.R.; Mansourian, M.; Mañanas, M.Á. Accurate Diagnosis of Suicide Ideation/Behavior Using Robust Ensemble Machine Learning: A University Student Population in the Middle East and North Africa (MENA) Region. Diagnostics 2020, 10, 956. [CrossRef]
- Liu, Y.; Xie, Y.-N.; Li, W.-G.; He, X.; He, H.-G.; Chen, L.-B.; Shen, Q. A Machine Learning-Based Risk Prediction Model for Post-Traumatic Stress Disorder during the COVID-19 Pandemic. Medicina 2022, 58, 1704. [CrossRef]
- Daza, A.; Saboya, N.; Necochea-Chamorro, J.I.; Zavaleta Ramos, K.; Vásquez Valencia, Y.D.R. Systematic Review of Machine Learning Techniques to Predict Anxiety and Stress in College Students. Inform Med Unlocked 2023, 43, 101391. [CrossRef]
- Tariq, R.; Orozco-del-Castillo, M.G.; Zamir, M.T.; Ramírez-Montoya, M.S.; Wilberforce, T. Explainable Artificial Intelligence for Predictive Modeling of Student Stress in Higher Education. Sci Rep 2025, 15, 38375. [CrossRef]
- Pujadas, E.R.; Díaz-Caneja, C.M.; Stevanovic, D.; Quintero, M.F.; Martín-Isla, C.; Hernández-González, J.; Atehortúa, A.; Lazrak, N.; Pries, L.; Delespaul, P.; et al. Longitudinal Prediction of Mental Health Outcomes in Vulnerable Youth Using Machine Learning. Cogn Comput 2025, 17, 152. [CrossRef]
- Pereira, M.G.; Santos, M.; Magalhães, R.; Rodrigues, C.; Araújo, O.; Durães, D. Burnout Risk Profiles in Psychology Students: An Exploratory Study with Machine Learning. Behav Sci 2025, 15, 505. [CrossRef]
- Younis, E.M.G.; Zaki, S.M.; Kanjo, E.; Houssein, E.H. Evaluating Ensemble Learning Methods for Multi-Modal Emotion Recognition Using Sensor Data Fusion. Sensors 2022, 22, 5611. [CrossRef]
- Almadhor, A.; Sampedro, G.A.; Abisado, M.; Abbas, S. Efficient Feature-Selection-Based Stacking Model for Stress Detection Based on Chest Electrodermal Activity. Sensors 2023, 23, 6664. [CrossRef]
- Hadhri, S.; Hadiji, M.; Labidi, W. A Voting Ensemble Classifier for Stress Detection. J Inf Telecommun 2024, 8, 399–416. [CrossRef]
- Campanella, S.; Altaleb, A.; Belli, A.; Pierleoni, P.; Palma, L. A Method for Stress Detection Using Empatica E4 Bracelet and Machine-Learning Techniques. Sensors 2023, 23, 3565. [CrossRef]
- Haghish, E.F.; Nes, R.B.; Obaidi, M.; Qin, P.; Stänicke, L.I.; Bekkhus, M.; Laeng, B.; Czajkowski, N. Unveiling Adolescent Suicidality: Holistic Analysis of Protective and Risk Factors Using Multiple Machine Learning Algorithms. J Youth Adolescence 2024, 53, 507–525. [CrossRef]
- Mahajan, P.; Uddin, S.; Hajati, F.; Moni, M.A. Ensemble Learning for Disease Prediction: A Review. Healthcare 2023, 11, 1808. [CrossRef]
- Shanto, S.S.; Jony, A.I. Interpretable Ensemble Learning Approach for Predicting Student Adaptability in Online Education Environments. Knowledge 2025, 5, 10. [CrossRef]
- Geng, S.; Wang, J.; Xia, Y.; Niu, B.; Deng, X.; Wu, X. Predicting Generalized Anxiety Disorder among Chinese Depressed Adolescents: An Explainable Machine Learning Approach. BMC Med Inform Decis Mak 2025, 25, 406. [CrossRef]
- Liu, C.; Yu, S. Students’ Stress Prediction and Explainable Analysis Based on Improved Decision Trees. Front Psychol 2026, 16. [CrossRef]
- Rois, R.; Ray, M.; Rahman, A.; Roy, S.K. Prevalence and Predicting Factors of Perceived Stress among Bangladeshi University Students Using Machine Learning Algorithms. J Health Popul Nutr 2021, 40, 50. [CrossRef]
- Chen, Q.; Lee, B.G. Deep Learning Models for Stress Analysis in University Students: A Sudoku-Based Study. Sensors 2023, 23, 6099. [CrossRef]
- Villar, A.; De Andrade, C.R.V. Supervised Machine Learning Algorithms for Predicting Student Dropout and Academic Success: A Comparative Study. Discov Artif Intell 2024, 4, 2. [CrossRef]
- Tang, B.; Li, S.; Zhao, C. Predicting the Performance of Students Using Deep Ensemble Learning. J Intell 2024, 12, 124. [CrossRef]
- De Filippis, R.; Foysal, A.A. Comprehensive Analysis of Stress Factors Affecting Students: A Machine Learning Approach. Discov Artif Intell 2024, 4, 62. [CrossRef]
- Abdul Rahman, H.; Kwicklis, M.; Ottom, M.; Amornsriwatanakul, A.; H. Abdul-Mumin, K.; Rosenberg, M.; Dinov, I.D. Machine Learning-Based Prediction of Mental Well-Being Using Health Behavior Data from University Students. Bioengineering 2023, 10, 575. [CrossRef]
- Xia, Z.; Lee, C.E.; Chen, C.-H.; Kuo, J.-Y.; Lim, K.Y.H. Mental States and Cognitive Performance Monitoring for User-Centered e-Learning System: A Case Study. In Advances in Transdisciplinary Engineering; Moser, B.R., Koomsap, P., Stjepandić, J., Eds.; IOS Press, 2022 ISBN 978-1-64368-338-6.



| Model | Accuracy (%) | Balanced accuracy (%) | F1-weighted (%) | MCC | ROC-AUC weighted (%) | Brier score | Total time (s) |
|---|---|---|---|---|---|---|---|
| Gradient Boosting | 89.55 ± 3.16 | 89.52 ± 3.21 | 89.54 ± 3.17 | 0.845 ± 0.047 | 98.59 ± 0.92 | 0.1594 ± 0.0485 | 615.02 |
| XGBoost | 89.43 ± 3.83 | 89.41 ± 3.84 | 89.43 ± 3.83 | 0.842 ± 0.057 | 98.57 ± 0.99 | 0.1542 ± 0.0622 | 94.47 |
| LightGBM | 89.32 ± 4.09 | 89.29 ± 4.11 | 89.27 ± 4.09 | 0.841 ± 0.061 | 98.51 ± 1.05 | 0.1492 ± 0.0536 | 386.71 |
| Voting | 88.64 ± 4.67 | 88.61 ± 4.69 | 88.63 ± 4.69 | 0.831 ± 0.070 | 98.43 ± 1.09 | 0.1316 ± 0.0418 | 122.25 |
| AdaBoost | 88.41 ± 4.07 | 88.40 ± 4.10 | 88.43 ± 4.08 | 0.831 ± 0.061 | 98.05 ± 1.55 | 0.5433 ± 0.0189 | 257.11 |
| Random Forest | 88.41 ± 3.55 | 88.40 ± 3.58 | 88.40 ± 3.56 | 0.828 ± 0.052 | 98.37 ± 1.06 | 0.1248 ± 0.0357 | 161.25 |
| Stacking | 88.41 ± 4.44 | 88.37 ± 4.48 | 88.39 ± 4.46 | 0.828 ± 0.066 | 98.36 ± 1.13 | 0.1492 ± 0.0394 | 2451.79 |
| Bagging | 88.41 ± 3.74 | 88.37 ± 3.77 | 88.38 ± 3.73 | 0.827 ± 0.056 | 98.41 ± 1.06 | 0.1266 ± 0.0368 | 7.84 |
| Extra Trees | 87.39 ± 3.73 | 87.39 ± 3.73 | 87.41 ± 3.72 | 0.813 ± 0.056 | 98.43 ± 0.98 | 0.1240 ± 0.0355 | 135.51 |
| Rank | Model | F1-weighted average | MCC average | ROC-AUC weighted average |
Total time (s) |
|---|---|---|---|---|---|
| 1 | Gradient Boosting | 0.895403 | 0.845225 | 0.985948 | 615.02 |
| 2 | XGBoost | 0.894329 | 0.842304 | 0.985749 | 94.47 |
| 3 | LightGBM | 0.892704 | 0.841016 | 0.985059 | 386.71 |
| 4 | Voting | 0.886323 | 0.831309 | 0.984270 | 122.25 |
| 5 | AdaBoost | 0.884322 | 0.830861 | 0.980513 | 257.11 |
| Model | F1-weighted average | SD | Average rank |
|---|---|---|---|
| XGBoost | 0.894329 | 0.038301 | 3.35 |
| LightGBM | 0.892704 | 0.040937 | 4.05 |
| Gradient Boosting | 0.895403 | 0.031678 | 4.50 |
| Voting | 0.886323 | 0.046867 | 4.80 |
| AdaBoost | 0.884322 | 0.040831 | 5.00 |
| Bagging | 0.883807 | 0.037348 | 5.20 |
| Random Forest | 0.884032 | 0.035562 | 5.60 |
| Stacking | 0.883914 | 0.044634 | 5.65 |
| Extra Trees | 0.874056 | 0.037175 | 6.85 |
| Metrics | Value |
|---|---|
| Accuracy | 0.881818 |
| Balanced accuracy | 0.882132 |
| F1-weighted | 0.881792 |
| F1-macro | 0.881938 |
| Weighted accuracy | 0.883877 |
| Sensitivity/weighted recall | 0.881818 |
| Macro precision | 0.883842 |
| Sensitivity/recall (macro) | 0.882132 |
| ROC-AUC macro | 0.986089 |
| ROC-AUC weighted | 0.986078 |
| Cohen’s Kappa | 0.822746 |
| MCC | 0.823690 |
| Brier score | 0.137442 |
| Class | Precision | Sensitivity (recall) | F1-score | Support |
|---|---|---|---|---|
| 0 | 0.93 | 0.84 | 0.88 | 74 |
| 1 | 0.88 | 0.92 | 0.90 | 72 |
| 2 | 0.85 | 0.89 | 0.87 | 74 |
| Macro average | 0.88 | 0.88 | 0.88 | 220 |
| Weighted average | 0.88 | 0.88 | 0.88 | 220 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.s |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).