Submitted:
06 September 2024
Posted:
10 September 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Related Work
2. Methods
2.1. Dataset and Patient Population
2.2. Exploratory Analysis
2.3. Xgboost BME Approach
- Build a forest of trees using a standard Xgboost algorithm with as the training set responses in logit scale and as the corresponding training set of covariates, j=1, …, n. Since logits of are continuous and binary classification using Xgboost is considered, the values were converted back to binary labels using median as the threshold. Given the high class imbalance, with the outcome class (mortality) constituting fewer than 3% of data, employing the median as a threshold dynamically modifies the decision boundary to better detect rare positive instances. Since the Xgboost now models only the fixed effects component of the response, it was necessary to update the hyperparameters. Random stratified 3-fold Grid Search Cross Validation was applied using the training dataset with the same hyperparameter search criteria as that for the Xgboost NC model similar to previous studies.[1,3] A maximum of 30 combinations was imposed to allow for variability of parameters across iterations.
- Obtain an estimate of using the training data on Xgboost in logit scale.
- estimate using and as inputs into the Gauss Hermite Quadrature using an approach similar to Simchoni et al.,[12] where . The number of quadrature was set at 80, as determine through pilot experiments, satisfying k < 2m – 1, where k represents the degree of the polynomial for numerical integration and m is the parameter adjusting set here as the number of random effect levels.
- = - , i= 1, ..., n, where represents the fixed component of the response and is re-binarized to 0 and 1 using the median of .
2.4. Validation Approach
2.4.1. Xgboost BME and NC Variant Models
2.4.2. Performance by Sample Size
2.4.3. Visualization of Parameters
2.4.4. Baseline Models
3. Results
3.1. Exploratory Analysis

3.2. Model Validation: Comparison Using All Samples
3.2.1. Xgboost BME and NC Variant Models
| Model Category | ECE | AUC | Brier | F1 | Net Benefit | CEM | CEM lower 95% CI | CEM upper 95% CI |
|---|---|---|---|---|---|---|---|---|
| standardized Xgboost BME | 0.998 | 0.854 | 0.977 | 0.293 | 0.908 | 0.739 | 0.7391 | 0.7397 |
| unstandardized Xgboost BME | 0.997 | 0.854 | 0.977 | 0.294 | 0.908 | 0.740 | 0.7396 | 0.7402 |
| standardized Xgboost NC | 0.997 | 0.854 | 0.977 | 0.295 | 0.908 | 0.741 | 0.7405 | 0.7411 |
| unstandardized Xgboost NC | 0.997 | 0.854 | 0.977 | 0.293 | 0.908 | 0.740 | 0.7394 | 0.7400 |

3.2.2. Glmer and Glm Variant Models
| Model Category | ECE | AUC | Brier | F1 | Net Benefit | CEM | CEM lower 95% CI | CEM upper 95% CI |
|---|---|---|---|---|---|---|---|---|
| standardized glmer | 0.993 | 0.827 | 0.973 | 0.269 | 0.889 | 0.719 | 0.7182 | 0.7188 |
| unstandardized glmer | 0.993 | 0.827 | 0.973 | 0.269 | 0.889 | 0.718 | 0.7178 | 0.7184 |
| unstandardized glm | 0.994 | 0.826 | 0.973 | 0.270 | 0.889 | 0.719 | 0.7183 | 0.7189 |
| standardized glm | 0.994 | 0.826 | 0.973 | 0.269 | 0.889 | 0.718 | 0.7181 | 0.7187 |
3.3. Performance by Sample Size
3.3.1. Unstandardized Xgboost BME and Standardized Xgboost NC Models

3.3.2. Unstandardized Glm and Standardized Glmer Models


3.4. Visualization of Parameters



4. Discussion
4.1. Technical Perspective
4.2. Relevance to Clinical Practice
4.2.1. Cardiac Surgery Perspective
4.2.2. Cardiology Perspective
5. Future work and Limitations
6. Conclusion
Supplementary Section: Dataset
References
- S. S et al., ‘Comparison of Machine Learning Techniques in Prediction of Mortality following Cardiac Surgery: Analysis of over 220,000 patients from a Large National Database’, European journal of cardio-thoracic surgery : official journal of the European Association for Cardio-thoracic Surgery, Aug. 2023. [CrossRef]
- T. Dong et al., ‘Performance Drift in Machine Learning Models for Cardiac Surgery Risk Prediction: Retrospective Analysis’, JMIRx Med, vol. 5, no. 1, p. e45973, Jun. 2024. [CrossRef]
- T. Dong et al., ‘Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study’, DIGITAL HEALTH, vol. 9, p. 20552076231187605, Jan. 2023. [CrossRef]
- N. K. Kumar, G. S. Sindhu, D. K. Prashanthi, and A. S. Sulthana, ‘Analysis and Prediction of Cardio Vascular Disease using Machine Learning Classifiers’, in 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS), Mar. 2020, pp. 15–21. [CrossRef]
- P. Tiwari, K. L. Colborn, D. E. Smith, F. Xing, D. Ghosh, and M. A. Rosenberg, ‘Assessment of a Machine Learning Model Applied to Harmonized Electronic Health Record Data for the Prediction of Incident Atrial Fibrillation’, JAMA Network Open, vol. 3, no. 1, pp. e1919396–e1919396, Jan. 2020. [CrossRef]
- Mehrtash, W. M. Wells, C. M. Tempany, P. Abolmaesumi, and T. Kapur, ‘Confidence Calibration and Predictive Uncertainty Estimation for Deep Medical Image Segmentation’, IEEE Transactions on Medical Imaging, vol. 39, no. 12, pp. 3868–3878, Dec. 2020. [CrossRef]
- Huang et al., ‘Performance Metrics for the Comparative Analysis of Clinical Risk Prediction Models Employing Machine Learning. [Miscellaneous Article]’, Circulation: Cardiovascular Quality & Outcomes, vol. 14, no. 10, Oct. 2021. [CrossRef]
- E. W. Steyerberg et al., ‘Assessing the Performance of Prediction Models: A Framework for Traditional and Novel Measures’, Epidemiology, vol. 21, no. 1, pp. 128–138, Jan. 2010. [CrossRef]
- J. Allyn et al., ‘A Comparison of a Machine Learning Model with EuroSCORE II in Predicting Mortality after Elective Cardiac Surgery: A Decision Curve Analysis’, PLOS ONE, vol. 12, no. 1, p. e0169772, Jan. 2017. [CrossRef]
- M. Gregorich, S. Strohmaier, D. Dunkler, and G. Heinze, ‘Regression with Highly Correlated Predictors: Variable Omission Is Not the Solution’, Int J Environ Res Public Health, vol. 18, no. 8, p. 4259, Apr. 2021. [CrossRef]
- S.-K. Ng and G. J. McLachlan, ‘Extension of mixture-of-experts networks for binary classification of hierarchical data’, Artificial Intelligence in Medicine, vol. 41, no. 1, pp. 57–67, Sep. 2007. [CrossRef]
- G. Simchoni and S. Rosset, ‘Integrating random effects in deep neural networks’, J. Mach. Learn. Res., vol. 24, no. 1, p. 156:7402-156:7458, Mar. 2024.
- Hajjem, F. Bellavance, and D. Larocque, ‘Mixed-effects random forest for clustered data’, Journal of Statistical Computation and Simulation, Jun. 2014, Accessed: Jun. 14, 2024. [Online]. Available: https://www.tandfonline.com/doi/abs/10.1080/00949655.2012.741599.
- T. Dong et al., ‘Random effects adjustment in machine learning models for cardiac surgery risk prediction: a benchmarking study’, Jun. 12, 2023, medRxiv. [CrossRef]
- T. Dong et al., ‘Deep recurrent reinforced learning model to compare the efficacy of targeted local versus national measures on the spread of COVID-19 in the UK’, BMJ Open, vol. 12, no. 2, p. e048279, Feb. 2022. [CrossRef]
- P. Kokol, M. Kokol, and S. Zagoranski, ‘Machine learning on small size samples: A synthetic knowledge synthesis’, Sci Prog, vol. 105, no. 1, p. 00368504211029777, Feb. 2022. [CrossRef]
- J. Marin, ‘Evaluating Synthetically Generated Data from Small Sample Sizes: An Experimental Study’, Jan. 21, 2023, arXiv: arXiv:2211.10760. [CrossRef]
- S. Lutakamale and Y. Z. Manyesela, ‘Machine Learning-Based Fingerprinting Positioning in Massive MIMO Networks: Analysis on the Impact of Small Training Sample Size to the Positioning Performance’, SN COMPUT. SCI., vol. 4, no. 3, p. 286, Mar. 2023. [CrossRef]
- G. Lu, B. Li, W. Yang, and J. Yin, ‘Unsupervised feature selection with graph learning via low-rank constraint’, Multimed Tools Appl, vol. 77, no. 22, pp. 29531–29549, Nov. 2018. [CrossRef]
- M. Baquedano et al., ‘Outcome monitoring and risk stratification after cardiac procedure in neonates, infants, children and young adults born with congenital heart disease: protocol for a multicentre prospective cohort study (Children OMACp)’, BMJ Open, vol. 13, no. 8, p. e071629, Aug. 2023. [CrossRef]
- H. Schmid, P. C. Stark, J. A. Berlin, P. Landais, and J. Lau, ‘Meta-regression detected associations between heterogeneous treatment effects and study-level, but not patient-level, factors’, Journal of Clinical Epidemiology, vol. 57, no. 7, pp. 683–697, Jul. 2004. [CrossRef]
- A. Cook, S.-Y. Oh, and M. V. Pusic, ‘Accuracy of Physicians’ Electrocardiogram Interpretations’, JAMA Intern Med, vol. 180, no. 11, pp. 1–11, Nov. 2020. [CrossRef]
- L. Pecchia, P. Melillo, M. Sansone, and M. Bracale, ‘Heart Rate Variability in healthy people compared with patients with Congestive Heart Failure’, in 2009 9th International Conference on Information Technology and Applications in Biomedicine, Nov. 2009, pp. 1–4. [CrossRef]
- L. Pecchia, P. Melillo, M. Sansone, and M. Bracale, ‘Discrimination power of short-term heart rate variability measures for CHF assessment’, IEEE Trans Inf Technol Biomed, vol. 15, no. 1, pp. 40–46, Jan. 2011. [CrossRef]
- P. Melillo, R. Fusco, M. Sansone, M. Bracale, and L. Pecchia, ‘Discrimination power of long-term heart rate variability measures for chronic heart failure detection’, Med Biol Eng Comput, vol. 49, no. 1, pp. 67–74, Jan. 2011. [CrossRef]
- M. Putnikovic, Z. Jordan, Z. Munn, C. Borg, and M. Ward, ‘Use of Electrocardiogram Monitoring in Adult Patients Taking High-Risk QT Interval Prolonging Medicines in Clinical Practice: Systematic Review and Meta-analysis’, Drug Saf, vol. 45, no. 10, pp. 1037–1048, Oct. 2022. [CrossRef]
- R. C. Brindle, A. T. Ginty, A. C. Phillips, and D. Carroll, ‘A tale of two mechanisms: a meta-analytic approach toward understanding the autonomic basis of cardiovascular reactivity to acute psychological stress’, Psychophysiology, vol. 51, no. 10, pp. 964–976, Oct. 2014. [CrossRef]
- N. Stoller et al., ‘Large regional variation in cardiac closure procedures to prevent ischemic stroke in Switzerland a population-based small area analysis’, PLoS One, vol. 19, no. 1, p. e0291299, Jan. 2024. [CrossRef]
- C. Schenker et al., ‘Regional variation and temporal trends in transcatheter and surgical aortic valve replacement in Switzerland: A population-based small area analysis’, PLoS One, vol. 19, no. 1, p. e0296055, Jan. 2024. [CrossRef]
- R. J. Sela and J. S. Simonoff, ‘RE-EM trees: a data mining approach for longitudinal and clustered data’, Mach Learn, vol. 86, no. 2, pp. 169–207, Feb. 2012. [CrossRef]
- B. E. Ankenman, A. I. Avilés, and J. C. Pinheiro, ‘Optimal Designs for Mixed-Effects Models with Two Random Nested Factors’, Statistica Sinica, vol. 13, no. 2, pp. 385–401, 2003.
- T. Snijders and R. Bosker, ‘Multilevel Analysis: An Introduction to Basic and Advanced Multilevel Modeling’. https://lst-iiep.iiep-unesco.org/cgi-bin/wwwi32.exe/[in=epidoc1.in]/?t2000=013777/(100), Jan. 1999.
- Bates, M. Mächler, B. Bolker, and S. Walker, ‘Fitting Linear Mixed-Effects Models Using lme4’, Journal of Statistical Software, vol. 67, pp. 1–48, Oct. 2015. [CrossRef]
- F. Zuur, E. N. Ieno, N. J. Walker, A. A. Saveliev, and G. M. Smith, ‘Mixed Effects Modelling for Nested Data’, in Mixed effects models and extensions in ecology with R, A. F. Zuur, E. N. Ieno, N. Walker, A. A. Saveliev, and G. M. Smith, Eds., New York, NY: Springer, 2009, pp. 101–142. [CrossRef]
- D. J. Bauer, D. M. McNeish, S. A. Baldwin, and P. J. Curran, ‘Analyzing nested data: Multilevel modeling and alternative approaches’, in The Cambridge handbook of research methods in clinical psychology, in Cambridge handbooks in psychology. , New York, NY, US: Cambridge University Press, 2020, pp. 426–443. [CrossRef]
- B. Fernández-Castilla, L. Jamshidi, L. Declercq, S. N. Beretvas, P. Onghena, and W. Van den Noortgate, ‘The application of meta-analytic (multi-level) models with multiple random effects: A systematic review’, Behav Res, vol. 52, no. 5, pp. 2031–2052, Oct. 2020. [CrossRef]
- B. Rasouli et al., ‘Combining high quality data with rigorous methods: emulation of a target trial using electronic health records and a nested case-control design’, BMJ, vol. 383, p. e072346, Dec. 2023. [CrossRef]
- J. P. A. Ioannidis and H.-O. Adami, ‘Nested Randomized Trials in Large Cohorts and Biobanks: Studying the Health Effects of Lifestyle Factors’, Epidemiology, vol. 19, no. 1, p. 75, Jan. 2008. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).