Submitted:
06 June 2026
Posted:
09 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Epidemic of Post-Operative Complications and Global Patient Safety Mandates
1.2. The Digital Divide and Institutional Friction of Infrastructure-Heavy AI in Public Healthcare
1.3. The Interpretability Crisis and Cognitive Psychology of "Black-Box" Algorithmic Resistance
1.4. Mathematical Information Compression as a Biomedical Engineering Solution
2. Materials and Methods
2.1. Cohort Selection, Multi-Center Registry Architecture, and Ethical De-identification Protocols
2.2. Combinatorial Topology of High-Cardinality Variables: The Empirical Risk Pooling Protocol

- High-Risk Diagnoses Group (ECRd > 0.30): This group isolates codes exhibiting an empirical complication probability exceeding 30%. It encompasses critical acute presentation states and highly invasive surgical conditions, specifically codes: S_11, E_6, E_5, E_1, E_3, O_3, P_7, E_2, and ORL_2.
- Moderate-Risk Diagnoses Group (0.10 < ECRd < 0.30): This group isolates intermediate codes exhibiting an empirical complication probability between 10% and 30%. It encompasses standard major visceral interventions, including codes: ORL_1, ORL_5, C_3, P_1, S_10, O_1, P_3, C_1, P_8, P_2, O_4, E_4, ORL_3, P_4, P_6, and S_6.
- Standard-Risk Diagnoses Group (ECRd < 0.10): This group isolates baseline codes where the complication incidence falls below 10%, capturing elective, minimally invasive, or low-complexity procedures. All remaining diagnostic codes in the registry were mapped to this tier.
2.3. Mathematical Framework of Univariate Parameter Screening


2.4. Multivariate Logistic Regression Optimization and Covariate Adjustment Matrix



2.5. Linear Metric Alignment and Integer Transformation Mechanics



2.6. Signal Detection Verification, Receiver Operating Characteristics, and Alarm Fatigue Modeling

3. Results
3.1. Cohort Demographics and Primary Descriptive Statistics

3.2. Univariate Screening Filters and Wald Statistical Extraction
- SEX (x2= 2.14, p = 0.143) failed to cross the significance threshold and was blocked from model integration.
- INSURER (x2= 3.89, p = 0.048) demonstrated a borderline association but was ultimately excluded to maximize model parsimony and avoid index over-complication.
- HOSPITALITY-NAME (x2= 842.15, p < 0.001) and the engineered variable DIAGNOSIS-GROUP (x2= 2314.60, p < 0.001) both revealed powerful associations with the target outcome and were passed into the multivariate optimization pipeline.
3.3. Compilation of the Integer Scorecard (S-CRI) Blueprint

3.4. Diagnostic Accuracy Profiles and Confusion Matrix Decomposition
- Sensitivity (True Positive Rate): 66.56% (95% CI: 64.7%–68.4%) — The index successfully catches approximately two-thirds of all patients who develop a post-surgical complication.
- Specificity (True Negative Rate): 89.64% (95% CI: 89.2%–90.1%) — The index correctly filters out nearly 90% of uncomplicated cases, successfully eliminating excessive false alarms.
- Overall Accuracy: 86.52% — The absolute correctness of the scoring model across the multi-center dataset.
3.5. Longitudinal Stratification of Empirical Risk Tiers
- Very Low Risk Tier (S-CRI Score < -10): Represents 34.2% of the cohort (n = 6,828$). The empirical complication rate within this tier is 0.23%. Clinical response: Standard post-operative tracking; excellent candidate for accelerated clinical discharge pathways.
- Low Risk Tier (S-CRI Score -10 to -5): Represents 47.9% of the cohort (n = 9,563$). The empirical complication rate is 4.12%. Clinical response: Standard post-operative observation and routine vitals tracking.
- Moderate Risk Tier (S-CRI Score -4 to -1): Represents 11.2% of the cohort ($n = 2,236$). The empirical complication rate rises to 30.22%. Clinical response: Triggers primary clinical screening alert; requests a comprehensive bedside nursing review and routine wound assessment.
- High Risk Tier (S-CRI Score 0 to 5): Represents 5.1% of the cohort (n = 1,018$). The empirical complication rate escalates to 53.24%. Clinical response: Mandatory daily clinical review by the senior surgical team; initiation of target-directed prophylactic care (e.g., specialized antibiotic or anticoagulant regimens).
- Very High Risk Tier (S-CRI Score > 5): Represents 1.6% of the cohort (n = 320). The empirical complication rate reaches 77.41%. Clinical response: Immediate clinical red-flag alert; transfer to a high-dependency step-down unit or intensive tracking protocols, combined with a mandatory multi-disciplinary case audit.
4. Discussion
4.1. The Symbiotic Ecosystem: S-CRI as a Transparent Interface for Multi-Criteria Predictive Systems
4.2. Information Theory and the Mathematical Efficiency of Parsimonious Scaling
4.3. Length of Stay as a Biological and Structural Proxy for Nosocomial Vulnerability
4.4. Human Factors Engineering: Algorithmic Calibration to Neutralize Bedside Alert Fatigue
4.5. Clinical Implementation Pathways and Strategic Resource Tiering
4.6. Digital Health Ecosystem Integration: EHR Pipelines, Dashboards, and Automated Monitoring
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Meara, J. G.; Leather, A. J.; Hagander, L.; Alkire, B. C.; Alonso, N.; Bekele, A.; Yip, W. Global Surgery 2030: Evidence and solutions for achieving health, welfare, and economic development. The Lancet (Replaces the "Move, A. B." placeholder for global surgical system infrastructure stress). 2015, 386(9993), 569–624. [Google Scholar] [CrossRef] [PubMed]
- World Health Organization. WHO calls for urgent action to reduce patient harm in healthcare . [Press release]. 13 September 2019. Available online: https://www.who.int/news/item/13-09-2019-who-calls-for-urgent-action-to-reduce-patient-harm-in-healthcare.
- Michou, I.; Maroulis, I.; Chatzilygeroudis, I.; Koutsojannis, C. Machine Learning for Predicting Medical Error Risks in Greek Surgery Departments. Appl. Sci. 2026, 16(11), 5411. [Google Scholar] [CrossRef]
- Huynh, A.L.; Roy, T.J.; Jackson, K.N.; Lee, A.G.; Liaw, W.; Hossain, M.M. Applications of artificial intelligence-based conversational agents in healthcare: A systematic umbrella review. Int. J. Med. Inform. 2026, 207, 106204. [Google Scholar] [CrossRef]
- Gkikas, M. A.; Vrettos, K.; Tsinopoulos, I.; Tzamalis, A. Preoperative prediction of intraoperative complications in cataract surgery: A machine learning approach. Int. Ophthalmol. 2025, 45(1), 471. [Google Scholar] [CrossRef]
- Kuhn, T. N.; Engelhardt, W. D.; Kahl, V. H.; et al. Artificial intelligence–driven patient selection for preoperative portal vein embolization for patients with colorectal cancer liver metastases. J. Vasc. Interv. Radiol. 2025, 36(3), 477–488. [Google Scholar] [CrossRef]
- Wachter, R. M. Patient safety at ten: Unmistakable progress, troubling gaps. Health Aff. 2010, 29(1), 165–173. [Google Scholar] [CrossRef] [PubMed]
- Mock, C. N.; Donkor, P.; Gawande, A.; Jamison, D. T.; Kruk, M. E.; Debas, H. T. Essential surgery: Key messages from Disease Control Priorities. In The Lancet, 3rd edition; 2015; Volume 385, 9983, pp. 2209–2219. [Google Scholar] [CrossRef]
- Makary, M. A.; Daniel, M. Medical error—The third leading cause of death in the US. BMJ 2016, 353, i2139. [Google Scholar] [CrossRef] [PubMed]
- Barach, P.; Small, S. D. Reporting and preventing medical mishaps: Lessons from non-medical near miss reporting systems. BMJ 2000, 320(7237), 759–763. [Google Scholar] [CrossRef]
- Sarker, S. K.; Vincent, C. Errors in surgery. Int. J. Surg. 2005, 3(1), 79–81. [Google Scholar] [CrossRef]
- Institute of Medicine. To err is human: Building a safer health system; National Academies Press, 2000. [Google Scholar] [CrossRef]
- Flin, R.; O’Connor, P. Safety at the sharp end: A guide to non-technical skills, 2nd ed.; CRC Press, 2017. [Google Scholar] [CrossRef]
- Marsh, K. M.; Burt, C. G.; Brooks, D. C.; Fanning, R. M.; Minter, R. M.; Quillin, R. C., III. Defining and studying errors in surgical care: A systematic review. Ann. Surg. 2022, 275(6), 1067–1073. [Google Scholar] [CrossRef]
- Henriksen, K.; Battles, J. B.; Marks, E. S.; Lewin, D. I. (Eds.) AHRQ Publication No. 05-0021Advances in patient safety: From research to implementation; (Vols. 1–4). Agency for Healthcare Research and Quality., 2005; Available online: https://www.ncbi.nlm.nih.gov/books/NBK20545/.
- Duclos, A.; Frits, M. L.; Iannaccone, C.; Bates, D. W.; Classen, D. C. Safety of inpatient care in surgical settings: Cohort study. BMJ 2024, 387, e080480. [Google Scholar] [CrossRef]
- Cohen, A. J.; Lui, H.; Zheng, M.; Cheema, B.; Cohen, S. M.; Maggard-Gibbons, M. Rates of serious surgical errors in California and plans to prevent recurrence. JAMA Netw. Open 2021, 4(5), e217058. [Google Scholar] [CrossRef]
- Elfanagely, O.; Messahel, A.; Ghanem, O.; Farag, A.; Thourani, V. H. Machine learning and surgical outcomes prediction: A systematic review. J. Surg. Res. 2021, 264, 346–361. [Google Scholar] [CrossRef]
- Obermeyer, Z.; Powers, B.; Vogeli, C.; Mullainathan, S. Dissecting racial bias in an algorithm used to manage the health of populations. Science 2019, 366(6464), 447–453. [Google Scholar] [CrossRef] [PubMed]
- Locke, S.; Bashall, A.; Al-Adely, S.; Moore, J.; Wilson, A. J.; Kitchen, G. B. Natural language processing in medicine: A review. Trends Anaesth. Crit. Care 2021, 38, 4–9. [Google Scholar] [CrossRef]
- Michou, I.; Maroulis, I.; Koutsojannis, C. Machine learning for medical error prevention in departments of surgery: A review of challenges and biases. World J. Biomed. Pharm. Sci. 2025, 22(1), 410. Available online: https://journalwjbphs.com/content/machine-learning-medical-error-prevention-departments-surgery-review-challenges-and-biases.
- Shanafelt, T. D.; Balch, C. M.; Bechamps, G. J.; Russell, T.; Dyrbye, L.; Satele, D.; Collicott, P.; Novotny, P. J.; Sloan, J.; Freischlag, J. Burnout and medical errors among American surgeons. Ann. Surg. 2010, 251(6), 995–1000. [Google Scholar] [CrossRef] [PubMed]
- Al-Ghunaim, T. A.; Johnson, J.; Bui, A.; Melton, G. B. Surgeon burnout and its association with patient safety outcomes. Am. J. Surg. 2022, 224(1), 228–234. [Google Scholar] [CrossRef] [PubMed]
- Carter, D. The surgeon as a risk factor. Adv. Surg. 2002, 36, 141–165. [Google Scholar] [CrossRef]
- Classen, D. C.; Resar, R.; Griffin, F.; Federico, F.; Frankel, T.; Kimmel, N.; Whittington, J. C.; Frankel, A.; Seger, A.; James, B. C. “Global trigger tool” shows that adverse events in hospitals may be ten times greater than previously measured. Health Aff. 2011, 30(4), 581–589. [Google Scholar] [CrossRef]
- Bilimoria, K. Y.; Liu, Y.; Paruch, J. L.; Zhou, L.; Kmiecik, T. E.; Ko, C. Y.; Cohen, M. E. Development and evaluation of the universal ACS NSQIP surgical risk calculator: A decision aid and informed consent tool for patients and surgeons. Ann. Surg. 2013, 258(1), 4–12. [Google Scholar] [CrossRef]
- de Vries, E.N.; Ramrattan, M.A.; Smorenburg, S.M.; Gouma, D.J.; Boermeester, M.A. The incidence and nature of in-hospital adverse events: A systematic review. BMJ Qual. Saf. 2008, 17, 216–223. [Google Scholar] [CrossRef] [PubMed]
- Ning, Y.; Li, S.; Ong, M. E. H.; Xie, F.; Chakraborty, B.; Ting, D. S. W.; Liu, N. A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort study. npj Digit. Med. 2022, 5, Article 163. [Google Scholar] [CrossRef] [PubMed]




| Clinical Variable Parameter | Regression Coefficient (β) | Standard Error (SE) | Adjusted Odds Ratio (95% CI) | Converted S-CRI Points |
|---|---|---|---|---|
| Model Intercept ($\beta_0$) | -3.1142 | 0.0512 | — | -3.1 (Base Offset) |
| Length of Stay (Per Day) | +0.3646 | 0.0084 | 1.44 (1.42–1.46) | +1.0 point per day |
| Hospital Classification | ||||
| Pathological Department | 0.0000 | — | 1.00 (Reference) | 0.0 points |
| Surgical Department | -0.2552 | 0.0421 | 0.77 (0.71–0.84) | -0.7 points |
| Without Addition (Specialized) | -2.4063 | 0.1245 | 0.09 (0.07–0.11) | -6.6 points |
| Diagnostic Risk Group | ||||
| High-Risk Diagnoses | 0.0000 | — | 1.00 (Reference) | 0.0 points |
| Moderate-Risk Diagnoses | -0.7656 | 0.0581 | 0.46 (0.41–0.52) | -2.1 points |
| Standard-Risk Diagnoses | -3.8647 | 0.0914 | 0.02 (0.01–0.03) | -10.6 points |
| Predicted Negative (S-CRI Score < -4) |
Predicted Positive (S-CRI Score ≥ -4) |
Total Rows | |
|---|---|---|---|
| Actual Uncomplicated (0) | 15,477 (True Negatives) | 1,788 (False Positives) | 17,265 |
| Actual Complicated (1) | 903 (False Negatives) | 1,797 (True Positives) | 2,700 |
| Total Columns | 16,380 | 3,585 | 19,965 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).