Submitted:
15 September 2025
Posted:
17 September 2025
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
2. Results
3. Discussion
4. Materials and Methods
4.1. Synthetic Data Generation
4.2. Simulation Informed Feature Set
4.3. Model Training and Validation
4.3.1. Computational Model and Data Generator
4.3.2. Machine Learning Configuration, Validation Strategy, and Interpretability Procedures
4.3.3. Rule-Based Baseline (Deterministic Classifier Derived from the Label Definition)
4.3.4. Evaluation and Transparency
4.3.5. Anti-Leak Ablations and Calibrated-Rule Comparator
- Full-features ML (baseline).
- No-label-features ML: all predictors except H:Q, LSI, and CCI.
- Label-features-only ML: using only H:Q, LSI, and CCI.
- Calibrated rule score: distance-to-threshold rule, isotonic-calibrated.
4.4. Interpretability and Reporting
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Kellis, E.; Galanis, N.; Kofotolis, N. Hamstring-to-quadriceps ratio in female athletes with a previous hamstring injury, anterior cruciate ligament reconstruction, and controls. Sports 2019, 7, 214. [CrossRef]
- Afonso J, Rocha-Rodrigues S, Clemente FM, Aquino M, Nikolaidis PT, Sarmento H, Fílter A, Olivares-Jabalera J and Ramirez-Campillo R. The Hamstrings: Anatomic and Physiologic Variations and Their Potential Relationships With Injury Risk. Front. Physiol. 2021; 12:694604. [CrossRef]
- Mănescu, D.C.; Mănescu, A.M. Artificial Intelligence in the Selection of Top-Performing Athletes for Team Sports: A Proof-of-Concept Predictive Modeling Study. Appl. Sci. 2025, 15, 9918. [CrossRef]
- Ruas, C.V.; Pinto, R.S.; Haff, G.G.; Lima, C.D.; Pinto, M.D.; Brown, L.E. Alternative methods of determining hamstrings-to-quadriceps ratios: A comprehensive review. Sports Med. Open 2019, 5, 11. [CrossRef]
- Högberg, J.; Forssblad, M.; Hägglund, M.; et al. No association between hamstrings-to-quadriceps strength ratio and risk of second ACL injury within two years of return to sport. Sports Med. Open 2024, 10, 4. [CrossRef]
- Gandarias-Madariaga, A.; Martínez-Serrano, A.; Alcaraz, P.E.; Calleja-González, J.; López del Campo, R.; Resta, R.; Zubillaga-Zubiaga, A. Hamstring Strain Injury Patterns in Spanish Professional Male Football (Soccer): A Systematic Video Analysis of 78 Match Injuries. J. Funct. Morphol. Kinesiol. 2025, 10, 201. [CrossRef]
- Torres, G.; Armada-Cortés, E.; Rueda, J.; San Juan, A.F.; Navarro, E. Comparison of Hamstrings and Quadriceps Muscle Activation in Male and Female Professional Soccer Players. Appl. Sci. 2021, 11, 738. [CrossRef]
- Mănescu, D.C. Elements of the specific conditioning in football at university level. Marathon 2015, 7(1), 107-111.
- Tabor, P.; Iwańska, D.; Mazurkiewicz, A.; Urbanik, C.; Mastalerz, A. The Hamstring/Quadriceps Ratio in Young Men and Its Relationship with the Functional Symmetry of the Lower Limb in Young Men. Symmetry 2021, 13, 2033. [CrossRef]
- Zore, M.R.; Kregar Velikonja, N.; Hussein, M. Pre- and Post-Operative Limb Symmetry Indexes and Estimated Preinjury Capacity Index of Muscle Strength as Predictive Factors for the Risk of ACL Reinjury: A Retrospective Cohort Study of Athletes after ACLR. Appl. Sci. 2021, 11, 3498. [CrossRef]
- Kaeding CC, Léger-St-Jean B, Magnussen RA. Epidemiology and Diagnosis of Anterior Cruciate Ligament Injuries. Clin Sports Med. 2017 Jan;36(1):1-8. [CrossRef]
- Mănescu, D.C. Fundamente teoretice ale activității fizice. 2013, Editura ASE.
- Stojanović, M.D.M.; Andrić, N.; Mikić, M.; Vukosav, N.; Vukosav, B.; Zolog-Șchiopea, D.-N.; Tăbăcar, M.; Melinte, R.M. Effects of Eccentric-Oriented Strength Training on Return to Sport Criteria in Late-Stage Anterior Cruciate Ligament (ACL)-Reconstructed Professional Team Sport Players. Medicina 2023, 59, 1111. [CrossRef]
- Roggio, F., Trovato, B., Sortino, M., & Musumeci, G. (2024). A comprehensive analysis of the machine learning pose estimation models used in human movement and posture analyses: A narrative review. Heliyon, 10, e39977. [CrossRef]
- Calderón-Díaz, M.; Silvestre Aguirre, R.; Vásconez, J.P.; Yáñez, R.; Roby, M.; Querales, M.; Salas, R. Explainable Machine Learning Techniques to Predict Muscle Injuries in Professional Soccer Players through Biomechanical Analysis. Sensors 2024, 24, 119. [CrossRef]
- Manescu, D.C. Solutions to fight against overtraining in bodybuilding routine. Marathon 2013, 5(2), 182- 186.
- Miralles-Iborra, A.; Moreno-Pérez, V.; Del Coso, J.; Courel-Ibáñez, J.; Elvira, J.L.L. Reliability of a Field-Based Test for Hamstrings and Quadriceps Strength Assessment in Football Players. Appl. Sci. 2023, 13, 4918. [CrossRef]
- Merlo, A.; Campanini, I.; Merletti, R.; Di Natali, G.; Cescon, C.; Vieira, T.M.M. Electrode Size and Placement for Surface EMG Bipolar Detection from the Brachioradialis Muscle: A Scoping Review. Sensors 2021, 21, 7322. [CrossRef]
- D’Amico, M.; Kinel, E.; D’Amico, G.; Roncoletta, P. A Self-Contained 3D Biomechanical Analysis Lab for Complete Automatic Spine and Full Skeleton Assessment of Posture, Gait and Run. Sensors 2021, 21, 3930. [CrossRef]
- Roggio, F.; Di Grande, S.; Cavalieri, S.; Falla, D.; Musumeci, G. Biomechanical Posture Analysis in Healthy Adults with Machine Learning: Applicability and Reliability. Sensors 2024, 24, 2929. [CrossRef]
- Manescu, D.C. Alimentaţia în fitness şi bodybuilding. 2010, Editura ASE.
- Van Melick, N.; Van der Weegen, W.; Van der Horst, N.; Bogie, R. Quadriceps and Hamstrings Strength Reference Values for Athletes with and without ACL Reconstruction Who Play Popular Pivoting Sports: A Scoping Review. J. Orthop. Sports Phys. Ther. 2022, 52, 142–155. [CrossRef]
- Grygorowicz, M.; Michałowska, M.; Walczak, T.; Owen, A.; Grabski, J.K.; Pyda, A.; et al. Discussion about different cut-off values of conventional hamstring-to-quadriceps ratio used in hamstring injury prediction among professional male football players. PLoS ONE 2017, 12, e0188974. [CrossRef]
- Loeza Magaña, P.; Valdez Solis, I.G.; Fernández Carapia, D.D.; Alcalá Morales, L.E.; Arias Vázquez, P.I.; Quezada González, H.R. Hamstrings/Quadriceps Ratio in Isokinetic Tests: Are We Looking in the Wrong Direction? Apunts Sports Med. 2023, 58, 100410. [CrossRef]
- Manescu, D.C. Bazele generale ale antrenamentului sportiv. 2025, Editura Risoprint. https://www.risoprint.ro/detaliicarte.php?id=2515.
- Voukelatos, D.; Evangelidis, P.E.; Pain, M.T. The Hamstrings-to-Quadriceps Functional Ratio Expressed over the Full Angle–Angular-Velocity Range Using a Limited Number of Data Points. R. Soc. Open Sci. 2022, 9, 210696. [CrossRef]
- Baumgart, C.; Welling, W.; Hoppe, M.W.; Freiwald, J.; Gokeler, A. Angle-Specific Analysis of Isokinetic Quadriceps and Hamstring Torques and Ratios in Patients after ACL Reconstruction. BMC Sports Sci. Med. Rehabil. 2018, 10, 23. [CrossRef]
- Van Eetvelde H, Mendonça LD, Ley C, Seil R, Tischer T. Machine learning methods in sport injury prediction and prevention: a systematic review. J Exp Orthop. 2021; 8:27. [CrossRef]
- Leckey C, van Dyk N, Doherty C, Lawlor A, Delahunt E. Machine learning approaches to injury risk prediction in sport: a scoping review with evidence synthesis. Br J Sports Med. 2024. [CrossRef]
- Croisier JL, Ganteaume S, Binet J, Genty M, Ferret JM. Strength imbalances and prevention of hamstring injury in professional soccer players: a prospective study. Am J Sports Med. 2008; 36(8):1469-1475. [CrossRef]
- Baroni, B.M.; Ruas, C.V.; Ribeiro-Alvares, J.B.; Pinto, R.S. Hamstring-to-Quadriceps Torque Ratios of Professional Male Soccer Players: A Systematic Review. J. Strength Cond. Res. 2020, 34, 281–293. [CrossRef]
- Dingenen B, Gokeler A. Optimization of the return-to-sport paradigm after anterior cruciate ligament reconstruction: a critical step back to move forward. Sports Med. 2017; 47(8):1487-1500. [CrossRef]
- Wellsandt E, Failla MJ, Snyder-Mackler L. Limb symmetry indexes can overestimate knee function after anterior cruciate ligament injury. J Orthop Sports Phys Ther. 2017; 47(5):334-338. [CrossRef]
- Manescu, D.C. Noțiuni complementare ale antrenamentului sportiv. 2025, Editura Risoprint. https://www.risoprint.ro/detaliicarte.php?id=2516.
- Paterno, M.V.; Rauh, M.J.; Thomas, S.; Hewett, T.E.; Schmitt, L.C. Return-to-Sport Criteria After Anterior Cruciate Ligament Reconstruction Fail to Identify the Risk of Second Anterior Cruciate Ligament Injury. J. Athl. Train. 2023, 57(9–10), 937–945. [CrossRef]
- Kellis, E.; Sahinis, C.; Baltzopoulos, V. Is Hamstrings-to-Quadriceps Torque Ratio Useful for Predicting Anterior Cruciate Ligament and Hamstring Injuries? A Systematic and Critical Review. J. Sport Health Sci. 2023, 12, 1–13. [CrossRef]
- Dauty, M.; Menu, P.; Fouasson-Chailloux, A. Cutoffs of Isokinetic Strength Ratio and Hamstring Strain Prediction in Professional Soccer Players. Scand. J. Med. Sci. Sports 2018, 28, 276–281. [CrossRef]
- Trepczynski, A.; Kutzner, I.; Kornaropoulos, E.; Taylor, W.R.; Duda, G.N.; Bergmann, G.; Heller, M.O. Impact of antagonistic muscle co-contraction on in vivo knee contact forces. J. NeuroEng. Rehabil. 2018, 15, 103. [CrossRef]
- Preece, S.J.; Jones, R.K.; Brown, C.A.; Cacciatore, T.W. Reductions in co-contraction following neuromuscular re-education are associated with decreased knee joint loading. BMC Musculoskelet. Disord. 2016, 17, 415. [CrossRef]
- Kotsifaki, A.; Korakakis, V.; Whiteley, R.; Van Rossom, S.; Jonkers, I. Measuring only hop distance during single leg hop testing is insufficient to detect deficits in knee function after ACL reconstruction: a systematic review. Br. J. Sports Med. 2020, 54(3), 139–153. [CrossRef]
- Petrigna, L.; Trovato, B.; Roggio, F.; Castorina, A.; Musumeci, G. Molecular Assessment of Healthy Pathological Articular Cartilages in Physically Active People: A Scoping Review. Int. J. Mol. Sci. 2023, 24, 3662. [CrossRef]
- Brophy RH, Schmitz L, Wright RW, et al. Return to Play and Future ACL Injury Risk After ACL Reconstruction in Soccer Athletes From the Multicenter Orthopaedic Outcomes Network (MOON) Group. The American Journal of Sports Medicine. 2012; 40(11):2517-2522. [CrossRef]
- Della Villa S, Boldrini L, Ricci M, et al. Clinical Outcomes and Return-to-Sports Participation of 50 Soccer Players After Anterior Cruciate Ligament Reconstruction Through a Sport-Specific Rehabilitation Protocol. Sports Health. 2011;4(1):17-24. [CrossRef]
- Webster, K.E.; Hewett, T.E. Meta-analysis of meta-analyses of anterior cruciate ligament injury reduction training programs. J. Orthop. Res. 2018, 36(10), 2696–2708. [CrossRef]
- Bouju S, Lauritzen JB, Journé A, Jørgensen HL. Return to sports after anterior cruciate ligament surgery with hamstring or patella tendon autograft - a systematic review. Dan Med J. 2024 Jun 19;71(7):A09230599. [CrossRef]
- Toole, A.R.; Ithurburn, M.P.; Rauh, M.J.; Hewett, T.E.; Paterno, M.V.; Schmitt, L.C. Young athletes cleared for sports participation after ACL reconstruction: how many actually meet recommended return-to-sport criteria cutoffs? J. Orthop. Sports Phys. Ther. 2017, 47(11), 825–833. [CrossRef]
- Moraiti C.O. et al. Anterior cruciate ligament reconstruction results in alterations in gait variability. Gait Posture 2010; 32(2):169–175. [CrossRef]
- De Oliveira E.A. et al. Linear and nonlinear measures of gait variability after anterior cruciate ligament reconstruction. J Electromyogr Kinesiol 2019; 46:21–27. [CrossRef]
- Armitano-Lago C. et al. Gait Variability Structure Linked to Worse Cartilage Composition Post-ACL Reconstruction. Med Sci Sports Exerc 2023; 55(8):1499–150. [CrossRef]
- Ito N. et al. Identifying Gait Pathology after ACL Reconstruction Using Temporal Characteristics of Kinetics and Electromyography. Med Sci Sports Exerc 2022; 54(6):923–93. [CrossRef]
- Lin P.E., Sigward S.M. Subtle alterations in whole-body mechanics during gait following anterior cruciate ligament reconstruction. Gait Posture 2019; 68:494–499. [CrossRef]
- Claudino J.G., Capanema D., de Souza T., Serrão J.C., Machado Pereira A.C., Nassis G.P. Current Approaches to the Use of Artificial Intelligence for Injury Risk Assessment and Performance Prediction in Team Sports: a Systematic Review. Sports Med Open. 2019; 5:28. [CrossRef]
- Mănescu, A.M.; Grigoroiu, C.; Smîdu, N.; Dinciu, C.C.; Mărgărit, I.R.; Iacobini, A.; Mănescu, D.C. Biomechanical Effects of Lower Limb Asymmetry During Running: An OpenSim Computational Study. Symmetry 2025, 17, 1348. [CrossRef]
- Badau, D.; Badau, A.; Ene-Voiculescu, V.; Ene-Voiculescu, C.; Teodor, D.F.; Sufaru, C.; Dinciu, C.C.; Dulceata, V.; Manescu, D.C.; Manescu, C.O. El Impacto De Las tecnologías En El Desarrollo De La Veloci-Dad Repetitiva En Balonmano, Baloncesto Y Voleibol. Retos 2025, 64, 809–824.
- Penichet-Tomas, A. Applied Biomechanics in Sports Performance, Injury Prevention, and Rehabilitation. Appl. Sci. 2024, 14, 11623. [CrossRef]
- Ekstrand J., Hägglund M., Waldén M. Epidemiology of muscle injuries in professional football (soccer). Am J Sports Med. 2011;39(6):1226–1232. [CrossRef]
- Mănescu, D.C. Computational Analysis of Neuromuscular Adaptations to Strength and Plyometric Training: An Integrated Modeling Study. Sports 2025, 13, 298. [CrossRef]
- Petersen J., Hölmich P. Evidence based prevention of hamstring injuries in sport. Br J Sports Med. 2005;39(6):319–323. [CrossRef]
- Bourne M.N., Timmins R.G., Opar D.A., Pizzari T., Ruddy J.D., Williams M.D., Shield A.J. An evidence-based framework for strengthening exercises to prevent hamstring injury. Sports Med. 2018;48(2):251–267. [CrossRef]
- Van Haren I.E.P.M. et al. Return to sport after anterior cruciate ligament reconstruction—prognostic factors and prognostic models: A systematic review. Ann Phys Rehabil Med. 2025; 68(3):101921. [CrossRef]
- Girdwood M.A. et al. Hop performance trajectory after ACL reconstruction: systematic review and longitudinal meta-analysis. Sports Med. 2025; 55(1):101-113. [CrossRef]
- Badau, D.; Badau, A.; Joksimović, M.; Manescu, C.O.; Manescu, D.C.; Dinciu, C.C.; Margarit, I.R.; Tudor, V.; Mujea, A.M.; Neofit, A.; et al. Identifying the Level of Symmetrization of Reaction Time According to Manual Lateralization between Team Sports Athletes, Individual Sports Athletes, and Non-Athletes. Symmetry 2024, 16, 28. [CrossRef]
- Seçkin A.Ç. Review on wearable technology in sports: concepts, applications and challenges for injury prevention and performance monitoring. Appl Sci. 2023;13(18):10399. [CrossRef]
- Mănescu, D.C. Big Data Analytics Framework for Decision-Making in Sports Performance Optimization. Data 2025, 10, 116. [CrossRef]
- Oh S. et al. Rehabilomics strategies enabled by cloud-based wearable sensor networks for motor rehabilitation. J Med Internet Res. 2025; 27(1):e54790. [CrossRef]










| Configuration | ROC-AUC (95% CI) | PR-AUC | Balanced Acc. (95% CI) | F1 | Brier (95% CI) | Calib. Slope (95% CI) | Calib. Intercept | Net Benefit @ p=0.20 |
|---|---|---|---|---|---|---|---|---|
| Full-features ML | 0.933 (0.908–0.958) | 0.918 | 0.943 (0.924–0.962) |
0.940 | 0.056 (0.041–0.072) | 1.00 (0.95–1.05) |
~0.00 | 0.34 |
| No-label-features ML | 0.804 (0.769–0.837) | 0.781 | 0.823 (0.794–0.851) |
0.823 | 0.118 (0.104–0.132) | 0.95 (0.90–1.01) |
+0.02 | 0.12 |
| Label-features-only ML | 0.915 (0.890–0.939) | 0.897 | 0.926 (0.905–0.946) |
0.928 | 0.065 (0.052–0.080) | 1.01 (0.96–1.06) |
~0.00 | 0.31 |
| Calibrated rule score | 0.881 (0.852–0.908) | 0.856 | 0.902 (0.878–0.925) |
0.902 | 0.085 (0.072–0.100) | 0.97 (0.92–1.02) |
+0.01 | 0.28 |
| Metric | Gradient Boosting (95% CI) | CI Width | Logistic Regression (95% CI) | Δ vs Logistic Regression | Clinical Interpretation |
|---|---|---|---|---|---|
| ROC–AUC | 0.933 (0.908–0.958) |
0.050 | 0.703 (0.671–0.735) |
+0.230 | Excellent discrimination between balanced and imbalanced athletes |
| Balanced Accuracy | 0.943 (0.924–0.962) |
0.038 | 0.706 (0.672–0.738) |
+0.237 | Reliable detection across classes despite prevalence differences |
| PR–AUC | 0.918 (0.892–0.943) |
0.051 | 0.745 (0.708–0.782) |
+0.173 | High precision–recall balance, robust under class imbalance |
| F1 Score | 0.940 (0.919–0.958) |
0.039 | 0.637 (0.603–0.671) |
+0.303 | Strong trade-off between sensitivity and specificity |
| Brier Score | 0.056 (0.041–0.072) |
0.031 | 0.201 (0.177–0.227) |
–0.145 | Well-calibrated probabilities usable as clinical risk estimates |
| Feature | Permutation Importance (Normalized) | Rank | Threshold Used in Labeling | Clinical Cut-Off Relevance | Expected Effect on Imbalance | Interpretation Dimension | Clinical References / Rationale |
|---|---|---|---|---|---|---|---|
| Dynamic H:Q ratio (H:Qdyn) | 0.36 | 1 | < 0.60 or >1.20 |
Standard ACL risk / return-to-sport clearance values | Low H:Q → ↑ quadriceps dominance (ACL strain); High H:Q → hamstring overuse | Muscle balance | Dynamometry and simulation studies confirm H:Q imbalance as key ACL risk factor |
| Knee-moment LSI | 0.31 | 2 | > ±12% | Commonly applied clearance criterion | High LSI reflects unilateral weakness or compensatory load shift | Symmetry / load distribution | Rehabilitation literature uses ±10–15% as critical threshold |
|
Co-contraction index (CCI) |
0.15 | 3 | > 0.58 | Reflects inefficient stabilization strategies | Elevated CCI → ↑ joint compression, delayed rehabilitation | Stability strategy | EMG-based studies in ACL patients document maladaptive co-activation |
| Vertical GRF LSI | 0.07 | 4 | Not used for labeling | Secondary asymmetry indicator | Increased asymmetry = unloading weaker limb | External load distribution | Kinetic studies link GRF asymmetry with persistent deficits post-injury |
| Stance-time LSI | 0.05 | 5 | Not used for labeling | Secondary asymmetry marker | Timing differences suggest neuromuscular imbalance | Temporal symmetry | Gait rehab protocols assess stance-time asymmetry as proxy of recovery |
| Time-to-peak knee flexion moment | 0.04 | 6 | Not used for labeling | No fixed clinical cut-off | Delays/advances reflect fatigue or compensation | Coordination / timing | Fatigue and injury studies report altered TTP as compensatory sign |
| Stride-to-stride variability | 0.02 | 7 | Not used for labeling | Contextual only | Increased variability = instability, fatigue | Motor consistency | Variability metrics widely used as fatigue/instability marker |
| Predictor | Partial Dependence Shape | Threshold (s) |
Direction of Risk | Observed Model Effect | Clinical Interpretation | Mechanistic Rationale | Clinical Application |
|---|---|---|---|---|---|---|---|
| Knee-moment LSI | U-shaped | ± 12% | ↑ risk beyond cut-off | Probability of imbalance increases sharply when asymmetry exceeds 12% | Confirms use of ± 10–15% as clearance threshold in return-to-sport testing | Unilateral weakness or compensatory load shift increases ACL strain | Return-to-sport clearance, rehabilitation monitoring |
| Dynamic H:Q ratio (H:Qdyn) | Monotonic decreasing (within 0.6–1.2) | < 0.60 or > 1.20 |
↑ risk at extremes | Ratios < 0.60 linked to quadriceps dominance; > 1.20 linked to hamstring overuse |
Matches ACL injury risk definitions and clearance criteria | Quadriceps dominance → ↑ ACL loading; hamstring over-dominance → inefficiency and strain | ACL risk screening, injury prevention, athlete profiling |
| Co-contraction index (CCI) | Positive monotonic | > 0.58 | ↑ risk with higher values | Higher co-contraction predicted increased imbalance | Reflects maladaptive stabilization (inefficient co-activation) | Excessive co-contraction raises joint compression and delays recovery | Neuromuscular training, rehabilitation follow-up |
| Speed | ROC-AUC | ROC-AUC (95% CI) | PR-AUC | Balanced Accuracy | F1 Score | Brier Score | Clinical Interpretation |
|---|---|---|---|---|---|---|---|
| Slow | 0.941 | 0.915–0.962 | 0.927 | 0.956 | 0.954 | 0.044 | Baseline condition; stable performance at lower intensity |
| Moderate | 0.933 | 0.902–0.953 | 0.915 | 0.936 | 0.934 | 0.063 | Standard rehab testing speed (~3 m·s⁻¹); robust detection |
| Fast | 0.914 | 0.889–0.945 | 0.910 | 0.930 | 0.923 | 0.061 | High-intensity stress test; performance remains reliable |
| Metric | Value | 95% CI | Δ vs. Logistic | Formula | Source (TP/FP/TN/FN) | Clinical Application |
|---|---|---|---|---|---|---|
| Sensitivity | 0.919 | 0.893 –0.944 | + 0.210 | TP / (TP+FN) | 249 / 22 | Screening for imbalance (few missed cases) |
| Specificity | 0.967 | 0.950 –0.981 | + 0.261 | TN / (TN+FP) | 292 / 10 | Rule out false positives; return-to-sport clearance |
| PPV | 0.961 | 0.940 –0.978 | + 0.214 | TP / (TP+FP) | 249 / 10 | Confidence when imbalance predicted |
| NPV | 0.930 | 0.905 –0.952 | + 0.225 | TN / (TN+FN) | 292 / 22 | Reassurance when balance predicted |
| Accuracy | 0.943 | 0.924 –0.962 | + 0.237 | (TP+TN)/(TP+TN+FP+FN) | 249 / 10 / 292 / 22 | Overall reliability |
| Balanced Accuracy | 0.943 | 0.924 –0.962 | + 0.237 | (Sensitivity+Specificity)/2 | – | Robust metric correcting for prevalence |
| F1 Score | 0.940 | 0.919–0.958 | + 0.303 | 2TP/(2TP+FP+FN) | 249 / 10 / 22 | Balanced trade-off between sensitivity and precision |
| Predictor | Relative Importance | Threshold Relevance | Observed Effect | Mechanistic Rationale | Interpretation Dimension | Clinical Interpretation | Clinical Application |
|---|---|---|---|---|---|---|---|
| Stride-to-stride variability | 2% | No clinical cut-off | Slight ↑ imbalance probability with higher variability | Instability reflects neuromuscular fatigue and inconsistent motor unit recruitment | Motor consistency | Identifies fatigue-related instability and compensatory variability | Fatigue monitoring, motor control training |
| Time-to-peak knee flexion moment (TTP-KFM) | 4% | Not standardized | Premature or delayed peaks linked to borderline misclassifications | Altered timing indicates compensatory strategies or fatigue-related delays | Coordination / timing | Sensitive to neuromuscular control shifts after injury | Rehab progression, fatigue assessment |
| Vertical GRF LSI | 7% | No fixed cut-off | Mild contribution to imbalance classification | Load asymmetry indicates unloading of weaker limb | External load distribution | Detects subtle asymmetries in ground reaction force profiles | Return-to-sport monitoring, gait retraining |
| Stance-time LSI | 5% | No fixed cut-off | Minor effect, complementary to knee-moment LSI | Asymmetry in stance time reflects residual deficits | Temporal symmetry | Captures small but clinically meaningful gait asymmetries | Fine-grained rehabilitation evaluation |
| Parameter | Implementation | Purpose / Rationale | Clinical / Biomechanical Relevance | Data Realism Strategy | Numerical Details (per Subject / Trial) |
Aggregate stats (Cohort-level) |
|---|---|---|---|---|---|---|
| Subjects | 160 synthetic individuals, bimodal latent imbalance distribution | Balanced vs. imbalanced subpopulations; unit of analysis = subject | Mirrors athletes with vs. without imbalance |
Gaussian bimodal sampling anchored to H:Q-informed imbalance | 80 balanced, 80 imbalanced |
160 subjects total |
| Trials per subject | 2–5 trials, 3 running speeds |
Captures intra-individual variability | Reflects repeated-measures protocols in biomechanics/rehab |
Randomized trial count per subject |
Median = 3; Mean ≈ 3.58; Range = 2–5 |
573 trials total |
| Speeds | Slow, Moderate, Fast |
Tests imbalance across task intensities | Aligns with lab protocols (~3 and ~4 m·s⁻¹) |
Added stride-to-stride noise on target velocity | 2.8 ± 0.1; 3.4 ± 0.1; 4.2 ± 0.1 m·s⁻¹ |
191 trials/condition |
| Variability | Inter-individual + stride-to-stride |
Represents population and gait heterogeneity | Matches variability in kinetics/EMG | Multiplicative trial-wise perturbations | CV inter-individual ≈ 12%; intra-trial ≈ 5% |
— |
| Noise | ~5% random label flips |
Simulates misclassification & measurement error | Reflects empirical uncertainty | Random label reassignment on OOF labels | ≈ 29 flips total; by speed: 10 slow, 9 moderate, 10 fast |
≈5% of 573 trials affected |
| Class distribution by speed | Balanced vs. imbalanced per condition |
Ensures comparable prevalence across speeds | Avoids confounding by task intensity |
Stratified generation per condition | Slow = 96/95; Moderate = 96/95; Fast = 96/95 (balanced/imbalanced) |
Balanced = 288; Imbalanced = 285 |
| Label thresholds | H:Q <0.6, LSI >10–15%, CCI ↑ early stance |
Defines imbalance prevalence and severity | Corresponds to ACL risk and return-to-sport criteria | Threshold-based labeling with controlled prevalence shift | LSI cut-off 10%→15% = −1.8% prevalence; H:Q shift −0.05 = +2.1% |
Net effect ≈ ±2–3% imbalance prevalence |
| Aggregate output | Generated synthetic dataset | Provides reproducible in-silico cohort |
Transparent methodological testbed | Seed-controlled generator | — | 573 labeled trials total |
| Aspect | Implementation in Model | Rationale | References / Common Use | Impact on Labeling | Expected Effect on ML Performance | Interpretability Dimension |
|---|---|---|---|---|---|---|
| Population heterogeneity | 160 virtual subjects; latent imbalance propensity from bimodal Gaussian | Mimics variation in athletes; ensures balanced/imbalanced clusters | Simulation-based heterogeneity | Indirect | Provides realistic variance for model generalization | Contextual variability |
| Flexor–extensor balance | Dynamic H:Q ratio (integrated moments) | Central to ACL stability, hamstring injury risk | Gold-standard ratio in sports medicine | Direct (H:Q <0.60 or >1.20) | Major predictor of imbalance | Muscle balance |
| Inter-limb asymmetry | LSIs for knee moment, stance, vGRF | Reflects unilateral weakness or compensatory loading | Return-to-sport criterion (±10–15%) | Direct (knee-moment LSI >12%) | Strong discriminative power | Symmetry / load distribution |
| Neuromuscular control | Synthetic activations → CCI | Captures stabilizing co-contraction | EMG/simulation practice | Direct (CCI >0.58) | Moderate predictor; adds nuance | Stability strategy |
| Temporal coordination | Time-to-peak knee flexion moment | Identifies compensatory timing shifts | Fatigue/post-injury gait analyses | None | Secondary role; refines interpretation | Timing / coordination |
| Variability modeling | Gaussian noise; CV computed | Mimics stride-to-stride inconsistency | Variability as control proxy | None | Adds robustness; improves error analysis | Motor consistency |
| Measurement noise | ~5% random label flips | Emulates misclassification and imperfect ground truth | Standard ML validation | Indirect | Stress-test for classifier calibration | Label uncertainty |
| Label definition | Composite thresholds (H:Q, LSI, CCI) | Anchors imbalance in task-specific biomechanical rules | Sports medicine thresholds | Direct | Ensures ground-truth plausibility | Clinical face validity |
| Criterion | Threshold | Biomechanical Relevance | Expected Biomechanical Consequence | Interpretability Dimension | Evidence Base | Role in Model Evaluation |
|---|---|---|---|---|---|---|
| Dynamic H:Q ratio | < 0.60 or > 1.20 |
Proxy for hamstrings–quadriceps balance; debated cut-offs in sports medicine | Low ratio → ↑ ACL strain; High ratio → hamstring overuse | Muscle balance | Commonly reported in isokinetic & simulation studies | Used for labeling and as top predictor in ML |
| Knee-moment LSI | > 12% (absolute) | Indicator of inter-limb asymmetry in joint loading | Reflects unilateral weakness or compensatory strategies | Symmetry / load distribution |
±10–15% widely used as return-to-sport cut-off | Labeling criterion and interpretable asymmetry index |
| Early-stance CCI | > 0.58 | Quantifies stabilizing co-activation of hamstrings and quadriceps | Excessive co-contraction → ↑ joint compression, inefficient stabilization | Stability strategy | Documented in EMG and simulation contexts | Labeling criterion and interpretability dimension |
| Random label noise | ~ 5% | Mimics imperfect ground truth and misclassification | Increases robustness to uncertainty | Label uncertainty | Standard ML stress-test technique | Labeling only (not used as predictor) |
| Component | Implementation | Rationale | Impact on Analysis | Interpretability Dimension |
|---|---|---|---|---|
| Algorithm | Gradient Boosting, decision tree base learners | Balances predictive accuracy with transparency; widely used in biomedical and sports data | Provides non-linear modeling without black-box opacity | Allows feature importance and partial dependence analysis |
| Tree depth | max_depth = 3 | Prevents overfitting; restricts complexity to meaningful biomechanical interactions | Ensures stable generalization across subjects | Shallow trees preserve clarity of marginal effects |
| Probability calibration | Isotonic regression | Produces calibrated risk estimates; superior to Platt scaling for non-linear boundaries | Probabilities can be interpreted as empirical imbalance risk | Enhances clinical interpretability of outputs |
| Validation scheme | Subject-wise GroupKFold, k = 5 |
Prevents data leakage; ensures subject-independent evaluation | Metrics reflect generalization to unseen individuals | Strengthens methodological rigor |
| Baseline model | Logistic regression (calibrated) | Provides linear reference for benchmarking GBM | Demonstrates added value of non-linear modeling | Coefficients interpretable as linear effects |
| Performance metrics | ROC-AUC, PR-AUC, balanced accuracy, F1, Brier score |
Capture discrimination, calibration, and robustness | Multi-dimensional evaluation of classifier | Supports transparent reporting |
| Uncertainty quantification | 2000× bootstrap on out-of-fold predictions |
Estimates confidence intervals for all metrics | Demonstrates robustness of findings | CI reporting aids reproducibility |
| Global interpretability | Permutation feature importance (hold-out set) |
Identifies predictors most influential for classification | Links model output to biomechanical determinants | Highlights task-specific predictors (H:Q, LSI, CCI) |
| Local interpretability | Partial dependence plots | Visualizes marginal predictor effects | Ensures plausibility of model decisions |
Direct mapping to biomechanical constructs |
| Condition | ROC-AUC (95% CI) | CI Width | PR-AUC (95% CI) | Balanced Acc. (95% CI) | F1 (95% CI) | Brier | Calib. slope | Class prevalence (% imbalanced) | Net Benefit @0.5 | ΔROC vs. baseline | ΔROC vs. Logistic |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Baseline (H:Q <0.60/>1.20; LSI >12%; CCI >0.58; 5% noise) | 0.933 (0.908–0.958) | 0.050 | 0.918 (0.892–0.943) | 0.943 (0.924–0.962) | 0.940 (0.919–0.958) | 0.056 | 1.00 | 47.3% | 0.34 | – | +0.230 |
| H:Q thresholds 0.55 / 1.25 | 0.927 (0.900–0.952) | 0.052 | 0.912 (0.886–0.939) | 0.936 (0.916–0.955) | 0.934 (0.913–0.952) | 0.060 | 0.98 | 45.1% | 0.33 | –0.006 | +0.224 |
| H:Q thresholds 0.65 / 1.15 | 0.921 (0.892–0.948) | 0.056 | 0.907 (0.879–0.934) | 0.932 (0.910–0.951) | 0.928 (0.905–0.947) | 0.064 | 0.97 | 49.6% | 0.32 | –0.012 | +0.218 |
| LSI threshold 10% | 0.936 (0.911–0.959) | 0.048 | 0.920 (0.895–0.945) | 0.946 (0.926–0.963) | 0.942 (0.921–0.959) | 0.055 | 1.01 | 52.1% | 0.35 | +0.003 | +0.233 |
| LSI threshold 15% | 0.922 (0.896–0.948) | 0.052 | 0.909 (0.882–0.936) | 0.931 (0.909–0.950) | 0.927 (0.904–0.946) | 0.066 | 0.96 | 43.7% | 0.31 | –0.011 | +0.219 |
| CCI threshold 0.55 | 0.934 (0.910–0.958) | 0.048 | 0.919 (0.893–0.944) | 0.944 (0.925–0.962) | 0.940 (0.918–0.958) | 0.057 | 1.00 | 48.5% | 0.34 | +0.001 | +0.231 |
| CCI threshold 0.60 | 0.918 (0.889–0.945) | 0.056 | 0.905 (0.877–0.932) | 0.929 (0.907–0.949) | 0.925 (0.902–0.944) | 0.068 | 0.95 | 46.8% | 0.30 | –0.015 | +0.214 |
| Noise 0% (ideal labels) | 0.939 (0.915–0.961) | 0.046 | 0.923 (0.898–0.947) | 0.948 (0.929–0.964) | 0.945 (0.924–0.962) | 0.052 | 1.02 | 47.0% | 0.35 | +0.006 | +0.236 |
| Noise 10% (stress test) | 0.910 (0.880–0.939) | 0.059 | 0.893 (0.864–0.922) | 0.917 (0.894–0.938) | 0.919 (0.895–0.940) | 0.073 | 0.93 | 47.6% | 0.29 | –0.023 | +0.207 |
| Depth | Estimators | Learning Rate | ROC-AUC (95% CI) | CI Width | PR-AUC | Balanced Acc. | F1 | Brier | Calib. slope | ΔROC vs. baseline | ΔROC vs. Logistic |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 100 | 0.10 | 0.902 (0.873–0.931) | 0.058 | 0.881 | 0.906 | 0.908 | 0.082 | 0.91 | –0.031 | +0.199 |
| 2 | 200 | 0.10 | 0.910 (0.882–0.938) | 0.056 | 0.889 | 0.913 | 0.916 | 0.078 | 0.92 | –0.023 | +0.207 |
| 3 | 100 | 0.05 | 0.928 (0.902–0.953) | 0.051 | 0.909 | 0.935 | 0.932 | 0.062 | 0.98 | –0.005 | +0.225 |
| 3 | 200 | 0.10 (final) | 0.933 (0.908–0.958) | 0.050 | 0.918 | 0.943 | 0.940 | 0.056 | 1.00 | – | +0.230 |
| 3 | 500 | 0.10 | 0.936 (0.910–0.960) | 0.050 | 0.920 | 0.946 | 0.943 | 0.054 | 0.99 | +0.003 | +0.233 |
| 4 | 200 | 0.10 | 0.939 (0.913–0.962) | 0.049 | 0.922 | 0.947 | 0.944 | 0.052 | 1.02 | +0.006 | +0.236 |
| 5 | 200 | 0.10 | 0.941 (0.914–0.965) | 0.051 | 0.924 | 0.949 | 0.946 | 0.051 | 1.08 | +0.008 | +0.238 |
| 3 | 200 | 0.20 | 0.927 (0.900–0.953) | 0.053 | 0.907 | 0.934 | 0.930 | 0.061 | 1.04 | –0.006 | +0.224 |
| Source of Bias / Assumption | Description | Potential Impact | Severity | Interpretability Dimension | Evidence Base | Mitigation Strategy |
|---|---|---|---|---|---|---|
| Threshold-based labeling | Fixed cut-offs for H:Q, LSI, CCI | May over- or under-estimate imbalance prevalence | Moderate | Labeling / class balance | Sports medicine debates on H:Q cut-offs and LSI ±10–15% | Sensitivity analysis across threshold ranges (Section 2.6) |
| Synthetic subject distribution | Bimodal Gaussian latent profiles | Simplifies real variability; may omit intermediate cases | High | Generalization / external validity | Common in simulation studies; lacks mixed phenotypes | Transparent reporting; future validation on real athletes |
| Label noise (5%) | Random flips introduced for robustness | Increases uncertainty in performance metrics | Low | Calibration / stability | Standard ML stress-test technique | Bootstrap CIs; robustness analysis with 0–10% noise |
| Cross-validation scheme | Subject-wise GroupKFold | May underestimate generalization error vs. real data | Moderate | Generalization | Recommended in biomechanics ML, but limited to synthetic cohorts | Strict grouping; external validation recommended |
| Interpretability methods | Permutation importance, partial dependence | May oversimplify feature interactions | Moderate | Feature transparency | Widely used in interpretable ML; known limitations | Use multiple methods; report global + local perspectives |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).