Submitted:
02 November 2023
Posted:
02 November 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Data Preprocessing and Exploratory Data Analysis
2.2. Predictiction Models
2.2.1. Random Forest (RF)
2.2.2. Multiple Linear Regression (MLR)
2.3. Performance Metrics
2.4. Feature Importance Analysis
2.4.1. Random Forest (RF)
2.4.2. Extreme Gradient Boosting (XGBoost) and Adaptive Boosting (AdaBoost)
3. Results
3.1. Prediction Model
3.2. Feature Importances
4. Discussion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Glassman, G. Understanding CrossFit. Crossfit Journal 2007, 56, 1–2.
- Sprey, J. W.; Ferreira, T.; de Lima, M. V.; Duarte Jr, A.; Jorge, P. B.; Santili, C. An epidemiological profile of CrossFit athletes in Brazil. Orthopaedic Journal of Sports Medicine 2016, 4 (8), 2325967116663706. [CrossRef]
- CrossFit | About Affiliation. [cited 2020 Feb 9]. Available from: https://www.crossfit.com/affiliate.
- International Weightlifting Federation - International Weightlifting Federation. [cited 2020 Jan 25].
- IPF- International Powerlifting Federation IPF. [cited 2020 Jan 25].
- Maté-Muñoz, J. L.; Lougedo, J. H.; Barba, M.; et al. Muscular fatigue in response to different modalities of CrossFit sessions. PLoS One 2017, 12 (7), e0181855. [CrossRef]
- Feito, Y.; Heinrich, K. M.; Butcher, S. J.; Poston, W. S. C. High-intensity functional training (HIFT): Definition and research implications for improved fitness. Sports 2018, 6 (3), 76. [CrossRef]
- Claudino, J. G.; Gabbett, T. J.; Bourgeois, F.; Souza, H. S.; Miranda, R. C.; Mezêncio, B.; et al. CrossFit overview: Systematic review and meta-analysis. Sports Medicine Open 2018, 4 (1), 11. [CrossRef]
- Jacob, N.; Novaes, J. S.; Behm, D. G.; Vieira, J. G.; Dias, M. R.; Vianna, J. M. Characterization of hormonal, metabolic, and inflammatory responses in CrossFit® training: A systematic review. Frontiers in Physiology 2020, 11, 1001. [CrossRef]
- De Souza, R. A. S.; Da Silva, A. G.; De Souza, M. F.; Souza, L. K. F.; Roschel, H.; Da Silva, S. F.; et al. A systematic review of CrossFit® workouts and dietary and supplementation interventions to guide nutritional strategies and future research in CrossFit®. International Journal of Sport Nutrition and Exercise Metabolism 2021, 31, 187–205. [CrossRef]
- Meier, N.; Schlie, J.; Schmidt, A. Physiological effects of regular CrossFit® training and the impact of the COVID-19 pandemic—A systematic review. Frontiers in Physiology 2023, 14, 1146718. [CrossRef]
- Butcher, S. J.; Neyedly, T. J.; Horvey, K. J.; Benko, C. R. Do physiological measures predict selected CrossFit® benchmark performance? Open Access Journal of Sports Medicine 2015, 6, 241. pmid:26261428. [CrossRef]
- Martínez-Gómez, R.; Valenzuela, P. L.; Barranco-Gil, D.; Moral-González, S.; García-González, A.; Lucia, A. Full-Squat as a Determinant of Performance in CrossFit. International Journal of Sports Medicine 2019, 40 (09), 592–6. [CrossRef]
- Serafini, P. R.; Feito, Y.; Mangine, G. T. Self-reported measures of strength and sport-specific skills distinguish ranking in an international online fitness competition. Journal of Strength and Conditioning Research 2018, 32 (12), 3474–84. pmid:28195976. [CrossRef]
- Bellar, D.; Hatchett, A.; Judge, L.; Breaux, M.; Marcus, L. The relationship of aerobic capacity, anaerobic peak power, and experience to performance in CrossFit exercise. Biology of Sport 2015, 32 (4), 315–20. pmid:26681834. [CrossRef]
- Barbieri, J. F.; Correia, R. F.; Castaño, L. A. A.; Brasil, D. V. C.; Ribeiro, A. N. Comparative and correlational analysis of the performance from 2016 CrossFit Games high-level athletes. Manual Therapy, Posturology & Rehabilitation Journal = Revista Manual Therapy 2017, 15. [CrossRef]
- Mangine, G. T.; Stratton, M. T.; Almeda, C. G.; Roberts, M. D.; Esmat, T. A.; VanDusseldorp, T. A.; Feito, Y. Physiological differences between advanced CrossFit athletes, recreational CrossFit participants, and physically-active adults. PLoS One 2020, 15 (4), e0223548. [CrossRef]
- Leitão, L.; Dias, M.; Campos, Y.; Vieira, J. G.; Sant’Ana, L.; Telles, L. G.; Vianna, J. Physical and physiological predictors of FRAN CrossFit® WOD athlete’s performance. International Journal of Environmental Research and Public Health 2021, 18 (8), 4070. [CrossRef]
- Mangine, G. T.; Tankersley, J. E.; McDougle, J. M.; Velazquez, N.; Roberts, M. D.; Esmat, T. A.; Feito, Y. Predictors of CrossFit open performance. Sports 2020, 8 (7), 102. [CrossRef]
- Meier, N.; Rabel, S.; Schmidt, A. Determination of a CrossFit® benchmark performance profile. Sports 2021, 9 (6), 80. [CrossRef]
- Liu, H.; Zhu, X. Design of the Physical Fitness Evaluation Information Management System of Sports Athletes Based on Artificial Intelligence. Comput. Intell. Neurosci. 2022, 30, 1925757. [CrossRef]
- Kapadia, K.; Abdel-Jaber, H.; Thabtah, F.; Hadi, W. Sport Analytics for Cricket Game Results Using Machine Learning: An Experimental Study. Appl. Comput. Informatics 2020, 18(3/4), 256-266. [CrossRef]
- Sarlis, V.; Tjortjis, C. Sports Analytics—Evaluation of Basketball Players and Team Performance. Inf. Syst. 2020, 93, 101562. [CrossRef]
- Nguyen, N. H.; Nguyen, D. T. A.; Ma, B.; Hu, J. The Application of Machine Learning and Deep Learning in Sport: Predicting NBA Players’ Performance and Popularity. J. Inf. Telecommun. 2022, 6(2), 217-235. [CrossRef]
- Kaggle | Crossfit Athletes. [cited 2023 Feb 2]. Available from: https://www.kaggle.com/datasets/ulrikthygepedersen/crossfit-athletes/data.
- Breiman, L. Random forests. Machine Learning 2001, 45, 5-32. [CrossRef]
- Miah, J.; Mamun, M.; Rahman, M. M.; Mahmud, M. I.; Ahmad, S.; Nasir, M. H. B. Mhfit: Mobile health data for predicting athletics fitness using machine learning models. In 2022 2nd International Seminar on Machine Learning, Optimization, and Data Science (ISMODE), 2022; pp. 584-589. IEEE.
- Breiman, L. Bagging predictors. Machine Learning 1996, 24, 123–140. [CrossRef]
- Apostolou, K.; Tjortjis, C. Sports Analytics algorithms for performance prediction. In 2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA), 2019; pp. 1-4. IEEE.
- Sinnakaudan, S. K.; Ghani, A. A.; Ahmad, M. S.; Zakaria, N. A. Multiple linear regression model for total bed material load prediction. Journal of Hydraulic Engineering 2006, 132 (5), 521-528. [CrossRef]
- Rath, S.; Tripathy, A.; Tripathy, A. R. Prediction of New Active Cases of Coronavirus Disease (COVID-19) Pandemic Using Multiple Linear Regression Model. Diabetes Metab. Syndr. 2020, 14(5), 1467-1474. [CrossRef]
- Xu, D.; Song, Y.; Meng, Y.; István, B.; Gu, Y. Relationship Between Firefighter Physical Fitness and Special Ability Performance: Predictive Research Based on Machine Learning Algorithms. Int. J. Environ. Res. Public Health 2020, 17(20), 7689. [CrossRef]
- Zaras, N.; Stasinaki, A. N.; Methenitis, S.; Karampatsos, G.; Fatouros, I.; Hadjicharalambous, M.; Terzis, G. Track and Field Throwing Performance Prediction: Training Intervention, Muscle Architecture Adaptations and Field Tests Explosiveness Ability. J. Phys. Educ. Sport 2019, 19, 436-443. [CrossRef]
- Hamilton, D. F.; Ghert, M.; Simpson, A. H. R. W. Interpretation of Regression Models in Clinical Outcome Studies Hamilton et al., 2015.
- Roy, B. Optimum Machine Learning Algorithm Selection for Forecasting Vegetation Indices: MODIS NDVI & EVI. Remote Sens. Appl. Soc. Environ. 2021, 23, 100582. [CrossRef]
- Chicco, D.; Warrens, M. J.; Jurman, G. The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE, and RMSE in Regression Analysis Evaluation. PeerJ Comput. Sci. 2021, 7, e623. [CrossRef]
- Gujarati, D. N. Basic Econometrics. Prentice Hall, 2022.
- Saarela, M.; Jauhiainen, S. Comparison of Feature Importance Measures as Explanations for Classification Models. SN Appl. Sci. 2021, 3, 1-12. [CrossRef]
- Gao, T.; Liu, J. Application of Improved Random Forest Algorithm and Fuzzy Mathematics in Physical Fitness of Athletes. J. Intell. Fuzzy Syst. 2021, 40(2), 2041-2053. [CrossRef]
- Khator, D.; Prajapati, D. IPL Prediction Using Machine Learning. [CrossRef]
- Zhu, Y.; Naikar, R. Predicting Tennis Serve Directions with Machine Learning. Int. Workshop Mach. Learn. Data Min. Sports Analytics 2022, 89-100. [CrossRef]
- Zamani Joharestani, M.; Cao, C.; Ni, X.; Bashir, B.; Talebiesfandarani, S. PM2.5 Prediction Based on Random Forest, XGBoost, and Deep Learning Using Multisource Remote Sensing Data. Atmosphere 2019, 10(7), 373. [CrossRef]
- Amendolara, A.; Pfister, D.; Settelmayer, M.; Shah, M.; Wu, V.; Donnelly, S.; Bills, K. An Overview of Machine Learning Applications in Sports Injury Prediction. Cureus 2023, 15(9). [CrossRef]
- Pantzalis, V. C.; Tjortjis, C. Sports Analytics for Football League Table and Player Performance Prediction. 2020 11th Int. Conf. Inf. Intell. Syst. Appl. (IISA) 2020, 1-8.
- Gao, Z.; Kowalczyk, A. Random Forest Model Identifies Serve Strength as a Key Predictor of Tennis Match Outcome. J. Sports Analytics 2021, 7(4), 255-262. [CrossRef]
- Al-Asadi, M. A.; Tasdemır, S. Predict the Value of Football Players Using FIFA Video Game Data and Machine Learning Techniques. IEEE Access 2022, 10, 22631-22645. [CrossRef]
- Sikka, D.; Rajeswari, D. Basketball Win Percentage Prediction using Ensemble-based Machine Learning. 2022 6th Int. Conf. Electron. Commun. Aerosp. Technol. 2022, 885-890.
- Shetty, M.; Rane, S.; Pandita, C.; Salvi, S. Machine Learning-based Selection of Optimal Sports Team based on the Players’ Performance. 2020 5th Int. Conf. Commun. Electron. Syst. (ICCES) 2020, 1267-1272.
- Rodrigues, N.; Sequeira, N.; Rodrigues, S.; Shrivastava, V. Cricket Squad Analysis Using Multiple Random Forest Regression. 2019 1st Int. Conf. Adv. Inf. Technol. (ICAIT) 2019, 104-108.
- Tümer, A. E.; Akyıldız, Z.; Güler, A. H., et al. Prediction of Soccer Clubs’ League Rankings by Machine Learning Methods: The Case of Turkish Super League. Proc. Inst. Mech. Eng., Part P J. Sports Eng. Technol. 2022, 0(0). [CrossRef]
- Li, H. Analysis on the Construction of Sports Match Prediction Model Using Neural Network. Soft Computing 2020, 24(11), 8343-8353. [CrossRef]
- Du, P.; Wang, Y.; Liao, C.; Xian, T. Sports Games Attendance Forecast Using Machine Learning. 2022 IEEE 2nd Int. Conf. Data Sci. Comput. Appl. (ICDSCA) 2022, 181-188.
- Geng, S.; Hu, T. Sports Games Modeling and Prediction Using Genetic Programming. 2020 IEEE Congress Evolutionary Comput. (CEC) 2020, 1-6.
- Lucero, R. A.; Fry, A. C.; LeRoux, C. D.; Hermes, M. J. Relationships Between Barbell Squat Strength and Weightlifting Performance. Int. J. Sports Sci. Coaching 2019, 14(4), 562-568. [CrossRef]
- Charniga, A., The relative value of the back squat in the training of weightlifters. 2018.
- Stone, M. H.; Sands, W. A.; Pierce, K. C., et al. Relationship of Maximum Strength to Weightlifting Performance. Med. Sci. Sports Exerc. 2005, 37, 1037–1043. [CrossRef]
- Moon, Y. J.; Park, T. M. Extraction of Major Training Methods Highly Related to Snatch Record and Jerk Record Improvement. Korean J. Sport Biomechanics 2021, 31(2), 148-153. [CrossRef]
- Jones, M. T.; Jagim, A. R.; Haff, G. G.; Carr, P. J.; Martin, J.; Oliver, J. M. Greater Strength Drives Difference in Power Between Sexes in the Conventional Deadlift Exercise. Sports 2016, 4(3), 43. [CrossRef]
- Lomasney, S.; Lessard, A.; Steitz, A.; Bosse, M., Gender Differences Between Overall Resistance Work and Overall Energy Costs. 2014.
- Zecchin, A.; Puggina, E. F.; Hortobágyi, T.; Granacher, U. Association Between Foundation Strength and Weightlifting Exercises in Highly Trained Weightlifters: Support for a General Strength Component. J. Strength Conditioning Res. 2023, 37(7), 1375-1381. [CrossRef]
- Zecchin, A.; Puggina, E. F.; Hortobágyi, T.; Granacher, U. Association between Foundation Strength and Weightlifting Exercises in Highly Trained Weightlifters: Support for a General Strength Component. J. Strength Conditioning Res. 2022, 10, 1519. [CrossRef]
- Brooks, T.; Cressey, E. Mobility Training for the Young Athlete. Strength Conditioning J. 2013, 35(3), 27-33. [CrossRef]


| Exercise | Metric | Random Forest | Multiple Linear Regression |
|---|---|---|---|
| Clean & jerk | R2 | 0.933 * | 0.929 |
| MSE | 0.063 * | 0.070 | |
| Snatch | R2 | 0.895 | 0.898 * |
| MSE | 0.100 * | 0.101 | |
| Back squat | R2 | 0.847 | 0.862 * |
| MSE | 0.113 * | 0.137 | |
| Deadlift | R2 | 0.805 | 0.819 * |
| MSE | 0.185 | 0.180 * |
| Exercise | Rank | Feature Name | Random Forest | XGBoost | AdaBoost | Mean |
|---|---|---|---|---|---|---|
| 1 | Snatch | 0.892 | 0.607 | 0.734 | 0.744 | |
| Clean & jerk | 2 | Back squat | 0.044 | 0.240 | 0.168 | 0.145 |
| 3 | Deadlift | 0.013 | 0.075 | 0.069 | 0.047 | |
| 1 | Clean & jerk | 0.901 | 0.920 | 0.848 | 0.889 | |
| Snatch | 2 | Back squat | 0.016 | 0.028 | 0.063 | 0.035 |
| 3 | Deadlift | 0.016 | 0.003 | 0.040 | 0.019 | |
| 1 | Clean & jerk | 0.498 | 0.530 | 0.468 | 0.498 | |
| Back squat | 2 | Deadlift | 0.376 | 0.393 | 0.420 | 0.396 |
| 3 | Snatch | 0.020 | 0.027 | 0.047 | 0.031 | |
| 1 | Back squat | 0.794 | 0.502 | 0.519 | 0.605 | |
| Deadlift | 2 | Clean & jerk | 0.034 | 0.289 | 0.245 | 0.189 |
| 3 | Gender | 0.017 | 0.153 | 0.060 | 0.076 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).