Submitted:
22 May 2025
Posted:
23 May 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Research Objectives
3. Data and Methods
3.1. Data Collection
| Variable | Symbol | Description |
|---|---|---|
| sets | St | Number of Sets won by Player 1/2 |
| games | Gm | Number of Games won by Player 1/2 |
| score | Scr | Scores of Player 1/2 |
| serve | Srv | Serve by Player 1/2 |
| points | Pt | Number of Points won by Player 1/2 |
| point_victor | PtVct | Point Victor is Player 1/2 |
| ace | Ace | Ace by Player 1/2 |
| break_pt_won | BPtW | Break points won by Player 1/2 |
| double_fault | DF | Double Fault made by Player 1/2 |
| rally_count | Ra | The number of rallies |
| distance_run | DR | The meters of running distance for Player 1/2 |
3.2. Methods
4. Momentum Quantification
4.1. Mechanism
4.1.1. Real-Time Winning Probability
4.1.2. Leverage Based on Counterfactual Prediction Framework
4.1.3. Momentum and Visualization
- Djokovic’s Dominance
- First Swing
- Alcaraz’s Fight Back
- Second Swing
- Alcaraz’s Victory
4.2. Interpretation of Momentum and Break-Serve Points
4.3. 2024 Paris Olympics Tennis Women's Single
5. Game Winner Prediction
5.1. Model Training
5.1.1. Training and Testing Set Splitting
5.1.2. Feature Engineering
- Momentum: M(t) = M1(t) – M2(t);
- Distance Run Difference (D_DR): p1_distance_run – p2_distance_run;
- Served Score (SrvScr): the cumulative points won when p1 served in the game;
- Received Score (RcvScr): the cumulative points won when p1 received in the game;
- Score Difference (D_Scr): p1_score – p2_score;
- Game won Difference (D_Gm): p1_game – p2_game;
- Others: Serve (Srv), Set number (St), Game number (Gm), Point number (Pt), and Point Victor (PtVct).

5.2. Model Accuracy
5.2.1. Training Set Accuracy
5.2.2. Testing Set Accuracy

6. Discussion
7. Conclusions
Author Contributions
Funding
Acknowledgments
Disclosure Statement
References
- L. Crust and M. Nesti, A review of psychological momentum in sports: Why qualitative research is needed, Athletic Insight 8 (2006), pp. 1-15.
- H. Dietl and C. Nesseler, Momentum in tennis: Controlling the match, UZH Business Working Paper Series 365 (2017). [CrossRef]
- C. A. Depken, J. M. Gandar, and D. A. Shapiro, Set-level strategic and psychological momentum in best-of-three-set professional tennis matches, J. Sports Econ. 23 (2022), pp. 598-623. [CrossRef]
- B. Moss and P. O’Donoghue, Momentum in US Open men’s singles tennis, Int. J. Perform. Anal. Sport 15 (2015), pp. 884-896. [CrossRef]
- P. Meier, R. Flepp, M. Ruedisser, and E. Franck, Separating psychological momentum from strategic momentum: Evidence from men’s professional tennis, J. Econ. Psychol. 78 (2020), p. 102269. [CrossRef]
- P. O’Donoghue and E. Brown, Sequences of service points and the misperception of momentum in elite tennis, Int. J. Perform. Anal. Sport 9 (2009), pp. 113-127. [CrossRef]
- Wimbledon, "Wimbledon Official Website," 2024; website available at https://www.wimbledon.com/index.html.
- R. Seidl and P. Lucey, Live counter-factual analysis in women's tennis using automatic key-moment detection, in Proc. MIT Sloan Sports Anal. Conf, 2022.
- T. Chen and C. Guestrin, XGBoost: A scalable tree boosting system, in Proc. 22nd ACM SIGKDD Int. Conf. Knowl. Discov. Data Min. (KDD'16), Assoc. Comput. Mach., 2016, pp. 785-794. [CrossRef]
- SHAP, "SHAP Documentation," 2024; documentation available at https://shap.readthedocs.io/en/latest/api.html.
- E. Colino, J. García-Unanue, J. L. Felipe, and others, Mechanical properties influencing athlete–surface interaction on tennis court surfaces, Sports Eng. 27 (2024), p. 18. [CrossRef]
- T. K. Kim, T test as a parametric statistic, Korean J. Anesthesiol. 68 (2015), pp. 540–546. [CrossRef]
- C. Goutte and E. Gaussier, A probabilistic interpretation of precision, recall, and F-score, with implications for evaluation, in Advances in Information Retrieval, Vol. 3408, Springer, 2005. [CrossRef]
- J. Huang and C. X. Ling, Using AUC and accuracy in evaluating learning algorithms, IEEE Trans. Knowl. Data Eng. 17 (2005), pp. 299–310. [CrossRef]
- M. W. Browne, Cross-validation methods, J. Math. Psychol. 44 (2000), pp. 108-132. [CrossRef]
- J. Snoek, H. Larochelle, and R. P. Adams, Practical Bayesian optimization of machine learning algorithms, Adv. Neural Inf. Process. Syst. 25 (2012).
- E. Gillet, D. Leroy, R. Thouvarecq, and J. F. Stein, A notational analysis of elite tennis serve and serve-return strategies on slow surface, J. Strength Cond. Res. 23 (2009), pp. 532-539. [CrossRef]
- W. Gu and T. L. Saaty, Predicting the outcome of a tennis tournament: Based on both data and judgments, J. Syst. Sci. Syst. Eng. 28 (2019), pp. 317–343. [CrossRef]
- J. S. Hunter, The Exponentially Weighted Moving Average, J. Qual. Technol. 18 (1986), pp. 203–210. [CrossRef]
- Paris 2024 Olympics; ‘2024 Paris Olympics Tennis Women Single,’ 2024; Website available at https://olympics.com/en/paris-2024.













| Accuracy | Precision | Recall | F1 | AUC |
| 0.852026 | 0.838200 | 0.866872 | 0.852295 | 0.852246 |
| Accuracy | Precision | Recall | F1 | AUC |
| 0.791367 | 0.737113 | 0.869301 | 0.797768 | 0.795306 |
| Multiple Splitting | Accuracy | Precision | Recall | F1 | AUC | |
| Set 1 | Training | 0.831157 | 0.819723 | 0.838059 | 0.82879 | 0.831323 |
| Testing | 0.876129 | 0.878109 | 0.882500 | 0.880299 | 0.875917 | |
| Set 2 | Training | 0.843791 | 0.829513 | 0.858286 | 0.843654 | 0.844046 |
| Testing | 0.812048 | 0.794258 | 0.825871 | 0.809756 | 0.812468 | |
| Set 3 | Training | 0.840432 | 0.825561 | 0.851446 | 0.838304 | 0.840736 |
| Testing | 0.818008 | 0.848921 | 0.816609 | 0.832451 | 0.818176 | |
| Set 4 | Training | 0.835654 | 0.827597 | 0.842481 | 0.834972 | 0.835742 |
| Testing | 0.893191 | 0.881020 | 0.891117 | 0.886040 | 0.893059 | |
| Set 5 | Training | 0.848803 | 0.834629 | 0.860853 | 0.847538 | 0.849081 |
| Testing | 0.788827 | 0.800905 | 0.778022 | 0.789298 | 0.789011 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).