Submitted:
20 May 2026
Posted:
21 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methods
2.1. Study Design and Patient Cohorts
2.2. Fluence Map Generation and ROI Extraction
2.3. MLC Error Simulation and Label Encoding
2.4. Deep Learning Model Architecture and Optimization
2.5. Implementation of Traditional Machine Learning Models for Comparison
2.6. Performance Evaluation
3. Statistical Methods
4. Results
4.1. Model Training Performance
4.2. Cross-Validation Assessment
4.3. Comparative Analysis with Traditional Machine Learning
4.4. Model Reliability Assessment
4.5. External Validation
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
References
- Otto, K. Volumetric modulated arc therapy: IMRT in a single gantry arc. Med. Phys. 2008, 35(1), 310–7. [Google Scholar] [CrossRef] [PubMed]
- Brahme, A. Optimization of stationary and moving beam radiation therapy techniques. Radiother. Oncol. 1988, 12(2), 129–40. [Google Scholar] [CrossRef] [PubMed]
- Ezzell, G.A.; et al. Guidance document on delivery, treatment planning, and clinical implementation of IMRT: report of the IMRT Subcommittee of the AAPM Radiation Therapy Committee. Med. Phys. 2003, 30(8), 2089–115. [Google Scholar] [CrossRef]
- Fredh, A.; et al. Patient QA systems for rotational radiation therapy: a comparative experimental study with intentional errors. Med. Phys. 2013, 40(3), 031716. [Google Scholar] [CrossRef]
- Heo, T.; et al. The effect of beam interruption during VMAT delivery on the delivered dose distribution. Phys. Med. 2015, 31(3), 297–300. [Google Scholar] [CrossRef] [PubMed]
- Park, J.M.; Park, S.Y.; Kim, H. Modulation index for VMAT considering both mechanical and dose calculation uncertainties. Phys. Med. Biol. 2015, 60(18), 7101–25. [Google Scholar] [CrossRef]
- Park, S.Y.; et al. Evaluation of the plan delivery accuracy of intensity-modulated radiation therapy by texture analysis using fluence maps. Phys. Med. 2019, 59, 64–74. [Google Scholar] [CrossRef]
- Heilemann, G.; Poppe, B.; Laub, W. On the sensitivity of common gamma-index evaluation methods to MLC misalignments in Rapidarc quality assurance. Med. Phys. 2013, 40(3), 031702. [Google Scholar] [CrossRef]
- Kim, J.I.; et al. The sensitivity of gamma-index method to the positioning errors of high-definition MLC in patient-specific VMAT QA for SBRT. Radiat. Oncol. 2014, 9, 167. [Google Scholar] [CrossRef]
- Yan, G.; et al. On the sensitivity of patient-specific IMRT QA to MLC positioning errors. J. Appl. Clin. Med. Phys. 2009, 10(1), 120–128. [Google Scholar] [CrossRef]
- Park, J.M.; et al. Modulation indices for volumetric modulated arc therapy. Phys. Med. Biol. 2014, 59(23), 7315–40. [Google Scholar] [CrossRef]
- Park, S.Y.; et al. Textural feature calculated from segmental fluences as a modulation index for VMAT. Phys. Med. 2015, 31(8), 981–990. [Google Scholar] [CrossRef]
- Kim, D.S.; et al. To propose adding index of achievement (IOA) to IMRT QA process. Radiat. Oncol. 2018, 13(1), 112. [Google Scholar] [CrossRef]
- Nyflot, M.J.; et al. Deep learning for patient-specific quality assurance: Identifying errors in radiotherapy delivery by radiomic analysis of gamma images with convolutional neural networks. Med. Phys. 2019, 46(2), 456–464. [Google Scholar] [CrossRef]
- Wootton, L.S.; et al. Error Detection in Intensity-Modulated Radiation Therapy Quality Assurance Using Radiomic Analysis of Gamma Distributions. Int. J. Radiat. Oncol. Biol. Phys. 2018, 102(1), 219–228. [Google Scholar] [CrossRef] [PubMed]
- Kimura, Y.; et al. Error detection using a convolutional neural network with dose difference maps in patient-specific quality assurance for volumetric modulated arc therapy. Phys. Med. 2020, 73, 57–64. [Google Scholar] [CrossRef] [PubMed]
- Nakamura, S.; et al. Deep learning-based detection and classification of multi-leaf collimator modeling errors in volumetric modulated radiation therapy. J. Appl. Clin. Med. Phys. 2023, 24(12), e14136. [Google Scholar] [CrossRef]
- Carlson, J.N.; et al. A machine learning approach to the accurate prediction of multi-leaf collimator positional errors. Phys. Med. Biol. 2016, 61(6), 2514–31. [Google Scholar] [CrossRef] [PubMed]
- Osman, A.F.I.; Maalej, N.M.; Jayesh, K. Prediction of the individual multileaf collimator positional deviations during dynamic IMRT delivery priori with artificial neural network. Med. Phys. 2020, 47(4), 1421–1430. [Google Scholar] [CrossRef]
- Chuang, K.C.; Giles, W.; Adamson, J. A tool for patient-specific prediction of delivery discrepancies in machine parameters using trajectory log files. Med. Phys. 2021, 48(3), 978–990. [Google Scholar] [CrossRef]
- Klein, E.E.; et al. Task Group 142 report: quality assurance of medical accelerators. Med. Phys. 2009, 36(9), 4197–212. [Google Scholar] [CrossRef]






| Cohort | Patients | Purpose | Data Split | Samples per Leaf | Total Samples |
| Prostate (primary) | 20 | Model development | Training 80% / Validation 10% / Test 10% | 2,420 | 48,400 |
| Prostate (additional) | 20 | External validation (same site) | All external test | 2,420 | 48,400 |
| Head & Neck | 10 | External validation (cross-site) | All external test | 1,210 | 24,200 |
| Metric | Value (%) |
| Accuracy | 97.21 |
| Precision | 97.14 |
| Recall | 97.06 |
| F1-score | 97.04 |
| Model Type | Training Accuracy (%) | Test Accuracy (%) |
| CNN | 97.10 | 97.21 |
| XGBoost | 99.92 | 69.86 |
| Random Forest | 94.29 | 22.00 |
| CatBoost | 99.60 | 63.95 |
| Comparison |
p-value (paired t-test) |
Corrected p-value (Holm-Bonferroni) |
Effect Size (Cohen’s d) | Interpretation |
| CNN vs XGBoost | 2.3 × 10-7 | < 0.001 | 44.8 | Extremely large effect |
| CNN vs Random Forest | 9.6 × 10-9 | < 0.001 | 99.6 | Extremely large effect |
| CNN vs CatBoost | 1.4 × 10-7 | < 0.001 | 43.0 | Extremely large effect |
| Metric | Value |
| Total samples | 4,840 |
| Misclassified samples | 145 |
| Misclassification rate | 3.00 % |
| Mean Bank A deviation magnitude | 0.52 mm |
| Mean Bank B deviation magnitude | 0.72 mm |
| Maximum Bank A deviation magnitude | 5 mm |
| Maximum Bank B deviation magnitude | 5 mm |
| Deviation magnitude | Bank A (%) | Bank B (%) |
| 0 mm | 55.1 | 35.9 |
| 1.0 mm | 40.7 | 58.6 |
| 2 mm | 2.1 | 4.1 |
| 3 mm | 1.4 | 0.7 |
| 4 mm | 0.0 | 0.0 |
| 5 mm | 0.7 | 0.7 |
| Metric | Value (%) |
| Accuracy | 96.19 |
| Precision | 96.17 |
| Recall | 96.79 |
| F1-score | 96.11 |
| Metric | Value (%) |
| Accuracy | 93.72 |
| Precision | 94.34 |
| Recall | 93.71 |
| F1-score | 93.78 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.