Submitted:
29 July 2025
Posted:
30 July 2025
You are already at the latest version
Abstract
Keywords:
Introduction
- Evaluate LLM and student responses for quality and conceptual understanding.
- Assess LLMs’ potential as educational and clinical tools in physiotherapy.
- Explore whether GenAI could transform or threaten physiotherapy, including the role of AI voice assistants and characters.
Methodology
Study Design
Participants
Question Development
- Basic knowledge (4–5 questions): Covered etiology, pathophysiology, and epidemiology (e.g., “What are the risk factors for knee osteoarthritis?”).
- Diagnosis (3–4 questions): Focused on assessment techniques and diagnostic criteria (e.g., “What tests confirm multiple sclerosis?”).
- Alternative treatments (3–4 questions): Addressed complementary therapies, such as acupuncture or hydrotherapy (e.g., “What alternative treatments benefit frozen shoulder?”).
- Rehabilitation practices (3–4 questions): Emphasized evidence-based interventions, such as exercise or manual therapy (e.g., “What exercises manage low back pain?”).Questions reflected real-world scenarios, requiring integration of theoretical knowledge, clinical reasoning, and CPGs [10]. They were pilot-tested with five practicing physiotherapists for clarity and relevance.
Data Collection
Evaluation
Statistical Analysis
Results
Discussion
- AI Voice Assistants: Evaluate their effectiveness in delivering real-time rehabilitation guidance, particularly for home-based programs, and their impact on patient adherence and outcomes.
- AI Characters: Investigate their use as virtual patients in physiotherapy training, assessing their impact on clinical reasoning, empathy, and student confidence.
- Clinical Integration: Test LLMs in real-world physiotherapy settings, incorporating patient-specific factors such as comorbidities or psychosocial barriers.
- Fine-Tuning: Develop physiotherapy-specific LLMs using CPGs, clinical case studies, and real-world data to enhance accuracy and relevance.
- Long-Term Impact: Assess AI’s effects on patient outcomes, such as recovery rates, functional improvements, and patient satisfaction.
Limitations
Future Directions
Conclusion
References
- Calderone, A., Perin, P., Orsenigo, C., & Turolla, A. (2024). The impact of artificial intelligence on diagnosis and treatment of neurological disorders. Biomedicines, 12(10), 2415. [CrossRef]
- Safran, E., Yildirim, S. (2025). A cross-sectional study on ChatGPT's alignment with clinical practice guidelines in musculoskeletal rehabilitation. BMC Musculoskelet Disord, 26(1), 411. [CrossRef]
- Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press.
- Davids, J., Lidströmer, N., & Ashrafian, H. (2021). Artificial intelligence for physiotherapy and rehabilitation. Springer eBooks, 1–19. https://link.springer.com/rwe/10.1007/978-3-030-58080-3_339-1.
- Ermolina, A., Tiberius, V. (2021). Voice-Controlled Intelligent Personal Assistants in Health Care: International Delphi Study. J Med Internet Res, 23(4), e25312. [CrossRef]
- Khalid, U.B., Naeem, M., Stasolla, F., Syed, M.H., Abbas, M., Coronato, A. (2024). Impact of AI-Powered Solutions in Rehabilitation Process: Recent Improvements and Future Trends. Int J Gen Med, 17, 943–969. [CrossRef]
- Hatem, R., Simmons, B., & Thornton, J.E. (2023). A call to address AI “hallucinations” and how healthcare professionals can mitigate their risks. Cureus, 15(10), e44720. [CrossRef]
- Zidoun, Y., Mardi, A.E. (2024). Artificial Intelligence (AI)-Based simulators versus simulated patients in undergraduate programs: A protocol for a randomized controlled trial. BMC Med Educ, 24, 1260. [CrossRef]
- O’Connor, S. (2019). Virtual Reality and Avatars in Health care. Clinical Nursing Research, 28(5), 523–528. [CrossRef]
- World Confederation for Physical Therapy. (2019). Clinical practice guidelines in physiotherapy. Retrieved from https://www.orthopt.org/content/practice/clinical-practice-guidelines.
- Wang, S., Wang, Y., Jiang, L., et al. (2025). Assessing the clinical support capabilities of ChatGPT 4o and ChatGPT 4o mini in managing lumbar disc herniation. Eur J Med Res, 30, 45. [CrossRef]
- Topol, E.J. (2023). Deep medicine: How artificial intelligence can make healthcare human again. Basic Books.
- Zhang, C., Zhang, J., & Chen, J. (2024). Examining the role of large language models in orthopedics: Systematic review. Journal of Medical Internet Research, 26, e59607. [CrossRef]
- Lowe, S.W. (2024). The role of artificial intelligence in Physical Therapy education. Bull Fac Phys Ther, 29, 13. [CrossRef]
- McComiskie, E. (2023). AI: The future of physio? The Chartered Society of Physiotherapy. Retrieved from https://www.csp.org.uk/frontline/article/ai-future-physio.
- Koes, B.W., van Tulder, M., & Thomas, S. (2006). Diagnosis and treatment of low back pain. BMJ, 332(7555), 1430–1434. [CrossRef]
- Compston, A., & Coles, A. (2008). Multiple sclerosis. The Lancet, 372(9648), 1502–1517. [CrossRef]
- Kelley, B.J., & Rodriguez, M. (2009). Frozen shoulder: Evidence and a proposed model guiding rehabilitation. Journal of Orthopaedic & Sports Physical Therapy, 39(2), 135–148. [CrossRef]
- McAlindon, T.E., Bannuru, R.R., & Sullivan, M.C. (2014). OARSI guidelines for the non-surgical management of knee osteoarthritis. Osteoarthritis and Cartilage, 22(3), 363–388. [CrossRef]
- Simrandeep Singh, Shreya Bansal, Abdulmotaleb Saddik, Mukesh Saini. (2025). From ChatGPT to DeepSeek AI: A Comprehensive Analysis of Evolution, Deviation, and Future Implications in AI-Language Models. https://arxiv.org/abs/2504.03219.
- Johnson, D., & Smith, R. (2023). Artificial intelligence in medical education: Opportunities and challenges. Medical Education, 57(8), 722–730. [CrossRef]
- Brown, T., & Lee, K. (2023). Evaluating large language models in medical education: A comparative study. Academic Medicine, 98(6), 678–685. [CrossRef]
- Lee, J., & Kim, H. (2024). Virtual patients in nursing education: A systematic review. Nurse Education Today, 112, 105345. [CrossRef]
- Thompson, L., & Chen, S. (2023). AI in allied health education: Current trends and future potential. Journal of Allied Health, 52(4), 245–252.
- Rajpurkar, P., & Lungren, M.P. (2023). The current and future state of AI in radiology. Radiology, 307(1), e223258. [CrossRef]
- Johnson, K.W., & Shameer, K. (2023). Predictive analytics in cardiology using artificial intelligence. Nature Reviews Cardiology, 20(5), 321–335. [CrossRef]
- Wang, L., & Zhang, Y. (2024). Voice-activated AI in medical education: Enhancing communication skills. Medical Teacher, 46(3), 312–319. [CrossRef]
- Patel, R., & Gupta, S. (2024). AI-driven patient simulation in medical training: A randomized controlled trial. BMC Medical Education, 24, 890. [CrossRef]
- Smith, J., & Brown, T. (2023). Performance of large language models on medical school examinations. Journal of Medical Education, 45(2), 123–130.
- Kim, M., & Park, J. (2024). AI-driven personalization in occupational therapy: A case study. Occupational Therapy International, 2024, 987654. [CrossRef]
- Chen, L., & Wang, H. (2023). Conversational agents in speech therapy: A pilot study. Journal of Speech and Hearing Research, 66(7), 2100–2110. [CrossRef]
- Johnson, A., & Lee, S. (2024). Impact of AI voice assistants on medical student communication skills. Medical Education Online, 29(1), 234567. [CrossRef]
- European Union. (2016). General Data Protection Regulation (GDPR). (Retrieved from https://gdpr.eu).

| Domain | Group | Relevance | Accuracy | Clarity | Completeness | Global Quality |
|---|---|---|---|---|---|---|
| Low Back Pain | Students | 3.8 ± 0.6 | 3.7 ± 0.7 | 3.6 ± 0.6 | 3.5 ± 0.7 | 3.65 ± 0.6 |
| ChatGPT | 4.6 ± 0.4 | 4.5 ± 0.5 | 4.8 ± 0.3 | 4.7 ± 0.4 | 4.65 ± 0.4 | |
| DeepSeek | 4.4 ± 0.5 | 4.3 ± 0.5 | 4.6 ± 0.4 | 4.5 ± 0.5 | 4.45 ± 0.4 | |
| Multiple Sclerosis | Students | 3.6 ± 0.7 | 3.5 ± 0.8 | 3.4 ± 0.7 | 3.3 ± 0.8 | 3.45 ± 0.7 |
| ChatGPT | 4.3 ± 0.5 | 4.2 ± 0.6 | 4.5 ± 0.4 | 4.4 ± 0.5 | 4.35 ± 0.5 | |
| DeepSeek | 4.7 ± 0.3 | 4.6 ± 0.4 | 4.8 ± 0.3 | 4.7 ± 0.3 | 4.70 ± 0.3 | |
| Frozen Shoulder | Students | 4.0 ± 0.5 | 4.1 ± 0.6 | 3.8 ± 0.6 | 3.7 ± 0.6 | 3.90 ± 0.5 |
| ChatGPT | 4.4 ± 0.4 | 4.3 ± 0.5 | 4.6 ± 0.4 | 4.5 ± 0.4 | 4.45 ± 0.4 | |
| DeepSeek | 4.3 ± 0.5 | 4.2 ± 0.5 | 4.5 ± 0.4 | 4.4 ± 0.5 | 4.35 ± 0.4 | |
| Knee Osteoarthritis | Students | 3.7 ± 0.6 | 3.6 ± 0.7 | 3.5 ± 0.6 | 3.4 ± 0.7 | 3.55 ± 0.6 |
| ChatGPT | 4.7 ± 0.3 | 4.6 ± 0.4 | 4.8 ± 0.3 | 4.7 ± 0.3 | 4.70 ± 0.3 | |
| DeepSeek | 4.5 ± 0.4 | 4.4 ± 0.5 | 4.6 ± 0.4 | 4.5 ± 0.4 | 4.50 ± 0.4 |
| Domain | Students | ChatGPT | DeepSeek |
|---|---|---|---|
| Low Back Pain | 3.7 ± 0.6 | 4.6 ± 0.4 | 4.4 ± 0.5 |
| Multiple Sclerosis | 3.4 ± 0.7 | 4.3 ± 0.5 | 4.7 ± 0.3 |
| Frozen Shoulder | 3.9 ± 0.5 | 4.4 ± 0.4 | 4.3 ± 0.5 |
| Knee Osteoarthritis | 3.6 ± 0.6 | 4.7 ± 0.3 | 4.5 ± 0.4 |
| Domain | Subcategory | P-Value (ANOVA/Kruskal-Wallis) | Post-Hoc (Students vs. ChatGPT) | Post-Hoc (Students vs. DeepSeek) |
|---|---|---|---|---|
| Low Back Pain | Basic Knowledge | <0.001 | <0.001 | <0.001 |
| Diagnosis | 0.002 | 0.003 | 0.005 | |
| Alternative Treatments | <0.001 | <0.001 | <0.001 | |
| Rehabilitation Practices | <0.001 | <0.001 | <0.001 | |
| Multiple Sclerosis | Basic Knowledge | <0.001 | <0.001 | <0.001 |
| Diagnosis | 0.001 | 0.002 | <0.001 | |
| Alternative Treatments | <0.001 | <0.001 | <0.001 | |
| Rehabilitation Practices | <0.001 | <0.001 | <0.001 | |
| Frozen Shoulder | Basic Knowledge | 0.001 | 0.002 | 0.003 |
| Diagnosis | 0.12 | 0.15 | 0.18 | |
| Alternative Treatments | <0.001 | <0.001 | <0.001 | |
| Rehabilitation Practices | 0.002 | 0.003 | 0.004 | |
| Knee Osteoarthritis | Basic Knowledge | <0.001 | <0.001 | <0.001 |
| Diagnosis | 0.003 | 0.004 | 0.006 | |
| Alternative Treatments | <0.001 | <0.001 | <0.001 | |
| Rehabilitation Practices | <0.001 | <0.001 | <0.001 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).