Submitted:
23 January 2025
Posted:
23 January 2025
You are already at the latest version
Abstract
Keywords:
Introduction
Methodology
Research Instrument
Data Sources
Study Design
Statistical Analysis
Results
Evaluation of GPT Model Performance Across Versions in BCM I and II
GPT-4o Was the Only Model to Surpass the Passing Threshold
Approximate 10% Increase in GPT’s Performance per Version Update in BCM I
Over 20% Performance Increase in GPT for BCM II with Each Version Update
Discussion
Conclusion
References
- Sahni, N.R.; Carrus, B. Artificial Intelligence in U.S. Health Care Delivery. New England Journal of Medicine 2023, 389, 348–358. [Google Scholar] [CrossRef]
- Sallam, M. ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns. Healthcare 2023, 11, 887. [Google Scholar] [CrossRef]
- Li, J.; Dada, A.; Puladi, B.; Kleesiek, J.; Egger, J. ChatGPT in healthcare: A taxonomy and systematic review. Computer Methods and Programs in Biomedicine 2024, 245, 108013. [Google Scholar] [CrossRef] [PubMed]
- Huang, J.; Tan, M. The role of ChatGPT in scientific communication: writing better scientific review articles. Am J Cancer Res 2023, 13, 1148–1154. [Google Scholar] [PubMed]
- Dave, T.; Athaluri, S.A.; Singh, S. ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations. Front Artif Intell 2023, 6, 1169595. [Google Scholar] [CrossRef] [PubMed]
- Ray, P.P. ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet of Things and Cyber-Physical Systems 2023, 3, 121–154. [Google Scholar] [CrossRef]
- Tong, W.; Guan, Y.; Chen, J.; Huang, X.; Zhong, Y.; Zhang, C.; Zhang, H. Artificial intelligence in global health equity: an evaluation and discussion on the application of ChatGPT, in the Chinese National Medical Licensing Examination. Front Med (Lausanne) 2023, 10, 1237432. [Google Scholar] [CrossRef]
- Liu, M.; Okuhara, T.; Chang, X.; Shirabe, R.; Nishiie, Y.; Okada, H.; Kiuchi, T. Performance of ChatGPT Across Different Versions in Medical Licensing Examinations Worldwide: Systematic Review and Meta-Analysis. J Med Internet Res 2024, 26, e60807. [Google Scholar] [CrossRef] [PubMed]
- Huang, C.H.; Hsiao, H.J.; Yeh, P.C.; Wu, K.C.; Kao, C.H. Performance of ChatGPT on Stage 1 of the Taiwanese medical licensing exam. Digit Health 2024, 10, 20552076241233144. [Google Scholar] [CrossRef]
- Lin, S.Y.; Chan, P.K.; Hsu, W.H.; Kao, C.H. Exploring the proficiency of ChatGPT-4: An evaluation of its performance in the Taiwan advanced medical licensing examination. Digit Health 2024, 10, 20552076241237678. [Google Scholar] [CrossRef] [PubMed]
- Zhu, L.; Mou, W.; Lai, Y.; Lin, J.; Luo, P. Language and cultural bias in AI: comparing the performance of large language models developed in different countries on Traditional Chinese Medicine highlights the need for localized models. Journal of Translational Medicine 2024, 22, 319. [Google Scholar] [CrossRef]
- KONG, Q.; CHEN, L.; YAO, J.; DING, C.; YIN, P. Feasibility and Challenges of Interactive AI for Traditional Chinese Medicine: An Example of ChatGPT. Chinese Medicine and Culture 2024, 7, 174–181. [Google Scholar] [CrossRef]
- Zhang, T.; Wang, Y.; Wang, X.; Yang, Y.; Ye, Y. Constructing fine-grained entity recognition corpora based on clinical records of traditional Chinese medicine. BMC Medical Informatics and Decision Making 2020, 20, 64. [Google Scholar] [CrossRef] [PubMed]
- Tan, Y.; Zhang, Z.; Li, M.; Pan, F.; Duan, H.; Huang, Z.; Deng, H.; Yu, Z.; Yang, C.; Shen, G.; Qi, P.; Yue, C.; Liu, Y.; Hong, L.; Yu, H.; Fan, G.; Tang, Y. MedChatZH: A tuning LLM for traditional Chinese medicine consultations. Computers in Biology and Medicine 2024, 172, 108290. [Google Scholar] [CrossRef] [PubMed]
- Yang, G.; Liu, X.; Shi, J.; Wang, Z.; Wang, G. TCM-GPT: Efficient pre-training of large language models for domain adaptation in Traditional Chinese Medicine. Computer Methods and Programs in Biomedicine Update 2024, 6, 100158. [Google Scholar] [CrossRef]
- Yang, S.; Zhao, H.; Zhu, S.; Zhou, G.; Xu, H.; Jia, Y.; Zan, H. Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-World Multi-Turn Dialogue. Proceedings of the AAAI Conference on Artificial Intelligence 2024, 38, 19368–19376. [Google Scholar] [CrossRef]
- Qi, J.; Wang, X.; Yang, T. In Traditional Chinese Medicine Prescription Recommendation Model Based on Large Language Models and Graph Neural Networks, 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 5-8 Dec. 2023, 2023; 2023; pp 4623-4627.
- Zhou, Z.; Yang, T.; Hu, K. In Traditional Chinese Medicine Epidemic Prevention and Treatment Question-Answering Model Based on LLMs, 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 5-8 Dec. 2023, 2023; 2023; pp 4755-4760.
- Li, W.; Ge, X.; Liu, S.; Xu, L.; Zhai, X.; Yu, L. Opportunities and challenges of traditional Chinese medicine doctors in the era of artificial intelligence. Front Med (Lausanne) 2023, 10, 1336175. [Google Scholar] [CrossRef] [PubMed]





Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).