Submitted:
16 June 2026
Posted:
18 June 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction

2. Methods of Medical Large Language Models
2.1. Pre-training and Fine-tuning Methods

- Feature Extraction:
- 2.
- Projection into Joint Embedding Space:
- 3.
- Similarity Computation:
- 4.
- Loss Function:
2.2. Prompting Methods
2.3. Retrieval-Augmented Generation
2.4. Web Search
3. LLMs Applications in Medicine
3.1. Application of LLMs in Medical Text
3.2. Application of LLMs in Medical Images
3.3. Application of Multimodal Model in Medicine
4. Discussion
4.1. Complex Data Processing
4.2. Hallucination and Accuracy
4.3. Ethics and Privacy
5. Conclusions
Author contributions
Data Availability
Conflicts of Interest
References
- Gu, Y.; Tinn, R.; Cheng, H.; Lucas, M.; Usuyama, N.; Liu, X.; Naumann, T.; Gao, J.; Poon, H., Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing. ACM Trans. Comput. Healthcare 2021, 3 (1), Article 2. [CrossRef]
- Huang, K.; Altosaar, J.; Ranganath, R. ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission 2019, p. arXiv:1904.05342. https://ui.adsabs.harvard.edu/abs/2019arXiv190405342H (accessed April 01, 2019).
- Achiam, J.; Adler, S.; Agarwal, S.; Ahmad, L.; Akkaya, I.; Aleman, F. L.; Almeida, D.; Altenschmidt, J.; Altman, S.; Anadkat, S., Gpt-4 technical report. arXiv preprint arXiv:2303.08774 2023.
- Singhal, K.; Azizi, S.; Tu, T.; Mahdavi, S. S.; Wei, J.; Chung, H. W.; Scales, N.; Tanwani, A.; Cole-Lewis, H.; Pfohl, S.; Payne, P.; Seneviratne, M.; Gamble, P.; Kelly, C.; Babiker, A.; Schärli, N.; Chowdhery, A.; Mansfield, P.; Demner-Fushman, D.; Agüera, Y. A. B.; Webster, D.; Corrado, G. S.; Matias, Y.; Chou, K.; Gottweis, J.; Tomasev, N.; Liu, Y.; Rajkomar, A.; Barral, J.; Semturs, C.; Karthikesalingam, A.; Natarajan, V., Large language models encode clinical knowledge. Nature 2023, 620 (7972), 172-180. [CrossRef]
- Kraljevic, Z.; Shek, A.; Bean, D. M.; Bendayan, R.; Teo, J. T. H.; Dobson, R. J. B., MedGPT: Medical Concept Prediction from Clinical Narratives. ArXiv 2021, abs/2107.03134.
- Li, Y.; Li, Z.; Zhang, K.; Dan, R.; Jiang, S.; Zhang, Y., ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge. Cureus 2023, 15 (6), e40895.
- Xiong, H.; Wang, S.; Zhu, Y.; Zhao, Z.; Liu, Y.; Huang, L.; Wang, Q.; Shen, D., DoctorGLM: Fine-tuning your Chinese Doctor is not a Herculean Task. ArXiv 2023, abs/2304.01097.
- Xiong, G.; Jin, Q.; Lu, Z.; Zhang, A., Benchmarking Retrieval-Augmented Generation for Medicine. ArXiv 2024, abs/2402.13178.
- Zhou, H.; Liu, F.; Gu, B.; Zou, X.; Huang, J.; Wu, J.; Li, Y.; Chen, S. S.; Zhou, P.; Liu, J., A survey of large language models in medicine: Principles, applications, and challenges. arXiv preprint arXiv:2311.05112 2023.
- Zhou, H.; Gu, B.; Zou, X.; Li, Y.; Chen, S. S.; Zhou, P.; Liu, J.; Hua, Y.; Mao, C.; Wu, X.; Li, Z.; Liu, F., A Survey of Large Language Models in Medicine: Progress, Application, and Challenge. ArXiv 2023, abs/2311.05112.
- He, K.; Mao, R.; Lin, Q.; Ruan, Y.; Lan, X.; Feng, M.; Cambria, E., A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics. ArXiv 2023, abs/2310.05694.
- Peng, Y.; Yan, S.; Lu, Z. Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking Datasets 2019, p. arXiv:1906.05474. https://ui.adsabs.harvard.edu/abs/2019arXiv190605474P (accessed June 01, 2019).
- Lee, J.; Yoon, W.; Kim, S.; Kim, D.; Kim, S.; So, C. H.; Kang, J. BioBERT: a pre-trained biomedical language representation model for biomedical text mining 2019, p. arXiv:1901.08746. https://ui.adsabs.harvard.edu/abs/2019arXiv190108746L (accessed January 01, 2019).
- Beltagy, I.; Lo, K.; Cohan, A., SciBERT: A pretrained language model for scientific text. arXiv preprint arXiv:1903.10676 2019.
- Li, Y.; Rao, S.; Solares, J. R. A.; Hassaine, A.; Ramakrishnan, R.; Canoy, D.; Zhu, Y.; Rahimi, K.; Salimi-Khorshidi, G., BEHRT: Transformer for Electronic Health Records. Scientific Reports 2020, 10, 7155. [CrossRef]
- Michalopoulos, G.; Wang, Y.; Kaka, H.; Chen, H.; Wong, A. UmlsBERT: Clinical Domain Knowledge Augmentation of Contextual Embeddings Using the Unified Medical Language System Metathesaurus 2020, p. arXiv:2010.10391. https://ui.adsabs.harvard.edu/abs/2020arXiv201010391M (accessed October 01, 2020).
- Yang, X.; Chen, A.; PourNejatian, N.; Shin, H. C.; E Smith, K.; Parisien, C.; Compas, C.; Martin, C.; Flores, M. G.; Zhang, Y.; Magoc, T.; Harle, C. A.; Lipori, G.; Mitchell, D. A.; Hogan, W. R.; Shenkman, E. A.; Bian, J.; Wu, Y. GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records 2022, p. arXiv:2203.03540. https://ui.adsabs.harvard.edu/abs/2022arXiv220303540Y (accessed February 01, 2022).
- Chen, Z.; Hernández Cano, A.; Romanou, A.; Bonnet, A.; Matoba, K.; Salvi, F.; Pagliardini, M.; Fan, S.; Köpf, A.; Mohtashami, A.; Sallinen, A.; Sakhaeirad, A.; Swamy, V.; Krawczuk, I.; Bayazit, D.; Marmet, A.; Montariol, S.; Hartley, M.-A.; Jaggi, M.; Bosselut, A. MEDITRON-70B: Scaling Medical Pretraining for Large Language Models 2023, p. arXiv:2311.16079. https://ui.adsabs.harvard.edu/abs/2023arXiv231116079C (accessed November 01, 2023).
- Jin, Q.; Dhingra, B.; Liu, Z.; Cohen, W. W.; Lu, X., Pubmedqa: A dataset for biomedical research question answering. arXiv preprint arXiv:1909.06146 2019.
- Johnson, A. E.; Pollard, T. J.; Shen, L.; Lehman, L.-w. H.; Feng, M.; Ghassemi, M.; Moody, B.; Szolovits, P.; Anthony Celi, L.; Mark, R. G., MIMIC-III, a freely accessible critical care database. Scientific data 2016, 3 (1), 1-9. [CrossRef]
- Devlin, J.; Chang, M.-W.; Lee, K.; Toutanova, K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding 2018, p. arXiv:1810.04805. https://ui.adsabs.harvard.edu/abs/2018arXiv181004805D (accessed October 01, 2018).
- Joshi, M.; Chen, D.; Liu, Y.; Weld, D. S.; Zettlemoyer, L.; Levy, O. SpanBERT: Improving Pre-training by Representing and Predicting Spans 2019, p. arXiv:1907.10529. https://ui.adsabs.harvard.edu/abs/2019arXiv190710529J (accessed July 01, 2019).
- Brown, T.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J. D.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A., Language models are few-shot learners. Advances in neural information processing systems 2020, 33, 1877-1901.
- Radford, A.; Wu, J.; Child, R.; Luan, D.; Amodei, D.; Sutskever, I. In Language Models are Unsupervised Multitask Learners, 2019.
- Zhang, S.; Xu, Y.; Usuyama, N.; Xu, H.; Bagga, J.; Tinn, R.; Preston, S.; Rao, R.; Wei, M.; Valluri, N.; Wong, C.; Tupini, A.; Wang, Y.; Mazzola, M.; Shukla, S.; Liden, L.; Gao, J.; Lungren, M. P.; Naumann, T.; Wang, S.; Poon, H. BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs 2023, p. arXiv:2303.00915. https://ui.adsabs.harvard.edu/abs/2023arXiv230300915Z (accessed March 01, 2023).
- Wang, Z.; Wu, Z.; Agarwal, D.; Sun, J. MedCLIP: Contrastive Learning from Unpaired Medical Images and Text 2022, p. arXiv:2210.10163. https://ui.adsabs.harvard.edu/abs/2022arXiv221010163W (accessed October 01, 2022).
- Linear-probe, A. In Learning Transferable Visual Models From Natural Language Supervision, 2021. [CrossRef]
- Chen, Y. In Convolutional Neural Network for Sentence Classification, 2015.
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; Uszkoreit, J.; Houlsby, N., An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. ArXiv 2020, abs/2010.11929.
- Singhal, K.; Tu, T.; Gottweis, J.; Sayres, R.; Wulczyn, E.; Hou, L.; Clark, K.; Pfohl, S.; Cole-Lewis, H.; Neal, D.; Schaekermann, M.; Wang, A.; Amin, M.; Lachgar, S.; Mansfield, P.; Prakash, S.; Green, B.; Dominowska, E.; Arcas, B. A. y.; Tomasev, N.; Liu, Y.; Wong, R.; Semturs, C.; Mahdavi, S. S.; Barral, J.; Webster, D.; Corrado, G. S.; Matias, Y.; Azizi, S.; Karthikesalingam, A.; Natarajan, V., Towards Expert-Level Medical Question Answering with Large Language Models. CoRR 2023, abs/2305.09617.
- Xu, L.; Xie, H.; Qin, S.-Z. J.; Tao, X.; Wang, F. L., Parameter-efficient fine-tuning methods for pretrained language models: A critical review and assessment. arXiv preprint arXiv:2312.12148 2023.
- Hu, E. J.; Shen, Y.; Wallis, P.; Allen-Zhu, Z.; Li, Y.; Wang, S.; Wang, L.; Chen, W., Lora: Low-rank adaptation of large language models. arXiv preprint arXiv:2106.09685 2021.
- Houlsby, N.; Giurgiu, A.; Jastrzebski, S.; Morrone, B.; De Laroussilhe, Q.; Gesmundo, A.; Attariyan, M.; Gelly, S. In Parameter-efficient transfer learning for NLP, International conference on machine learning, PMLR: 2019; pp 2790-2799.
- Lin, Z.; Madotto, A.; Fung, P., Exploring versatile generative language model via parameter-efficient transfer learning. arXiv preprint arXiv:2004.03829 2020.
- He, J.; Zhou, C.; Ma, X.; Berg-Kirkpatrick, T.; Neubig, G., Towards a unified view of parameter-efficient transfer learning. arXiv preprint arXiv:2110.04366 2021.
- Sahoo, P.; Singh, A. K.; Saha, S.; Jain, V.; Mondal, S.; Chadha, A., A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications. arXiv preprint arXiv:2402.07927 2024.
- Heston, T. F.; Khun, C., Prompt Engineering in Medical Education. International Medical Education 2023.
- Radford, A.; Wu, J.; Child, R.; Luan, D.; Amodei, D.; Sutskever, I., Language models are unsupervised multitask learners. OpenAI blog 2019, 1 (8), 9.
- Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; Agarwal, S., Language models are few-shot learners. arXiv preprint arXiv:2005.14165 2020.
- Wei, J.; Wang, X.; Schuurmans, D.; Bosma, M.; Xia, F.; Chi, E.; Le, Q. V.; Zhou, D., Chain-of-thought prompting elicits reasoning in large language models. Advances in neural information processing systems 2022, 35, 24824-24837. [CrossRef]
- Zhang, Z.; Zhang, A.; Li, M.; Smola, A., Automatic chain of thought prompting in large language models. arXiv preprint arXiv:2210.03493 2022.
- Liu, H.; Teng, Z.; Cui, L.; Zhang, C.; Zhou, Q.; Zhang, Y. In LogiCoT: Logical Chain-of-Thought Instruction Tuning, The 2023 Conference on Empirical Methods in Natural Language Processing, 2023.
- Hu, H.; Lu, H.; Zhang, H.; Lam, W.; Zhang, Y., Chain-of-symbol prompting elicits planning in large langauge models. arXiv preprint arXiv:2305.10276 2023.
- Long, J., Large language model guided tree-of-thought. arXiv preprint arXiv:2305.08291 2023.
- Gao, Y.; Xiong, Y.; Gao, X.; Jia, K.; Pan, J.; Bi, Y.; Dai, Y.; Sun, J.; Wang, H. J. a. p. a., Retrieval-augmented generation for large language models: A survey. arXiv preprint arXiv:2312.10997 2023.
- Kumar, M.; Mani, U. A.; Tripathi, P.; Saalim, M.; Roy, S., Artificial Hallucinations by Google Bard: Think Before You Leap. Cureus 2023, 15 (8), e43313. [CrossRef]
- Ilin, I., Advanced rag techniques: an illustrated overview. 2023. [CrossRef]
- Johnson, A. E. W.; Bulgarelli, L.; Shen, L.; Gayles, A.; Shammout, A.; Horng, S.; Pollard, T. J.; Hao, S.; Moody, B.; Gow, B.; Lehman, L.-w. H.; Celi, L. A.; Mark, R. G., MIMIC-IV, a freely accessible electronic health record dataset. Scientific Data 2023, 10 (1), 1. [CrossRef]
- Yu, W.; Iter, D.; Wang, S.; Xu, Y.; Ju, M.; Sanyal, S.; Zhu, C.; Zeng, M.; Jiang, M., Generate rather than retrieve: Large language models are strong context generators. arXiv preprint arXiv:2209.10063 2022.
- Shao, Z.; Gong, Y.; Shen, Y.; Huang, M.; Duan, N.; Chen, W., Enhancing retrieval-augmented large language models with iterative retrieval-generation synergy. arXiv preprint arXiv:2305.15294 2023.
- Wu, J.; Zhu, J.; Qi, Y. In Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation, 2024.
- Jiang, X.; Fang, Y.; Qiu, R.; Zhang, H.; Xu, Y.; Chen, H.; Zhang, W.; Zhang, R.; Fang, Y.; Chu, X.; Zhao, J.; Wang, Y. In TC-RAG:Turing-Complete RAG's Case study on Medical LLM Systems, 2024.
- Long, C.; Liu, Y.; Ouyang, C.; Yu, Y., Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications. ArXiv 2024, abs/2407.21055.
- Qin, Y.; Cai, Z.; Jin, D.; Yan, L.; Liang, S.; Zhu, K.; Lin, Y.; Han, X.; Ding, N.; Wang, H., Webcpm: Interactive web search for chinese long-form question answering. arXiv preprint arXiv:2305.06849 2023.
- Liu, X.; Lai, H.; Yu, H.; Xu, Y.; Zeng, A.; Du, Z.; Zhang, P.; Dong, Y.; Tang, J. In Webglm: Towards an efficient web-enhanced question answering system with human preferences, Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023; pp 4549-4560.
- Nakano, R.; Hilton, J.; Balaji, S.; Wu, J.; Ouyang, L.; Kim, C.; Hesse, C.; Jain, S.; Kosaraju, V.; Saunders, W., Webgpt: Browser-assisted question-answering with human feedback, 2021. URL https://arxiv. org/abs/2112.09332 2021.
- Saab, K.; Tu, T.; Weng, W.-H.; Tanno, R.; Stutz, D.; Wulczyn, E.; Zhang, F.; Strother, T.; Park, C.; Vedadi, E.; Chaves, J. Z.; Hu, S.-Y.; Schaekermann, M.; Kamath, A. B.; Cheng, Y.; Barrett, D. G. T.; Cheung, C.; Mustafa, B.; Palepu, A.; McDuff, D.; Hou, L.; Golany, T.; Liu, L.; Alayrac, J.-B.; Houlsby, N.; Tomašev, N.; Freyberg, J.; Lau, C.; Kemp, J.; Lai, J.; Azizi, S.; Kanada, K.; Man, S.; Kulkarni, K.; Sun, R.; Shakeri, S.; He, L.; Caine, B.; Webson, A.; Latysheva, N.; Johnson, M.; Mansfield, P.; Lu, J.; Rivlin, E.; Anderson, J.; Green, B.; Wong, R.; Krause, J.; Shlens, J.; Dominowska, E.; Eslami, S. M. A.; Cui, C.; Vinyals, O.; Kavukcuoglu, K.; Manyika, J.; Dean, J.; Hassabis, D.; Matias, Y.; Webster, D. R.; Barral, J.; Corrado, G. S.; Semturs, C.; Mahdavi, S. S.; Gottweis, J.; Karthikesalingam, A.; Natarajan, V., Capabilities of Gemini Models in Medicine. ArXiv 2024, abs/2404.18416.
- Ferber, D.; Nahhas, O. S. M. E.; Wölflein, G.; Wiest, I. C.; Clusmann, J.; Lessman, M.-E.; Foersch, S.; Lammert, J.; Tschochohei, M.; Jäger, D.; Salto-Tellez, M.; Schultz, N.; Truhn, D.; Kather, J. N., Autonomous Artificial Intelligence Agents for Clinical Decision Making in Oncology. ArXiv 2024, abs/2404.04667.
- Schopow, N.; Osterhoff, G.; Baur, D., Applications of the Natural Language Processing Tool ChatGPT in Clinical Practice: Comparative Study and Augmented Systematic Review. JMIR medical informatics 2023, 11, e48933. [CrossRef]
- Qureshi, R.; Shaughnessy, D.; Gill, K. A. R.; Robinson, K. A.; Li, T.; Agai, E., Are ChatGPT and large language models "the answer" to bringing us closer to systematic review automation? Systematic reviews 2023, 12 (1), 72. [CrossRef]
- Zangrossi, P.; Martini, M.; Guerrini, F.; P, D. E. B.; Spena, G., Large language model, AI and scientific research: why ChatGPT is only the beginning. Journal of neurosurgical sciences 2024. [CrossRef]
- Tessler, I.; Wolfovitz, A.; Livneh, N.; Gecel, N. A.; Sorin, V.; Barash, Y.; Konen, E.; Klang, E., Advancing Medical Practice with Artificial Intelligence: ChatGPT in Healthcare. The Israel Medical Association journal : IMAJ 2024, 26 (2), 80-85.
- Touvron, H.; Martin, L.; Stone, K. R.; Albert, P.; Almahairi, A.; Babaei, Y.; Bashlykov, N.; Batra, S.; Bhargava, P.; Bhosale, S.; Bikel, D. M.; Blecher, L.; Ferrer, C. C.; Chen, M.; Cucurull, G.; Esiobu, D.; Fernandes, J.; Fu, J.; Fu, W.; Fuller, B.; Gao, C.; Goswami, V.; Goyal, N.; Hartshorn, A. S.; Hosseini, S.; Hou, R.; Inan, H.; Kardas, M.; Kerkez, V.; Khabsa, M.; Kloumann, I. M.; Korenev, A. V.; Koura, P. S.; Lachaux, M.-A.; Lavril, T.; Lee, J.; Liskovich, D.; Lu, Y.; Mao, Y.; Martinet, X.; Mihaylov, T.; Mishra, P.; Molybog, I.; Nie, Y.; Poulton, A.; Reizenstein, J.; Rungta, R.; Saladi, K.; Schelten, A.; Silva, R.; Smith, E. M.; Subramanian, R.; Tan, X.; Tang, B.; Taylor, R.; Williams, A.; Kuan, J. X.; Xu, P.; Yan, Z.; Zarov, I.; Zhang, Y.; Fan, A.; Kambadur, M.; Narang, S.; Rodriguez, A.; Stojnic, R.; Edunov, S.; Scialom, T., Llama 2: Open Foundation and Fine-Tuned Chat Models. ArXiv 2023, abs/2307.09288.
- Leiter, C.; Zhang, R.; Chen, Y.; Belouadi, J.; Larionov, D.; Fresen, V.; Eger, S., ChatGPT: A Meta-Analysis after 2.5 Months. ArXiv 2023, abs/2302.13795.
- Nicholson, A. E.; Korb, K. B.; Nyberg, E. P.; Wybrow, M.; Zukerman, I.; Mascaro, S.; Thakur, S.; Alvandi, A. O.; Riley, J.; Pearson, R.; Morris, S.; Herrmann, M.; Azad, A. K. M.; Bolger, F.; Hahn, U.; Lagnado, D. A., BARD: A Structured Technique for Group Elicitation of Bayesian Networks to Support Analytic Reasoning. Risk Analysis 2020, 42, 1155 - 1178.
- Abi-Rafeh, J.; Mroueh, V. J.; Bassiri-Tehrani, B.; Marks, J.; Kazan, R.; Nahai, F., Complications Following Body Contouring: Performance Validation of Bard, a Novel AI Large Language Model, in Triaging and Managing Postoperative Patient Concerns. Aesthetic plastic surgery 2024.
- Sandmann, S.; Riepenhausen, S.; Plagwitz, L.; Varghese, J., Systematic analysis of ChatGPT, Google search and Llama 2 for clinical decision support tasks. Nature communications 2024, 15 (1), 2050. [CrossRef]
- Touvron, H.; Lavril, T.; Izacard, G.; Martinet, X.; Lachaux, M.-A.; Lacroix, T.; Rozière, B.; Goyal, N.; Hambro, E.; Azhar, F.; Rodriguez, A.; Joulin, A.; Grave, E.; Lample, G., LLaMA: Open and Efficient Foundation Language Models. ArXiv 2023, abs/2302.13971.
- Blease, C.; Torous, J.; McMillan, B.; Hägglund, M.; Mandl, K. D., Generative Language Models and Open Notes: Exploring the Promise and Limitations. JMIR medical education 2024, 10, e51183. [CrossRef]
- Ashraf, H.; Ashfaq, H., The Role of ChatGPT in Medical Research: Progress and Limitations. Annals of biomedical engineering 2024, 52 (3), 458-461.
- Abi-Rafeh, J.; Xu, H. H.; Kazan, R.; Tevlin, R.; Furnas, H., Large Language Models and Artificial Intelligence: A Primer for Plastic Surgeons on the Demonstrated and Potential Applications, Promises, and Limitations of ChatGPT. Aesthetic surgery journal 2024, 44 (3), 329-343. [CrossRef]
- Ufuk, F., The Role and Limitations of Large Language Models Such as ChatGPT in Clinical Settings and Medical Journalism. Radiology 2023, 307 (3), e230276. [CrossRef]
- Melton, G. B.; McDonald, C. J.; Tang, P. C.; Hripcsak, G., Electronic Health Records. In Biomedical Informatics: Computer Applications in Health Care and Biomedicine, Shortliffe, E. H.; Cimino, J. J., Eds. Springer International Publishing: Cham, 2021; pp 467-509.
- Tsai, C. H.; Eghdam, A.; Davoody, N.; Wright, G.; Flowerday, S.; Koch, S., Effects of Electronic Health Record Implementation and Barriers to Adoption and Use: A Scoping Review and Qualitative Analysis of the Content. Life (Basel, Switzerland) 2020, 10 (12). [CrossRef]
- Yang, X.; Chen, A.; PourNejatian, N.; Shin, H. C.; Smith, K. E.; Parisien, C.; Compas, C.; Martin, C.; Costa, A. B.; Flores, M. G.; Zhang, Y.; Magoc, T.; Harle, C. A.; Lipori, G.; Mitchell, D. A.; Hogan, W. R.; Shenkman, E. A.; Bian, J.; Wu, Y., A large language model for electronic health records. NPJ digital medicine 2022, 5 (1), 194. [CrossRef]
- Lin, T.; Wang, Y.; Liu, X.; Qiu, X., A Survey of Transformers. AI Open 2021, 3, 111-132.
- Vaswani, A.; Shazeer, N. M.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A. N.; Kaiser, L.; Polosukhin, I. In Attention is All you Need, Neural Information Processing Systems, 2017.
- Ong, J.; Kedia, N.; Harihar, S.; Vupparaboina, S. C.; Singh, S. R.; Venkatesh, R.; Vupparaboina, K.; Bollepalli, S. C.; Chhablani, J., Applying large language model artificial intelligence for retina International Classification of Diseases (ICD) coding. Journal of Medical Artificial Intelligence 2023, 6. [CrossRef]
- Toma, A.; Lawler, P. R.; Ba, J.; Krishnan, R. G.; Rubin, B. B.; Wang, B. In Clinical Camel: An Open Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding, 2023.
- Du, Z.; Qian, Y.; Liu, X.; Ding, M.; Qiu, J.; Yang, Z.; Tang, J. In GLM: General Language Model Pretraining with Autoregressive Blank Infilling, Annual Meeting of the Association for Computational Linguistics, 2021.
- Zhang, H.; Chen, J.; Jiang, F.; Yu, F.; Chen, Z.; Li, J.; Chen, G.; Wu, X.; Zhang, Z.; Xiao, Q.; Wan, X.; Wang, B.; Li, H. In HuatuoGPT, towards Taming Language Model to Be a Doctor, Conference on Empirical Methods in Natural Language Processing, 2023.
- Liao, Y.; Jiang, S.; Wang, Y.; Wang, Y., MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Mode ls with Sparse Mixture of Low-Rank Adapter Experts. [CrossRef]
- Zhu, W.; Wang, X., ChatMed: A Chinese Medical Large Language Model. GitHub.
- Wang, H.; Liu, C.-L.; Xi, N.; Qiang, Z.; Zhao, S.; Qin, B.; Liu, T., HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge. ArXiv 2023, abs/2304.06975.
- Yang, S.; Zhao, H.; Zhu, S.; Zhou, G.; Xu, H.; Jia, Y.; Zan, H. In Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue, AAAI Conference on Artificial Intelligence, 2023.
- Zeng, A.; Liu, X.; Du, Z.; Wang, Z.; Lai, H.; Ding, M.; Yang, Z.; Xu, Y.; Zheng, W.; Xia, X.; Tam, W. L.; Ma, Z.; Xue, Y.; Zhai, J.; Chen, W.; Zhang, P.; Dong, Y.; Tang, J., GLM-130B: An Open Bilingual Pre-trained Model. ArXiv 2022, abs/2210.02414.
- Chowdhery, A.; Narang, S.; Devlin, J.; Bosma, M.; Mishra, G.; Roberts, A.; Barham, P.; Chung, H. W.; Sutton, C.; Gehrmann, S.; Schuh, P.; Shi, K.; Tsvyashchenko, S.; Maynez, J.; Rao, A.; Barnes, P.; Tay, Y.; Shazeer, N. M.; Prabhakaran, V.; Reif, E.; Du, N.; Hutchinson, B. C.; Pope, R.; Bradbury, J.; Austin, J.; Isard, M.; Gur-Ari, G.; Yin, P.; Duke, T.; Levskaya, A.; Ghemawat, S.; Dev, S.; Michalewski, H.; García, X.; Misra, V.; Robinson, K.; Fedus, L.; Zhou, D.; Ippolito, D.; Luan, D.; Lim, H.; Zoph, B.; Spiridonov, A.; Sepassi, R.; Dohan, D.; Agrawal, S.; Omernick, M.; Dai, A. M.; Pillai, T. S.; Pellat, M.; Lewkowycz, A.; Moreira, E.; Child, R.; Polozov, O.; Lee, K.; Zhou, Z.; Wang, X.; Saeta, B.; Díaz, M.; Firat, O.; Catasta, M.; Wei, J.; Meier-Hellstern, K. S.; Eck, D.; Dean, J.; Petrov, S.; Fiedel, N., PaLM: Scaling Language Modeling with Pathways. J. Mach. Learn. Res. 2022, 24, 240:1-240:113.
- Anil, R.; Dai, A. M.; Firat, O.; Johnson, M.; Lepikhin, D.; Passos, A. T.; Shakeri, S.; Taropa, E.; Bailey, P.; Chen, Z.; Chu, E.; Clark, J.; Shafey, L. E.; Huang, Y.; Meier-Hellstern, K. S.; Mishra, G.; Moreira, E.; Omernick, M.; Robinson, K.; Ruder, S.; Tay, Y.; Xiao, K.; Xu, Y.; Zhang, Y.; Abrego, G. H.; Ahn, J.; Austin, J.; Barham, P.; Botha, J. A.; Bradbury, J.; Brahma, S.; Brooks, K. M.; Catasta, M.; Cheng, Y.; Cherry, C.; Choquette-Choo, C. A.; Chowdhery, A.; Crépy, C.; Dave, S.; Dehghani, M.; Dev, S.; Devlin, J.; D'iaz, M. C.; Du, N.; Dyer, E.; Feinberg, V.; Feng, F.; Fienber, V.; Freitag, M.; García, X.; Gehrmann, S.; González, L.; Gur-Ari, G.; Hand, S.; Hashemi, H.; Hou, L.; Howland, J.; Hu, A. R.; Hui, J.; Hurwitz, J.; Isard, M.; Ittycheriah, A.; Jagielski, M.; Jia, W. H.; Kenealy, K.; Krikun, M.; Kudugunta, S.; Lan, C.; Lee, K.; Lee, B.; Li, E.; Li, M.-L.; Li, W.; Li, Y.; Li, J. Y.; Lim, H.; Lin, H.; Liu, Z.-Z.; Liu, F.; Maggioni, M.; Mahendru, A.; Maynez, J.; Misra, V.; Moussalem, M.; Nado, Z.; Nham, J.; Ni, E.; Nystrom, A.; Parrish, A.; Pellat, M.; Polacek, M.; Polozov, O.; Pope, R.; Qiao, S.; Reif, E.; Richter, B.; Riley, P.; Ros, A.; Roy, A.; Saeta, B.; Samuel, R.; Shelby, R. M.; Slone, A.; Smilkov, D.; So, D. R.; Sohn, D.; Tokumine, S.; Valter, D.; Vasudevan, V.; Vodrahalli, K.; Wang, X.; Wang, P.; Wang, Z.; Wang, T.; Wieting, J.; Wu, Y.; Xu, K.; Xu, Y.; Xue, L. W.; Yin, P.; Yu, J.; Zhang, Q.; Zheng, S.; Zheng, C.; Zhou, W.; Zhou, D.; Petrov, S.; Wu, Y., PaLM 2 Technical Report. ArXiv 2023, abs/2305.10403. [CrossRef]
- Chen, Y.; Wang, Z.; Xing, X.; Zheng, H.; Xu, Z.; Fang, K.; Wang, J.; Li, S.; Wu, J.; Liu, Q.; Xu, X., BianQue: Balancing the Questioning and Suggestion Ability of Health LLMs with Multi-turn Health Conversations Polished by ChatGPT. ArXiv 2023, abs/2310.15896.
- Xin, S.; Jiashu, W.; Aoping, Z.; Jinlong, S.; Xiao-Jun, W., CSCA U-Net: A channel and space compound attention CNN for medical image segmentation. Artificial Intelligence in Medicine 2024, 150, 102800. [CrossRef]
- Zhou, Y.; Chia, M. A.; Wagner, S. K.; Ayhan, M. S.; Williamson, D. J.; Struyven, R. R.; Liu, T.; Xu, M.; Lozano, M. G.; Woodward-Court, P.; Kihara, Y.; Allen, N.; Gallacher, J. E. J.; Littlejohns, T.; Aslam, T.; Bishop, P.; Black, G.; Sergouniotis, P.; Atan, D.; Dick, A. D.; Williams, C.; Barman, S.; Barrett, J. H.; Mackie, S.; Braithwaite, T.; Carare, R. O.; Ennis, S.; Gibson, J.; Lotery, A. J.; Self, J.; Chakravarthy, U.; Hogg, R. E.; Paterson, E.; Woodside, J.; Peto, T.; McKay, G.; McGuinness, B.; Foster, P. J.; Balaskas, K.; Khawaja, A. P.; Pontikos, N.; Rahi, J. S.; Lascaratos, G.; Patel, P. J.; Chan, M.; Chua, S. Y. L.; Day, A.; Desai, P.; Egan, C.; Fruttiger, M.; Garway-Heath, D. F.; Hardcastle, A.; Khaw, S. P. T.; Moore, T.; Sivaprasad, S.; Strouthidis, N.; Thomas, D.; Tufail, A.; Viswanathan, A. C.; Dhillon, B.; Macgillivray, T.; Sudlow, C.; Vitart, V.; Doney, A.; Trucco, E.; Guggeinheim, J. A.; Morgan, J. E.; Hammond, C. J.; Williams, K.; Hysi, P.; Harding, S. P.; Zheng, Y.; Luben, R.; Luthert, P.; Sun, Z.; McKibbin, M.; O’Sullivan, E.; Oram, R.; Weedon, M.; Owen, C. G.; Rudnicka, A. R.; Sattar, N.; Steel, D.; Stratton, I.; Tapp, R.; Yates, M. M.; Petzold, A.; Madhusudhan, S.; Altmann, A.; Lee, A. Y.; Topol, E. J.; Denniston, A. K.; Alexander, D. C.; Keane, P. A.; Eye, U. K. B.; Vision, C., A foundation model for generalizable disease detection from retinal images. Nature 2023, 622 (7981), 156-163. [CrossRef]
- Hua, S.; Yan, F.; Shen, T.; Zhang, X., PathoDuet: Foundation Models for Pathological Slide Analysis of H&E and IHC Stains. ArXiv 2023, abs/2312.09894.
- Topol, E. J., As artificial intelligence goes multimodal, medical applications multiply. Science (New York, N.Y.) 2023, 381 (6663), adk6139. [CrossRef]
- Wang, X.; Zhang, X.; Wang, G.; He, J.; Li, Z.; Zhu, W.; Guo, Y.; Dou, Q.; Li, X.; Wang, D.; Hong, L.; Lao, Q.; Ruan, T.; Zhou, Y.; Li, Y.; Zhao, J.; Li, K.; Sun, X.; Zhu, L.; Zhang, S., OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine. ArXiv 2024, abs/2402.18028.
- Moor, M.; Banerjee, O.; Abad, Z. S. H.; Krumholz, H. M.; Leskovec, J.; Topol, E. J.; Rajpurkar, P., Foundation models for generalist medical artificial intelligence. Nature 2023, 616 (7956), 259-265. [CrossRef]
- Chang Shu, B. C., Fangyu Liu, Zihao Fu, Ehsan Shareghi, Nigel Collier Visual Med-Alpaca: A Parameter-Efficient Biomedical LLM with Visual Capabilities. https://github.com/cambridgeltl/visual-med-alpaca/tree/main.
- Li, C.; Wong, C.; Zhang, S.; Usuyama, N.; Liu, H.; Yang, J.; Naumann, T.; Poon, H.; Gao, J., LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day. ArXiv 2023, abs/2306.00890.
- Wang, S.; Zhao, Z.; Ouyang, X.; Wang, Q.; Shen, D., ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language Models. ArXiv 2023, abs/2302.07257.
- Rongsheng, W.; Tan, T., XrayGLM: The first Chinese Medical Multimodal Model that Chest Radiogr aphs Summarization. GitHub.
- Thawakar, O.; Shaker, A. M.; Mullappilly, S. S.; Cholakkal, H.; Anwer, R. M.; Khan, S. S.; Laaksonen, J.; Khan, F. S., XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models. ArXiv 2023, abs/2306.07971.
- Jie, R. T. B. Y. a. Y. G. x., A Review on Research and Application of Medical Large Language Models. Chinese Journal of Health Informatics and Management 2023, 20 (06), 853-861.
- Thirunavukarasu, A. J.; Ting, D. S. J.; Elangovan, K.; Gutierrez, L.; Tan, T. F.; Ting, D. S. W., Large language models in medicine. Nature medicine 2023, 29 (8), 1930-1940. [CrossRef]
- Shin, H.-C.; Zhang, Y.; Bakhturina, E.; Puri, R.; Patwary, M.; Shoeybi, M.; Mani, R. In BioMegatron: Larger Biomedical Domain Language Model, Online, November; Association for Computational Linguistics: Online, 2020; pp 4700-4706.
- Johnson, A. E. W.; Pollard, T. J.; Shen, L.; Lehman, L.-w. H.; Feng, M.; Ghassemi, M.; Moody, B.; Szolovits, P.; Anthony Celi, L.; Mark, R. G., MIMIC-III, a freely accessible critical care database. Scientific Data 2016, 3 (1), 160035. [CrossRef]
- Pampari, A.; Raghavan, P.; Liang, J. J.; Peng, J. In emrQA: A Large Corpus for Question Answering on Electronic Medical Records, Conference on Empirical Methods in Natural Language Processing, 2018.
- Jin, D.; Pan, E.; Oufattole, N.; Weng, W.-H.; Fang, H.; Szolovits, P., What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams. ArXiv 2020, abs/2009.13081.
- Zhang, S.; Zhang, X.; Wang, H.; Guo, L.; Liu, S., Multi-Scale Attentive Interaction Networks for Chinese Medical Question Answer Selection. IEEE Access 2018, 1-1. [CrossRef]
- He, J.; Fu, M.; Tu, M., Applying deep matching networks to Chinese medical question answering: a study and a dataset. BMC medical informatics and decision making 2019, 19 (Suppl 2), 52. [CrossRef]
- Li, J.; Wang, X.; Wu, X.; Zhang, Z.; Xu, X.; Fu, J.; Tiwari, P.; Wan, X.; Wang, B., Huatuo-26M, a Large-scale Chinese Medical QA Dataset. ArXiv 2023, abs/2305.01526.
- BYAMBASUREN Odmaa, Y. Y., SUlZhifang, DAl Damai, CHANG Baobao, LI Suiian, ZAN Hongying, Preliminary Study on the Construction of Chinese Medical Knowledge Graph. Journal of Chinese Information Processing 2019, 33 (10), 1-9.
- Gao, L.; Biderman, S.; Black, S.; Golding, L.; Hoppe, T.; Foster, C.; Phang, J.; He, H.; Thite, A.; Nabeshima, N.; Presser, S.; Leahy, C., The Pile: An 800GB Dataset of Diverse Text for Language Modeling. ArXiv 2020, abs/2101.00027.
- Yuan, S.; Zhao, H.; Du, Z.; Ding, M.; Liu, X.; Cen, Y.; Zou, X.; Yang, Z.; Tang, J., WuDaoCorpora: A super large-scale Chinese corpora for pre-training language models. AI Open 2021, 2, 65-68. [CrossRef]
- Paperno, D.; Kruszewski, G.; Lazaridou, A.; Pham, Q. N.; Bernardi, R.; Pezzelle, S.; Baroni, M.; Boleda, G.; Fernández, R., The LAMBADA dataset: Word prediction requiring a broad discourse context. ArXiv 2016, abs/1606.06031.
- Hendrycks, D.; Burns, C.; Basart, S.; Zou, A.; Mazeika, M.; Song, D. X.; Steinhardt, J., Measuring Massive Multitask Language Understanding. ArXiv 2020, abs/2009.03300.
- Srivastava, A.; Rastogi, A.; Rao, A.; Shoeb, A. A. M.; Abid, A.; Fisch, A.; Brown, A. R.; Santoro, A.; Gupta, A.; Garriga-Alonso, A.; Kluska, A.; Lewkowycz, A.; Agarwal, A.; Power, A.; Ray, A.; Warstadt, A.; Kocurek, A. W.; Safaya, A.; Tazarv, A.; Xiang, A.; Parrish, A.; Nie, A.; Hussain, A.; Askell, A.; Dsouza, A.; Slone, A.; Rahane, A. A.; Iyer, A. S.; Andreassen, A.; Madotto, A.; Santilli, A.; Stuhlmuller, A.; Dai, A. M.; La, A.; Lampinen, A. K.; Zou, A.; Jiang, A.; Chen, A.; Vuong, A.; Gupta, A.; Gottardi, A.; Norelli, A.; Venkatesh, A.; Gholamidavoodi, A.; Tabassum, A.; Menezes, A.; Kirubarajan, A.; Mullokandov, A.; Sabharwal, A.; Herrick, A.; Efrat, A.; Erdem, A.; Karakacs, A.; Roberts, B. R.; Loe, B. S.; Zoph, B.; Bojanowski, B.; Ozyurt, B.; Hedayatnia, B.; Neyshabur, B.; Inden, B.; Stein, B.; Ekmekci, B.; Lin, B. Y.; Howald, B. S.; Orinion, B.; Diao, C.; Dour, C.; Stinson, C.; Argueta, C.; Ram'irez, C. e. F.; Singh, C.; Rathkopf, C.; Meng, C.; Baral, C.; Wu, C.; Callison-Burch, C.; Waites, C.; Voigt, C.; Manning, C. D.; Potts, C.; Ramirez, C.; Rivera, C.; Siro, C.; Raffel, C.; Ashcraft, C.; Garbacea, C.; Sileo, D.; Garrette, D. H.; Hendrycks, D.; Kilman, D.; Roth, D.; Freeman, D.; Khashabi, D.; Levy, D.; Gonz'alez, D. M. i.; Perszyk, D. R.; Hernandez, D.; Chen, D.; Ippolito, D.; Gilboa, D.; Dohan, D.; Drakard, D.; Jurgens, D.; Datta, D.; Ganguli, D.; Emelin, D.; Kleyko, D.; Yuret, D.; Chen, D.; Tam, D.; Hupkes, D.; Misra, D.; Buzan, D.; Coelho Mollo, D.; Yang, D.; Lee, D.-H.; Schrader, D.; Shutova, E.; Cubuk, E. D.; Segal, E.; Hagerman, E.; Barnes, E.; Donoway, E. P.; Pavlick, E.; Rodolà, E.; Lam, E.; Chu, E.; Tang, E.; Erdem, E.; Chang, E.; Chi, E. A.; Dyer, E.; Jerzak, E. J.; Kim, E.; Manyasi, E. E.; Zheltonozhskii, E.; Xia, F.; Siar, F.; Mart'inez-Plumed, F.; Happ'e, F.; Chollet, F.; Rong, F.; Mishra, G.; Winata, G. I.; Melo, G. d.; Kruszewski, G.; Parascandolo, G.; Mariani, G.; Wang, G. X.; Jaimovitch-L'opez, G.; Betz, G.; Gur-Ari, G.; Galijasevic, H.; Kim, H.; Rashkin, H.; Hajishirzi, H.; Mehta, H.; Bogar, H.; Shevlin, H.; Schutze, H.; Yakura, H.; Zhang, H.; Wong, H. M.; Ng, I.; Noble, I.; Jumelet, J.; Geissinger, J.; Kernion, J.; Hilton, J.; Lee, J.; Fisac, J. F.; Simon, J. B.; Koppel, J.; Zheng, J.; Zou, J.; Koco'n, J.; Thompson, J.; Wingfield, J.; Kaplan, J.; Radom, J.; Sohl-Dickstein, J. N.; Phang, J.; Wei, J.; Yosinski, J.; Novikova, J.; Bosscher, J.; Marsh, J.; Kim, J.; Taal, J.; Engel, J.; Alabi, J. O.; Xu, J.; Song, J.; Tang, J.; Waweru, J. W.; Burden, J.; Miller, J.; Balis, J. U.; Batchelder, J.; Berant, J.; Frohberg, J.; Rozen, J.; Hernández-Orallo, J.; Boudeman, J.; Guerr, J.; Jones, J.; Tenenbaum, J.; Rule, J. S.; Chua, J.; Kanclerz, K.; Livescu, K.; Krauth, K.; Gopalakrishnan, K.; Ignatyeva, K.; Markert, K.; Dhole, K. D.; Gimpel, K.; Omondi, K.; Mathewson, K. W.; Chiafullo, K.; Shkaruta, K.; Shridhar, K.; McDonell, K.; Richardson, K.; Reynolds, L.; Gao, L.; Zhang, L.; Dugan, L.; Qin, L.; Contreras-Ochando, L.; Morency, L.-P.; Moschella, L.; Lam, L.; Noble, L.; Schmidt, L.; He, L.; Col'on, L. O.; Metz, L.; cSenel, L. K.; Bosma, M.; Sap, M.; Hoeve, M. t.; Farooqi, M.; Faruqui, M.; Mazeika, M.; Baturan, M.; Marelli, M.; Maru, M.; Quintana, M. J. R. i.; Tolkiehn, M.; Giulianelli, M.; Lewis, M.; Potthast, M.; Leavitt, M. L.; Hagen, M.; Schubert, M. a. a.; Baitemirova, M.; Arnaud, M.; McElrath, M. A.; Yee, M.; Cohen, M.; Gu, M.; Ivanitskiy, M. I.; Starritt, M.; Strube, M.; Swkedrowski, M.; Bevilacqua, M.; Yasunaga, M.; Kale, M.; Cain, M.; Xu, M.; Suzgun, M.; Walker, M.; Tiwari, M.; Bansal, M.; Aminnaseri, M.; Geva, M.; Gheini, M.; MukundVarma, T.; Peng, N.; Chi, N. A.; Lee, N.; Krakover, N. G.-A.; Cameron, N.; Roberts, N.; Doiron, N.; Martinez, N.; Nangia, N.; Deckers, N.; Muennighoff, N.; Keskar, N. S.; Iyer, N.; Constant, N.; Fiedel, N.; Wen, N.; Zhang, O.; Agha, O.; Elbaghdadi, O.; Levy, O.; Evans, O.; Casares, P. A. M.; Doshi, P.; Fung, P.; Liang, P. P.; Vicol, P.; Alipoormolabashi, P.; Liao, P.; Liang, P.; Chang, P.; Eckersley, P.; Htut, P. M.; Hwang, P.-B.; Milkowski, P.; Patil, P. S.; Pezeshkpour, P.; Oli, P.; Mei, Q.; Lyu, Q.; Chen, Q.; Banjade, R.; Rudolph, R. E.; Gabriel, R.; Habacker, R.; Risco, R.; Milliere, R.; Garg, R.; Barnes, R.; Saurous, R. A.; Arakawa, R.; Raymaekers, R.; Frank, R.; Sikand, R.; Novak, R.; Sitelew, R.; Le Bras, R.; Liu, R.; Jacobs, R.; Zhang, R.; Salakhutdinov, R.; Chi, R.; Lee, R.; Stovall, R.; Teehan, R.; Yang, R.; Singh, S.; Mohammad, S. M.; Anand, S.; Dillavou, S.; Shleifer, S.; Wiseman, S.; Gruetter, S.; Bowman, S. R.; Schoenholz, S. S.; Han, S.; Kwatra, S.; Rous, S. A.; Ghazarian, S.; Ghosh, S.; Casey, S.; Bischoff, S.; Gehrmann, S.; Schuster, S.; Sadeghi, S.; Hamdan, S. S.; Zhou, S.; Srivastava, S.; Shi, S.; Singh, S.; Asaadi, S.; Gu, S. S.; Pachchigar, S.; Toshniwal, S.; Upadhyay, S.; Debnath, S.; Shakeri, S.; Thormeyer, S.; Melzi, S.; Reddy, S.; Makini, S. P.; Lee, S.-H.; Torene, S.; Hatwar, S.; Dehaene, S.; Divic, S.; Ermon, S.; Biderman, S.; Lin, S.; Prasad, S.; Piantadosi, S. T.; Shieber, S. M.; Misherghi, S.; Kiritchenko, S.; Mishra, S.; Linzen, T.; Schuster, T.; Li, T.; Yu, T.; Ali, T.; Hashimoto, T.; Wu, T.-L.; Desbordes, T.; Rothschild, T.; Phan, T.; Wang, T.; Nkinyili, T.; Schick, T.; Kornev, T.; Tunduny, T.; Gerstenberg, T.; Chang, T.; Neeraj, T.; Khot, T.; Shultz, T.; Shaham, U.; Misra, V.; Demberg, V.; Nyamai, V.; Raunak, V.; Ramasesh, V. V.; Prabhu, V. U.; Padmakumar, V.; Srikumar, V.; Fedus, W.; Saunders, W.; Zhang, W.; Vossen, W.; Ren, X.; Tong, X.; Zhao, X.; Wu, X.; Shen, X.; Yaghoobzadeh, Y.; Lakretz, Y.; Song, Y.; Bahri, Y.; Choi, Y.; Yang, Y.; Hao, Y.; Chen, Y.; Belinkov, Y.; Hou, Y.; Hou, Y.; Bai, Y.; Seid, Z.; Zhao, Z.; Wang, Z.; Wang, Z. J.; Wang, Z.; Wu, Z., Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. ArXiv 2022, abs/2206.04615. [CrossRef]
- Xu, L.; Zhang, X.; Li, L.; Hu, H.; Cao, C.; Liu, W.; Li, J.; Li, Y.; Sun, K.; Xu, Y.; Cui, Y.; Yu, C.; Dong, Q.; Tian, Y.; Yu, D.; Shi, B.; Zeng, J.-j.; Wang, R.; Xie, W.; Li, Y.; Patterson, Y.; Tian, Z.; Zhang, Y.; Zhou, H.; Liu, S.; Zhao, Q.; Yue, C.; Zhang, X.; Yang, Z.-Y.; Richardson, K.; Lan, Z. In CLUE: A Chinese Language Understanding Evaluation Benchmark, International Conference on Computational Linguistics, 2020.
- Xu, L.; Lu, X.; Yuan, C.; Zhang, X.; Yuan, H.; Xu, H.; Wei, G.; Pan, X.; Hu, H., FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark. ArXiv 2021, abs/2107.07498.
- The winograd schema challenge. Proceedings of the International Conference on Knowledge Representation and Reasoning 2012, 552--561.
- Wang, A.; Pruksachatkun, Y.; Nangia, N.; Singh, A.; Michael, J.; Hill, F.; Levy, O.; Bowman, S. R., SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems. ArXiv 2019, abs/1905.00537.
- Kwiatkowski, T.; Palomaki, J.; Redfield, O.; Collins, M.; Parikh, A. P.; Alberti, C.; Epstein, D.; Polosukhin, I.; Devlin, J.; Lee, K.; Toutanova, K.; Jones, L.; Kelcey, M.; Chang, M.-W.; Dai, A. M.; Uszkoreit, J.; Le, Q. V.; Petrov, S., Natural Questions: A Benchmark for Question Answering Research. Transactions of the Association for Computational Linguistics 2019, 7, 453-466. [CrossRef]
- Geva, M.; Khashabi, D.; Segal, E.; Khot, T.; Roth, D.; Berant, J., Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies. Transactions of the Association for Computational Linguistics 2021, 9, 346-361. [CrossRef]
- Talmor, A.; Herzig, J.; Lourie, N.; Berant, J., CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge. ArXiv 2019, abs/1811.00937.
- Zhou, B.; Khashabi, D.; Ning, Q.; Roth, D., “Going on a vacation” takes longer than “Going for a walk”: A Study of Temporal Commonsense Understanding. ArXiv 2019, abs/1909.03065.
- Jin, D.; Pan, E.; Oufattole, N.; Weng, W.-H.; Fang, H.; Szolovits, P., What Disease Does This Patient Have? A Large-Scale Open Domain Question Answering Dataset from Medical Exams. Applied Sciences 2021, 11 (14), 6421. [CrossRef]
- Pal, A.; Umapathi, L. K.; Sankarasubbu, M. In MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering, ACM Conference on Health, Inference, and Learning, 2022.
- Jin, Q.; Dhingra, B.; Liu, Z.; Cohen, W. W.; Lu, X. In PubMedQA: A Dataset for Biomedical Research Question Answering, Conference on Empirical Methods in Natural Language Processing, 2019.
- Chen, S.; Ju, Z.; Dong, X.; Fang, H.; Wang, S.; Yang, Y.; Zeng, J.; Zhang, R.; Zhang, R.; Zhou, M.; Zhu, P.; Xie, P., MedDialog: A Large-scale Medical Dialogue Dataset. ArXiv 2020, abs/2004.03329.
- Zhang, N.; Chen, M.; Bi, Z.; Liang, X.; Li, L.; Shang, X.; Yin, K.; Tan, C.; Xu, J.; Huang, F.; Si, L.; Ni, Y.; Xie, G.; Sui, Z.; Chang, B.; Zong, H.; Yuan, Z.; Li, L.; Yan, J.; Zan, H.; Zhang, K.; Tang, B.; Chen, Q. In CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark, Dublin, Ireland, May; Association for Computational Linguistics: Dublin, Ireland, 2022; pp 7888-7915.
- Jha, D.; Smedsrud, P. H.; Riegler, M. A.; Halvorsen, P.; de Lange, T.; Johansen, D.; Johansen, H. D. In Kvasir-SEG: A Segmented Polyp Dataset, MultiMedia Modeling, Cham, 2020//; Ro, Y. M.; Cheng, W.-H.; Kim, J.; Chu, W.-T.; Cui, P.; Choi, J.-W.; Hu, M.-C.; De Neve, W., Eds. Springer International Publishing: Cham, 2020; pp 451-462.
- Vázquez, D.; Bernal, J.; Sánchez, F. J.; Fernández-Esparrach, G.; López, A. M.; Romero, A.; Drozdzal, M.; Courville, A., A Benchmark for Endoluminal Scene Segmentation of Colonoscopy Images. Journal of healthcare engineering 2017, 2017, 4037190. [CrossRef]
- Bernal, J.; Sánchez, J.; Vilariño, F., Towards automatic polyp detection with a polyp appearance model. Pattern Recognition 2012, 45 (9), 3166-3182. [CrossRef]
- Silva, J.; Histace, A.; Romain, O.; Dray, X.; Granado, B., Toward embedded detection of polyps in WCE images for early diagnosis of colorectal cancer. International journal of computer assisted radiology and surgery 2014, 9 (2), 283-93.
- Caicedo, J. C.; Goodman, A.; Karhohs, K. W.; Cimini, B. A.; Ackerman, J.; Haghighi, M.; Heng, C.; Becker, T.; Doan, M.; McQuin, C.; Rohban, M.; Singh, S.; Carpenter, A. E., Nucleus segmentation across imaging experiments: the 2018 Data Science Bowl. Nature Methods 2019, 16 (12), 1247-1253. [CrossRef]
- Tschandl, P.; Rosendahl, C.; Kittler, H., The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Scientific Data 2018, 5 (1), 180161. [CrossRef]
- Shu, X.; Chang, F.; Zhang, X.; Shao, C.; Yang, X., ECAU-Net: Efficient channel attention U-Net for fetal ultrasound cerebellum segmentation. Biomedical Signal Processing and Control 2022, 75, 103528. [CrossRef]
- Gulshan, V.; Peng, L.; Coram, M.; Stumpe, M. C.; Wu, D.; Narayanaswamy, A.; Venugopalan, S.; Widner, K.; Madams, T.; Cuadros, J., Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. Jama 2016. [CrossRef]
- Bycroft, C.; Freeman, C.; Petkova, D.; Band, G.; Elliott, L. T.; Sharp, K.; Motyer, A.; Vukcevic, D.; Delaneau, O.; O’Connell, J.; Cortes, A.; Welsh, S.; Young, A.; Effingham, M.; McVean, G.; Leslie, S.; Allen, N.; Donnelly, P.; Marchini, J., The UK Biobank resource with deep phenotyping and genomic data. Nature 2018, 562 (7726), 203-209. [CrossRef]
- Lotz, J.; Weiss, N.; van der Laak, J.; Heldmann, S., Comparison of consecutive and restained sections for image registration in histopathology. Journal of Medical Imaging 2021, 10, 067501 - 067501.
- Liu, S.; Zhu, C.; Xu, F.; Jia, X.; Shi, Z.; Jin, M., BCI: Breast Cancer Immunohistochemical Image Generation through Pyramid Pix2pix. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022, 1814-1823. [CrossRef]
- Ehteshami Bejnordi, B.; Veta, M.; Johannes van Diest, P.; van Ginneken, B.; Karssemeijer, N.; Litjens, G.; van der Laak, J. A. W. M.; Consortium, a. t. C., Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer. JAMA 2017, 318 (22), 2199-2210. [CrossRef]
- Wang, Z.; Liu, C.; Zhang, S.; Dou, Q., Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train. ArXiv 2023, abs/2306.16741.
- Sudlow, C.; Gallacher, J.; Allen, N.; Beral, V.; Burton, P.; Danesh, J.; Downey, P.; Elliott, P.; Green, J.; Landray, M.; Liu, B.; Matthews, P.; Ong, G.; Pell, J.; Silman, A.; Young, A.; Sprosen, T.; Peakman, T.; Collins, R., UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS medicine 2015, 12 (3), e1001779. [CrossRef]
- Irvin, J. A.; Rajpurkar, P.; Ko, M.; Yu, Y.; Ciurea-Ilcus, S.; Chute, C.; Marklund, H.; Haghgoo, B.; Ball, R. L.; Shpanskaya, K. S.; Seekins, J.; Mong, D. A.; Halabi, S. S.; Sandberg, J. K.; Jones, R.; Larson, D. B.; Langlotz, C.; Patel, B. N.; Lungren, M. P.; Ng, A. In CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison, AAAI Conference on Artificial Intelligence, 2019.
- Pavlova, M.; Terhljan, N.; Chung, A. G.; Zhao, A.; Surana, S.; Aboutalebi, H.; Gunraj, H.; Sabri, A.; Alaref, A.; Wong, A., COVID-Net CXR-2: An Enhanced Deep Convolutional Neural Network Design for Detection of COVID-19 Cases From Chest X-ray Images. Frontiers in medicine 2022, 9, 861680. [CrossRef]
- Johnson, A. E. W.; Pollard, T. J.; Berkowitz, S. J.; Greenbaum, N. R.; Lungren, M. P.; Deng, C. Y.; Mark, R. G.; Horng, S., MIMIC-CXR, a de-identified publicly available database of chest radiographs with free-text reports. Sci Data 2019, 6 (1), 317. [CrossRef]
- Shao, H.; Zhong, D.; Du, X., Deep Distillation Hashing for Unconstrained Palmprint Recognition. IEEE Transactions on Instrumentation and Measurement 2021, 70, 1-13. [CrossRef]
- Liu, G.; Liao, Y.; Wang, F.; Zhang, B.; Zhang, L.; Liang, X.; Wan, X.; Li, S.; Li, Z.; Zhang, S.; Cui, S., Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report Generation With Alternate Learning. IEEE transactions on neural networks and learning systems 2021, 32 (9), 3786-3797. [CrossRef]
- Cohen, J. P.; Morrison, P.; Dao, L., COVID-19 Image Data Collection. ArXiv 2020, abs/2003.11597.
- Iglesia-Vayá, M. d. l.; Saborit, J. M.; Montell, J. A.; Pertusa, A.; Bustos, A.; Cazorla, M.; Galant, J.; Barber, X.; Orozco-Beltrán, D.; García-García, F.; Caparrós, M.; González, G.; Salinas, J. M., BIMCV COVID-19+: a large annotated dataset of RX and CT images from COVID-19 patients. ArXiv 2020, abs/2006.01174.
- Wu, X.; Yang, S.; Qiu, Z.; Ge, S.; Yan, Y.; Wu, X.; Zheng, Y.; Zhou, S. K.; Xiao, L., DeltaNet: Conditional Medical Report Generation for COVID-19 Diagnosis. ArXiv 2022, abs/2211.13229.
- Johnson, A. E.; Pollard, T. J.; Shen, L.; Lehman, L. W.; Feng, M.; Ghassemi, M.; Moody, B.; Szolovits, P.; Celi, L. A.; Mark, R. G., MIMIC-III, a freely accessible critical care database. Sci Data 2016, 3, 160035. [CrossRef]
- Shih, G.; Wu, C. C.; Halabi, S. S.; Kohli, M. D.; Prevedello, L. M.; Cook, T. S.; Sharma, A.; Amorosa, J.; Arteaga, V. A.; Galperin-Aizenberg, M.; Gill, R. R.; Godoy, M. C. B.; Hobbs, S.; Jeudy, J.; Laroia, A.; Shah, P. N.; Vummidi, D. R.; Yaddanapudi, K.; Stein, A., Augmenting the National Institutes of Health Chest Radiograph Dataset with Expert Annotations of Possible Pneumonia. Radiology. Artificial intelligence 2019, 1 1, e180041. [CrossRef]
- Wang, X.; Peng, Y.; Lu, L.; Lu, Z.; Bagheri, M.; Summers, R. M., ChestX-Ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017, 3462-3471. [CrossRef]
- Jaeger, S.; Candemir, S.; Antani, S.; Wáng, Y. X.; Lu, P. X.; Thoma, G., Two public chest X-ray datasets for computer-aided screening of pulmonary diseases. Quantitative imaging in medicine and surgery 2014, 4 (6), 475-7.
- Zhang, S.; Xu, Y.; Usuyama, N.; Xu, H.; Bagga, J.; Tinn, R.; Preston, S.; Rao, R.; Wei, M.; Valluri, N.; Wong, C.; Tupini, A.; Wang, Y.; Mazzola, M.; Shukla, S.; Liden, L.; Gao, J.; Lungren, M. P.; Naumann, T.; Wang, S.; Poon, H., BiomedCLIP: a multimodal biomedical foundation model pretrained from f ifteen million scientific image-text pairs. [CrossRef]
- Lau, J. J.; Gayen, S.; Ben Abacha, A.; Demner-Fushman, D., A dataset of clinically generated visual questions and answers about radiology images. Sci Data 2018, 5, 180251. [CrossRef]
- Liu, B.; Zhan, L.-M.; Xu, L.; Ma, L.; Yang, Y. F.; Wu, X.-M., Slake: A Semantically-Labeled Knowledge-Enhanced Dataset For Medical Visual Question Answering. 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) 2021, 1650-1654. [CrossRef]
- He, X.; Zhang, Y.; Mou, L.; Xing, E. P.; Xie, P., PathVQA: 30000+ Questions for Medical Visual Question Answering. ArXiv 2020, abs/2003.10286.
- Johnson, A. E. W.; Pollard, T. J.; Greenbaum, N. R.; Lungren, M. P.; Deng, C.-y.; Peng, Y.; Lu, Z.; Mark, R. G.; Berkowitz, S. J.; Horng, S. In MIMIC-CXR-JPG, a large publicly available database of labeled chest radiographs, 2019. [CrossRef]
- Demner-Fushman, D.; Kohli, M. D.; Rosenman, M. B.; Shooshan, S. E.; Rodriguez, L.; Antani, S.; Thoma, G. R.; McDonald, C. J., Preparing a collection of radiology examinations for distribution and retrieval. Journal of the American Medical Informatics Association : JAMIA 2016, 23 (2), 304-10.
- Johnson, A. E. W.; Pollard, T. J.; Berkowitz, S. J.; Greenbaum, N. R.; Lungren, M. P.; Deng, C.-y.; Mark, R. G.; Horng, S., MIMIC-CXR, a de-identified publicly available database of chest radiographs with free-text reports. Scientific Data 2019, 6 (1), 317. [CrossRef]
| Mode | Models | Tasks | Params | Datasets |
| Text | GatorTron [75] | Quick access to information to aid clinical diagnosis. | 8.9B | The UF Health Integrated Data Repository (IDR), PubMed [103], Wikipedia [103], Medical Information Mart for Intensive Care III (MIMIC-III) [104], emrQA dataset [105] |
| MedGPT [5] | To predict and diagnose diseases and to design treatment plans for patients. | GPTv2 (7.5B/1.5B/11.7B) | King's College Hospital (KCH), MIMIC-III [104] | |
| LLM-ICD [78] | Automatically generates ICD codes for retinal diseases. | ChatGPT/GPT-3.5 | MIMIC-III [104] | |
| ChatDoctor [6] | Provide accurate advice to patients through self-searching of online and offline medical databases. | 7B | HealthCareMagic-100k (www.healthcaremagic.com), iCliniq (www.icliniq.com) | |
| Clinical Camel [79] | Applied to clinical research, turning medical literature into conversation. | 13B/70B | ShareGPT, Clinical Articles, MedQA [106] | |
| DoctorGLM [7] | Chinese medical dialogue model. | 6.2B | Translated ChatDoctor's [6] database, Chinese Medical Dialogue (CMD), MedDialog | |
| HuatuoGPT [81] | Provide detailed, rich content while interacting and diagnosing like a doctor. | 7B | HealthCareMagic-100k, iCliniq, cMedQA2 [107], webMedQA [108], and Huatuo-26M [109] | |
| Text | MedicalGPT-zh (MING) [82] | Handle complex Chinese medical conversations and apply to a variety of scenarios, such as online medical consultation, patient education, health guidance, etc. | 7B | Chinese medical dialogue dataset constructed based on www.healthcaremagic.com, USMLE cases, and other data |
| ChatMed [83] | Answer patients' daily medical questions online. | 7B | ChatMed Consult Dataset, ChatMed TCM Dataset | |
| HuaTuo [84] (BenTsao) | Generate accurate and professional medical information in a Chinese context. | 7B | Chinese Medical Knowledge Graph (CMeKG) [110] | |
| Zhongjing [85] | Integrate TCM knowledge into the LLM to provide patients with personalized TCM advice and treatment plans. | 13B | CMeKG [110], Chinese Multi-turn Medical Question Answering (CMtMedQA), Huatuo-26M [109] | |
| GLM-130B [86] | Intelligent question and answer for medical and health problems to assist diagnosis and treatment. | 130B | Pile [111], Chinese WudaoCorpora [112], LAMBADA [113], MMLU [114], BIG-bench-lite [115], CLUE [116], FewCLUE [117], Winograd Schemas Challenge (WSC) [118], SuperGLUE [119], Natural Questions [120], StrategyQA [121], Commonsense QA [122], Multiple-choice Temporal Commonsense (MC-TACO) [123] | |
| Med-PaLM 2 [30] | Answer open-ended questions from patients. | 340B | MedQA [124], MedMCQA [125], PubMedQA [126], MMLU [114] | |
| Text | BianQue [89] | The model was trained with multiple rounds of dialogue data to improve its ability of asking questions. | 6.2B | BianQueCorpus, MedDialog-CN [127], IMCS-V2 [128], CHIPMDCFNPC [128], MedDG [128] |
| Image | CSCA U-Net [90] | Accurately identify and segment areas of interest in medical images, so that doctors can propose treatment plans suitable for patients. | 35.27M | Kvasir-SEG [129], CVC-ClinicDB [130], CVC-ColonDB [131], ETIS [132], CVC-T [130], 2018 Data Science Bowl (2018 DSB) [133], ISIC 2018 [134], JSUAH-Cerebellum [135] |
| RETFound [91] | The diagnosis and prognosis of eye diseases and the prediction of complex systemic diseases. | 1.6M | Moorfields Diabetic imAge dataSet (MEH-MIDAS), Kaggle EyePACS [136], Moorfields AlzEye study (MEH-AlzEye), UK Biobank [137] | |
| PathoDuet [92] | Understanding and analyzing pathological images. | 1.5M | TCGA, HyReCo [138], BCI [139], NCT-CRC-HE, CAMELYON16 [140], IHC dataset | |
| Multi-mode | OpenMEDLab [94] | The basic medical model can be applied to a variety of medical data and solve a variety of clinical and research problems. | 14M-1.4B | SA-Med2D-20M, SNOW, Endo-FM [141], MedFM |
| GMAI [95] | Use multiple datasets to learn and flexibly interpret different medical data. | 540B | MIMIC [48], UK Biobank [142], UniProt | |
| Multi-mode | Med-MLLM [96] | Learn medical knowledge from unlabeled data for rapid response to rare diseases and outbreaks. | 8.9B | CheXpert [143], COVIDx-CXR-2 [144], MIMIC-CXR [145], COVID-19-CT-CXR [146], COVID-19 CT [147], COVID-CXR [148], BIMCV-COVID-19 [149], COVID-HCH [150], PubMed [103], MIMIC-III [151], RSNA Pneumonia [152], NIH ChestX-ray [153], Shenzhen Tuberculosis [154] |
| Visual Med-Alpaca [96] | Understand visual information and generate biomedical relevant text and image content. | 7B | roco-dataset, MEDIQA RQE, MedQA, MedDialog, MEDIQA QA, PubMedQA | |
| LLaVA-Med [97] | A visual-verbal dialogue assistant that answers open research questions on biomedical images. | 7B | PMC-15M [155], VQA-RAD [156], SLAKE [157], PathVQA [158] | |
| ChatCAD [98] | Convert medical images into text to generate diagnostic reports. | 175B | MIMIC-CXR [145], CheXpert [143] | |
| XrayGLM [99] | Provide diagnostic reports by viewing chest X-rays. | 6B | MIMIC-CXR [159], OpenI [160] | |
| XrayGPT [100] | Conversational medical AI for radiation image analysis. | 7B | MIMIC-CXR [161], OpenI [160] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).