Submitted:
26 June 2025
Posted:
27 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Contributions and Limitations of the Work
1.2. Organization of the Work
2. Related Works
3. Methodology
3.1. Pathology Classification
3.1.1. Dataset Sampling
3.2. Affected Region and Its Anatomical Name
3.3. Report Generation
4. Results and Discussion
4.1. Classification Models Evaluation
4.1.1. Models Comparison
4.1.2. Sample Demographics
4.2. Report Generation Evaluation
4.3. IHRAS Case Studies
5. Conclusion
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| AP | Antero-Posterior |
| CNN | Convolutional Neural Network |
| CXR | Chest X-Ray |
| Grad-CAM | Gradient-weighted Class Activation Mapping |
| DenseNet | Dense Convolutional Network |
| IHRAS | Intelligent Humanized Radiology Analysis System |
| PA | Postero-Anterior |
| ResNet | a Deep Residual Learning Network |
| SAR-Net | Structure-Aware Relation Network |
| SNOMED CT | Systematized Nomenclature of Medicine - Clinical Terms |
| SVM | Support Vector Machine |
References
- Akhter, Y.; Singh, R.; Vatsa, M. AI-based radiodiagnosis using chest X-rays: A review. Frontiers in big data 2023, 6, 1120989. [Google Scholar] [CrossRef] [PubMed]
- O’Reilly, P.; Awwad, D.A.; Lewis, S.; Reed, W.; Ekpo, E. Inter-rater concordance in the classification of COVID-19 in chest X-ray images using the RANZCR template for COVID-19 infection. Journal of Medical Imaging and Radiation Sciences 2025, 56, 101911. [Google Scholar] [CrossRef]
- Lantsman, C.D.; Barash, Y.; Klang, E.; Guranda, L.; Konen, E.; Tau, N. Trend in radiologist workload compared to number of admissions in the emergency department. European Journal of Radiology 2022, 149, 110195. [Google Scholar] [CrossRef] [PubMed]
- Fanni, S.C.; Marcucci, A.; Volpi, F.; Valentino, S.; Neri, E.; Romei, C. Artificial intelligence-based software with CE mark for chest X-ray interpretation: Opportunities and challenges. Diagnostics 2023, 13, 2020. [Google Scholar] [CrossRef] [PubMed]
- Çallı, E.; Sogancioglu, E.; van Ginneken, B.; van Leeuwen, K.G.; Murphy, K. Deep learning for chest X-ray analysis: A survey. Medical image analysis 2021, 72, 102125. [Google Scholar] [CrossRef]
- Rajpoot, R.; Gour, M.; Jain, S.; Semwal, V.B. Integrated ensemble CNN and explainable AI for COVID-19 diagnosis from CT scan and X-ray images. Scientific Reports 2024, 14, 24985. [Google Scholar] [CrossRef]
- Yang, Y.; Liu, Y.; Liu, X.; Gulhane, A.; Mastrodicasa, D.; Wu, W.; Wang, E.J.; Sahani, D.; Patel, S. Demographic bias of expert-level vision-language foundation models in medical imaging. Science Advances 2025, 11, eadq0305. [Google Scholar] [CrossRef]
- Li, J.; Zhu, G.; Hua, C.; Feng, M.; Bennamoun, B.; Li, P.; Lu, X.; Song, J.; Shen, P.; Xu, X.; et al. A systematic collection of medical image datasets for deep learning. ACM Computing Surveys 2023, 56, 1–51. [Google Scholar] [CrossRef]
- Reis, E.P.; De Paiva, J.P.; Da Silva, M.C.; Ribeiro, G.A.; Paiva, V.F.; Bulgarelli, L.; Lee, H.M.; Santos, P.V.; Brito, V.M.; Amaral, L.T.; et al. BRAX, Brazilian labeled chest x-ray dataset. Scientific Data 2022, 9, 487. [Google Scholar] [CrossRef]
- Malik, H.; Anees, T.; Din, M.; Naeem, A. CDC_Net: Multi-classification convolutional neural network model for detection of COVID-19, pneumothorax, pneumonia, lung Cancer, and tuberculosis using chest X-rays. Multimedia Tools and Applications 2023, 82, 13855–13880. [Google Scholar] [CrossRef]
- Sharma, S.; Guleria, K. A deep learning based model for the detection of pneumonia from chest X-ray images using VGG-16 and neural networks. Procedia Computer Science 2023, 218, 357–366. [Google Scholar] [CrossRef]
- Abhishek, A.; Jha, R.K.; Sinha, R.; Jha, K. Automated detection and classification of leukemia on a subject-independent test dataset using deep transfer learning supported by Grad-CAM visualization. Biomedical Signal Processing and Control 2023, 83, 104722. [Google Scholar] [CrossRef]
- Teng, Z.; Li, L.; Xin, Z.; Xiang, D.; Huang, J.; Zhou, H.; Shi, F.; Zhu, W.; Cai, J.; Peng, T.; et al. A literature review of artificial intelligence (AI) for medical image segmentation: from AI and explainable AI to trustworthy AI. Quantitative Imaging in Medicine and Surgery 2024, 14, 9620. [Google Scholar] [CrossRef] [PubMed]
- Tan, Y.; Zhang, Z.; Li, M.; Pan, F.; Duan, H.; Huang, Z.; Deng, H.; Yu, Z.; Yang, C.; Shen, G.; et al. MedChatZH: A tuning LLM for traditional Chinese medicine consultations. Computers in biology and medicine 2024, 172, 108290. [Google Scholar] [CrossRef] [PubMed]
- Umair, M.; Khan, M.S.; Ahmed, F.; Baothman, F.; Alqahtani, F.; Alian, M.; Ahmad, J. Detection of COVID-19 using transfer learning and Grad-CAM visualization on indigenously collected X-ray dataset. Sensors 2021, 21, 5813. [Google Scholar] [CrossRef]
- Maity, A.; Nair, T.R.; Mehta, S.; Prakasam, P. Automatic lung parenchyma segmentation using a deep convolutional neural network from chest X-rays. Biomedical Signal Processing and Control 2022, 73, 103398. [Google Scholar] [CrossRef]
- Kufel, J.; Bielówka, M.; Rojek, M.; Mitręga, A.; Lewandowski, P.; Cebula, M.; Krawczyk, D.; Bielówka, M.; Kondoł, D.; Bargieł-ączek, K.; et al. Multi-label classification of chest X-ray abnormalities using transfer learning techniques. Journal of Personalized Medicine 2023, 13, 1426. [Google Scholar] [CrossRef]
- Nicolson, A.; Dowling, J.; Koopman, B. Improving chest X-ray report generation by leveraging warm starting. Artificial intelligence in medicine 2023, 144, 102633. [Google Scholar] [CrossRef]
- Verma, K.; Sikka, G.; Swaraj, A.; Kumar, S.; Kumar, A. Classification of COVID-19 on Chest X-Ray Images Using Deep Learning Model with Histogram Equalization and Lung Segmentation. SN Computer Science 2024, 5, 379. [Google Scholar] [CrossRef]
- Cirik, E.; Osman, O.; Esen, V. Exploring the Impact of CLAHE Processing on Disease Classes ‘Effusion,’‘Infiltration,’‘Atelectasis,’and ‘Mass’ in the NIH Chest XRay Dataset Using VGG16 and ResNet50 Architectures. In Proceedings of the International Conference on Forthcoming Networks and Sustainability in the AIoT Era. Springer; 2024; pp. 422–429. [Google Scholar]
- Wang, X.; Peng, Y.; Lu, L.; Lu, Z.; Bagheri, M.; Summers, R.M. Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In Proceedings of the Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp.2097–2106.
- Cohen, J.P.; Viviano, J.D.; Bertin, P.; Morrison, P.; Torabian, P.; Guarrera, M.; Lungren, M.P.; Chaudhari, A.; Brooks, R.; Hashir, M.; et al. TorchXRayVision: A library of chest X-ray datasets and models. In Proceedings of the International Conference on Medical Imaging with Deep Learning. PMLR; 2022; pp. 231–249. [Google Scholar]
- Huang, G.; Liu, Z.; van der Maaten, L.; Weinberger, K.Q. Densely Connected Convolutional Networks. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016. [Google Scholar]
- Swapnarekha, H.; Behera, H.S.; Nayak, J.; Naik, B. Deep DenseNet and ResNet Approach for COVID-19 Prognosis: Experiments on Real CT Images. In Proceedings of the Computational Intelligence in Pattern Recognition; Das, A.K.; Nayak, J.; Naik, B.; Dutta, S.; Pelusi, D., Eds., Singapore, 2022; pp. 731–747.
- Suara, S.; Jha, A.; Sinha, P.; Sekh, A.A. Is grad-cam explainable in medical images? In Proceedings of the International Conference on Computer Vision and Image Processing. Springer; 2023; pp. 124–135. [Google Scholar]
- Lian, J.; Liu, J.; Zhang, S.; Gao, K.; Liu, X.; Zhang, D.; Yu, Y. A Structure-Aware Relation Network for Thoracic Diseases Detection and Segmentation. IEEE Transactions on Medical Imaging 2021, 40, 2042–2052. [Google Scholar] [CrossRef]
- Wu, J.; Gu, B.; Zhou, R.; Xie, K.; Snyder, D.; Jiang, Y.; Carducci, V.; Wyss, R.; Desai, R.J.; Alsentzer, E.; et al. BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text. arXiv 2025, arXiv:cs.CL/2504.19467]. [Google Scholar]
- Guo, D.; Yang, D.; Zhang, H.; Song, J.; Zhang, R.; Xu, R.; Zhu, Q.; Ma, S.; Wang, P.; Bi, X.; et al. Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning. arXiv preprint, arXiv:2501.12948.2025. [Google Scholar]
- Zhong, R.; Xu, Y.; Zhang, C.; Yu, J. Leveraging large language model to generate a novel metaheuristic algorithm with CRISPE framework. Cluster Computing 2024, 27, 13835–13869. [Google Scholar] [CrossRef]
- Vuokko, R.; Vakkuri, A.; Palojoki, S. Systematized nomenclature of medicine–clinical terminology (SNOMED CT) clinical use cases in the context of electronic health record systems: systematic literature review. JMIR medical informatics 2023, 11, e43750. [Google Scholar] [CrossRef] [PubMed]
- Li, J.; Li, R.; Liu, Q. Beyond static datasets: A deep interaction approach to llm evaluation. arXiv preprint, arXiv:2309.04369 2023.
- Hayat, N.; Lashen, H.; Shamout, F.E. Multi-label generalized zero shot learning for the classification of disease in chest radiographs. In Proceedings of the Machine learning for healthcare conference. PMLR; 2021; pp. 461–477. [Google Scholar]
- Santomartino, S.M.; Hafezi-Nejad, N.; Parekh, V.S.; Yi, P.H. Performance and usability of code-free deep learning for chest radiograph classification, object detection, and segmentation. Radiology: Artificial Intelligence 2023, 5, e220062. [Google Scholar] [CrossRef]





| Reference | Year | Classification | Explainability | Segmentation | LLM |
|---|---|---|---|---|---|
| [15] | 2021 | ✓ | ✓ | ✗ | ✗ |
| [16] | 2022 | ✗ | ✗ | ✓ | ✗ |
| [17] | 2023 | ✓ | ✓ | ✗ | ✗ |
| [18] | 2023 | ✓ | ✗ | ✗ | ✓ |
| [19] | 2024 | ✓ | ✗ | ✓ | ✗ |
| [20] | 2024 | ✓ | ✗ | ✗ | ✗ |
| Our work | 2025 | ✓ | ✓ | ✓ | ✓ |
| Component | Description |
|---|---|
| Capacity | You have deep knowledge of clinical report writing using SNOMED CT, based exclusively on the informed findings |
| Role | Act as a board-certified clinical radiologist preparing an official diagnostic report, with accurate and clear communication of findings |
| Insight | Patient age and gender, view position, findings and probabilities, most affected region |
| Statement | Write a radiological report using SNOMED CT with: Patient Metadata Section, Findings Interpretation, Anatomical Localization |
| Personality | Professional, concise clinical tone using complete sentences with standard medical report structure |
| Experiment | Provide a single report |
| Category | Value | Precision | Recall | F1-score | Specificity |
|---|---|---|---|---|---|
| Age | 0-25 | 0.23 | 0.52 | 0.32 | 0.84 |
| 25-50 | 0.23 | 0.52 | 0.32 | 0.85 | |
| 50-75 | 0.24 | 0.53 | 0.33 | 0.85 | |
| 75+ | 0.24 | 0.51 | 0.33 | 0.86 | |
| Gender | M | 0.24 | 0.53 | 0.33 | 0.85 |
| F | 0.23 | 0.52 | 0.32 | 0.85 | |
| View Position | PA | 0.24 | 0.52 | 0.33 | 0.85 |
| AP | 0.23 | 0.53 | 0.32 | 0.85 | |
| Findings | No findings | 0.24 | 0.51 | 0.32 | 0.85 |
| Effusion | 0.24 | 0.53 | 0.33 | 0.85 | |
| Atelectasis | 0.24 | 0.53 | 0.33 | 0.85 | |
| Pneumothorax | 0.23 | 0.52 | 0.32 | 0.85 | |
| Edema | 0.23 | 0.54 | 0.32 | 0.85 | |
| Infiltration | 0.24 | 0.53 | 0.33 | 0.85 | |
| Fibrosis | 0.23 | 0.54 | 0.32 | 0.84 | |
| Consolidation | 0.23 | 0.52 | 0.32 | 0.85 | |
| Emphysema | 0.24 | 0.51 | 0.33 | 0.85 | |
| Mass | 0.23 | 0.53 | 0.33 | 0.85 | |
| Pneumonia | 0.24 | 0.55 | 0.33 | 0.84 | |
| Hernia | 0.27 | 0.56 | 0.36 | 0.85 | |
| Cardiomegaly | 0.23 | 0.54 | 0.33 | 0.85 | |
| Pleural Thickening | 0.24 | 0.51 | 0.32 | 0.85 | |
| Nodule | 0.24 | 0.51 | 0.32 | 0.85 |
| Metric | Mean Score | Min Score | Max Score |
|---|---|---|---|
| Faithfulness | 0.99 | 0.89 | 1.00 |
| Answer Relevancy | 0.91 | 0.75 | 1.00 |
| Hallucination | 0.00 | 0.00 | 0.00 |
| Toxicity | 0.00 | 0.00 | 0.00 |
| Bias | 0.00 | 0.00 | 0.00 |
| Prompt Alignment | 0.86 | 0.82 | 0.90 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).