Submitted:
07 April 2025
Posted:
07 April 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Utilize ChatGPT for chronic disease diagnosis, particularly heart disease and diabetes prediction.
- Explore the impact of feature selection methods and prompt engineering on predictive performance.
- Investigate the influence of different learning strategies, including zero-shot, few-shot, and chain-of-thought reasoning on model outcomes.
- Evaluate ChatGPT’s results against models from the literature.
- Suggest a workflow for ChatGPT as an assistant for ML/ DL models to enhance clinical decision-making.
2. Related Works
2.1. Diabetes Disease Prediction
2.2. Heart Disease Prediction
3. Methodology
3.1. Dataset
3.1.1. Diabetes Disease Dataset
3.2. Data Pre-Processing
3.3. Experiments
3.3.1. Feature Selection
Pima Indian Diabetes Dataset (PIDD)
Frankfurt Hospital Diabetes Dataset (FHDD)
UCI Cleveland Heart Disease Dataset
3.3.2. Learning Strategy Selection
3.3.3. Prompt Engineering
Prompts Formulation and Optimization
3.3.4. API Configuration and Parameter Optimization
3.3.5. Evaluation
4. Results
4.1. Diabetes Prediction
4.1.1. Prediction Results for the Pima Indian Diabetes Dataset (PIDD)
4.1.2. Prediction Results for the Frankfurt Hospital Diabetes Dataset (FHDD)
4.2. Heart Disease Prediction
5. Discussion
5.1. Comparative analysis with Existing ML/DL Models
5.2. Challenges in Using ChatGPT for Disease Prediction and Potential Enhancements
- High sensitivity to prompt engineering: High sensitivity to prompt formulation
- Complexity of medical data: The varying complexity of medical data and disease types such as heart disease.
- Bise: ChatGPT relies on general training data, which introduces bias.
- Regulation issues: ChatGPT is not FDA/CE-approved.
- Privacy concerns: ChatGPT can get access to highly sensitive medical data leading to discrimination or misuse.
5.3. The Role of ChatGPT in Enhancing Clinical Decision-Making
6. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Conflicts of Interest
References
- D. Fernández-edreira, J. Liñares-blanco, and C. Fernandez-lozano, “Machine Learning analysis of the human infant gut microbiome identifies influential species in type 1 diabetes,” Expert Syst. Appl., vol. 185, no. November 2020, p. 115648, 2021. [CrossRef]
- Y. Kumar, A. Koul, R. Singla, and M. F. Ijaz, “Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda,” J. Ambient Intell. Humaniz. Comput., vol. 14, no. 7, pp. 8459–8486, 2023. [CrossRef]
- J. Tom, M. Zsoldos, and A. Thurzo, “AI and Face-Driven Orthodontics : A Scoping Review of Digital Advances in Diagnosis and Treatment Planning,” pp. 158–176, 2024.
- S. Dara, S. Dhamercherla, S. Singh, J. Ch, and M. Babu, Machine Learning in Drug Discovery : A Review, vol. 55, no. 3. Springer Netherlands, 2022.
- Blanco-gonz et al., “The Role of AI in Drug Discovery : Challenges , Opportunities , and Strategies,” pp. 1–11, 2023.
- E. Garreffa et al., “Regional lymphadenopathy following COVID-19 vaccination : Literature review and considerations for patient management in breast cancer care,” Eur. J. Cancer, vol. 159, pp. 38–51, 2021. [CrossRef]
- T. W. Smith, “Intimate Relationships and Coronary Heart Disease : Implications for Risk , Prevention , and Patient Management,” Curr. Cardiol. Rep., pp. 761–774, 2022. [CrossRef]
- Y. Kumar, S. Gupta, R. Singla, and Y. Chen, “A Systematic Review of Artificial Intelligence Techniques in Cancer Prediction and Diagnosis,” Arch. Comput. Methods Eng., vol. 29, no. 4, pp. 2043–2070, 2022. [CrossRef]
- D. Nam, J. Chapiro, V. Paradis, T. P. Seraphin, and J. N. Kather, “Artificial intelligence in liver diseases : Improving diagnostics , prognostics and response prediction,” J. Hepatol., vol. 4, no. 4, p. 100443. [CrossRef]
- R. Sawhney, A. Malik, S. Sharma, and V. Narayan, “A comparative assessment of artificial intelligence models used for early prediction and evaluation of chronic kidney disease,” Decis. Anal. J., vol. 6, no. January, p. 100169, 2023. [CrossRef]
- “National Cancer Institute.”. Available online: https://www.cancer.gov/ (accessed on 2 December 2024).
- “American Heart Association, Classes of Heart Failure, American Heart Association.”. Available online: https://www.heart.org/en/health-topics/heart-failure/what-is-heart-failure/classes-of-heart-failure.
- “World Health Organization, Cardiovascular Diseases, WHO.”. Available online: https://www.who.int/health-topics/cardiovascular-diseases/#tab=tab_1.
- “World Health Organization, Diabetes, WHO,” 2024. Available online: https://www.who.int/news-room/fact-sheets/detail/diabetes.
- E. P. Onakpojeruo, M. T. Mustapha, D. U. Ozsahin, and I. Ozsahin, “A Comparative Analysis of the Novel Conditional Deep Convolutional Neural Network Model, Using Conditional Deep Convolutional Generative Adversarial Network-Generated Synthetic and Augmented Brain Tumor Datasets for Image Classification,” Brain Sci., vol. 14, no. 6, 2024. [CrossRef]
- E. P. Onakpojeruo, M. T. Mustapha, D. U. Ozsahin, and I. Ozsahin, “Enhanced MRI-based brain tumour classification with a novel Pix2pix generative adversarial network augmentation framework,” Brain Commun., vol. 6, no. 6, pp. 1–22, 2024. [CrossRef]
- Alanazi, L. Aldakhil, M. Aldhoayan, and B. Aldosari, “Machine Learning for Early Prediction of Sepsis in Intensive Care Unit (ICU) Patients,” Med., vol. 59, no. 7, pp. 1–9, 2023. [CrossRef]
- H. A. Ganatra, S. Q. Latifi, and O. Baloglu, “Pediatric Intensive Care Unit Length of Stay Prediction by Machine Learning,” Bioengineering, vol. 11, no. 10, pp. 1–11, 2024. [CrossRef]
- “Multicenter, Randomized Trial of a Bionic Pancreas in Type 1 Diabetes,” N. Engl. J. Med., vol. 387, no. 13, pp. 1161–1172, Sep. 2022. [CrossRef]
- M. D. Abràmoff, P. T. Lavin, M. Birch, N. Shah, and J. C. Folk, “Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices,” npj Digit. Med. 2018 11, vol. 1, no. 1, pp. 1–8, Aug. 2018. [CrossRef]
- M. M. Ahsan, S. A. Luna, and Z. Siddique, “Machine-Learning-Based Disease Diagnosis : A Comprehensive Review,” Healthcare, pp. 1–30, 2022.
- E. A. Alomari, “Unlocking the Potential: A Comprehensive Systematic Review of ChatGPT in Natural Language Processing Tasks,” C. - Comput. Model. Eng. Sci., vol. 141, no. 1, pp. 43–85, 2024. [CrossRef]
- L. Caruccio, S. Cirillo, G. Polese, G. Solimando, S. Sundaramurthy, and G. Tortora, “Can ChatGPT provide intelligent diagnoses? A comparative study between predictive models and ChatGPT to define a new medical diagnostic bot,” Expert Syst. Appl., vol. 235, no. July 2023, p. 121186, 2024. [CrossRef]
- M. Gollapalli et al., “A novel stacking ensemble for detecting three types of diabetes mellitus using a Saudi Arabian dataset : Pre-diabetes , T1DM , and T2DM,” Comput. Biol. Med., vol. 147, no. April, p. 105757, 2022. [CrossRef]
- H. F. Ahmad, H. Mukhtar, H. Alaqail, M. Seliaman, and A. Alhumam, “Investigating Health-Related Features and Their Impact on the Prediction of Diabetes Using Machine Learning,” Appl. Sci., 2021.
- H. Qteat and M. Awad, “Using Hybrid Model of Particle Swarm Optimization and Multi-Layer Perceptron Neural Networks for Classification of Diabetes Using Hybrid Model of Particle Swarm Optimization and Multi-Layer Perceptron Neural Networks for Classification of Diabetes,” no. June, pp. 10–22, 2021. [CrossRef]
- N. L. Fitriyani and M. Syafrudin, “Development of Disease Prediction Model Based on Ensemble Learning Approach for Diabetes and Hypertension,” Spec. Sect. DATA-ENABLED Intell. Digit. Heal. Receiv., vol. 7, 2019.
- saif abdulwahed Ameer Ali, mohammed alrubei, “DIABETES CLASSIFICATION BASED ON KNN,” IIUM Eng. Journal, vol. 21, no. 1, pp. 175–181, 2020.
- E. S. Almutairi and M. F. Abbod, “Machine Learning Methods for Diabetes Prevalence Classification in Saudi Arabia,” modelling, pp. 37–55, 2023.
- N. Alsulami, M. Almasre, S. Sarhan, and W. Alsaggaf, “Deep Learning Models for Type 2 Diabetes Detection in Saudi Arabia,” J. Pioneer. Med. Sci., vol. 13, no. 2, pp. 60–73, 2024.
- H. S. T. Khan, “Machine learning-based application for predicting risk of type 2 diabetes mellitus (t2dm) in saudi arabia: A retrospective cross-sectional study,” IEEE Access, vol. 8, 2020.
- T. R. Mahesh et al., “Blended Ensemble Learning Prediction Model for Strengthening Diagnosis and Treatment of Chronic Diabetes Disease,” Comput. Intell. Neurosci., vol. 2022, 2022. [CrossRef]
- R. Krishnamoorthi et al., “A Novel Diabetes Healthcare Disease Prediction Framework Using Machine Learning Techniques,” J. Healthc. Eng., vol. 2022, 2022. [CrossRef]
- R. Patil and S. Tamane, “A Comparative Analysis on the Evaluation of Classification Algorithms in the Prediction of Diabetes,” IJECE, vol. 8, no. 5, pp. 3966–3975, 2018. [CrossRef]
- Q. Wang, W. Cao, J. Guo, J. Ren, Y. Cheng, and D. N. Davis, “DMP _ MI : An Effective Diabetes Mellitus Classification Algorithm on Imbalanced Data With Missing Values,” vol. 7, 2019.
- N. N. R. Delshi Howsalya Devi, Anita Bai, “A novel hybrid approach for diagnosing diabetes mellitus using farthest first and support vector machine algorithms,” Obes. Med., vol. 17, 2020.
- Paramjot Kaur & Ramanpreet Kaur, “Comparative Analysis of Classification Techniques for Diagnosis of Diabetes,” 2020.
- R. D. Joshi and C. K. Dhakal, “Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches,” Int. J. Environ. Res. Public Heal. Artic., 2021.
- S Sivaranjani; S Ananya; J Aravinth; R Karthika, “Diabetes Prediction using Machine Learning Algorithms with Feature Selection and Dimensionality Reduction,” 2021.
- Nour Abdulhadi; Amjed Al-Mousa, “Diabetes Detection Using Machine Learning Classification Methods,” 2021.
- S. Kumari, D. Kumar, and M. Mittal, “International Journal of Cognitive Computing in Engineering An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier,” Int. J. Cogn. Comput. Eng., vol. 2, no. November 2020, pp. 40–46, 2021. [CrossRef]
- S. K. Kalagotla, S. V Gangashetty, and K. Giridhar, “A novel stacking technique for prediction of diabetes,” Comput. Biol. Med., vol. 135, no. February, p. 104554, 2021. [CrossRef]
- M. M. Bukhari, B. F. Alkhamees, S. Hussain, A. Gumaei, A. Assiri, and S. S. Ullah, “An Improved Artificial Neural Network Model for Effective Diabetes Prediction,” Complexity, vol. 2021, 2021. [CrossRef]
- M. S. Al Reshan et al., “An Innovative Ensemble Deep Learning Clinical Decision Support System for Diabetes Prediction,” IEEE Access, vol. 12, no. May, pp. 106193–106210, 2024. [CrossRef]
- “Type 2 Diabetes Dataset | IEEE DataPort.”. Available online: https://ieee-dataport.org/documents/type-2-diabetes-dataset (accessed on 14 February 2025).
- V. Dambra, M. Roccotelli, and M. P. Fanti, “Diabetic Disease Detection using Machine Learning Techniques,” 10th 2024 Int. Conf. Control. Decis. Inf. Technol. CoDIT 2024, pp. 1436–1441, 2024. [CrossRef]
- D. Shrestha, “Comparative Analysis of Machine Learning Algorithms for Heart Disease Prediction,” Adv. Transdiscipl. Eng., vol. 27, pp. 64–69, 2022. [CrossRef]
- K. M. Alfadli and A. O. Almagrabi, “Feature-Limited Prediction on the UCI Heart Disease Dataset,” Comput. Mater. Contin., vol. 74, no. 3, pp. 5871–5883, 2023. [CrossRef]
- Anderies, J. A. R. W. Tchin, P. H. Putro, Y. P. Darmawan, and A. A. S. Gunawan, “Prediction of Heart Disease UCI Dataset Using Machine Learning Algorithms,” Eng. Math. Comput. Sci. J., vol. 4, no. 3, pp. 87–93, 2022. [CrossRef]
- R. Bharti, A. Khamparia, M. Shabaz, G. Dhiman, S. Pande, and P. Singh, “Prediction of Heart Disease Using a Combination of Machine Learning and Deep Learning,” Comput. Intell. Neurosci., vol. 2021, 2021. [CrossRef]
- D. Asif, M. Bibi, and M. S. Arif, “Enhancing Heart Disease Prediction through Ensemble Learning Techniques with Hyperparameter Optimization,” Algorithms, pp. 1–17, 2023.
- Z. Noroozi, A. Orooji, and L. Erfannia, “Analyzing the impact of feature selection methods on machine learning algorithms for heart disease prediction,” Nature Publishing Group UK, 2023. [CrossRef]
- N. C. and S. Peddakrishna, “Enhancing Heart Disease Prediction Accuracy through Machine Learning Techniques and Optimization,” processes, vol. 11, no. 1210, 2023.
- E. Korial, I. I. Gorial, and A. J. Humaidi, “An Improved Ensemble-Based Cardiovascular Disease Detection System with Chi-Square Feature Selection,” Computers, vol. 13, no. 6, pp. 1–17, 2024. [CrossRef]
- “pandas 2.2.3,” NumFOCUS. Available online: https://pandas.pydata.org/.
- R. Harris et al., “Array programming with NumPy,” Nature, vol. 585, no. 7825, pp. 357–362, 2020. [CrossRef]
- T. F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, “Scikit-learn: Machine learning in python,” J. ofMachine Learn. Res., vol. 12, no. 9, pp. 2825–2830, 2011. [CrossRef]
- J. D. Hunter, “Matplotlib: A 2D graphics environment,” Comput. Sci. Eng., vol. 9, no. 3, pp. 90–95, 2007.
- “Pima Indians Diabetes Database,” National Institute of Diabetes and Digestive and Kidney Diseases, 2024. Available online: https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database.
- “UCI Heart Disease Dataset.”. Available online: https://archive.ics.uci.edu/dataset/45/heart+disease.
- F. Amiri, M. Rezaei Yousefi, C. Lucas, A. Shakery, and N. Yazdani, “Mutual information-based feature selection for intrusion detection systems,” J. Netw. Comput. Appl., vol. 34, no. 4, pp. 1184–1199, 2011. [CrossRef]
- Y. Cai, Z. Ding, B. Yang, Z. Peng, and W. Wang, “Zero-Shot Learning Through Cross-Modal Transfer Richard,” Phys. A Stat. Mech. its Appl., vol. 514, pp. 729–740, 2015.
- Y. Wang, Q. Yao, J. T. Kwok, and L. M. Ni, “Generalizing from a Few Examples: A Survey on Few-shot Learning,” ACM Comput. Surv., vol. 53, no. 3, pp. 1–34, 2020. [CrossRef]
- T. Kojima, S. S. Gu, M. Reid, Y. Matsuo, and Y. Iwasawa, “Large Language Models are Zero-Shot Reasoners,” Adv. Neural Inf. Process. Syst., vol. 35, no. NeurIPS, 2022.
- J. Wei et al., “Chain-of-Thought Prompting Elicits Reasoning in Large Language Models,” Adv. Neural Inf. Process. Syst., vol. 35, no. NeurIPS, pp. 1–43, 2022.
- Ferrara, “Should ChatGPT be biased? Challenges and risks of bias in large language models,” First Monday, vol. 28, no. 11, 2023. [CrossRef]
- N. Micheletti, R. Marchesi, N. I.-H. Kuo, S. Barbieri, G. Jurman, and V. Osmani, “Generative AI Mitigates Representation Bias Using Synthetic Health Data,” medRxiv, p. 2023.09.26.23296163, 2024, [Online]. Available online: http://medrxiv.org/content/early/2024/02/01/2023.09.26.23296163.abstract.
- G. Peikos, S. Symeonidis, P. Kasela, and G. Pasi, “Utilizing ChatGPT to Enhance Clinical Trial Enrollment,” 2023, [Online]. Available online: http://arxiv.org/abs/2306.02077.
- Kartchner, I. Al-Hussaini, O. Kronick, S. Ramalingam, and C. Mitchell, “Zero-Shot Information Extraction for Clinical Meta-Analysis using Large Language Models,” in The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks , 2023, pp. 396–405.
- M. Nishio et al., “Zero-shot classification of TNM staging for Japanese radiology report using ChatGPT at RR-TNM subtask of NTCIR-17 MedNLP-SC,” 2023. [CrossRef]



















| Feature name | Description | Data type |
|---|---|---|
| Pregnancies | Number of pregnancies | Integer |
| Glucose | Plasma glucose concentration | Integer |
| BloodPressure | Diastolic blood pressure | Integer |
| SkinThickness | Triceps skin fold thickness | Integer |
| Insulin | 2-Hour serum insulin | Integer |
| BMI | Body mass index | Float |
| DiabetesPedigreeFunction | A function that scores the likelihood of diabetes based on family history | Float |
| Age | Age of the patient | Integer |
| Outcome | Diabetes outcome (0 = No- T2DM , 1 = T2DM). | Integer |
| Feature name | Full Name | Description | Data Type |
|---|---|---|---|
| age | Age | The age of the patient in years. | Integer |
| sex | Sex | The sex of the patient, where 1 = male and 0 = female. | categorical |
| cp | Chest Pain Type | The type of chest pain, where 1 = typical angina, 2 = atypical angina, 3 = non-anginal pain, 4 = asymptomatic. | categorical |
| trestbps | Resting Blood Pressure | The resting blood pressure of the patient in mm Hg. | Integer |
| chol | Serum Cholesterol Level | The serum cholesterol level of the patient in mg/dl. | Integer |
| fbs | Fasting Blood Sugar | A binary feature (1 or 0), where 1 indicates the patient’s fasting blood sugar is greater than 120 mg/dl. | categorical |
| restecg | Resting Electrocardiographic Result | The electrocardiographic result at rest, where 0 = normal, 1= ST-T wave abnormality, 2= left ventricular. | categorical |
| thalach | Maximum Heart Rate Achieved | The maximum heart rate achieved by the patient during exercise. | Float |
| exang | Exercise Induced Angina | A binary feature (1 or 0), where 1 indicating the patient experienced exercise-induced angina. | categorical |
| oldpeak | ST Depression Induced by Exercise | The depression induced by exercise relative to rest. | Float |
| slope | Slope of the Peak Exercise ST Segment | The slope of the peak exercise ST segment, where 1 = upsloping, 2 = flat, 3 = downsloping. | categorical |
| ca | Number of Major Vessels | The number of major vessels (0-3) colored by fluoroscopy. Higher values indicate more significant coronary artery disease. | Integer |
| thal | Thalassemia | The type of thalassemia, where 3 = normal, 6 = fixed defect, 7 = reversible defect. | categorical |
| num | Diagnosis of Heart Disease | The outcome variable, where 0 = no heart disease, 1-4= presence of heart disease. | Integer |
| Learning method | Features selection method | Accuracy (%) | Precision (%) | Recall (%) | F1-score (%) | ||||
|---|---|---|---|---|---|---|---|---|---|
| GPT-4o-mini | GPT-4o | GPT-4o-mini | GPT-4o | GPT-4o-mini | GPT-4o | GPT-4o-mini | GPT-4o | ||
| Zero-shot | None | 45.83 | 75.0 | 39.18 | 62.75 | 100.00 | 69.78 | 56.30 | 66.08 |
| MIFS | 39.97 | 75.52 | 36.76 | 62.99 | 100.00 | 72.39 | 53.76 | 67.36 | |
| CBFS | 45.96 | 75.39 | 39.24 | 63.12 | 100.00 | 70.90 | 56.36 | 66.78 | |
| 3-shot | None | 66.53 | 76.69 | 51.22 | 67.45 | 85.82 | 64.18 | 64.16 | 65.77 |
| MIFS | 66.92 | 76.30 | 51.55 | 67.48 | 86.94 | 61.94 | 64.72 | 61.94 | |
| CBFS | 67.84 | 76.56 | 52.42 | 67.46 | 84.70 | 63.43 | 64.76 | 65.38 | |
| 5-shot | None | 67.83 | 76.69 | 52.44 | 68.94 | 84.33 | 60.45 | 64.66 | 64.41 |
| MIFS | 69.01 | 76.43 | 53.61 | 69.51 | 83.21 | 57.84 | 65.20 | 63.14 | |
| CBFS | 67.70 | 77.08 | 52.29 | 70.00 | 85.07 | 60.07 | 64.77 | 64.66 | |
| 10-shot | None | 69.53 | 75.78 | 54.29 | 64.64 | 80.22 | 67.54 | 64.76 | 66.06 |
| MIFS | 69.14 | 74.87 | 53.90 | 62.46 | 79.85 | 70.15 | 64.36 | 66.08 | |
| CBFS | 68.75 | 73.44 | 53.57 | 60.46 | 78.36 | 69.03 | 63.64 | 64.46 | |
| Learning method | Accuracy (%) | Precision (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| Zero-shot CoT | 70.96 | 55.67 | 82.46 | 66.47 |
| Knowledge-Enhanced Zero-shot CoT | 73.04 | 59.16 | 73.51 | 65.56 |
| 3-shot CoT | 72.53 | 58.82 | 70.90 | 64.30 |
| Knowledge-Enhanced 3-shot CoT | 73.69 | 60.31 | 72.01 | 65.65 |
| Learning method | Features selection method | Accuracy (%) | Precision (%) | Recall (%) | F1-score (%) | ||||
|---|---|---|---|---|---|---|---|---|---|
| GPT-4o-mini | GPT-4o | GPT-4o-mini | GPT-4o | GPT-4o-mini | GPT-4o | GPT-4o-mini | GPT-4o | ||
| Zero-shot | None | 41.55 | 74.7 | 36.91 | 60.85 | 100.00 | 72.95 | 53.92 | 66.36 |
| CBFS | 44.75 | 75.2 | 38.11 | 61.90 | 98.68 | 71.49 | 54.99 | 66.35 | |
| 3-shot | None | 61.55 | 74.85 | 46.90 | 60.58 | 94.15 | 75.73 | 62.62 | 67.32 |
| CBFS | 59.59 | 74.7 | 45.63 | 60.55 | 94.74 | 74.71 | 61.60 | 66.88 | |
| 5-shot | None | 66.64 | 75.85 | 50.71 | 63.00 | 88.89 | 71.20 | 64.58 | 66.85 |
| CBFS | 66.14 | 75.65 | 50.29 | 62.33 | 87.28 | 72.81 | 63.82 | 67.16 | |
| 10-shot | None | 66.7 | 75.14 | 50.77 | 61.67 | 86.55 | 72.22 | 64.00 | 66.53 |
| CBFS | 69.19 | 75.5 | 53.15 | 62.28 | 83.77 | 71.93 | 65.04 | 66.76 | |
| Learning method | Accuracy (%) | Precision (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| Zero-shot CoT | 68.85 | 52.79 | 84.36 | 64.94 |
| Knowledge-Enhanced Zero-shot CoT | 72.3 | 57.34 | 74.27 | 64.71 |
| 3-shot CoT | 68.75 | 52.59 | 87.43 | 65.68 |
| Knowledge-Enhanced 3-shot CoT | 72.35 | 57.05 | 77.49 | 65.72 |
| Prompts | Accuracy (%) | Precision (%) | Recall (%) | F1-score (%) | ||||
|---|---|---|---|---|---|---|---|---|
| GPT-4o-mini | GPT-4o | GPT-4o-mini | GPT-4o | GPT-4o-mini | GPT-4o | GPT-4o-mini | GPT-4o | |
| 1 | 52.19 | 75.08 | 49.09 | 69.09 | 98.54 | 83.21 | 65.53 | 75.50 |
| 2 | 55.22 | 73.74 | 50.76 | 66.12 | 97.81 | 88.32 | 66.83 | 75.62 |
| 3 | 58.92 | 78.11 | 53.09 | 72.22 | 94.16 | 85.40 | 67.89 | 78.26 |
| 4 | 55.22 | 80.47 | 50.77 | 76.51 | 96.35 | 83.21 | 66.50 | 79.72 |
| Learning method | Features selection method | Accuracy (%) | Precision (%) | Recall (%) | F1-score (%) | ||||
|---|---|---|---|---|---|---|---|---|---|
| GPT-4o--mini | GPT-4o | GPT-4omini | GPT-4o | GPT-4o-mini | GPT-4o | GPT-4o-mini | GPT-4o | ||
| Zero-shot | None | 55.22 | 80.47 | 50.77 | 76.51 | 96.35 | 83.21 | 66.50 | 79.72 |
| MIFS | 64.65 | 81.14 | 57.69 | 77.18 | 87.59 | 83.94 | 69.57 | 80.42 | |
| CBFS | 71.04 | 83.84 | 64.57 | 80.69 | 82.48 | 85.40 | 72.44 | 82.98 | |
| 3-shot | None | 69.36 | 83.50 | 64.38 | 81.88 | 75.18 | 82.48 | 69.36 | 82.18 |
| MIFS | 73.74 | 83.84 | 66.86 | 82.96 | 85.40 | 81.75 | 75.00 | 82.35 | |
| CBFS | 75.76 | 85.19 | 76.42 | 85.50 | 68.61 | 81.75 | 72.31 | 83.58 | |
| 5-shot | None | 74.07 | 82.49 | 69.48 | 81.02 | 78.10 | 81.02 | 73.54 | 81.02 |
| MIFS | 74.41 | 82.83 | 69.43 | 86.44 | 69.43 | 74.45 | 74.15 | 80.00 | |
| CBFS | 78.45 | 85.52 | 79.67 | 87.30 | 71.53 | 80.29 | 75.38 | 83.65 | |
| 10-shot | None | 74.07 | 82.83 | 69.23 | 84.68 | 78.83 | 76.64 | 73.72 | 80.46 |
| MIFS | 72.73 | 82.15 | 66.28 | 85.59 | 83.21 | 73.72 | 73.79 | 79.22 | |
| CBFS | 77.44 | 84.18 | 76.12 | 85.71 | 74.45 | 78.83 | 75.28 | 82.13 | |
| Learning method | Accuracy (%) | Precision (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| Zero-shot CoT | 75.76 | 67.57 | 91.24 | 77.64 |
| Knowledge-Enhanced zero-shot CoT | 65.66 | 57.58 | 97.08 | 72.28 |
| 3-shot CoT | 76.77 | 70.73 | 84.67 | 77.08 |
| Knowledge-Enhanced 3-shot CoT | 72.05% | 64.21 | 89.05 | 74.62 |
| Dataset | Ref. | Approach | |
|---|---|---|---|
| Pima Indian Diabetes Dataset | Bukhari et al. [43] | Deep learning (ABP-SCGNN) | |
| Alreshan et al. [44] | Deep learning (stack-ANN) | ||
| Frankfurt Hospital Diabetes Dataset | Alreshan et al. [44] | Deep learning (stack-ANN) | |
| Dambra et al. [46] | Random forest | ||
| Heart Disease | Cleveland UCI Heart Disease Dataset | Asif et al. [51] | Extra tree classifier |
| Bharti et al. [50] | Deep learning (ANN) | ||
| Chandrasekhar and Peddakrishna [53] | Soft voting ensemble method with feature scaling to address outliers |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).