Submitted:
26 May 2026
Posted:
27 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methods
2.1. Research Design and Methodology Overview
2.2. Dataset

2.3. Pre-Processing Pipeline
2.3.1. Feature Selection
2.3.2. Categorical Encoding
2.3.3. Outlier Detection and Treatment
2.3.4. Feature Normalisation
2.4. Classification Algorithms
2.4.1. k-Nearest Neighbour (k-NN)
2.4.2. Decision Tree (CART)
2.4.3. Gaussian Naive Bayes (NB)
2.4.4. Multi-Layer Perceptron Neural Network (MLP-NN)
2.5. Validation Strategy
2.6. Evaluation Metrics
3. Results and Discussions
3.1. Descriptive Statistics
3.2. Feature Correlation Analysis
3.3. Classification Performance
3.4. Per-Class Performance of the Best Model
3.5. Effect of Neighbourhood Size on k-NN



3.6. Statistical Significance Testing
3.6.1. Friedman Test (Global Comparison)
| Model | Mean Acc. (%) | SD (%) | Avg. Rank | Rank |
|---|---|---|---|---|
| Neural Network | 90.93 | 0.74 | 1.200 | 1st |
| Decision Tree | 90.93 | 0.72 | 1.800 | 2nd |
| k-NN (k=11) | 86.82 | 0.89 | 3.700 | 3rd |
| k-NN (k=9) | 86.51 | 0.70 | 4.000 | 4th |
| k-NN (k=7) | 86.12 | 0.98 | 4.600 | 5th |
| k-NN (k=5) | 85.52 | 0.73 | 5.700 | 6th |
| k-NN (k=3) | 83.69 | 0.79 | 7.400 | 7th |
| Naive Bayes | 83.15 | 0.93 | 7.600 | 8th |
3.6.2. Wilcoxon Signed-Rank Post-hoc Test

3.6.3. Cohen's Kappa Coefficient
| Model | Accuracy (%) | Cohen's kappa | Interpretation |
|---|---|---|---|
| Neural Network | 91.70 | 0.8726 | Almost perfect |
| Decision Tree | 90.90 | 0.8602 | Almost perfect |
| k-NN (k=11) | 87.70 | 0.8108 | Almost perfect |
| k-NN (k=9) | 87.25 | 0.8041 | Almost perfect |
| k-NN (k=7) | 85.75 | 0.7807 | Substantial |
| k-NN (k=5) | 85.10 | 0.7712 | Substantial |
| k-NN (k=3) | 83.60 | 0.7485 | Substantial |
| Naive Bayes | 82.60 | 0.7338 | Substantial |
3.6.4. McNemar's Test
| Comparison | chi-squared | p-value | Decision |
|---|---|---|---|
| NN vs k-NN (k=3) | 93.917 | < 0.001*** | NN significantly better |
| NN vs k-NN (k=5) | 69.198 | < 0.001*** | NN significantly better |
| NN vs k-NN (k=7) | 59.760 | < 0.001*** | NN significantly better |
| NN vs k-NN (k=9) | 36.701 | < 0.001*** | NN significantly better |
| NN vs k-NN (k=11) | 32.170 | < 0.001*** | NN significantly better |
| NN vs Decision Tree | 1.264 | 0.261 (n.s.) | No significant difference |
| NN vs Naive Bayes | 108.480 | < 0.001*** | NN significantly better |
3.6.5. ROC-AUC Analysis
| Model | AUC (OvR macro) | Interpretation |
|---|---|---|
| Neural Network | 0.9858 | Outstanding |
| k-NN (k=11) | 0.9670 | Excellent |
| k-NN (k=9) | 0.9651 | Excellent |
| Naive Bayes | 0.9622 | Excellent |
| k-NN (k=7) | 0.9596 | Excellent |
| k-NN (k=5) | 0.9526 | Excellent |
| Decision Tree | 0.9489 | Excellent |
| k-NN (k=3) | 0.9351 | Excellent |

3.6.6. Consolidated Statistical Summary
| Model | Avg. Rank | kappa | AUC | McNemar vs NN | Verdict |
|---|---|---|---|---|---|
| Neural Network | 1.200 | 0.8726 | 0.9858 | — | Best / top cluster |
| Decision Tree | 1.800 | 0.8602 | 0.9489 | n.s. | Equiv. to NN |
| k-NN (k=11) | 3.700 | 0.8108 | 0.9670 | p<0.001 | Below top-2 |
| k-NN (k=9) | 4.000 | 0.8041 | 0.9651 | p<0.001 | Below top-2 |
| k-NN (k=7) | 4.600 | 0.7807 | 0.9596 | p<0.001 | Below top-2 |
| k-NN (k=5) | 5.700 | 0.7712 | 0.9526 | p<0.001 | Below top-2 |
| k-NN (k=3) | 7.400 | 0.7485 | 0.9351 | p<0.001 | Below top-2 |
| Naive Bayes | 7.600 | 0.7338 | 0.9622 | p<0.001 | Below top-2 |
3.7. Discussion
4. Conclusions
Acknowledgements
References
- Genitsaridi, I., Salpea, P., Salim, A., et al. (2026). 11th edition of the IDF Diabetes Atlas: Global, regional, and national diabetes prevalence estimates for 2024 and projections for 2050. The Lancet Diabetes & Endocrinology, 14(2), 149-156. [CrossRef]
- IDF (International Diabetes Federation). (2021). IDF Diabetes Atlas (10th ed.). International Diabetes Federation.
- ADA (American Diabetes Association). (2023). Standards of medical care in diabetes-2023. Diabetes Care, 46(Suppl 1), S1-S267. [CrossRef]
- Knowler, W. C., Barrett-Connor, E., Fowler, S. E., et al. (2002). Reduction in the incidence of type 2 diabetes with lifestyle intervention or metformin. New England Journal of Medicine, 346(6), 393-403.
- Tuomilehto, J., Lindstrom, J., Eriksson, J. G., et al. (2001). Prevention of type 2 diabetes mellitus by changes in lifestyle among subjects with impaired glucose tolerance. New England Journal of Medicine, 344(18), 1343-1350.
- Swapna, G., Karthik, B., Vinayakumar, R., & Soman, K. P. (2022). Diabetes mellitus disease prediction using machine learning classifiers with oversampling and feature augmentation. Advances in Human-Computer Interaction, 2022, Article 9220560. [CrossRef]
- Iparraguirre-Villanueva, O., Espinola-Linares, K., Flores-Castaneda, R. O., & Cabanillas-Carbonell, M. (2023). Application of machine learning models for early detection and accurate classification of type 2 diabetes. Diagnostics, 13(14), 2383. [CrossRef]
- Al Sadi, K., & Balachandran, W. (2023). Prediction model of type 2 diabetes mellitus for Oman prediabetes patients using artificial neural network and six machine learning classifiers. Applied Sciences, 13(4), 2344. [CrossRef]
- Chen, R., Wang, S., Wang, S., Li, Y., Yang, Y., Liu, M., & Zheng, X. (2023). Comparative study on risk prediction model of type 2 diabetes based on machine learning theory: A cross-sectional study. BMJ Open, 13(8), e069018. [CrossRef]
- Ordonez-Guillen, N. E., Gonzalez-Compean, J. L., Lopez-Arevalo, I., Contreras-Murillo, M., & Aldana-Bobadilla, E. (2023). Machine learning based study for the classification of type 2 diabetes mellitus subtypes. BioData Mining, 16(1), 24. [CrossRef]
- Rashid, J., Batool, S., Kim, J., Nisar, M. W., Hussain, A., Juneja, S., & Kushwaha, R. (2022). An augmented artificial intelligence approach for chronic diseases prediction. Frontiers in Public Health, 10, 860396. [CrossRef]
- Datta, D., Bhattacharya, M., Rajesh, S. S., Shynu, T., Regin, R., & Priscila, S. S. (2023). Development of a predictive model of diabetic using supervised machine learning classification algorithm of ensemble voting. International Journal of Bioinformatics Research and Applications, 19(3), 151-169.
- Cover, T., & Hart, P. (1967). Nearest neighbor pattern classification. IEEE Transactions on Information Theory, 13(1), 21-27.
- Breiman, L., Friedman, J., Stone, C. J., & Olshen, R. A. (1984). Classification and Regression Trees. Wadsworth.
- Russell, S., & Norvig, P. (2020). Artificial Intelligence: A Modern Approach (4th ed.). Pearson.
- LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436-444.
- Kingma, D. P., & Ba, J. (2015). Adam: A method for stochastic optimization. In Proceedings of ICLR 2015. arXiv:1412.6980.
- Demsar, J. (2006). Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research, 7, 1-30.
- Rainio, O., Teuho, J., & Klén, R. (2024). Evaluation metrics and statistical tests for machine learning. Scientific Reports, 14, 6086. [CrossRef]
- Bellman, R. (1961). Adaptive Control Processes: A Guided Tour. Princeton University Press.
- Pedregosa, F., Varoquaux, G., Gramfort, A., et al. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825-2830.
- IDF (International Diabetes Federation). (2025). IDF Diabetes Atlas (11th ed.). International Diabetes Federation. https://diabetesatlas.org.
- Landis, J. R., & Koch, G. G. (1977). The measurement of observer agreement for categorical data. Biometrics, 33(1), 159-174.
- Yuhefizar, R. Watrianthos, and R. Komalasari, "An LDA Analysis for Topic Modeling of the RESTI Journal," in 2024 2nd International Symposium on Information Technology and Digital Innovation (ISITDI), IEEE, Jul. 2024, pp. 46-52. [CrossRef]
- Ronal Watrianthos and Y. Yuhefizar, "Exploring Research Trends and Impact: A Bibliometric Analysis of RESTI Journal from 2018 to 2022," Jurnal RESTI, vol. 7, no. 4, pp. 970-981, Aug. 2023. [CrossRef]
- M. A. Rabbani, "Type 2 Diabetes Risk Dataset," Kaggle, 2026. [Online]. Available: https://www.kaggle.com/datasets/mubasharahmedrabbani/type-2-diabetes-risk-dataset. [Accessed: May 2026].
| Feature | Min | Max | Mean | SD | Q1 | Median | Q3 |
|---|---|---|---|---|---|---|---|
| Age (years) | 19 | 90 | 53.86 | 15.75 | 41.00 | 54.00 | 66.00 |
| BMI (kg/m²) | 11.19 | 50.45 | 26.04 | 6.06 | 22.09 | 25.70 | 29.78 |
| Fasting glucose (mg/dL) | 70 | 250 | 111.86 | 38.07 | 83.31 | 100.81 | 131.73 |
| HbA1c (%) | 1.99 | 12.00 | 5.46 | 1.36 | 4.50 | 5.06 | 6.15 |
| Insulin (μU/mL) | 2.00 | 40.00 | 24.27 | 10.44 | 15.54 | 23.44 | 33.64 |
| HOMA-IR | 0.50 | 5.00 | 4.35 | 1.04 | 3.81 | 5.00 | 5.00 |
| Triglycerides (mg/dL) | 84.59 | 341.12 | 206.16 | 28.36 | 186.91 | 206.02 | 225.21 |
| HDL cholesterol (mg/dL) | 14.01 | 71.41 | 36.11 | 7.67 | 30.85 | 36.10 | 41.31 |
| LDL cholesterol (mg/dL) | 52.77 | 219.62 | 126.07 | 16.14 | 115.17 | 126.02 | 136.96 |
| Systolic BP (mmHg) | 80 | 200 | 136.09 | 16.18 | 125.15 | 136.05 | 146.98 |
| Feature | r | Clinical Significance |
|---|---|---|
| Fasting plasma glucose | +0.67 | Primary ADA diagnostic criterion |
| BMI | +0.66 | Principal modifiable risk factor |
| HOMA-IR | +0.64 | Quantifies insulin resistance |
| HbA1c | +0.59 | ADA diagnostic threshold >= 6.5% |
| Insulin | +0.48 | Compensatory hyperinsulinaemia |
| Age | +0.40 | Progressive decline in insulin sensitivity |
| Triglycerides | +0.29 | Dyslipidaemia of metabolic syndrome |
| LDL cholesterol | +0.26 | Cardiovascular risk co-factor |
| Systolic BP | +0.26 | Hypertension co-pathway with T2DM |
| HDL cholesterol | -0.27 | Inversely associated with insulin resistance |
| Model | 5-Fold CV | 10-Fold CV | 80/20 | 90/10 | Mean |
|---|---|---|---|---|---|
| MLP Neural Network | 90.91 | 91.02 | 91.70 | 91.90 | 91.38 |
| Decision Tree | 91.01 | 91.07 | 90.90 | 91.40 | 91.10 |
| k-NN (k=11) | 86.93 | 86.64 | 87.70 | 87.10 | 87.09 |
| k-NN (k=9) | 86.40 | 86.45 | 87.25 | 86.60 | 86.67 |
| k-NN (k=7) | 86.08 | 86.02 | 85.75 | 85.50 | 85.84 |
| k-NN (k=5) | 85.54 | 85.52 | 85.10 | 84.20 | 85.09 |
| k-NN (k=3) | 84.01 | 84.00 | 83.60 | 83.30 | 83.73 |
| Naive Bayes | 83.20 | 83.20 | 82.60 | 81.80 | 82.70 |
| Class | Precision | Recall | F1-Score | Support |
|---|---|---|---|---|
| Normal (Stage 0) | 0.934 | 0.937 | 0.935 | 555 |
| Pre-diabetes (Stage 2) | 0.852 | 0.861 | 0.856 | 574 |
| Diabetes (Stage 3) | 0.950 | 0.941 | 0.946 | 871 |
| Macro average | 0.912 | 0.913 | 0.912 | 2,000 |
| Weighted average | 0.917 | 0.917 | 0.917 | 2,000 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).