Submitted:
12 January 2024
Posted:
12 January 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Studies
3. Materials and Methods
3.1. Model Architecture
3.2. Data Transformation and Preparation
3.3. Model Training Dynamics
3.4. Performance Evaluation and Metrics
4. Model Evaluation and Results
4.1. Analysis of the Confusion Matrix and Model's Predictive Power
| Actual vs Predicted | Non-Diabetic (0) | Diabetic (1) |
|---|---|---|
| Non-Diabetic (0) | 2424 | 0 |
| Diabetic (1) | 16 | 205 |
- a)
- Specificity (100%): Specificity measures the model's accuracy in identifying non-diabetic cases. It reaches a perfect 100%, indicating that the model correctly identifies all non-diabetic cases. The absence of false positives underscores the model's precision and accuracy. In practical terms, this means that no individual without diabetes is incorrectly diagnosed as diabetic.
- b)
- Precision (100%): Precision assesses the model's accuracy in predicting diabetic cases, with a rate of 100%. This exceptional precision minimizes the chances of false diabetic diagnoses. When the model predicts a positive case (diabetes), it is incredibly accurate, ensuring that individuals identified as diabetic are highly likely to have the condition.
- c)
- Recall (Sensitivity) (100% for Positive Class, 99.34% for Negative Class): Recall evaluates the model's ability to detect actual diabetic cases. High recall rates ensure comprehensive patient care and minimize missed diagnoses. Specifically, for the positive class (diabetic cases), the recall rate is 100%, meaning the model correctly identifies all diabetic individuals. For the negative class (non-diabetic cases), the recall rate is 99.34%, indicating that the model successfully identifies the vast majority of non-diabetic individuals.
- d)
- F1 Score (96.24%): The F1 Score harmonizes precision and recall, signifying a strong balance between identifying diabetic cases accurately and minimizing false positives. This balanced metric is particularly important in medical diagnostics, where both false positives and false negatives can have significant consequences. The F1 Score of 96.24% demonstrates the model's ability to achieve both high precision and recall simultaneously.
- e)
- Accuracy (99.40%): The high accuracy rate reflects the model's reliability in disease classification. An accuracy of 99.40% means that the model correctly classifies nearly all cases (both diabetic and non-diabetic), making it an effective tool for diabetes prediction.
4.2. Remaining Performance Metrics
- f)
- AUC (94.51%): The ROC (Receiver Operating Characteristic) curve, as depicted in Figure 2, illustrates the model's ability to differentiate between diabetic and non-diabetic classes across various thresholds. The high AUC value of 94.51% signifies superior discriminatory power, a vital characteristic for accurate classification in medical diagnostics. A high AUC value means that the model is excellent at distinguishing between individuals with diabetes and those without.
4.3. Interpretation and Clinical Relevance
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- B. Shickel, P. J. Tighe, A. Bihorac, and P. Rashidi, “Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis,” IEEE J. Biomed. Health Inform., vol. 22, no. 5, pp. 1589–1604, 2018. [CrossRef]
- E. Choi, A. Schuetz, W. F. Stewart, and J. Sun, “Doctor AI: Predicting Clinical Events via Recurrent Neural Networks,” in Machine Learning for Healthcare Conference, 2016, pp. 301–318.
- Al Sadi, K., & Balachandran, W. (2023). Prediction Model of Type 2 Diabetes Mellitus for Oman Prediabetes Patients Using Artificial Neural Network and Six Machine Learning Classifiers. Applied Sciences, 13(4), 2344. https://www.mdpi.com/2076-3417/13/4/2344. [CrossRef]
- Al Sadi, K., & Balachandran, W. (2023). Revolutionizing Early Disease Detection: A High-Accuracy 4D CNN Model for Type 2 Diabetes Screening in Oman. Bioengineering, 10(12), 1420. https://www.mdpi.com/2306-5354/10/12/1420. [CrossRef]
- Massaro et al, "LSTM DSS Automatism and Dataset Optimization for Diabetes Prediction," Applied Sciences, vol. 9, no. 17, pp. 3532, 2019. [CrossRef]
- M. Rahman, D. Islam, R. J. Mukti, and I. Saha, "A deep learning approach based on convolutional LSTM for detecting diabetes," Computational Biology and Chemistry, vol. 88, 107329, 2020. [CrossRef]
- P. B. K. Chowdary and R. Udaya, "An Effective Approach for Detecting Diabetes using Deep Learning Techniques based on Convolutional LSTM Networks," International Journal of Advanced Computer Science & Applications, vol. 12, (4), 2021. [CrossRef]
- E. M. S. Rochman, H. Suprajitno, A. Rachmad, R. Nindyasari, and F. H. Rachman, "Comparison of LSTM and GRU in Predicting the Number of Diabetic Patients," in 2022 IEEE 8th Information Technology International Seminar (ITIS), 2022, pp. 145-149. [CrossRef]
- Y. Yang, X. Zheng, and C. Ji, "Disease prediction model based on bilstm and attention mechanism," in 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2019, pp. 1141-1148. [CrossRef]
- S. A. Alex, N. Z. Jhanjhi, M. Humayun, A. O. Ibrahim, and A. W. Abulfaraj, "Deep LSTM Model for Diabetes Prediction with Class Balancing by SMOTE," Electronics, vol. 11, no. 17, pp. 2737, 2022. [CrossRef]
- S. Arora, S. Kumar, and P. Kumar, "Implementation of LSTM for Prediction of Diabetes using CGM," in 2021 10th International Conference on System Modelling & Advancement in Research Trends (SMART), 2021, pp. 718-722. [CrossRef]
- U. M. Butt, S. Letchmunan, M. Ali, F. H. Hassan, A. Baqir, and H. H. R. Sherazi, "Research Article Machine Learning Based Diabetes Classification and Prediction for Healthcare Applications," 2021. [CrossRef]
- F. Iacono, L. Magni, and C. Toffanin, "Personalized LSTM models for glucose prediction in Type 1 diabetes subjects," in 2022 30th Mediterranean Conference on Control and Automation (MED), 2022, pp. 324-329. [CrossRef]
- P. N. Srinivasu et al, "Using Recurrent Neural Networks for Predicting Type-2 Diabetes from Genomic and Tabular Data," Diagnostics (Basel), vol. 12, (12), pp. 3067, 2022. [CrossRef]
- S. Jaiswal and P. Gupta, "Diabetes Prediction Using Bi-directional Long Short-Term Memory," SN Computer Science, vol. 4, no. 4, pp. 373, 2023. [CrossRef]
- S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
- R. Pascanu, T. Mikolov, and Y. Bengio, "On the difficulty of training recurrent neural networks," in International Conference on Machine Learning, 2013, pp. 1310-1318. [CrossRef]
- J. L. Ba, J. R. Kiros, and G. E. Hinton, "Layer normalization," arXiv preprint arXiv:1607.06450, 2016. [CrossRef]
- S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
- J. L. Ba, J. R. Kiros, and G. E. Hinton, "Layer Normalization," arXiv preprint arXiv:1607.06450, 2016. [Online]. Available: https://arxiv.org/abs/1607.06450. [CrossRef]
- P. Kingma and J. Ba, "Adam: A Method for Stochastic Optimization," arXiv preprint arXiv:1412.6980, 2014. [Online]. Available: https://arxiv.org/abs/1412.6980. [CrossRef]


| Model Description |
Precision | Recall (Positive Class) |
Recall (Negative Class) |
Accuracy | AUC | Sensitivity | Specificity | F1 Score | Additional Notes |
|---|---|---|---|---|---|---|---|---|---|
| Conv-LSTM [7] | Not specified | Not specified | Not specified | 97.26% | N/A | Not specified | Not specified | Not specified | Used Pima Indians Diabetes Database |
| LSTM-AR [15] | 75.73% | 83.66%. | 49.38%. | 71.79% | N/A | 83.66% | 49.38% | Not specified | Implemented on ERP platform |
| LSTM vs GRU [8] | Not specified | Not specified | Not specified | GRU better | N/A | Not specified | Not specified | Not specified | RMSE used for comparison |
| LSTM and GRU [13] | Not specified | Not specified | Not specified | Not specified | Sensitivity, Specificity, F1-score, MCC | Not specified | Not specified | Not specified | Genomic data used for prediction |
| BiLSTM with Attention [9] | Higher than traditional | Not specified | Not specified | Not specified | Precision and Recall | Not specified | Not specified | Not specified | Utilized EHRs for prediction |
| SMOTE-based Deep LSTM [10] | Not specified | Not specified | Not specified | 99.64% | N/A | Not specified | Not specified | Not specified | Employed SMOTE for class imbalance |
| LSTM for CGM [11] | Not specified | Not specified | Not specified | Not specified | Average RMSE: 4.02 | Not specified | Not specified | Not specified | Predicted blood glucose trends |
| BLSTM [15] | Not specified | 96% | 91% | 94% | Not specified | 91% | 93% | Sensitivity emphasized in the study | |
| 7-layer LSTM (this study) | 100% | 100% | 99.34% | 99.40% | 94.51% | 100% | 100% | 96.24% | High accuracy and reliability |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).