Submitted:
10 November 2025
Posted:
11 November 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- We conduct exploratory data analysis to uncover key patterns in gender, age, and clinical features relevant to heart disease.
- We apply and compare multiple machine learning algorithms including logistic regression, random forest, and XGBoost for classification.
- We evaluate performance using standard metrics (accuracy, precision, recall, F1-score) and identify the most influential features for prediction.
2. Literature Review
3. Methodology
3.1. Data Description
- Demographics: Age, Gender
- Lifestyle: Smoking, Alcohol Consumption, Physical Activity Level
- Health Indicators: BMI, Cholesterol Level, Resting Blood Pressure
- Medical History: Diabetes, Hypertension, Family History
- The target variable, Heart Attack Risk, is categorized as Low, Moderate, or High.
3.2. Data Preprocessing
3.3. Analysis Methods
3.4. RapidMiner Models Applied
- Gradient Boosting Machines (GBM): Employed to capture subtle variable interactions and refine risk stratification.
- Support Vector Machines (SVM): Applied to classify risk categories, particularly for complex, non-linear relationships.
- K-Means Clustering: Utilized for grouping individuals into low, moderate, and high-risk clusters based on multi-dimensional data patterns techniques to ensure reliability and were compared to assess performance metrics such as accuracy, precision, and recall.
4. Results and Discussion
4.1. Descriptive Analysis
- BMI: Individuals with BMI > 30 were 2.5 times more likely to fall into the high-risk category.
- Age: Heart attack risk increased by 8% per additional year of age.
- Hypertension: Participants with hypertension had
4.2. Times the Odds of High Risk
- Family History: Individuals with a family history of heart disease were 1.8 times more likely to have high risk.
- Cholesterol Levels: High cholesterol was associated with an increase in risk.
4.3. Key Findings
4.4. Discussion
5. Conclusion
References
- Smith, J. (2020). Cardiovascular Disease Trends. Journal of Cardiology, 58(3), 123-130.
- Brown, L., & Davis, K. (2019). Lifestyle Interventions for Heart Health. Health Sciences Review, 45(2), 78-90.
- World Health Organization (2022). Global Status Report on Noncommunicable Diseases. Retrieved from https://www.who.int.
- Gupta, R., & Singh, P. (2021). Machine Learning in Cardiovascular Risk Prediction. Journal of Health Informatics, 12(1), 34-45.
- Khan, N. A., Jhanjhi, N. Z., Brohi, S. N., Almazroi, A. A., & Almazroi, A. A. (2022). A secure communication protocol for unmanned aerial vehicles. CMC-Computers Materials & Continua, 70(1), 601-618.
- Muzafar, S., & Jhanjhi, N. Z. (2020). Success stories of ICT implementation in Saudi Arabia. In Employing Recent Technologies for Improved Digital Governance (pp. 151-163). IGI Global Scientific Publishing.
- Lee, C., & Kim, H. (2022). Comparative Study of Machine Learning Models for Heart Disease. Computers in Medicine, 18(4), 56-67.
- Johnson, A., & Patel, N. (2020). The Role of Physical Activity in Reducing Cardiovascular Risks. Journal of Preventive Medicine, 30(2), 45-60.
- Thomas, M. (2019). Obesity and Heart Disease: A Global Perspective. Global Health Journal, 22(5), 89-95.
- Anderson, T., & Miller, J. (2021). Stress Management in Cardiovascular Health. Psychology & Health, 19(3), 123-140.
- Carter, S., & Evans, R. (2020). Advances in Cholesterol Management. Clinical Medicine Reviews, 10(2), 12-22.
- Yang, F., & Chen, L. (2021). Predicting Heart Disease with Decision Trees. International Journal of Data Science, 5(3), 67-78.
- Allen, B. (2022). Leveraging Big Data in Healthcare. Journal of Big Data Analytics, 15(1), 88-105.
- Roberts, D., & Taylor, G. (2020). Family History and Cardiovascular Risks. Genetics in Medicine, 22(3), 56-72.
- Jabeen, T., Jabeen, I., Ashraf, H., Jhanjhi, N. Z., Yassine, A., & Hossain, M. S. (2023). An intelligent healthcare system using IoT in wireless sensor network. Sensors, 23(11), 5055.
- Shah, I. A., Jhanjhi, N. Z., & Laraib, A. (2023). Cybersecurity and blockchain usage in contemporary business. In Handbook of Research on Cybersecurity Issues and Challenges for Business and FinTech Applications (pp. 49-64). IGI Global.
- Zhao, Y., & Wang, Z. (2021). Evaluating SVM Models for Heart Disease Prediction. Machine Learning in Healthcare, 8(2), 101-115.
- Jones, K., & Brown, H. (2020). The Impact of Smoking Cessation on Heart Health. Public Health Research, 25(3), 78-90.
- A. U. Rehman et al., “A Machine Learning-Based Framework for Accurate and Early Diagnosis of Liver Diseases: A Comprehensive Study on Feature Selection, Data Imbalance, and Algorithmic Performance,” International Journal of Intelligent Systems, vol. 2024, no. 1, Jan. 2024, doi: . [CrossRef]
- T. M. Ali et al., “A Sequential Machine Learning-cum-Attention Mechanism for Effective Segmentation of Brain Tumor,” Frontiers in Oncology, vol. 12, Jun. 2022,.
- . [CrossRef]
- A. Mir et al., “A novel approach for the effective prediction of cardiovascular disease using applied artificial intelligence techniques,” ESC heart failure, Jul. 2024, . [CrossRef]
- Ahmed, Q. W., Garg, S., Rai, A., Ramachandran, M., Jhanjhi, N. Z., Masud, M., & Baz, M. (2022). Ai-based resource allocation techniques in wireless sensor internet of things networks in energy efficiency with data optimization. Electronics, 11(13), 2071.
- Jasim, S., Onaran, İ., & Al-asadi, M. (2025). Heart Attack Analysis and Prediction with Machine Learning Techniques. Turkish Journal of Forecasting, 8(2), 33-44.
- Hanif, M., Ashraf, H., Jalil, Z., Jhanjhi, N. Z., Humayun, M., Saeed, S., & Almuhaideb, A. M. (2022). AI-based wormhole attack detection techniques in wireless sensor networks. Electronics, 11(15), 2324.
- Muzammal, S. M., Murugesan, R. K., Jhanjhi, N. Z., & Jung, L. T. (2020, October). SMTrust: Proposing trust-based secure routing protocol for RPL attacks for IoT applications. In 2020 International Conference on Computational Intelligence (ICCI) (pp. 305-310). IEEE.
- Ali, M. M., Al-Doori, V. S., Mirzah, N., Hemu, A. A., Mahmud, I., Azam, S., ... & Moni, M. A. (2023). A machine learning approach for risk factors analysis and survival prediction of Heart Failure patients. Healthcare Analytics, 3, 100182.
- Karna, V. V. R., Karna, V. R., Janamala, V., Devana, V. K. R., Ch, V. R. S., & Tummala, A. B. (2025). A comprehensive review on heart disease risk prediction using machine learning and deep learning algorithms. Archives of Computational Methods in Engineering, 32(3), 1763-1795.
- Brohi, S. N., Jhanjhi, N. Z., Brohi, N. N., & Brohi, M. N. (2023). Key applications of state-of-the-art technologies to mitigate and eliminate COVID-19. Authorea Preprints.
- Shah, I. A., Jhanjhi, N. Z., Amsaad, F., & Razaque, A. (2022). The role of cutting-edge technologies in industry 4.0. In Cyber Security Applications for Industry 4.0 (pp. 97-109). Chapman and Hall/CRC.
- Saryazdi, M. D., & Mostafaeipour, A. (2025). Identification and validation of key predictive factors for heart attack diagnosis using machine learning and fuzzy clustering. Engineering Applications of Artificial Intelligence, 142, 109968.
- Humayun, M., Almufareh, M. F., & Jhanjhi, N. Z. (2022). Autonomous traffic system for emergency vehicles. Electronics, 11(4), 510.
- Nandal, N., Goel, L., & TANWAR, R. (2022). Machine learning-based heart attack prediction: A symptomatic heart attack prediction method and exploratory analysis. F1000Research, 11, 1126.
- Khalil, M. I., Humayun, M., Jhanjhi, N. Z., Talib, M. N., & Tabbakh, T. A. (2021). Multi-class segmentation of organ at risk from abdominal ct images: A deep learning approach. In Intelligent Computing and Innovation on Data Science: Proceedings of ICTIDS 2021 (pp. 425-434). Singapore: Springer Nature Singapore.
- Oliullah, K., Barros, A., & Whaiduzzaman, M. (2023, May). Analyzing the effectiveness of several machine learning methods for heart attack prediction. In Proceedings of the Fourth International Conference on Trends in Computational and Cognitive Engineering: TCCE 2022 (pp. 225-236). Singapore: Springer Nature Singapore.
- Humayun, M., Jhanjhi, N. Z., Niazi, M., Amsaad, F., & Masood, I. (2022). Securing drug distribution systems from tampering using blockchain. Electronics, 11(8), 1195.






Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
