Submitted:
23 November 2025
Posted:
01 December 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Background and Related Work
3. Methodology
3.1. Dataset Description
- A synthetic hospital dataset of 5 million patient records.
- A real-world EHR dataset obtained from an open-source medical repository, including diagnoses, lab tests, and medication histories.
3.2. Algorithms Evaluated
- Apriori: Candidate generation-based algorithm.
- FP-Growth: Tree-based method using frequent pattern trees.
- ∙ ECLAT: Vertical format mining through itemset intersection.
3.3. Evaluation Metrics
- Scalability: Runtime performance as dataset size increases.
- Memory Efficiency: Peak memory usage during execution.
- Accuracy: Ability to identify clinically valid patterns (measured with support and confidence).
- Processing Overhead: Number of database scans and intermediate structures.
4. Results
4.1. Scalability
- FP-Growth scaled efficiently to millions of transactions with minimal overhead.
- Apriori showed exponential growth in runtime, becoming impractical for datasets larger than 500,000 records.
- ECLAT performed moderately well but experienced slowdowns with dense medical datasets.
4.2. Memory Consumption
- Apriori consumed the most memory due to extensive candidate generation.
- FP-Growth had the most balanced memory usage.
- ECLAT consumed minimal memory in sparse datasets but struggled in dense ones.
4.3. Accuracy
- FP-Growth produced the most patterns with high confidence and lift.
- ECLAT performed best in detecting patterns in sparse datasets (e.g., rare diseases).
- Apriori produced fewer but highly precise associations.
4.4. Pattern Quality
5. Discussion
6. Conclusion
References
- Jhanjhi, N. Z. (2024, November). Comparative analysis of frequent pattern mining algorithms on healthcare data. In 2024 IEEE 9th International Conference on Engineering Technologies and Applied Sciences (ICETAS) (pp. 1-10). IEEE.
- Mishra, M. V. (2025). AI-Driven Personalization: Generative Models in E-Commerce. International Journal of Advanced Research in Science, Communication and Technology, 110-116.
- Karayilmaz, C.; Ozker, A. N. Kamusal Nitelikli Ozel Mallarin Sunumunda Akilli Sehirler Olgusu: Akilli Sehir Uygulamalarinda Küresel Degisimler. Karamanoglu Mehmetbey Universitesi Sosyal ve Ekonomik Arastırmalar Dergisi 2020, 22(38), 82–100. [Google Scholar]
- Mishra, M. V. Data Integration and Feature Engineering for Supply Chain Management: Enhancing Decision Making through Unified Data Processing. International Journal of Advanced Research in Science, Communication and Technology 2025, 5(2), 521–530. [Google Scholar] [CrossRef]
- Mishra, M.; Achanta, P. R. D.; Grover, N.; Chourasia, R.; Sivasamy, S. Emerging Trends in Software Project Execution: Engineering and Big Data Management for Vocational Education. In Integrating AI and Sustainability in Technical and Vocational Education and Training (TVET); 2025; pp. 263–278. [Google Scholar]
- Mannava, M. K.; Gupta, H.; Mishra, M. V.; Banerjee, S. Optimizing Financial Processes Through AI-Enhanced Project Management, Big Data Engineering, and Sustainability. In AI-Enabled Sustainable Innovations in Education and Business; IGI Global Scientific Publishing, 2025; pp. 203–224. [Google Scholar]
- Gupta, H.; Mishra, M. V. M.; Grover, N.; Chourasia, R. Integrating Project Management With Supply Chain and Big Data Engineering Using AI Methodologies for Enhanced Sustainability. In AI-Enabled Sustainable Innovations in Education and Business; IGI Global Scientific Publishing, 2025; pp. 319–352. [Google Scholar]
- Özker, A. N. Factual Changes in In?? lation and National Income: Their Impact on the Tax Burden Within OECD Countries. Pakistan Journal of Life & Social Sciences 2023, 21(1). [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).