Submitted:
26 November 2024
Posted:
27 November 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methodology
2.1. Dataset
2.2. Data Preprocessing
2.2.1. Handling Missing Data
2.2.2. Data Normalization
2.2.3. Train-Test Split
2.3. Structural Learning
2.3.1. Constraint-Based Approach
2.3.2. Score-Based Approach
- BIC and AIC: Both metrics balance model fit with model complexity, penalizing more complex models to avoid overfitting. BIC generally favors simpler models with fewer parameters, while AIC is more lenient, allowing slightly more complex models if they provide better fit.
- Log-Likelihood: This metric focuses solely on the fit of the model to the data, often resulting in more complex networks with potentially better predictive performance but reduced interpretability.
- K2: The K2 score is specific to Bayesian Networks and evaluates the likelihood of the data given the network structure, often leading to highly tailored networks for the dataset at hand.
2.4. Parameter Learning
- Maximum Likelihood Estimation (MLE): MLE was used to estimate the parameters, maximizing the likelihood of the observed data given the model.
- Bayesian Estimation: To enhance robustness, Bayesian Estimation was employed, incorporating prior knowledge through the use of prior distributions.
2.5. Cross-Validation
3. Results
3.1. Naive Bayes Model
3.2. Tree-Augmented Naive Bayes (TAN) Model
3.3. Decision Tree Model
3.4. Random Forest Model
4. Conclusion
5. Discussion
References
- World Health Organization, “Breast cancer,” 2024, accessed: 18-Aug2024. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/breast-cancer.
- Weng, Y.; Cao, Y.; Li, M.; Yang, X. The Application of Big Data and AI in Risk Control Models: Safeguarding User Security. International Journal of Frontiers in Engineering Technology 2024, 6. [Google Scholar] [CrossRef]
- Zhou, T.; Zhao, J.; Luo, Y.; Xie, X.; Wen, W.; Ding, C.; Xu, X. AdaPI: Facilitating DNN Model Adaptivity for Efficient Private Inference in Edge Computing. 2024 IEEE/ACM International Conference on Computer Aided Design (ICCAD). IEEE, 2024.
- Tao, Y.; Jia, Y.; Wang, N.; Wang, H. The FacT: Taming Latent Factor Models for Explainability with Factorization Trees. Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval; Association for Computing Machinery: New York, NY, USA, 2019. [Google Scholar] [CrossRef]
- Zheng, H.; Zhang, Q.; Gong, Y.; Liu, Z.; Chen, S. Identification of Prognostic Biomarkers for Stage III Non-Small Cell Lung Carcinoma in Female Nonsmokers Using Machine Learning. arXiv 2024, arXiv:2408.16068. [Google Scholar]
- Ji, Y.; Yu, Z.; Wang, Y. Assertion Detection Large Language Model In-context Learning LoRA Fine-tuning. arXiv 2024, arXiv:2401.17602. [Google Scholar]
- Yu, C.; Jin, Y.; Xing, Q.; Zhang, Y.; Guo, S.; Meng, S. Advanced User Credit Risk Prediction Model using LightGBM, XGBoost and Tabnet with SMOTEENN. arXiv 2024, arXiv:2408.03497. [Google Scholar]
- Mo, K.; Liu, W.; Xu, X.; Yu, C.; Zou, Y.; Xia, F. Fine-Tuning Gemma-7B for Enhanced Sentiment Analysis of Financial News Headlines. 2024 IEEE 4th International Conference on Electronic Technology, Communication and Information (ICETCI), 2024, pp. 130–135. [CrossRef]
- Xie, X.; Peng, H.; Hasan, A.; Huang, S.; Zhao, J.; Fang, H.; Zhang, W.; Geng, T.; Khan, O.; Ding, C. Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks. 2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD). IEEE, 2023, pp. 01–09.
- Dan, H.C.; Yan, P.; Tan, J.; Zhou, Y.; Lu, B. Multiple distresses detection for Asphalt Pavement using improved you Only Look Once Algorithm based on convolutional neural network. International Journal of Pavement Engineering 2024, 25, 2308169. [Google Scholar] [CrossRef]
- Ji, Y.; Li, Z.; Meng, R.; He, D. ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation. arXiv 2024, arXiv:2410.05168. [Google Scholar]
- Song, Y. Deep Learning Applications in the Medical Image Recognition. American Journal of Computer Science and Technology 2019, 2, 22–26. [Google Scholar] [CrossRef]
- Liu, X.; Wang, Z. Deep Learning in Medical Image Classification from MRI-based Brain Tumor Images. arXiv 2024, arXiv:2408.00636. [Google Scholar]
- Ji, Y.; Gao, Y.; Bao, R.; Li, Q.; Liu, D.; Sun, Y.; Ye, Y. Prediction of COVID-19 Patients’ Emergency Room Revisit using Multi-Source Transfer Learning. 2023 IEEE 11th International Conference on Healthcare Informatics (ICHI), 2023, pp. 138–144. [CrossRef]
- Yu, H.; Yu, C.; Wang, Z.; Zou, D.; Qin, H. Enhancing Healthcare through Large Language Models: A Study on Medical Question Answering. arXiv 2024, arXiv:2408.04138. [Google Scholar]
- Zhang, Q.; Qi, W.; Zheng, H.; Shen, X. CU-Net: A U-Net Architecture for Efficient Brain-Tumor Segmentation on BraTS 2019 Dataset. 2024 4th International Conference on Machine Learning and Intelligent Systems Engineering (MLISE), 2024, pp. 255–258. [CrossRef]
- Ji, Y.; Li, Z.; Meng, R.; Sivarajkumar, S.; Wang, Y.; Yu, Z.; Ji, H.; Han, Y.; Zeng, H.; He, D. RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts. arXiv 2024, arXiv:2405.13179. [Google Scholar]
- Ji, Y.; Ma, W.; Sivarajkumar, S.; Zhang, H.; Sadhu, E.M.; Li, Z.; Wu, X.; Visweswaran, S.; Wang, Y. Enhancing Equity in Large Language Models for Medical Applications. arXiv 2024, arXiv:2410.05180. [Google Scholar]
- Gong, Y.; Zhang, Q.; Zheng, H.; Liu, Z.; Chen, S. Graphical Structural Learning of rs-fMRI data in Heavy Smokers. arXiv 2024, arXiv:2409.08395. [Google Scholar]
- Xin, Y.; Luo, S.; Jin, P.; Du, Y.; Wang, C. Self-Training with Label-Feature-Consistency for Domain Adaptation. International Conference on Database Systems for Advanced Applications. Springer, 2023, pp. 84–99.
- Yang, J.; Liu, J.; Yao, Z.; Ma, C. Measuring digitalization capabilities using machine learning. Research in International Business and Finance 2024, 70, 102380. [Google Scholar] [CrossRef]
- Lucas, P.J.; Van der Gaag, L.C.; Abu-Hanna, A. Bayesian networks in biomedicine and health-care, 2004.
- Wolberg, W.H.; Street, W.N.; Mangasarian, O.L. Breast cancer Wisconsin (diagnostic) data set. UCI Machine Learning Repository, 1992.
- Weng, Y.; Wu, J. Leveraging Artificial Intelligence to Enhance Data Security and Combat Cyber Attacks. Journal of Artificial Intelligence General science (JAIGS) ISSN: 3006-4023 2024, 5, 392–399. [Google Scholar] [CrossRef]
- Orak, N.H. A Hybrid Bayesian network framework for risk assessment of arsenic exposure and adverse reproductive outcomes. Ecotoxicology and Environmental Safety 2020, 192, 110270. [Google Scholar] [CrossRef] [PubMed]
- Kharya, S.; Agrawal, S.; Soni, S. Naive Bayes classifiers: a probabilistic detection model for breast cancer. Int. J. Comput. Appl 2014, 92, 26–31. [Google Scholar] [CrossRef]


| Model | Accuracy | Sensitivity | Specificity |
|---|---|---|---|
| Naive Bayes | 0.9708 | 0.9608 | 1.0000 |
| TAN | 0.8321 | 0.9314 | 0.5429 |
| Decision Tree | 0.9489 | 0.9608 | 0.9143 |
| Random Forest | 0.9752 | 0.9706 | 0.9412 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).