Submitted:
16 June 2025
Posted:
17 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Background of the Study
1.2. Statement of the Problem
1.3. Objectives of the Study
1.4. Research Questions
1.5. Significance of the Study
1.6. Scope of the Study
1.7. Limitations of the Study
1.8. Definition of Terms
2. Literature Review
2.1. Introduction
2.2. Overview of Alzheimer’s Disease
2.3. Cognitive Assessment Tools in Alzheimer’s Detection
- Mini-Mental State Examination (MMSE): A 30-point questionnaire assessing orientation, attention, memory, language, and visual-spatial skills.
- Alzheimer’s Disease Assessment Scale-Cognitive Subscale (ADAS-Cog): Measures memory, language, and praxis.
- Clinical Dementia Rating (CDR): Evaluates functional performance in memory, orientation, judgment, and problem-solving.
2.4. MRI-Based Imaging in Alzheimer’s Diagnosis
- Hippocampal Volume: Strongly associated with early AD.
- Cortical Thickness: Particularly in the entorhinal cortex and medial temporal lobes.
- Ventricular Enlargement: Suggests general brain atrophy.
2.5. Machine Learning in Medical Diagnosis
- Support Vector Machines (SVM): Effective in high-dimensional spaces, commonly used for binary classification.
- Random Forests (RF): Ensemble learning technique with good performance and feature importance interpretation.
- Artificial Neural Networks (ANNs): Capable of modeling complex relationships, especially with large datasets.
- Convolutional Neural Networks (CNNs): Designed for image data, CNNs are effective in analyzing MRI slices and volumes.
2.6. Multimodal Learning for Alzheimer’s Detection
- Early Fusion: Combining raw features from both modalities before training.
- Late Fusion: Independent models for each modality whose outputs are then combined.
- Hybrid Fusion: Integration at multiple levels of representation.
2.7. Explainable AI in Medical Diagnostics
- SHAP (SHapley Additive exPlanations): Quantifies the contribution of each feature to model predictions.
- Grad-CAM (Gradient-weighted Class Activation Mapping): Visualizes which regions of an image influenced a CNN’s decision.
2.8. Research Gaps and Motivation for the Study
- Many studies focus on either cognitive scores or imaging, rather than fusing both modalities.
- There is insufficient emphasis on model interpretability and clinical usability.
- Most existing models lack validation across diverse demographic populations.
3. Methodology
3.1. Research Design
3.2. Data Source and Description
- T1-weighted structural MRI scans
- Cognitive assessment scores (MMSE, ADAS-Cog, CDR)
- Demographic and clinical information
3.3. Data Preprocessing
3.3.1. MRI Preprocessing:
- Skull stripping and bias correction
- Spatial normalization to MNI space
- Segmentation into grey matter, white matter, and cerebrospinal fluid
- Feature extraction using FreeSurfer and FSL for metrics such as hippocampal volume, cortical thickness, and surface area
3.3.2. Cognitive Score Cleaning:
- Normalization of scores to remove inter-test variability
- Imputation of missing values using K-Nearest Neighbor (KNN) imputation
3.3.3. Feature Engineering:
- Dimensionality reduction via Principal Component Analysis (PCA) and t-SNE
- Feature selection based on mutual information and correlation analysis
3.4. Machine Learning Models
3.4.1. Support Vector Machine (SVM):
3.4.2. Random Forest (RF):
3.4.3. Multilayer Perceptron (MLP):
3.4.4. Convolutional Neural Network (CNN):
3.5. Model Training and Validation
- Accuracy
- Precision
- Recall
- F1-Score
- Area Under the ROC Curve (AUC-ROC)
3.6. Model Interpretability
- SHAP: Used for tabular cognitive and feature-level interpretation
- Grad-CAM: Used for spatial heatmap visualization in MRI image-based CNNs
3.7. Ethical Considerations
3.8. Summary
4. Results and Analysis
4.1. Introduction
4.2. Dataset Composition and Descriptive Statistics
- Cognitively Normal (CN): 268
- Mild Cognitive Impairment (MCI): 301
- Alzheimer’s Disease (AD): 243
- Mean age: 72.6 years
- Gender: 54% male, 46% female
-
Average MMSE score:
- ○
- CN: 28.9
- ○
- MCI: 25.4
- ○
- AD: 19.2
4.3. Feature Importance Analysis
- Hippocampal volume (left and right)
- Entorhinal cortical thickness
- Ventricular volume
- MMSE and ADAS-Cog scores
4.4. Model Performance Comparison
4.4.1. Support Vector Machine (SVM)
- Accuracy: 86.4%
- Precision: 85.1%
- Recall: 84.3%
- F1-Score: 84.7%
- AUC-ROC: 0.89
4.4.2. Random Forest (RF)
- Accuracy: 88.7%
- Precision: 87.9%
- Recall: 86.8%
- F1-Score: 87.3%
- AUC-ROC: 0.91
4.4.3. Multilayer Perceptron (MLP)
- Accuracy: 84.2%
- Precision: 83.7%
- Recall: 81.2%
- F1-Score: 82.4%
- AUC-ROC: 0.88
4.4.4. Convolutional Neural Network (CNN) (MRI-only input)
- Accuracy: 85.6%
- Precision: 84.2%
- Recall: 83.9%
- F1-Score: 84.0%
- AUC-ROC: 0.90
4.4.5. Multimodal Hybrid Model (Cognitive + MRI Features)
- Accuracy: 91.5%
- Precision: 90.8%
- Recall: 91.0%
- F1-Score: 90.9%
- AUC-ROC: 0.94
4.5. Confusion Matrix and Error Analysis
- CN classification accuracy: 94.1%
- MCI classification accuracy: 89.3%
- AD classification accuracy: 91.2%
4.6. Explainability Analysis
- SHAP Values:
- Grad-CAM Results (CNN MRI Analysis):
4.7. Summary of Key Findings
- Combining cognitive and MRI data significantly improves classification performance.
- Random Forest and CNN performed well individually, but multimodal fusion achieved superior accuracy.
- Explainability tools provided interpretable insights consistent with neuroscience literature.
5. Discussion, Conclusion and Recommendations
5.1. Introduction
5.2. Discussion of Findings
5.2.1. Multimodal Data Integration Enhances Diagnostic Performance
5.2.2. MRI Biomarkers as Early Indicators
5.2.3. Importance of Explainable AI
5.2.4. Addressing Research Questions
- RQ1: The integrated model indeed outperformed individual modalities, supporting the hypothesis.
- RQ2: The Random Forest and CNN models showed optimal performance, with the hybrid approach yielding the best results.
- RQ3: Hippocampal volume, MMSE, and ventricular size were the most informative features.
- RQ4: Explainability techniques revealed clinically relevant insights, enhancing trust and interpretability.
5.3. Conclusion
5.4. Contributions to Knowledge
- Developed and validated a novel hybrid machine learning model for early Alzheimer’s detection.
- Demonstrated the synergistic value of combining cognitive and MRI data.
- Applied state-of-the-art explainability methods to bridge AI and clinical decision-making.
- Provided a reproducible framework applicable to other neurological disorders.
5.5. Limitations of the Study
- The dataset was limited to ADNI, which may not fully represent global population diversity.
- Variability in MRI scanner types and acquisition protocols could affect generalizability.
- Deep learning models require large datasets; the CNN architecture was constrained by dataset size.
- External validation on real-time hospital data was not conducted due to access limitations.
5.6. Recommendations
- Future studies should include longitudinal data to model disease progression over time.
- Model validation on diverse, locally sourced datasets is essential for global applicability.
- Integration of other modalities, such as PET scans and blood biomarkers, may further enhance diagnostic capability.
- Deployment of AI tools in clinical settings should involve physicians, data scientists, and ethicists to ensure responsible adoption.
5.7. Future Research Directions
- Exploration of federated learning for cross-institutional training without compromising patient privacy.
- Use of graph neural networks to model complex brain region interactions.
- Development of mobile-based cognitive assessment tools powered by lightweight ML models.
- Investigation of early prodromal markers in at-risk but asymptomatic individuals.
6. Summary, Conclusion, and Implications
6.1. Introduction
6.2. Summary of the Study
- Chapter One introduced the problem context, research questions, objectives, and significance of the study, emphasizing the need for intelligent, non-invasive, and interpretable diagnostic systems for Alzheimer’s Disease.
- Chapter Two reviewed existing literature on AD pathology, cognitive assessment, neuroimaging biomarkers, and machine learning applications in medical diagnostics. This chapter identified key gaps in existing models, especially regarding modality integration and model transparency.
- Chapter Three outlined the methodology adopted, including the data source (ADNI), preprocessing techniques, machine learning models (SVM, Random Forest, MLP, CNN), multimodal integration strategies, evaluation metrics, and ethical considerations.
- Chapter Four presented the results of model evaluation, highlighting the superior performance of the multimodal hybrid model, and demonstrating the importance of hippocampal volume, MMSE, and ventricular size in classification. Explainability tools (SHAP, Grad-CAM) were employed to reveal the internal decision-making logic of the models.
- Chapter Five discussed these findings in relation to the research questions and prior studies, while acknowledging the study’s limitations and proposing recommendations and future research directions.
6.3. Major Findings and Contributions
6.3.1. Efficacy of Multimodal Machine Learning
6.3.2. Feature Significance Consistent with Clinical Pathology
6.3.3. Explainable AI for Clinical Trust
6.3.4. Bridging the Gap Between Computational and Clinical Research
6.4. Theoretical Implications
- Cognitive and imaging features provide complementary representations of disease progression.
- Machine learning frameworks can be structured to enhance generalizability through data fusion and dimensionality reduction techniques.
- Interpretability must be an integral design component of medical AI to ensure knowledge transfer and trust across disciplinary boundaries.
6.5. Practical and Clinical Implications
6.5.1. Improved Early Diagnosis and Screening
6.5.2. Reduced Subjectivity and Improved Consistency
6.5.3. Foundation for Personalized Medicine
6.5.4. Scalability and Cost Reduction
6.6. Policy Implications
- Health Technology Assessment (HTA): The adoption of AI tools like the one developed in this study should be considered as part of public health screening programs for dementia and neurodegenerative diseases.
- Data Governance and Ethical AI: As more health systems embrace AI, regulatory frameworks must mandate interpretability, fairness, and validation across demographic subgroups.
- Capacity Building: Investments in digital infrastructure, clinician training, and interdisciplinary collaboration are essential to transition from pilot studies to national-level deployment of AI-assisted diagnosis.
6.7. Limitations of the Study
- Dataset Generalizability: The use of ADNI, although comprehensive, may not reflect global population diversity. Future studies should include external validation with multi-ethnic cohorts and varying clinical protocols.
- Cross-Sectional Analysis: The current framework is based on cross-sectional data. Longitudinal modeling would allow for prediction of disease progression and trajectory estimation.
- Model Complexity vs Interpretability Tradeoff: More complex deep learning models (e.g., transformers, graph-based networks) may offer performance gains but were not included due to interpretability constraints.
- Computational Constraints: MRI preprocessing and CNN training require significant computational resources, which may limit real-time applications in low-resource settings without appropriate optimization.
6.8. Recommendations for Future Research
- Incorporating Longitudinal Data: Explore disease progression modeling using temporal data to predict transition from MCI to AD.
- Expanding Modalities: Integrate PET scans, genetic data (e.g., APOE4), and blood-based biomarkers for a more holistic representation.
- Cross-Domain Validation: Apply the model to other neurodegenerative disorders such as Parkinson’s Disease or Lewy Body Dementia to test generalizability.
- Deploying Federated Learning Models: Protect data privacy by training models across distributed sites without data centralization.
- Engaging End-Users in Design: Co-create AI tools with clinicians, patients, and caregivers to ensure usability, acceptance, and ethical alignment.
6.9. Conclusion
References
- Hossain, M. D., Rahman, M. H., & Hossan, K. M. R. (2025). Artificial Intelligence in healthcare: Transformative applications, ethical challenges, and future directions in medical diagnostics and personalized medicine.
- Tayebi Arasteh, S., Lotfinia, M., Nolte, T., Sähn, M. J., Isfort, P., Kuhl, C., ... & Truhn, D. (2023). Securing collaborative medical AI by using differential privacy: Domain transfer for classification of chest radiographs. Radiology: Artificial Intelligence, 6(1), e230212.
- Yoon, J., Mizrahi, M., Ghalaty, N. F., Jarvinen, T., Ravi, A. S., Brune, P., ... & Pfister, T. (2023). EHR-Safe: generating high-fidelity and privacy-preserving synthetic electronic health records. NPJ digital medicine, 6(1), 141.
- Venugopal, R., Shafqat, N., Venugopal, I., Tillbury, B. M. J., Stafford, H. D., & Bourazeri, A. (2022). Privacy preserving generative adversarial networks to model electronic health records. Neural Networks, 153, 339-348.
- Ahmed, T., Aziz, M. M. A., Mohammed, N., & Jiang, X. (2021, August). Privacy preserving neural networks for electronic health records de-identification. In Proceedings of the 12th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics (pp. 1-6).
- Mohammadi, M., Vejdanihemmat, M., Lotfinia, M., Rusu, M., Truhn, D., Maier, A., & Arasteh, S. T. (2025). Differential Privacy for Deep Learning in Medicine. arXiv preprint. arXiv:2506.00660.
- Khalid, N., Qayyum, A., Bilal, M., Al-Fuqaha, A., & Qadir, J. (2023). Privacy-preserving artificial intelligence in healthcare: Techniques and applications. Computers in Biology and Medicine, 158, 106848.
- Libbi, C. A., Trienes, J., Trieschnigg, D., & Seifert, C. (2021). Generating synthetic training data for supervised de-identification of electronic health records. Future Internet, 13(5), 136.
- Manwal, M., & Purohit, K. C. (2024, November). Privacy Preservation of EHR Datasets Using Deep Learning Techniques. In 2024 International Conference on Cybernation and Computation (CYBERCOM) (pp. 25-30). IEEE.
- Yadav, N., Pandey, S., Gupta, A., Dudani, P., Gupta, S., & Rangarajan, K. (2023). Data privacy in healthcare: In the era of artificial intelligence. Indian Dermatology Online Journal, 14(6), 788-792.
- de Arruda, M. S. M. S., & Herr, B. Personal Health Train: Advancing Distributed Machine Learning in Healthcare with Data Privacy and Security.
- Tian, M., Chen, B., Guo, A., Jiang, S., & Zhang, A. R. (2024). Reliable generation of privacy-preserving synthetic electronic health record time series via diffusion models. Journal of the American Medical Informatics Association, 31(11), 2529-2539.
- Ghosheh, G. O., Li, J., & Zhu, T. (2024). A survey of generative adversarial networks for synthesizing structured electronic health records. ACM Computing Surveys, 56(6), 1-34.
- Nowrozy, R., Ahmed, K., Kayes, A. S. M., Wang, H., & McIntosh, T. R. (2024). Privacy preservation of electronic health records in the modern era: A systematic survey. ACM Computing Surveys, 56(8), 1-37.
- Williamson, S. M., & Prybutok, V. (2024). Balancing privacy and progress: a review of privacy challenges, systemic oversight, and patient perceptions in AI-driven healthcare. Applied Sciences, 14(2), 675.
- Alzubi, J. A., Alzubi, O. A., Singh, A., & Ramachandran, M. (2022). Cloud-IIoT-based electronic health record privacy-preserving by CNN and blockchain-enabled federated learning. IEEE Transactions on Industrial Informatics, 19(1), 1080-1087.
- Sidharth, S. (2015). Privacy-Preserving Generative AI for Secure Healthcare Synthetic Data Generation.
- Mullankandy, S., Mukherjee, S., & Ingole, B. S. (2024, December). Applications of AI in Electronic Health Records, Challenges, and Mitigation Strategies. In 2024 International Conference on Computer and Applications (ICCA) (pp. 1-7). IEEE.
- Seh, A. H., Al-Amri, J. F., Subahi, A. F., Agrawal, A., Pathak, N., Kumar, R., & Khan, R. A. (2022). An analysis of integrating machine learning in healthcare for ensuring confidentiality of the electronic records. Computer Modeling in Engineering & Sciences, 130(3), 1387-1422.
- Lin, W. C., Chen, J. S., Chiang, M. F., & Hribar, M. R. (2020). Applications of artificial intelligence to electronic health record data in ophthalmology. Translational vision science & technology, 9(2), 13-13.
- Ali, M., Naeem, F., Tariq, M., & Kaddoum, G. (2022). Federated learning for privacy preservation in smart healthcare systems: A comprehensive survey. IEEE journal of biomedical and health informatics, 27(2), 778-789.
- Ng, J. C., Yeoh, P. S. Q., Bing, L., Wu, X., Hasikin, K., & Lai, K. W. (2024). A Privacy-Preserving Approach Using Deep Learning Models for Diabetic Retinopathy Diagnosis. IEEE Access.
- Wang, Z., & Sun, J. (2022, December). PromptEHR: Conditional electronic healthcare records generation with prompt learning. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing (Vol. 2022, p. 2873).
- Agrawal, V., Kalmady, S. V., Manoj, V. M., Manthena, M. V., Sun, W., Islam, M. S., ... & Greiner, R. (2024, May). Federated Learning and Differential Privacy Techniques on Multi-hospital Population-scale Electrocardiogram Data. In Proceedings of the 2024 8th International Conference on Medical and Health Informatics (pp. 143-152).
- Adusumilli, S., Damancharla, H., & Metta, A. (2023). Enhancing Data Privacy in Healthcare Systems Using Blockchain Technology. Transactions on Latest Trends in Artificial Intelligence, 4(4).
- Tayefi, M., Ngo, P., Chomutare, T., Dalianis, H., Salvi, E., Budrionis, A., & Godtliebsen, F. (2021). Challenges and opportunities beyond structured data in analysis of electronic health records. Wiley Interdisciplinary Reviews: Computational Statistics, 13(6), e1549.
- Meduri, K., Nadella, G. S., Yadulla, A. R., Kasula, V. K., Maturi, M. H., Brown, S., ... & Gonaygunta, H. (2025). Leveraging federated learning for privacy-preserving analysis of multi-institutional electronic health records in rare disease research. Journal of Economy and Technology, 3, 177-189.
- Ghosheh, G., Li, J., & Zhu, T. (2022). A review of Generative Adversarial Networks for Electronic Health Records: applications, evaluation measures and data sources. arXiv preprint. arXiv:2203.07018.
- Chukwunweike, J. N., Praise, A., & Bashirat, B. A. (2024). Harnessing Machine Learning for Cybersecurity: How Convolutional Neural Networks are Revolutionizing Threat Detection and Data Privacy. International Journal of Research Publication and Reviews, 5(8).
- Tekchandani, P., Bisht, A., Das, A. K., Kumar, N., Karuppiah, M., Vijayakumar, P., & Park, Y. (2024). Blockchain-Enabled Secure Collaborative Model Learning using Differential Privacy for IoT-Based Big Data Analytics. IEEE Transactions on Big Data.
- Tekchandani, P., Bisht, A., Das, A. K., Kumar, N., Karuppiah, M., Vijayakumar, P., & Park, Y. (2024). Blockchain-Enabled Secure Collaborative Model Learning using Differential Privacy for IoT-Based Big Data Analytics. IEEE Transactions on Big Data.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).