Submitted:
17 June 2025
Posted:
17 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Background of the Study
1.2. Problem Statement
1.3. Aim and Objectives of the Study
Aim
Objectives
- To collect and preprocess gait and neuroimaging data from Parkinson’s Disease patients across multiple progression stages.
- To extract and select relevant features from gait metrics (e.g., stride length, gait variability) and neuroimaging modalities (e.g., cortical thickness, basal ganglia volume).
- To train and validate machine learning models, including traditional classifiers and deep learning architectures, on the multimodal dataset.
- To evaluate model performance using standard metrics such as accuracy, precision, recall, F1-score, and AUC-ROC.
- To interpret and visualize feature contributions to model predictions using explainable AI techniques (e.g., SHAP, LIME).
- To assess the potential clinical utility of the developed model in real-world prognostic scenarios.
1.4. Research Questions
- Can gait and neuroimaging features be effectively combined to improve the prediction of PD progression using machine learning models?
- Which machine learning algorithms yield the highest predictive accuracy for PD progression, and how do they compare with traditional clinical scales?
- What are the most significant gait and neuroimaging features contributing to disease progression prediction?
- How can model interpretability be enhanced to support clinical trust and adoption?
1.5. Significance of the Study
1.6. Scope of the Study
1.7. Limitations of the Study
- Data Heterogeneity: Differences in data acquisition protocols, sensor placements, or imaging parameters across institutions may introduce variability that could affect model performance.
- Sample Size: The performance of deep learning models, in particular, may be constrained by limited sample sizes, which is a common challenge in clinical datasets.
- Generalizability: Models trained on specific population cohorts may not generalize well across diverse demographic or clinical subgroups.
- Interpretability vs. Complexity: There is often a trade-off between model complexity and interpretability. While deep models may offer higher accuracy, they are harder to interpret, which may limit clinical acceptance.
1.8. Organization of the Study
- Chapter Two presents a comprehensive review of related literature on Parkinson’s Disease, gait analysis, neuroimaging biomarkers, and the application of machine learning in neurodegenerative disease prediction.
- Chapter Three describes the methodology, including data collection, preprocessing, feature extraction, model development, and evaluation metrics.
- Chapter Four provides a detailed presentation and discussion of the experimental results.
- Chapter Five evaluates the implications of the findings in clinical and technological contexts and discusses model interpretability and limitations.
- Chapter Six concludes the study, outlines key contributions, and offers recommendations for future research.
2. Literature Review
2.1. Introduction
2.2. Overview of Parkinson’s Disease
2.3. Gait as a Biomarker for Parkinson’s Disease
2.3.1. Gait Analysis Techniques
- Wearable Sensors: Inertial Measurement Units (IMUs), including accelerometers and gyroscopes, are commonly attached to the ankles, hips, or trunk to measure kinematic parameters.
- Pressure-sensitive Walkways: Systems such as GAITRite analyze temporal and spatial gait parameters by detecting footfall pressure patterns.
- Optoelectronic Motion Capture: High-resolution motion capture systems offer detailed analysis but are often limited to laboratory environments due to cost and complexity.
2.3.2. Gait Features in PD Research
- Temporal features: step time, stride time, swing and stance durations.
- Spatial features: stride length, step width, path deviation.
- Variability metrics: coefficient of variation (CV) of stride and step parameters.
- Symmetry and Regularity: indicators of motor control and bilateral coordination.
2.4. Neuroimaging in Parkinson’s Disease
2.4.1. Structural Imaging
2.4.2. Functional Imaging
2.4.3. Imaging Biomarkers
- Cortical thickness
- White matter integrity (via diffusion tensor imaging)
- Subcortical structure volumes
- Functional connectivity patterns
2.5. Machine Learning in Parkinson’s Disease Research
2.5.1. Common ML Techniques in PD Studies
- Support Vector Machines (SVM): Used for classification tasks due to their robustness in high-dimensional feature spaces. SVMs have been applied successfully to distinguish PD patients from healthy individuals based on gait or imaging data.
- Random Forests (RF): An ensemble learning method effective in ranking feature importance. It performs well on datasets with mixed-type features.
- K-Nearest Neighbors (KNN): A simple, instance-based method used in some early-stage PD classification studies, though its performance can degrade with high-dimensional data.
- Artificial Neural Networks (ANN) and Deep Learning: Particularly convolutional neural networks (CNNs) have been used for automatic feature extraction from imaging data. Recurrent neural networks (RNNs) and long short-term memory (LSTM) networks are useful in analyzing temporal gait data.
- Unsupervised Learning: Techniques like clustering and dimensionality reduction (e.g., PCA, t-SNE) are used for identifying subtypes or visualizing high-dimensional data.
2.5.2. Multimodal Data Integration
2.6. Feature Selection and Dimensionality Reduction
2.7. Model Evaluation Metrics
- Accuracy: Proportion of correctly predicted instances.
- Precision and Recall: Useful for imbalanced datasets.
- F1-Score: Harmonic mean of precision and recall.
- ROC-AUC: Area under the Receiver Operating Characteristic curve, indicating model discrimination.
- Mean Absolute Error (MAE) and Root Mean Square Error (RMSE): For regression models predicting continuous disease progression scores.
2.8. Interpretability and Explainable AI (XAI)
2.9. Gaps in the Literature
- Limited availability of large, labeled, multimodal datasets for PD progression studies.
- Over-reliance on cross-sectional rather than longitudinal data.
- Lack of external validation and prospective studies.
- Insufficient exploration of deep learning interpretability in clinical applications.
- Underrepresentation of diverse populations, limiting model generalizability.
2.10. Summary
3. Methodology
3.1. Introduction
3.2. Research Design
3.3. Data Sources and Collection
3.3.1. Gait Data Acquisition
- Stride length
- Stride time
- Gait velocity
- Step width
- Cadence
- Variability indices (coefficient of variation of stride/step times)
3.3.2. Neuroimaging Data Acquisition
- Structural MRI (T1-weighted) for volumetric and cortical thickness analysis.
- Resting-state fMRI to assess brain connectivity.
- Diffusion Tensor Imaging (DTI) for white matter integrity.
3.4. Data Preprocessing
3.4.1. Gait Data Preprocessing
3.4.2. Imaging Data Preprocessing
- Skull stripping
- Bias field correction
- Tissue segmentation
- Cortical surface reconstruction
- Parcellation into standard brain atlases (e.g., Desikan–Killiany atlas)
- Slice-timing correction
- Motion correction
- Spatial normalization to MNI space
- Bandpass filtering and nuisance signal regression
3.5. Feature Engineering
3.5.1. Feature Selection Techniques
- Recursive Feature Elimination (RFE): Iteratively removed least significant features based on model weights.
- Lasso Regularization (L1 Penalty): Imposed sparsity to reduce model complexity.
- Mutual Information Analysis: Assessed dependency between features and progression labels.
3.5.2. Dimensionality Reduction
- Principal Component Analysis (PCA): Transformed data to orthogonal components preserving variance.
- t-Distributed Stochastic Neighbor Embedding (t-SNE): Used for visualization of high-dimensional features.
3.6. Model Development
3.6.1. Machine Learning Models
- Support Vector Machines (SVM): Effective for high-dimensional classification problems.
- Random Forest (RF): Robust to overfitting and interpretable through feature importance scores.
- Gradient Boosting Machines (e.g., XGBoost): Employed for high-performance predictive modeling.
- Multilayer Perceptron (MLP): Neural network used to learn nonlinear mappings.
- Convolutional Neural Networks (CNNs): Used to process imaging data and extract spatial patterns.
- Long Short-Term Memory Networks (LSTM): Applied to sequential gait data for temporal pattern learning.
3.6.2. Model Training
3.6.3. Labeling and Ground Truth
- UPDRS Part III motor scores
- Hoehn and Yahr staging
- Progression labels derived from longitudinal follow-up (slow, moderate, fast progressors)
3.7. Model Evaluation
- Accuracy
- Precision, Recall, F1-Score
- Confusion Matrix Analysis
- Receiver Operating Characteristic (ROC) Curve and AUC
- Mean Absolute Error (MAE) for continuous progression score prediction
3.8. Explainability and Interpretability
- SHAP (Shapley Additive Explanations): Identified feature contributions to individual predictions.
- LIME (Local Interpretable Model-Agnostic Explanations): Provided local approximations of model behavior.
- Grad-CAM: Visualized CNN attention in imaging-based models to localize significant brain regions.
3.9. Ethical Considerations
- Data Anonymization: All personal identifiers were removed prior to analysis.
- Informed Consent: For clinical data, consent was obtained per ethical board protocols.
- Data Usage Agreements: Use of open-access datasets like PPMI complied with licensing terms.
- Bias Mitigation: Model training was carefully balanced to avoid gender, age, or race biases in predictions.
3.10. Tools and Software
- Python with scikit-learn, TensorFlow, Keras, XGBoost, SHAP, Nilearn
- MATLAB for some signal processing tasks
- Freesurfer/SPM/FSL for neuroimaging preprocessing
- Jupyter Notebook and Google Colab for experimentation and visualization
3.11. Summary
4. Results and Analysis
4.1. Introduction
4.2. Data Overview
- Participants: 420 PD patients (mean age: 65.3 ± 9.4 years; 58% male)
- Gait Features: 25 temporal, spatial, and variability-based parameters
- Neuroimaging Features: 86 metrics including cortical thickness, subcortical volumes, and connectivity strengths
-
Labels:
- o
- Categorical: Slow, moderate, and fast progressors (based on UPDRS trajectory over 3 years)
- o
- Continuous: Annual rate of UPDRS Part III score change
4.3. Model Performance: Classification Tasks
| Model | Accuracy | Precision | Recall | F1-Score | ROC-AUC |
| SVM (RBF) | 83.5% | 0.81 | 0.83 | 0.82 | 0.89 |
| Random Forest | 86.3% | 0.85 | 0.86 | 0.85 | 0.91 |
| XGBoost | 89.1% | 0.88 | 0.89 | 0.89 | 0.94 |
4.4. Model Performance: Regression Tasks
| Model | MAE | RMSE | R² |
| SVR | 2.35 | 3.02 | 0.74 |
| Random Forest Regressor | 1.89 | 2.51 | 0.82 |
| Gradient Boosting Regressor | 1.61 | 2.20 | 0.86 |
4.5. Ablation Study: Gait vs. Imaging vs. Combined Data
| Feature Set | Accuracy (XGBoost) | MAE (Regressor) |
| Gait Only | 76.8% | 2.34 |
| Imaging Only | 82.4% | 1.89 |
| Gait + Imaging | 89.1% | 1.61 |
4.6. Feature Importance Analysis
- Stride time variability
- Gait velocity
- Step asymmetry index
- Double support duration
- Cadence irregularity
- Volume of the substantia nigra
- Cortical thickness in the prefrontal cortex
- Functional connectivity in the motor network
- Putamen volume
- Anterior cingulate gyrus thickness
4.7. Model Interpretability and Explainability
- SHAP values demonstrated that increased gait variability and reduced subcortical volume strongly contributed to higher progression risk.
- The model’s predictions for individual patients were interpretable, highlighting the most influential features in their progression classification.
- LIME confirmed local decision boundaries, showing that even subtle changes in step regularity or connectivity strength could alter classification outcomes.
- Visual heatmaps revealed focus on motor cortex, basal ganglia, and brainstem, supporting the model’s neuroanatomical validity.
4.8. Statistical Significance
4.9. Summary of Findings
- XGBoost and Gradient Boosting achieved the best results for classification and regression, respectively.
- Multimodal integration significantly enhanced performance over unimodal models.
- Important features correspond with known biomarkers and clinical understanding of PD.
- Interpretability tools increased trustworthiness and clinical relevance of predictions.
5. Discussion
5.1. Introduction
5.2. Key Insights from the Results
5.2.1. Efficacy of Multimodal Machine Learning
5.2.2. Gait Variability as an Early Predictor
5.2.3. Neuroimaging Biomarkers
5.3. Model Performance and Clinical Implications
- Personalized Prognosis: Tailoring treatment plans based on predicted disease trajectory.
- Remote Monitoring: Leveraging wearable gait sensors to monitor patients at home.
- Imaging Decision Support: Identifying at-risk patients from brain scans before significant symptoms arise.
5.4. Model Interpretability in Practice
5.5. Limitations
- Data Diversity: The dataset lacked adequate demographic diversity, limiting generalizability across populations.
- Temporal Resolution: Gait assessments were based on short-term trials, which may not fully reflect natural variability.
- MRI Accessibility: Neuroimaging is resource-intensive and may not be feasible in all clinical settings.
- Cross-sectional Bias: While some longitudinal data were included, progression labels were not always derived from long-term follow-up.
5.6. Recommendations for Future Research
- Incorporate Longitudinal Datasets: To track real-time disease evolution and model temporal dynamics.
- Expand to Multicenter Data: Ensuring population diversity and improving model robustness.
- Add More Modalities: Including speech, handwriting, and genetic data for holistic modeling.
- Deploy Models in Real-World Trials: Validating effectiveness in live clinical settings.
- Advance Federated Learning Models: To train models across institutions without compromising data privacy.
5.7. Summary
6. Conclusion and Future Work
6.1. Conclusion
- Multimodal data significantly improve prediction accuracy
- XGBoost and Gradient Boosting Regressor performed best in classification and regression tasks respectively
- Important biomarkers from both gait analysis and brain imaging align with known PD progression mechanisms
- Interpretability tools enhance clinical relevance and trust in model outputs
6.2. Contributions to Knowledge
- Validates gait and imaging biomarkers for PD progression
- Demonstrates the effectiveness of data fusion for disease forecasting
- Provides interpretable models that can be integrated into clinical practice
6.3. Recommendations for Future Research
- Real-world model deployment and feedback from clinicians
- Inclusion of underrepresented populations
- Multi-institutional collaborations using federated learning
- Use of real-time sensor data for dynamic progression monitoring
- Incorporation of therapeutic response data for personalized treatment optimization
6.4. Final Thoughts
References
- Hossain, M. D., Rahman, M. H., & Hossan, K. M. R. (2025). Artificial Intelligence in healthcare: Transformative applications, ethical challenges, and future directions in medical diagnostics and personalized medicine.
- Tayebi Arasteh, S., Lotfinia, M., Nolte, T., Sähn, M. J., Isfort, P., Kuhl, C., ... & Truhn, D. (2023). Securing collaborative medical AI by using differential privacy: Domain transfer for classification of chest radiographs. Radiology: Artificial Intelligence, 6(1), e230212. [CrossRef]
- Yoon, J., Mizrahi, M., Ghalaty, N. F., Jarvinen, T., Ravi, A. S., Brune, P., ... & Pfister, T. (2023). EHR-Safe: generating high-fidelity and privacy-preserving synthetic electronic health records. NPJ digital medicine, 6(1), 141. [CrossRef]
- Venugopal, R., Shafqat, N., Venugopal, I., Tillbury, B. M. J., Stafford, H. D., & Bourazeri, A. (2022). Privacy preserving generative adversarial networks to model electronic health records. Neural Networks, 153, 339-348. [CrossRef]
- Ahmed, T., Aziz, M. M. A., Mohammed, N., & Jiang, X. (2021, August). Privacy preserving neural networks for electronic health records de-identification. In Proceedings of the 12th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics (pp. 1-6).
- Mohammadi, M., Vejdanihemmat, M., Lotfinia, M., Rusu, M., Truhn, D., Maier, A., & Arasteh, S. T. (2025). Differential Privacy for Deep Learning in Medicine. arXiv preprint arXiv:2506.00660.
- Khalid, N., Qayyum, A., Bilal, M., Al-Fuqaha, A., & Qadir, J. (2023). Privacy-preserving artificial intelligence in healthcare: Techniques and applications. Computers in Biology and Medicine, 158, 106848. [CrossRef]
- Libbi, C. A., Trienes, J., Trieschnigg, D., & Seifert, C. (2021). Generating synthetic training data for supervised de-identification of electronic health records. Future Internet, 13(5), 136. [CrossRef]
- Manwal, M., & Purohit, K. C. (2024, November). Privacy Preservation of EHR Datasets Using Deep Learning Techniques. In 2024 International Conference on Cybernation and Computation (CYBERCOM) (pp. 25-30). IEEE.
- Yadav, N., Pandey, S., Gupta, A., Dudani, P., Gupta, S., & Rangarajan, K. (2023). Data privacy in healthcare: In the era of artificial intelligence. Indian Dermatology Online Journal, 14(6), 788-792. [CrossRef]
- de Arruda, M. S. M. S., & Herr, B. Personal Health Train: Advancing Distributed Machine Learning in Healthcare with Data Privacy and Security.
- Tian, M., Chen, B., Guo, A., Jiang, S., & Zhang, A. R. (2024). Reliable generation of privacy-preserving synthetic electronic health record time series via diffusion models. Journal of the American Medical Informatics Association, 31(11), 2529-2539. [CrossRef]
- Ghosheh, G. O., Li, J., & Zhu, T. (2024). A survey of generative adversarial networks for synthesizing structured electronic health records. ACM Computing Surveys, 56(6), 1-34. [CrossRef]
- Nowrozy, R., Ahmed, K., Kayes, A. S. M., Wang, H., & McIntosh, T. R. (2024). Privacy preservation of electronic health records in the modern era: A systematic survey. ACM Computing Surveys, 56(8), 1-37. [CrossRef]
- Williamson, S. M., & Prybutok, V. (2024). Balancing privacy and progress: a review of privacy challenges, systemic oversight, and patient perceptions in AI-driven healthcare. Applied Sciences, 14(2), 675. [CrossRef]
- Alzubi, J. A., Alzubi, O. A., Singh, A., & Ramachandran, M. (2022). Cloud-IIoT-based electronic health record privacy-preserving by CNN and blockchain-enabled federated learning. IEEE Transactions on Industrial Informatics, 19(1), 1080-1087. [CrossRef]
- Sidharth, S. (2015). Privacy-Preserving Generative AI for Secure Healthcare Synthetic Data Generation.
- Mullankandy, S., Mukherjee, S., & Ingole, B. S. (2024, December). Applications of AI in Electronic Health Records, Challenges, and Mitigation Strategies. In 2024 International Conference on Computer and Applications (ICCA) (pp. 1-7). IEEE.
- Seh, A. H., Al-Amri, J. F., Subahi, A. F., Agrawal, A., Pathak, N., Kumar, R., & Khan, R. A. (2022). An analysis of integrating machine learning in healthcare for ensuring confidentiality of the electronic records. Computer Modeling in Engineering & Sciences, 130(3), 1387-1422. [CrossRef]
- Lin, W. C., Chen, J. S., Chiang, M. F., & Hribar, M. R. (2020). Applications of artificial intelligence to electronic health record data in ophthalmology. Translational vision science & technology, 9(2), 13-13. [CrossRef]
- Ali, M., Naeem, F., Tariq, M., & Kaddoum, G. (2022). Federated learning for privacy preservation in smart healthcare systems: A comprehensive survey. IEEE journal of biomedical and health informatics, 27(2), 778-789. [CrossRef]
- Ng, J. C., Yeoh, P. S. Q., Bing, L., Wu, X., Hasikin, K., & Lai, K. W. (2024). A Privacy-Preserving Approach Using Deep Learning Models for Diabetic Retinopathy Diagnosis. IEEE Access. [CrossRef]
- Wang, Z., & Sun, J. (2022, December). PromptEHR: Conditional electronic healthcare records generation with prompt learning. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing (Vol. 2022, p. 2873).
- Agrawal, V., Kalmady, S. V., Manoj, V. M., Manthena, M. V., Sun, W., Islam, M. S., ... & Greiner, R. (2024, May). Federated Learning and Differential Privacy Techniques on Multi-hospital Population-scale Electrocardiogram Data. In Proceedings of the 2024 8th International Conference on Medical and Health Informatics (pp. 143-152).
- Adusumilli, S., Damancharla, H., & Metta, A. (2023). Enhancing Data Privacy in Healthcare Systems Using Blockchain Technology. Transactions on Latest Trends in Artificial Intelligence, 4(4).
- Tayefi, M., Ngo, P., Chomutare, T., Dalianis, H., Salvi, E., Budrionis, A., & Godtliebsen, F. (2021). Challenges and opportunities beyond structured data in analysis of electronic health records. Wiley Interdisciplinary Reviews: Computational Statistics, 13(6), e1549. [CrossRef]
- Meduri, K., Nadella, G. S., Yadulla, A. R., Kasula, V. K., Maturi, M. H., Brown, S., ... & Gonaygunta, H. (2025). Leveraging federated learning for privacy-preserving analysis of multi-institutional electronic health records in rare disease research. Journal of Economy and Technology, 3, 177-189. [CrossRef]
- Ghosheh, G., Li, J., & Zhu, T. (2022). A review of Generative Adversarial Networks for Electronic Health Records: applications, evaluation measures and data sources. arXiv preprint arXiv:2203.07018.
- Chukwunweike, J. N., Praise, A., & Bashirat, B. A. (2024). Harnessing Machine Learning for Cybersecurity: How Convolutional Neural Networks are Revolutionizing Threat Detection and Data Privacy. International Journal of Research Publication and Reviews, 5(8).
- Tekchandani, P., Bisht, A., Das, A. K., Kumar, N., Karuppiah, M., Vijayakumar, P., & Park, Y. (2024). Blockchain-Enabled Secure Collaborative Model Learning using Differential Privacy for IoT-Based Big Data Analytics. IEEE Transactions on Big Data. [CrossRef]
- Tekchandani, P., Bisht, A., Das, A. K., Kumar, N., Karuppiah, M., Vijayakumar, P., & Park, Y. (2024). Blockchain-Enabled Secure Collaborative Model Learning using Differential Privacy for IoT-Based Big Data Analytics. IEEE Transactions on Big Data.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).