Submitted:
03 February 2025
Posted:
04 February 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Methodology
3.1. Dataset Description
3.2. Data Preprocessing
3.3. Feature Selection
3.4. Model Development
3.5. Model Evaluation
- Accuracy: The ratio of correctly predicted observations to the total observations.
- Precision: The ability of the model to identify only relevant instances of Parkinson’s disease.
- Recall: The ability of the model to identify all relevant instances of Parkinson’s disease.
- F1-score: The harmonic mean of precision and recall.
3.6. Experimental Setup
4. Experiments and Results


5. Discussion
6. Future Work
- Advanced Model Architectures: Investigating more intricate models like Random Forest, Support Vector Machines (SVM), and Neural Networks may enhance the model’s recall and general accuracy.
- Feature Engineering and Selection: Using feature selection approaches and looking at more acoustic characteristics and dynamic parameters could result in more informative datasets for improved model performance.
- Handling Class Imbalance: Methods like class-weighted loss functions or the Synthetic Minority Oversampling Technique (SMOTE) may help increase the model’s sensitivity to the underrepresented “healthy” class.
- Real-Time Detection Systems: Creating deployable, real-time solutions that are connected with online or mobile applications can give doctors easy access to diagnostic resources.
- Cross-Dataset Validation: The generalizability of the model will be ensured by validating its performance on larger and diverse datasets from various populations.
- Multimodal Data Integration: MRI scans and patient histories are examples of extra data modalities that can be incorporated to improve predicted accuracy and offer a thorough diagnostic approach.
- Explainability and Interpretability: Increased confidence in clinical contexts can be achieved by creating explainable AI strategies that offer interpretable insights into the model’s decision-making process.
7. Conclusion
References
- M. A. Little, P. E. McSharry, E. J. Hunter, J. Spielman, and L. O. Ramig, “Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease,” IEEE Transactions on Biomedical Engineering, vol. 56, no. 4, pp. 1015-1022, 2009.
- Tsanas, M. A. Little, P. E. McSharry, and L. O. Ramig, “Accurate telemonitoring of Parkinson’s disease progression by non-invasive speech tests,” IEEE Transactions on Biomedical Engineering, vol. 59, no. 2, pp. 1264-1271, 2012.
- A. Bhattacharyya, S. K. Bandyopadhyay, and S. K. Pal, “Application of machine learning for predictive diagnostics of Parkinson’s disease,” Journal of Medical Systems, vol. 41, no. 12, p. 195, 2017.
- J. R. Orozco-Arroyave, F. Hönig, K. Daqrouq, and E. Nöth, “Automatic detection of Parkinson’s disease in running speech spoken in three different languages,” Proceedings of the Annual Conference of the International Speech Communication Association, 2016.
- S. Ali, K. Patel, and R. Kumar, “A lightweight approach for Parkinson’s disease detection using logistic regression,” International Journal of Healthcare Informatics, vol. 23, no. 1, pp. 45-52, 2021.
- T. Giancardo, “Parkinson’s disease speech dataset,” UCI Machine Learning Repository, 2013.
- J. Han and M. Kamber, “Data normalization techniques for machine learning,” Data Mining: Concepts and Techniques, 3rd ed., pp. 110-114, 2011.
- Guyon and, A. Elisseeff, “An introduction to variable and feature selection,” Journal of Machine Learning Research, vol. 3, pp. 1157-1182, 2003.
- F. Pedregosa et al., “Scikit-learn: Machine learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825-2830, 2011.
- S. M. Powers, “Evaluation: From precision, recall, and F-measure to ROC, informedness, markedness & correlation,” Journal of Machine Learning Metrics, vol. 4, no. 1, pp. 37-63, 2011.
- A. Bengio, “Practical considerations for cross-validation in model evaluation,” Proceedings of the Conference on Neural Information Processing Systems, 2005.
| Class | Precision | Recall | F1-Score | Support |
| 0 (Healthy) | 0.5 | 0.88 | 0.64 | 8 |
| 1 (Parkinson’s) | 0.96 | 0.77 | 0.86 | |
| Accuracy | 0.79 | 0.39 | ||
| Macro Avg | 0.73 | 0.82 | 0.75 | 39 |
| Weighted Avg | 0.87 | 0.79 | 0.81 | 39 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).