Submitted:
09 April 2025
Posted:
09 April 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Dataset Preparation
2.2. Acoustic Signal Features
2.2.1. Long Term Features:
2.2.2. Pitch Period Entropy (PPE):
2.3. Classification Algorithms
2.2.3. Naive Bayes (NB):
2.2.4. Decision Tree
2.2.5. K-Nearest Neighbor (KNN Classifier):
2.2.6. Support Vector Machine (SVM):
2.2.7. Artificial Neural Network (ANN):
2.2.8. Cross Validation
2.2.9. Evaluation Criteria
3. Results

4. Discussion
5. Limitation
6. Conclusion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| PD | Parkinson disease |
| HS | Directory of open access journals |
| CPPS | smoothed cepstral peak prominence |
| PPE | Pitch Period Entropy |
| RPDE | recurrence period density entropy |
| MFCCs | Mel-frequency cepstral coefficients |
| RF | Random Forest |
| KNN | k-nearest neighbors |
| ML | Machine Learning |
| NB | Naïve Bayes |
| SVM | Support vector machines |
| LR | Logistic Regression |
| ANN | Artificial Neural Networks m |
| ROC | Receiver-operating characteristic curve |
| AUC | Area under the curve |
| RBD | REM sleep behavior disorder |
| REM | Rapid Eye Movement |
| Eq | equation |
| TP | True Positive |
| FP | False Positive |
| TN | True Negative |
| FN | False Negative |
| MSE | Mean Squared Error |
| CV | Cross Validation |
| UPDRS | Unified Parkinson’s Disease Rating Scale |
| PIGD | Postural Instability and Gait Disorders |
References
- Yang, K., Wu, Z., Long, J., Li, W., Wang, X., Hu, N., Zhao, X., Sun, T.: White matter changes in Parkinson’s disease. NPJ Parkinsons Dis. 9, 1–10 (2023). [CrossRef]
- Dorsey, E.R., Sherer, T., Okun, M.S., Bloemd, B.R.: The emerging evidence of the Parkinson pandemic. J Parkinsons Dis. 8, S3–S8 (2018). [CrossRef]
- Chaudhuri, K.R., Azulay, J.P., Odin, P., Lindvall, S., Domingos, J., Alobaidi, A., Kandukuri, P.L., Chaudhari, V.S., Parra, J.C., Yamazaki, T., Oddsdottir, J., Wright, J., Martinez-Martin, P.: Economic Burden of Parkinson’s Disease: A Multinational, Real-World, Cost-of-Illness Study. Drugs Real World Outcomes. 11, 1–11 (2024). [CrossRef]
- Mallamaci, R., Musarò, D., Greco, M., Caponio, A., Castellani, S., Munir, A., Guerra, L., Damato, M., Fracchiolla, G., Coppola, C., Cardone, R.A., Rashidi, M., Tardugno, R., Sergio, S., Trapani, A., Maffia, M.: Dopamine- and Grape-Seed-Extract-Loaded Solid Lipid Nanoparticles: Interaction Studies between Particles and Differentiated SH-SY5Y Neuronal Cell Model of Parkinson’s Disease. Molecules. 29, (2024). [CrossRef]
- Poewe, W., Seppi, K., Tanner, C.M., Halliday, G.M., Brundin, P., Volkmann, J., Schrag, A.E., Lang, A.E.: Parkinson disease. Nat Rev Dis Primers. 3, 1–21 (2017). [CrossRef]
- Magrinelli, F., Picelli, A., Tocco, P., Federico, A., Roncari, L., Smania, N., Zanette, G., Tamburin, S.: Pathophysiology of Motor Dysfunction in Parkinson’s Disease as the Rationale for Drug Treatment and Rehabilitation. Parkinsons Dis. 2016, (2016). [CrossRef]
- Mantri, S., Morley, J.F.: Prodromal and early Parkinson’s disease diagnosis. Pract Neurol. 35, 28–31 (2018).
- Brabenec, L., Mekyska, J., Galaz, Z., Rektorova, I.: Speech disorders in Parkinson’s disease: early diagnostics and effects of medication and brain stimulation. J Neural Transm. 124, 303–334 (2017). [CrossRef]
- Ansari, K.A., Johnson, A.: OLFACTORY FUNCTION IN PATIENTS WITH PARKINSON’S DISEASE. Pergamon Press (1975).
- Ramig, L.O., Fox, C., Sapir, S.: Speech treatment for Parkinson’s disease, (2008).
- Goetz, C.G., Poewe, W., Rascol, O., Sampaio, C., Stebbins, G.T., Counsell, C., Giladi, N., Holloway, R.G., Moore, C.G., Wenning, G.K., Yahr, M.D., Seidl, L.: Movement Disorder Society Task Force report on the Hoehn and Yahr staging scale: Status and recommendations. Movement Disorders. 19, 1020–1028 (2004). [CrossRef]
- Weil, R.S., Morris, H.R.: REM sleep behaviour disorder: An early window for prevention in neurodegeneration?, (2019).
- Doty, R.L.: Olfactory dysfunction in neurodegenerative diseases: is there a common pathological substrate?, (2017).
- AnnalesD997Jokinen.
- O’Sullivan, S.B., Schmitz, T.J.: Physical FIFTH EDITION.
- Ahn, S., Springer, K., Gibson, J.S.: Social withdrawal in Parkinson’s disease: A scoping review. Geriatr Nurs (Minneap). 48, 251–261 (2022). [CrossRef]
- Ma, A., Lau, K.K., Thyagarajan, D.: Voice changes in Parkinson’s disease: What are they telling us? Journal of Clinical Neuroscience. 72, 1–7 (2020). [CrossRef]
- Harel, B., Cannizzaro, M., Snyder, P.J.: Variability in fundamental frequency during speech in prodromal and incipient Parkinson’s disease: A longitudinal case study. Brain Cogn. 56, 24–29 (2004). [CrossRef]
- Rusz, J., Krupička, R., Vítečková, S., Tykalová, T., Novotný, M., Novák, J., Dušek, P., Růžička, E.: Speech and gait abnormalities in motor subtypes of de-novo Parkinson’s disease. CNS Neurosci Ther. 29, 2101–2110 (2023). [CrossRef]
- Skorvanek, M., Martinez-Martin, P., Kovacs, N., Rodriguez-Violante, M., Corvol, J.C., Taba, P., Seppi, K., Levin, O., Schrag, A., Foltynie, T., Alvarez-Sanchez, M., Arakaki, T., Aschermann, Z., Aviles-Olmos, I., Benchetrit, E., Benoit, C., Bergareche-Yarza, A., Cervantes-Arriaga, A., Chade, A., Cormier, F., Datieva, V., Gallagher, D.A., Garretto, N., Gdovinova, Z., Gershanik, O., Grofik, M., Han, V., Huang, J., Kadastik-Eerme, L., Kurtis, M.M., Mangone, G., Martinez-Castrillo, J.C., Mendoza-Rodriguez, A., Minar, M., Moore, H.P., Muldmaa, M., Mueller, C., Pinter, B., Poewe, W., Rallmann, K., Reiter, E., Rodriguez-Blazquez, C., Singer, C., Tilley, B.C., Valkovic, P., Goetz, C.G., Stebbins, G.T.: Differences in MDS-UPDRS Scores Based on Hoehn and Yahr Stage and Disease Duration. Mov Disord Clin Pract. 4, 536–544 (2017). [CrossRef]
- Naranjo, L., Pérez, C.J., Martín, J., Campos-Roca, Y.: A two-stage variable selection and classification approach for Parkinson’s disease detection by using voice recording replications. Comput Methods Programs Biomed. 142, 147–156 (2017). [CrossRef]
- Di Cesare, M.G., Perpetuini, D., Cardone, D., Merla, A.: Assessment of Voice Disorders Using Machine Learning and Vocal Analysis of Voice Samples Recorded through Smartphones. BioMedInformatics. 4, 549–565 (2024). [CrossRef]
- Géron, A.: Hands-On Machine Learning with. O’Reilly Media.
- Zewoudie, A.W., Luque, J., Hernando, J.: The use of long-term features for GMM- and i-vector-based speaker diarization systems. EURASIP J Audio Speech Music Process. 2018, (2018). [CrossRef]
- Little, M., McSharry, P., Hunter, E., Spielman, J., Ramig, L.: Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. Nature Precedings. 1 (2008).
- Jeancolas, L., Benali, H., Benkelfat, B.-E., Mangone, G., Corvol, J.-C., Vidailhet, M., Lehericy, S., Petrovska-Delacrétaz, D.: Automatic detection of early stages of Parkinson’s disease through acoustic voice analysis with mel-frequency cepstral coefficients. In: 2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP). pp. 1–6 (2017).
- Convey, R.B., Laukkanen, A.M., Ylinen, S., Penttilä, N.: Analysis of Voice in Parkinson’s Disease Utilizing the Acoustic Voice Quality Index. Journal of Voice. 1–10 (2024). [CrossRef]
- Hawi, S., Alhozami, J., AlQahtani, R., AlSafran, D., Alqarni, M., Sahmarany, L. El: Automatic Parkinson’s disease detection based on the combination of long-term acoustic features and Mel frequency cepstral coefficients (MFCC). Biomed Signal Process Control. 78, 104013 (2022). [CrossRef]
- Tracey, B., Volfson, D., Glass, J., Haulcy, R., Kostrzebski, M., Adams, J., Kangarloo, T., Brodtmann, A., Dorsey, E.R., Vogel, A.: Towards interpretable speech biomarkers: exploring MFCCs. Sci Rep. 13, (2023). [CrossRef]
- Kotsiantis, S.B., Zaharakis, I.D., Pintelas, P.E.: Machine learning: A review of classification and combining techniques. Artif Intell Rev. 26, 159–190 (2006). [CrossRef]
- Berrar, D.: Cross-validation. Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics. 1–3, 542–545 (2018). [CrossRef]
- Dao, S.V.T., Yu, Z., Tran, L. V., Phan, P.N.K., Huynh, T.T.M., Le, T.M.: An Analysis of Vocal Features for Parkinson’s Disease Classification Using Evolutionary Algorithms. Diagnostics. 12, (2022). [CrossRef]
- Powers, D.M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. (2020). [CrossRef]
- Fawcett, T.: An introduction to ROC analysis. Pattern Recognit Lett. 27, 861–874 (2006). [CrossRef]
- Christen, P., Hand, D.J., Kirielle, N.: A review of the F-measure: its history, properties, criticism, and alternatives. ACM Comput Surv. 56, 1–24 (2023).
- James, M., Hastie, P., Taylor, B.: First Printing : July 5 , 2023. (2023).
- Prior, F., Virmani, T., Iyer, A., Larson-Prior, L., Kemp, A., Rahmatallah, Y., Pillai, L., Glover, A.: Voice Samples for Patients with Parkinson’s Disease and Healthy Controls. https://figshare.com/articles/dataset/Voice_Samples_for_Patients_with_Parkinson_s_Disease_and_Healthy_Controls/23849127, (2023).
- Iyer, A., Kemp, A., Rahmatallah, Y., Pillai, L., Glover, A., Prior, F., Larson-Prior, L., Virmani, T.: A machine learning method to process voice samples for identification of Parkinson’s disease. Sci Rep. 13, (2023). [CrossRef]
- Dejonckere, P.H., Bradley, P., Clemente, P., Cornut, G., Friedrich, G., Heyning, P. Van De: A basic protocol for functional assessment of voice pathology , especially for investigating the efficacy of ( phonosurgical ) treatments and evaluating new assessment techniques Guideline elaborated by the Committee on Phoniatrics. (2001). [CrossRef]
- Boersma, P., van Heuven, V.: Speak and unSpeak with Praat. Glot International. 5, 341–347 (2001).
- Gorriz, J.M., Segovia, F., Ramirez, J., Ortiz, A., Suckling, J.: Is K-fold cross validation the best model selection method for Machine Learning? (2024).
- Oganian, Y., Bhaya-Grossman, I., Johnson, K., Chang, E.F.: Vowel and formant representation in the human auditory speech cortex. Neuron. 111, 2105—-2118.e4 (2023). [CrossRef]
- Little, M.A., McSharry, P.E., Roberts, S.J., Costello, D.A.E., Moroz, I.M.: Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection. Biomed Eng Online. 6, (2007). [CrossRef]
- Davis, S., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans Acoust. 28, 357–366 (1980).
- Tsanas, A., Little, M.A., McSharry, P.E., Spielman, J., Ramig, L.O.: Novel speech signal processing algorithms for high-accuracy classification of Parkinson’s disease. IEEE Trans Biomed Eng. 59, 1264–1271 (2012).
- Jin, Z., Shang, J., Zhu, Q., Ling, C., Xie, W., Qiang, B.: RFRSF: Employee Turnover Prediction Based on Random Forests and Survival Analysis. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 12343 LNCS, 503–515 (2020). [CrossRef]
- Peretz, O., Koren, M., Koren, O.: Naive Bayes classifier – An ensemble procedure for recall and precision enrichment. Eng Appl Artif Intell. 136, (2024). [CrossRef]
- Alalayah, K.M., Senan, E.M., Atlam, H.F., Ahmed, I.A., Shatnawi, H.S.A.: Automatic and Early Detection of Parkinson’s Disease by Analyzing Acoustic Signals Using Classification Algorithms Based on Recursive Feature Elimination Method. Diagnostics. 13, (2023). [CrossRef]
- Evgeniou, T., Pontil, M.: Support vector machines: Theory and applications. In: Advanced course on artificial intelligence. pp. 249–257. Springer (1999).
- Grossi, E., Buscema, M.: Introduction to artificial neural networks. Eur J Gastroenterol Hepatol. 19, 1046–1054 (2007). [CrossRef]
- Vakili, M., Ghamsari, M., Rezaei, M.: Performance Analysis and Comparison of Machine and Deep Learning Algorithms for IoT Data Classification. (2020).
- Breiman, L.: Random Forests. (2001).
- Das, R.: A comparison of multiple classification methods for diagnosis of Parkinson disease. Expert Syst Appl. 37, 1568–1572 (2010). [CrossRef]
- Wroge, T.J., Yasin¨ Yasin¨ozkanca, Y., Demiroglu, C., Si, D., Atkins, D.C., Ghomi, R.H.: Parkinson’s Disease Diagnosis Using Machine Learning and Voice; Parkinson’s Disease Diagnosis Using Machine Learning and Voice. (2018).
- Suppa, A., Costantini, G., Asci, F., Di Leo, P., Al-Wardat, M.S., Di Lazzaro, G., Scalise, S., Pisani, A., Saggio, G.: Voice in Parkinson’s Disease: A Machine Learning Study. Front Neurol. 13, (2022). [CrossRef]
- Wright, H., Postema, M., Aharonson, V.: Towards a voice-based severity scale for Parkinson’s disease monitoring. 10, (2024).
- Hossain, M.A., Amenta, F.: Machine Learning-Based Classification of Parkinson’s Disease Patients Using Speech Biomarkers. J Parkinsons Dis. 14, 95–109 (2024). [CrossRef]



| Features |
Number of Features | Description |
| Jitter | 2 | Measures variability in vocal fold vibration frequency |
| Shimmer | 5 | Measures amplitude fluctuations in vocal cycles |
| NHR | 1 | Noise-to-harmonics ratio |
| HNR | 1 | Harmonics-to-noise ratio |
| Pitch | 1 | Fundamental frequency of vocal fold vibration |
| Intensity | 1 | Overall loudness of the voice |
| Formant | 4 | Resonant frequencies of the vocal tract |
| CPPS | 1 | Measures prominence of spectral peaks |
| PPE | 1 | Measures irregularity in speech pitch to distinguish between natural variations and pathological speech |
| RPDE | 1 | Measures regularity of voice signal |
| MFCC | 12 | Represents spectral envelope of the signal, useful for voice quality analysis |
| Metrics |
Formula |
Description |
|---|---|---|
| Accuracy |
|
Proportion of correctly classified instances |
| Recall |
Proportion of actual positives correctly identified | |
| Precision |
Proportion of predicted positives that are actually positive | |
| F1-Score |
Harmonic means precision and recall | |
| ROC-AUC |
- | Area under the receiver operating characteristic curve. |
| MSE |
Mean squared error between predicted and actual values |
| Resource | Details |
|---|---|
| CPU | i5 Gen6 |
| RAM | 12.67 GB |
| GPU | 4 GB Tesla T4, 15360 MiB |
| Software | Python3. 10.12 and 3.12.8 |
| Models | MSE-Average (5-fold CV) ± SD |
| Random Forest (RF) | 0.17 ± 0.07 |
| Logistic Regression (LR) | 0.27 ± 0.09 |
| Naive Bayes (NB) | 0.26 ± 0.15 |
| Decision Tree (DT) | 0.24 ± 0.11 |
| K-Nearest Neighbors (KNN) | 0.36 ± 0.10 |
| Support Vector Machine (SVM) | 0.24 ± 0.06 |
| Artificial Neural Network (ANN) | 0.50 ± 0.01 |
| Algorithm | Accuracy | Recall | Precision | F1-Score | ROC-AUC |
|---|---|---|---|---|---|
| Random Forest (RF) | 0.8272 ±0.10 | 0.7500±0.15 | 0.8257±0.14 | 0.8251±0.1 | 0.8965±0.07 |
| Logistic Regression (LR) | 0.7529±0.06 | 0.7750±0.09 | 0.7467±0.12 | 0.7487±0.07 | 0.8132±0.09 |
| Support Vector Machine (SVM) | 0.7529±0.06 | 0.7750±0.04 | 0.7529±0.09 | 0.7487±0.04 | 0.8263±0.09 |
| Naive Bayes (NB) | 0.7397±0.15 | 0.8250±0.18 | 0.7312 ±0.16 | 0.7578±0.14 | 0.8181±0.13 |
| Decision Tree (DT) | 0.6801±0.16 | 0.7750±0.09 | 0.7871±0.16 | 0.6589±0.09 | 0.8071±0.11 |
| K-Nearest Neighbors (KNN) | 0.6301±0.1 | 0.7000±0.15 | 0.6222±0.11 | 0.6493±0.09 | 0.6760±0.06 |
| Artificial Neural Network (ANN) | 0.5058±0.01 | 0.2000±0.40 | 0.0941±0.18 | 0.0000±0 | 0.5267±0.08 |
| Study | Featured used | Machine Learning Models | Best performance |
|---|---|---|---|
| Our research |
Long-term features, short-term features, PPE, RPDE | RF, SVM, NB |
89.65%(ROC-AUC), 82.63%(ROC-AUC) 82.50%(Recall) |
| Fred Prior (30) |
long term and short-term features | RF, LR, CNN |
78%(AUC), 78%(AUC), 97%(AUC) |
| Max little (17) |
Long-term features, non-standard measurement | SVM |
90.4%(accuracy) |
| Wroge (44) |
GeMaps features, AVEC features |
Gradient Boosted Decision Tree ANN |
82% (accuracy),65%(recall) 86%(accuracy),82%(recall) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).