Submitted:
06 August 2025
Posted:
07 August 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Works
| Reference | Year | Dataset | Age | Data type | Model | Accuracy [%] |
|---|---|---|---|---|---|---|
| Ref.[1] | 2019 | Pediatric | Children | Signal | CNN | 96 |
| Ref.[2] | 2019 | Mendelay | Children | Audio | SVM | 94.2 |
| Ref.[7] | 2024 | CirCor | Adult | PCG | CRNN | 99.7 |
| Ref.[8] | 2023 | PASCAL | Adult | PCG | RNN | 90 |
| Ref.[10] | 2023 | CirCor | Adult | PCG | CNN/LSTM | 99 |
| Ref.[11] | 2012 | Proposed | Adult | PCG | Diagnose | 90 |
| Ref.[14] | 2010 | - | Adult | ECG | MLP | 93 |
| Ref.[17] | 2023 | CirCor | Adult | PCG | CNN | 91 |
| Ref.[18] | 2006 | Proposed | Children | ECG | Analysis | 93 |
| Ref.[20] | 2015 | Proposed | Adult | PCG | KNN | 93.3 |
| Ref.[21] | 2021 | ECG | Adult | ECG | HMM | 99 |
| Ref.[19] | 2001 | Proposed | Children | Signal | ANN | 100 |
| Ref.[22] | 2020 | PhysioNet | Adult | ECG/PCG | HSMM | 96 |
| Ref.[24] | 2024 | Proposed | Adult | Audio | Pre-traind | 58.0 |
| Ref.[15] | 2020 | Not specified | Adult | ECG | CNN | 97 |
| Ref.[13] | 2017 | PhysioNet | Adult | PCG | CNN | 83 |
| Ref.[25] | 2020 | PhysioNet | Adult | PCG | CNN/MLP | 98 |
| Ref.[26] | 2023 | PhysioNet | Adult | PCG | CNN | 71 |
3. Methods
3.1. Dataset
3.2. Architecture
3.3. Feature Extraction
3.4. Models
3.4.1. Model 1 for 1D Representations
3.4.2. Model 2 for 2D Representations
3.4.3. Model 3 for 1D and 2D Representations
3.5. Model Evaluation
- TP (true positives): the number of patients correctly identified as patients.
- FN (false negatives): the patients incorrectly classified as healthy.
- FP (false positives): healthy individuals misclassified as patients.
- TN (true negatives): healthy subjects accurately classified as healthy.
4. Results
4.1. Results of Performance
4.2. Results of Cost for Training
5. Discussion
6. Limitations
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| MFCC | Mel Frequency Cepstral Coefficients |
| ECG | Electrocardiograms |
| PCG | Phonocardiograms |
| CHD | Congenital Heart Disease |
| LSTM | Long Short-Term Memory |
| CNN | Convolutional Neural Networ |
| FFT | Fast Fourier Transform |
| DWT | Discrete Wavelet Transform |
| STFT | Short-Term Fourier Transform |
| CVD | Cardiovascular diseases |
| MAcc | Mean Accuracy |
References
- Xiao, B; Xiuli Bi, Y. X.; Li, W.; Ma, Z.; Zhang, J.; Ma, X. Follow the Sound of Children’s Heart: A Deep-Learning-Based Computer-Aided Pediatric CHDs Diagnosis System. IEEE Internet of Things Journal 2020, 7(3), 1994–2004. [CrossRef]
- Islam, R.; Hassan, M.; Raihan, M.; Datto, S. K.; Shahriar, A.; More, A. A Wireless Electronic Stethoscope to Classify Children Heart Sound Abnormalities. 22nd International Conference on Computer and Information Technology (ICCIT) 2019, 18–20.
- Tao, Z.; Ren, Z.; Yang, X.; Liang, Y.; Shi, X. 2D ViT and 1D CRNN-based Heart Sound Signals Detection Model. IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2023, 05–08.
- Lu, K.; Zhao, F.; Gu, P.; Wang, H.; Zang, T.; Wang, H. TetraCVD: A Temporal-Textual Transformer based Model for Cardiovascular Disease Diagnosis. IEEE International Conference on Bioinformatics and Biomedicine 2023, 2129–2132.
- Wang, R.; Duan, Y.; Li, Y.; Zheng, D.; Liu, X.; Lam, C. T.; Tan, T. PCTMF-Net: heart sound classification with parallel CNNs-transformer and second-order spectral analysis. Journal The Visual Computer,Springer 2023, 39(8), 3811–-3822. [CrossRef]
- Ren, Z.; Qiao, Y.; Yuan, Y.; Zhou, Y.; Liang, Y.; & Shi, X. Time and time-frequency features integrated cnn model for heart sound signals detection. IEEE International Conference on Bioinformatics and Biomedicine (BIBM). 2022, 1138–1143.
- Radha, K.; Bansal, M.; Sharma, R. Raw Waveform-Based Custom Scalogram CRNN in Cardiac Abnormality Diagnosis IEEE Access 2024, 12, 13986–14004.
- Habijan, M.; Galić, I.; Pizurica, A. Heart Sound Classification using Deep Learning. IEEE 8th International Conference on Smart and Sustainable Technologies (SpliTech) 2023, 1-–6.
- Islam, Md Riadul; rakib , mahadi hassan ; Raihan, M. Children Heart Sound- Normal & Abnormal. Mendeley Data 2023, v1.
- Nguyen, M. T.; Lin, W. W.; Huang, J. H. Heart sound classification using deep learning techniques based on log-mel spectrogram. Journal Circuits, Systems, and Signal Processing 2023, 42(1), 344–360. [CrossRef]
- Emmanuel, B. S.A Review of signal processing techniques for heart sound analysis in clinical diagnosis. Journal of medical engineering & technology 2012, 36(6), 303–307. [CrossRef]
- Liu, J.; Wang, H.; Yang, Z.; Quan, J.; Liu, L.; Tian, J. Deep learning-based computer-aided heart sound analysis in children with left-to-right shunt congenital heart disease.International Journal of Cardiology 2022,348, 58–64. [CrossRef]
- Rubin, J.; Abreu, R.; Ganguli, A., Nelaturi; S., Matei, I.; Sricharan, K. Recognizing abnormal heart sounds using deep learning. journal arXiv preprint arXiv:1707 2017, 04642.
- Sepehri, A. A.; Gharehbaghi, A.; Dutoit, T.; Kocharian, A.;Kiani, A. A. Novel method for pediatric heart sound segmentation without using the ECG. Computer methods and programs in biomedicine 2010, 99(1), 43–48. [CrossRef]
- Li, Y.; Pang, Y.; Wang, K.; Li, X. Toward improving ECG biometric identification using cascaded convolutional neural networks. journal Neurocomputing, Elsevier 2020, 391, 83–95. [CrossRef]
- Zhao, Q.; Geng, S.; Wang, B.; Sun, Y.; Nie, W.; Bai, B.; Yu, C.; Zhang, F.; Tang, G.; Zhang, D.; Zhou, Y.; Liu, J.; Hong, S. Deep Learning for Heart Sound Analysis: A Literature Review. Journal medRxiv, Cold Spring Harbor Laboratory Press) 2023, 09.
- Chen, J.; Guo, Z.,;Xu, X.; Zhang, L. B.; Teng, Y.; Chen, Y.; More; Wang, W. A robust deep learning framework based on spectrograms for heart sound classification .IEEE/ACM Transactions on Computational Biology and Bioinformatics 2023, 21(4), 936–947. [CrossRef]
- Finley, J. P.; Warren, A. E.; Sharratt, G. P.; Amit, M. Assessing children’s heart sounds at a distance with digital recordings. journal Pediatrics,American Academy of Pediatrics) 2006, 118(6), 2322–2325. [CrossRef]
- DeGroff, C. G.; Bhatikar, S.; Hertzberg, J.; Shandas, R.; Valdes-Cruz L.; Mahajan, R. L.Artificial neural network–based method of screening heart murmurs in children. journal Circulation, Am Heart Assoc 2023, 103(22), 2711–2716. [CrossRef]
- Randhawa, S. K.;Singh, M. Classification of heart sound signals using multi-modal features .journal Procedia Computer Science,Elsevier 2023, 58, 165–171. [CrossRef]
- Altuve, M.;Monroy, N. F. Hidden Markov model-based heartbeat detector using electrocardiogram and arterial pressure signals .Biomedical Engineering Letters, Springer 2021, 11(3), 249–261. [CrossRef]
- Shukla, S.; Singh, S. K.; Mitra, D. An efficient heart sound segmenta tion approach using kurtosis and zero frequency filter features. Biomedical Signal Processing and Control 2020, 57, 101762. [CrossRef]
- Sound, M. H. Murmur Library. University of Michigan Heart Sound and Murmur Library 2012.
- Qian, K.; Bao, Z.; Zhao, Z.; Koike, T.; Dong, F.; Schmitt, M.;More; Yamamoto, Y. Learning Representations from Heart Sound: A Comparative Study on Shallow and Deep Models. Cyborg and Bionic Systems, AAAS 2024, 5, 0075. [CrossRef]
- Dissanayake, T.; Fernando, T.; Denman, S.; Sridharan, S.; Ghaemmaghami, H.; & Fookes, C. Understanding the importance of heart sound segmentation for heart anomaly detection. Clinton, journal arXiv preprint arXiv:2005 2020, 10480.
- Chen, W.; Zhou, Z.; Bao, J.; Wang, C.; Chen, H.; Xu, C.; Wu, H. Classifying heart-sound signals based on cnn trained on melspectrum and log-melspectrum features. Journal Bioengineering 2023, 10(6), 645. [CrossRef]
- Liu, C.; Springer, D.; Li, Q.; Moody, B.; Juan, R. A.; Chorro, F. J.; Castells, F.; Roig, J. M.; Silva, I.; Johnson, A.; Syed, Z.; Schmidt, S. E.; Papadanill, C. D.; Hadjiileontiadis, L.; Naseri, H.; Moukadem, A.; Dieterlen, A.; Brandt, C.; Tang, H.; Samieinasab, M.; Samieinasab, M. R.; Sameni, R.; Mark, R. G.; Clifford, G. D. An open access database for the evaluation of heart sound algorithms.journal Physiological measurement 2016, 37(12), 2181. [CrossRef]
- PhysioBank, P.Physionet: components of a new research resource for complex physiologic signals. journal Circulation 2000, 101(23), e215–e220.
- Chakir, F.; Jilbab, A.; Nacir, C.; Hammouch, A. Phonocardiogram signals processing approach for PASCAL classifying heart sounds challenge. Signal, Image and Video Processing,Springer 2018, 12(6), 1149–1155. [CrossRef]




| Representation | Models | Dataset | Data type | Accuracy [%] |
|---|---|---|---|---|
| 1D representations | Ref.[8] LSTM, RNN | PASCAL | Audio | 90.0 |
| Ref.[24] LSTM | Proposed | Audio | 38.0 | |
| Ref.[2] SVM | Mendelay | Audio | 94.1 | |
| 2D representations | Ref.[3] 2D Vit1D CRNN | PhysioNet | Audio | 97.3 |
| Ref.[24] ResNet 50 | Proposed | Audio | 56.0 | |
| Ref.[5] PCTMF-Net | PhysioNet | Audio | 93.0 | |
| Our proposed models | Model 1 for 1D representations | Mendelay | Audio | 66.7 |
| Model 2 for 2D representations | Mendelay | Audio | 91.7 | |
| Model 3 for 1D and 2D representations | Mendelay | Audio | 98.9 |
| Feature Extractor | Precision | F1-score | Test Accuracy [%] |
|---|---|---|---|
| MFCC | (Ab)0.88 (N)0.80 | (Ab)0.82 (N)0.84 | 0.83 |
| Wavelet | (Ab)0.75 (N)0.70 | (Ab)0.71 (N)0.74 | 0.72 |
| STFT | (Ab)0.60 (N)0.62 | (Ab)0.63 (N)0.59 | 0.61 |
| Models | Acc | Sp | Se | MAcc | Number of Parameters |
|---|---|---|---|---|---|
| Ref.[3] 2D Vit-1D CRNN | 0.9733 | 0.9731 | 0.9735 | 0.9733 | - |
| Ref.[6] TTFI-CNN | 0.9715 | 0.9713 | 0.9717 | 0.9715 | - |
| Our Model 1 | 0.6666 | 0.5000 | 0.5000 | 0.5000 | 331,010 |
| Our Model 2 | 0.9167 | 0.8333 | 1.000 | 0.9167 | 9,914,309 |
| Our Model 3 | 0.9891 | 0.9894 | 0.9894 | 0.9894 | 3,346,370 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).