Submitted:
03 June 2025
Posted:
05 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Research
3. Conceptual Design of ST Elevation Sonification
3.1. Electrocardiography

3.2. Feature Extraction
3.3. Sonification Designs
3.3.1. Sonification Concept 1: Heartbeat-Locked ST Elevation Arpeggio
- a.
- with diatonic sequence [0, 2, 4, 5, 7, 9, 12, 14, 16, 17, 19, 21], i.e., major scale, precordial leads transposed by one octave (i.e. 12 semitones offset): Sound examples1 S1.1a.wav for the 6 STEMI conditions [weak anterior, moderate anterior, severe anterior, weak inferior, moderate inferior, severe inferior], 3 heartbeats for each condition.
- b.
- with pentatonic scale [0, 2, 4, 7, 9, 12, 14, 16, 19, 21, 24, 26]: Sound examples S1.1b.wav for the 6 STEMI conditions as in the previous item.
3.3.2. Sonification Concept 2: Musical Phrase Melody over Several Heartbeats
3.3.3. Sonification Concept 3: Grouped Lead Scans
- on each heartbeat play the QRS tone
- every kth heartbeat (e.g. ) play a sequence of 6 notes for limb leads in order aVR, III, aVF, II, I, aVL, via inferior to lateral, with pitch encoding ST level, relative to the pitch of the QRS tone. In addition, outside a ‘healthy’ range use larger sharpness, or modulation
- on the heartbeat (d = 1, 2, or 3 adjustable) play 6 notes for the precordial leads V1...V6, i.e. from septal via anterior to lateral, using the same mappings as for limb leads, in addition (if necessary) another timbre in case users can’t distinguish the order of the groups.
- continuous mapping: the ST elevations are mapped linearly to pitch, corresponding to an exponential mapping to frequency. A suitable source interval is V, a suitable target interval is a musical fifth, i.e. 7 semitones on the chromatic scale down and up, i.e. , cf. Section 4 for details on the implementation in Python.
- chromatic mapping: starting from the above-described continuous mapping here we round (i.e., discretize) the pitch value (represented in MIDI notes as float) to integer values, resulting in pitches as they occur on the chromatic scale, i.e. tones available on the piano keyboard (though we don’t need to use any specific musical instrument)
- diatonic mapping: here we choose all tones of two octaves (excluding the octave below and above the center tone) and map the source interval V to the index in this array of notes, thus we obtain a musical interval (when relating the tone to the QRS pitch) which we are highly familiar with from Western music and which are rather easy to recognize: tonic, major second, major third, forth, fifth, major sixth and major seventh.
- 5-state mapping: here we discretize the ST elevation into 5 levels: severe suppression, weak suppression, IE, weak elevation, severe elevation) using two thresholds (, ) and the sign of the value for group selection. As pitch mapping we experimented with a harmonic choice, i.e. the tones should fit to a major chord, choosing the tones e, g, c’, e’, g’ from negative to positive, or [-9, -5, 0, 4, 7] as semitone offset to the pitch of the QRS tone.
- 5-state-dissonant mapping: this is a variation of the previous 5-state mapping, but here we chose the extrema of the pitches to be dissonant to elicit a clear feeling of discomfort, i.e. such sounds should draw more attention as they are more disturbing or unpleasant (when being heard together with the QRS tone). Specifically we tried a major 7 down and a minor 6 (augmented 5) up, particularly the major 7 down has much friction with the reference.
-
Tristate mapping: here we discretize the ST elevation in just three niveaus: For each specific ST elevation s we use
- -
- For : a lower pitch, e.g. a forth below the reference
- -
- For : the reference pitch
- -
- For : a higher pitch, e.g. a fifth above the reference.
- The 30 sound examples S3.1–S3.30 are titled in a self-explanatory way, specifying mode, condition, , stride and separation. We recommend to listen to files in blocks of 6 that belong to the same mode and only vary in condition.
- Sound examples S3.31–S3.34 all correspond to a single selected condition (inferior moderate) but with different , stride, and set separation. For instance sonification example S3.34 is a fast and dense and may be the setting of choice if ST elevation changes are expected and should be detected not later than 3 heart beats. S3.31–S3.33 show a slightly faster lead sequence ( ms) with different modes.
4. Implementation of ST Elevation Sonification
5. Sonification Demonstration and Method Selection for Evaluation


- Comprehensibility:
- the 5 pitch levels are few enough to be understood correctly without much training, the low presentation speed allows to follow the pitch contour.
- Dominant Perceptual Quality:
- the presentation speed is fast enough so that the six channels in each block are perceptually grouped in one cluster, forming a pattern or gestalt, similar to how raindrop sounds fuse to the perception of rain, or a sequence of knocking sounds merge to the interaction of a woodpecker with a tree. This grouping facilitates the association of groups to lead groups. Users thus are not challenged to interpret individual leads.
- Aesthetics/pleasantness:
- the sound should be pleasant to hear, e.g., not too frequent, not too harsh or loud. This motivated the selection of dB, musical intervals for the two elevation pitches and the two suppression pitches. The isoelectric pitch is already given by the pitch of the QRS tone.
- Memorizability:
- is an important characteristic optimized with design 3, as it involves only two groups, and the sequence is topologically ordered (Cabrera circle resp. counterclockwise)
- Compatibility with environmental sounds:
- note that the selection of sounds as synthesized sounds instead of using real-world sound samples or physical model-based sounds is useful as it avoids any confusion with ambient sounds. In addition, the sounds are selected to avoid interference with speech / vocal interaction. The pitches are in a middle/upper range of musical frequencies, where the sensitivity is high and low-cost loudspeakers easily project the sounds over several meters.
- Backgrounding and Saliency:
- the sound streams should be regular (e.g., in rhythmic and temporal organization) to facilitate that listeners can habituate to these sound streams. Habituation is a perceptual skill allowing to ignore sound streams (in favor of other tasks or sound streams) yet to remain sensitive to relevant changes in those background streams. An example is the sound of the refrigerator in the kitchen, or the exhaust in out-dated fossil fuel driven automobiles, which drivers clearly hear but can ignore until their sound pattern changes due to malfunction. However, mapping absolute ST values to more extreme pitches and sharpness, changes can be expected to be salient enough to draw the listeners’ attention.
- Universality:
- musical sounds are perceived according to the cultural background. The current choice of intervals, however, is largely driven by consonance, which is a more universal feature: the octave is a preferred interval in most musical communities, and the fifth and fourth are generally preferred consonant intervals.

6. Classification Study Design
6.1. Overview
- Q1
- I had experience with pre-clinical emergencies.
- Q2
- I have participated to more than 3 emergency trainings.
- Q3
- I feel confident when handling emergency situations.
- Q4
- For me, emergency situations cause negative stress.
- SQ1
- Sonification is pleasant to listen to
- SQ2
- The sonification is informative, i.e., it enables to identify ST elevation changes in the ECG
- SQ3
- I can imagine to listen to these sonifications for a longer time period
6.2. Participants
6.3. Classification Task
7. Statistical Methods
8. Results
8.1. Excellent Performance of Sonification-Assisted STEMI Classifications
8.2. Uncertainty in Coping with Emergency Situations Affects Sonification-Based Classification Performance
9. Discussion
10. Conclusion
Supplementary Materials
Author Contributions
Funding
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Abbreviations
| ECG | Electrocardiography |
| EMS | Emergency Medical Services |
| PMSon | Parameter Mapping Sonification |
| STEMI | ST elevation Mycardial Infarction |
References
- Widimsky P, Wijns W, Fajadet J, de Belder M, Knot J, Aaberge L, et al. Reperfusion therapy for ST elevation acute myocardial infarction in Europe: description of the current situation in 30 countries. European Heart Journal. 2010;31(8):943–957. [CrossRef]
- Baradi A, Dinh DT, Brennan A, Stub D, Somaratne J, Palmer S, et al. Prevalence and Predictors of Emergency Medical Service Use in Patients Undergoing Primary Percutaneous Coronary Intervention for ST-Elevation Myocardial Infarction. Heart, Lung and Circulation. 2024;33(7):990–997. [CrossRef]
- Byrne RA, Rossello X, Coughlan JJ, Barbato E, Berry C, Chieffo A, et al. 2023 ESC Guidelines for the management of acute coronary syndromes: Developed by the task force on the management of acute coronary syndromes of the European Society of Cardiology (ESC). European Heart Journal. 2023;44(38):3720–3826. [CrossRef]
- Aldana Blanco AL, Hermann T, Tiesmeier J, Persson J, Grautoff S. Sonfication enables continuous surveillance of the ST segment in the electrocardiogram. American Journal of Emergency Medicine. 2022;58:286–297. [CrossRef]
- Aldana Blanco AL, Grautoff S, Hermann T. ECG sonification to support the diagnosis and monitoring of myocardial infarction. Journal on Multimodal User Interfaces. 2020;14:207–218. [CrossRef]
- Badings EA, Remkes WS, The SH, Dambrink JE, Tjeerdsma G, Rasoul S, et al. Early or late intervention in patients with transient ST-segment elevation acute coronary syndrome: subgroup analysis of the elisa-3 trial: Intervention in Transient ST-Segment Elevation ACS. Catheterization and Cardiovascular Interventions. 2016;88(5):755–764. [CrossRef]
- Väljamäe A, Mealla S, Steffert T, Holland S, Marimon X, Benitez R, et al. A review of real-time EEG sonification research. In: International Conference of Auditory Display 2013 (ICAD 2013); 2013. p. 85–93.
- Pauletto S, Hunt A. The sonification of EMG data. In: Proceedings of the 12th International Conference on Auditory Display; 2006. p. 152–157.
- Gionfrida L, Roginska A. A novel sonification approach to support the diagnosis of alzheimer‘s dementia. Frontiers in Neurology. 2017;8:647. [CrossRef]
- Roginska A, Mohanraj H, Keary J, Friedman K. Sonification method to enhance the diagnosis of dementia. In: International Conference on Auditory Display; 2014.Available from: https://api.semanticscholar.org/CorpusID:20280044.
- Stahl B, Thoshkahna B. Real-time heart rate sonification for athletes. In: Proceedings of the 21st International Conference on Auditory Display. University of Music and Performing Arts Graz. Institute of Electronic Music and Acoustics; 2015.Available from: https://api.semanticscholar.org/CorpusID:114817490.
- Aldana Blanco AL, Grautoff S, Hermann T. CardioSounds: Real-time auditory assistance for supporting cardiac diagnostic and monitoring. In: Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences, AM ’17. New York, NY, USA: Association for Computing Machinery; 2017. p. 1–4. Available from: . [CrossRef]
- Ballora M, Pennycook B, Ivanov PC, Glass L, Goldberger AL. Heart Rate Sonification: A New Approach to Medical Diagnosis. Leonardo. 2004;37(1):41–46. [CrossRef]
- Landry S, Sun Y, Slade DL, Jeon M. Tempo-Fit Heart Rate App: Using Heart Rate Sonification as Exercise Performance Feedback. In: Proceedings of the International Conference on Auditory Display; 2016.Available from: https://digitalcommons.mtu.edu/cls-fp/40.
- Aldana Blanco AL, Grautoff S, Hermann T. Heart Alert: ECG Sonification for Supporting the Detection and Diagnosis of ST Segment Deviations. In: Proceedings of the Interactive Sonification Workshop (ISon). Bielefeld: ISon; 2016.Available from: https://interactive-sonification.org/ISon2016.
- Kather JN, Hermann T, Bukschat Y, Kramer T, Schad LR, Zöllner FG. Polyphonic sonification of electrocardiography signals for diagnosis of cardiac pathologies. Scientific Reports. 2017;7:44549. [CrossRef]
- Grond F, Berger J. Parameter Mapping Sonification. In: Hermann T, Hunt A, Neuhoff JG, editors. The Sonification Handbook. Berlin, Germany: Logos Publishing House; 2011. p. 363–397. Available from: http://sonification.de/handbook/chapters/chapter15/.
- Blanco ALA, Grautoff S, Hermann T. CardioSounds: Real-time Auditory Assistance for Supporting Cardiac Diagnostic and Monitoring. In: Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences. AM ’17. New York, NY, USA: Association for Computing Machinery; 2017.Available from: . [CrossRef]
- Hermann T, Hunt A, Neuhoff JG, editors. The Sonification Handbook. Berlin, Germany: Logos Publishing House; 2011. Available from: http://sonification.de/handbook.
- Kramer G. An Introduction to Auditory Display. In: Kramer G, editor. Auditory Display. ICAD. Addison-Wesley; 1994. p. 1–78.
- Hildebrandt T, Hermann T, Rinderle-Ma S. Continuous sonification enhances adequacy of interactions in peripheral process monitoring. International Journal of Human-Computer Studies. 2016;95:54–65.
- Grond F, Bovermann T, Hermann T. A SuperCollider Class for Vowel Synthesis and its Use for Sonification. In: Worall D, Wersényi G, editors. The 17th Annual Conference on Auditory Display, Budapest, Hungary 20-24 June, 2011, Proceedings. OPAKFI; 2011.Available from: https://pub.uni-bielefeld.de/record/2277122.
- Walker BN, Nees MA. Theory of sonification. In: Hermann T, Hunt A, Neuhoff JG, editors. The Sonification Handbook. Berlin, Germany: Logos Verlag Berlin; 2011. p. 9–39.
- Reinsch D, Hermann T. Interacting with sonifications: The mesonic framework for interactive auditory data science. In: Rönnberg N, Lenzi S, Ziemer T, Hermann T, Bresin R, editors. Psychoacoustics in the Loop.Proceedings of the 7th Interactive Sonification Workshop; 2022. p. 65–74.
- Hermann T, Reinsch D. sc3nb: a Python-SuperCollider Interface for Auditory Data Science. In: Proceedings of the 16th International Audio Mostly Conference. AM ’21. New York, NY, USA: Association for Computing Machinery; 2021. p. 208–215. Available from: . [CrossRef]
- Anderson HR, Borgen AC, Christnacht R, Ng J, Weller JG, Davison HN, et al. Stats on the desats: alarm fatigue and the implications for patient safety. BMJ Open Quality. 2023;12(3):e002262. [CrossRef]
- Kuhn M. Building Predictive Models in R Using the caret Package. Journal of Statistical Software. 2008;28(5):1–26. [CrossRef]
- Pinheiro JC, Bates DM. Mixed-Effects Models in S and S-PLUS. Statistics and Computing. Springer New York, NY; 2006.
- R Core Team. R: A Language and Environment for Statistical Computing; 2024. Available from: https://www.R-project.org/.
- Case A. Calm Technology: Principles and Patterns for Non-intrusive Design. O’Reilly Media; 2016. Available from: https://books.google.de/books?id=wQaGjwEACAAJ.
- Elkammash A, Ring L. The effectiveness of cardiology e-training in improving electrocardiogram interpretation skills in junior medical trainees; a pre-post mixed methods study. Future Healthcare Journal. 2023;10(Suppl 3):72–73. [CrossRef]
| 1 | Sound files are provided as supplementary material |






| Characteristic | Female, | Male, | p-value |
|---|---|---|---|
| Age | 23.00 (23.00, 24.00) | 24.00 (23.00, 25.00) | 0.2 |
| Q1 | 4.00 (3.00, 5.00) | 4.00 (4.00, 5.00) | 0.9 |
| Q2 | 4.00 (2.00, 5.00) | 4.00 (2.00, 4.00) | 0.6 |
| Q3 | 4.00 (3.00, 4.00) | 4.00 (3.00, 4.00) | 0.6 |
| Q4 | 2.00 (2.00, 3.00) | 3.00 (2.00, 3.00) | 0.2 |
| SQ1 | 4.00 (3.00, 5.00) | 4.00 (3.00, 5.00) | 0.5 |
| SQ2 | 5.00 (4.00, 5.00) | 4.00 (3.00, 5.00) | 0.4 |
| SQ3 | 3.00 (2.00, 4.00) | 3.00 (2.00, 5.00) | 0.9 |
| Preknowledge | 3 / 31 (9.7%) | 1 / 13 (7.7%) | >0.9 |
| Musicality | 2 / 31 (6.5%) | 1 / 13 (7.7%) | >0.9 |
| Instrument | 14 / 31 (45%) | 7 / 13 (54%) | 0.6 |
| n / N (%) Median (IQR); Fisher’s exact test; | |||
| Welch Two Sample t-test; Pearson’s Chi-squared test | |||
| Gold Standard | ||||||
|---|---|---|---|---|---|---|
| IE | I-moderate | I- severe | A-moderate | A-severe | Total | |
| Prediction | ||||||
| IE | 132 | 0 | 0 | 0 | 0 | 132 |
| I-moderate | 0 | 106 | 14 | 15 | 0 | 135 |
| I-severe | 0 | 8 | 95 | 4 | 13 | 120 |
| A-moderate | 0 | 12 | 4 | 100 | 11 | 127 |
| A-severe | 0 | 6 | 19 | 13 | 108 | 146 |
| Total | 132 | 132 | 132 | 132 | 132 | 660 |
| Class | Sensitivity | Specificity | f1 | Balanced Accuracy |
|---|---|---|---|---|
| IE | 1 | 1 | 1 | 1 |
| I-moderate | 0.8 | 0.95 | 0.79 | 0.87 |
| I-severe | 0.72 | 0.95 | 0.75 | 0.84 |
| A-moderate | 0.76 | 0.95 | 0.77 | 0.85 |
| A-severe | 0.82 | 0.93 | 0.78 | 0.87 |
| Gold Standard | |||
|---|---|---|---|
| 0 | 1 | Total | |
| Prediction | |||
| 0 | 132 | 0 | 132 |
| 1 | 0 | 528 | 528 |
| Total | 132 | 528 | 660 |
| Gold Standard | ||||
|---|---|---|---|---|
| IE | moderate | severe | Total | |
| Prediction | ||||
| IE | 132 | 0 | 0 | 132 |
| moderate | 0 | 233 | 29 | 262 |
| severe | 0 | 31 | 235 | 266 |
| Total | 132 | 264 | 264 | 660 |
| Class | Sensitivity | Specificity | f1 | Balanced Accuracy |
|---|---|---|---|---|
| IE | 1 | 1 | 1 | 1 |
| moderate | 0.88 | 0.93 | 0.89 | 0.9 |
| severe | 0.89 | 0.92 | 0.89 | 0.9 |
| Predictors | Estimate | Std.Error | p-value |
|---|---|---|---|
| Gender female | 0.18 | 0.06 | 0.005 |
| Age | -0.001 | 0.018 | 0.968 |
| Q1 | 0.034 | 0.04 | 0.399 |
| Q2 | 0.024 | 0.031 | 0.442 |
| Q3 | -0.115 | 0.042 | 0.01 |
| Q4 | 0.057 | 0.039 | 0.158 |
| Preknowledge yes | 0.087 | 0.149 | 0.562 |
| Instrument yes | 0.012 | 0.06 | 0.846 |
| Musicality yes | 0.06 | 0.107 | 0.578 |
| SQ1 | -0.011 | 0.024 | 0.663 |
| SQ2 | 0.053 | 0.021 | 0.015 |
| SQ3 | 0.011 | 0.023 | 0.653 |
| Block 2 | 0.05 | 0.032 | 0.114 |
| Block 3 | 0.091 | 0.032 | 0.004 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).