Submitted:
04 April 2025
Posted:
07 April 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. The role of virtual reality in psychoacoustic research
3. Experiments
3.1. Audio Processing and Software Integration
- Session 1: 35 trials (6 minutes)
- Break: Up to 5 minutes
- Session 2: 40 trials (7 minutes)
3.2. Sound stimuli - Preliminary research
4. Results
4.1. Circular mean and Circular standard deviation
- – circular mean
- – consecutive angles expressed in radians
- – circular standard deviation
- – mean resultant length
4.2. Distance between points on a sphere
4.3. Precision versus Accuracy
- IEM – 3 such instances,
- Apple – 1 such instance,
- Dolby – 9 such instances.
- IEM – 5 times,
- Apple – 9 times,
- Dolby – 11 times.
5. Discussion
- Greater response dispersion for front-facing sources: Responses for centrally positioned sound sources at the front (positions 1–5) were more scattered, whereas responses for rear sources (positions 11–13) were more concentrated, as indicated by lower standard deviation values. This effect was particularly noticeable in the azimuth plane (Table 4).
- Correct hemisphere identification with small shifts: When the sound source was shifted slightly in the horizontal plane (e.g., position 6, offset 30° to the left), participants correctly identified the hemisphere, confirming that even minor shifts in azimuth were perceived accurately.
- Increased response precision for rear-located sources: The further behind the listener the virtual sound source was placed, the more concentrated the responses were around it.
- Tendency to localize sound sources behind the head: A striking observation is that 63 out of 75 mean response values were positioned behind the listener (azimuth angle between 120° and 180° or -120° and -180°). This suggests that participants had difficulty localizing sound sources in front of them (Table 3).
- Limited accuracy for front-facing sources
- Smaller variations in elevation perception: Differences between mean values were smaller in the vertical plane (elevation angle) than in the horizontal plane (azimuth angle). This suggests that listeners perceived less variation in the up-down positioning of the sound source (Table 3).
- Largest errors for front-centered positions: The greatest differences between mean listener responses and the reference values occurred for front-central positions (1–5) (Table 3).
6. Conclusions
Acknowledgments
Conflicts of Interest
References
- Lee, H. A Conceptual Model of Immersive Experience in Extended Reality, 2020. [CrossRef]
- Sauer, J. Acoustic Virtual Reality: An Introductory Research for Application. Technical report, Delft University of Technology, 2020.
- Sharma, N.K.; Gaznepoglu, Ü.E.; Robotham, T.; Habets, E.A.P. Two Congruent Cues Are Better Than One: Impact of ITD–ILD Combinations on Reaction Time for Sound Lateralization. JASA Express Lett. 2023, 3, 054401. [Google Scholar] [CrossRef] [PubMed]
- Sagasti, A.; Pietrzak, A.; Martin, R.; Eguinoa, R. Localization of Sound Sources in Binaural Reproduction of First and Third Order Ambisonics. Vib. Phys. Syst. 2023, 33, 2022214. [Google Scholar]
- Mróz, B.; Kostek, B. Pursuing Listeners’ Perceptual Response in Audio-Visual Interactions—Headphones vs. Loudspeakers: A Case Study. Arch. Acoust. 2022, 47, 71–79. [Google Scholar]
- Moraes, A.N.; et al. The Role of Physiological Responses in a VR-Based Sound Localization Task. IEEE Access 2021, 9, 122082–122091. [Google Scholar] [CrossRef]
- Tang, J. Research on the Application of Sound in Virtual Reality. Highlights Sci. Eng. Technol. 2023, 44, 206–212. [Google Scholar] [CrossRef]
- Olivieri, F.; Peters, N.; Sen, D. Scene-Based Audio and Higher Order Ambisonics: A Technology Overview and Application to Next-Generation Audio, VR, and 360 Video. Technical report, EBU Tech., 2019.
- Sochaczewska, K.; Małecki, P.; Piotrowska, M. Evaluation of the Minimum Audible Angle on Horizontal Plane in 3rd Order Ambisonic Spherical Playback System. In Proceedings of the Immersive and 3D Audio: From Architecture to Automotive; 2021. [Google Scholar]
- Bosman, I.; Buruk, O.; Jørgensen, K.; Hamari, J. The Effect of Audio on the Experience in Virtual Reality: A Scoping Review. Behav. Inf. Technol. 2023. [Google Scholar] [CrossRef]
- Gari, S.V.A.; Schissler, C.; Robinson, P. Perceptual Comparison of Efficient Real-Time Geometrical Acoustics Engines in Virtual Reality. In Proceedings of the AES Int. Audio Games Conf. 2024. [Google Scholar]
- Blauert, J.; et al. An Interactive Virtual-Environment Generator for Psychoacoustic Research. I: Architecture and Implementation. Acta Acust. United Acust. 2000, 86, 94–102. [Google Scholar]
- Małecki, P.; Stefańska, J.; Szydłowska, M. Assessing Spatial Audio: A Listener-Centric Case Study on Object-Based and Ambisonic Audio Processing. Arch. Acoust. 2024, 49. [Google Scholar] [CrossRef]
- Bahu, H.; Carpentier, T.; Noisternig, M.; Warusfel, O. Comparison of Different Egocentric Pointing Methods for 3D Sound Localization Experiments. Acta Acust. United Acust. 2016, 102, 107–118. [Google Scholar] [CrossRef]
- Shum, L.C.; Valdés, B.A.; Loos, H.M.V.D. Determining the Accuracy of Oculus Touch Controllers for Motor Rehabilitation Applications Using Quantifiable Upper Limb Kinematics: Validation Study. JMIR Biomed. Eng. 2019, 4, e12291. [Google Scholar] [CrossRef]
- Ahrens, A.; Lund, K.D.; Marschall, M.; Dau, T. Sound Source Localization with Varying Amounts of Visual Information in Virtual Reality. PLoS ONE 2019, 14, e0214603. [Google Scholar] [CrossRef] [PubMed]
- Chandler, D.; Grantham, W. Minimum audible movement angle in the horizontal plane as a function of stimulus frequency and bandwidth, source azimuth, and velocity. J. Acoust. Soc. Am. 1992, 91, 1624–1636. [Google Scholar] [CrossRef] [PubMed]
- Dalgarno, B.; Lee, M.J.W. What are the learning affordances of 3-D Virtual Environments? Br. J. Educ. Technol. 2010, 40, 10–32. [Google Scholar] [CrossRef]
- Kapralos, B.; Jenkin, M.; Milios, E. Virtual Audio Systems. Presence Teleoperators Virtual Environ. 2008, 17, 527–549. [Google Scholar] [CrossRef]
- Shilling, R.; Shinn-Cunningham, B.G. Virtual Auditory Displays. In Handbook of Virtual Environments; Stanney, K., Ed.; Lawrence Erlbaum Associates Inc, 2000. [Google Scholar]
- Hartmann, W.M.; Wittenberg, A. On the externalization of sound images. J. Acoust. Soc. Am. 1996, 99, 3678–3688. [Google Scholar] [CrossRef] [PubMed]
- Zotter, F.; Frank, M. All-Round Ambisonic Panning and Decoding. J. Audio Eng. Soc. 2012, 60, 807–820. [Google Scholar]
- Pfanzagl-Cardone, E. The Dolby ‘Atmo’ System. In The Art and Science of 3D Audio Recording; Springer Int. Publ.: Cham, 2023; pp. 143–188. [Google Scholar]
- Roffler, S.K.; Butler, R.A. Factors that influence the localization of sound in the vertical plane. Journal of the Acoustical Society of America 1968, 43, 1255–1259. [Google Scholar] [CrossRef] [PubMed]
- Circular Statistics Toolbox (Directional Statistics). Available online: https://www.mathworks.com/matlabcentral/fileexchange/10676-circular-statistics-toolbox-directional-statistics (accessed on 21 February 2025).
- Distance between two points on sphere 2025. Accessed: 2025-03-20.
- Freyman, R.L.; Balakrishnan, U.; Zurek, P.M. Lateralization of Noise-Burst Trains Based on Onset and Ongoing Interaural Delays. J. Acoust. Soc. Am. 2010, 128, 320–331. [Google Scholar] [CrossRef] [PubMed]








| No. | Azimuth [°] | Elevation [°] | No. | Azimuth [°] | Elevation [°] |
|---|---|---|---|---|---|
| 1 | 0 | 0 | 14 | 180 | 0 |
| 2 | 0 | 15 | 15 | 180 | 15 |
| 3 | 0 | 30 | 16 | 180 | 30 |
| 4 | 0 | 45 | 17 | 180 | 45 |
| 5 | 0 | 60 | 18 | -150 | 15 |
| 6 | 30 | 0 | 19 | -150 | 45 |
| 7 | 30 | 15 | 20 | -120 | 15 |
| 8 | 30 | 45 | 21 | -90 | 45 |
| 9 | 60 | 15 | 22 | -60 | 15 |
| 10 | 90 | 45 | 23 | -30 | 0 |
| 11 | 120 | 15 | 24 | -30 | 15 |
| 12 | 150 | 15 | 25 | -30 | 45 |
| 13 | 150 | 45 |
| Position | Reference | Noise | Voice | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Az [°] | El [°] | Subject 1 | Subject 2 | Subject 1 | Subject 2 | |||||
| Az [°] | El [°] | Az [°] | El [°] | Az [°] | El [°] | Az [°] | El [°] | |||
| 1 | 0 | 0 | -158,4 | 0,7 | 169,6 | 27,7 | 177,6 | 47,3 | 177,5 | 38,7 |
| 2 | 0 | 15 | -157,6 | -6,1 | -127,5 | 11,0 | 147,4 | 29,6 | 135,8 | 3,1 |
| 3 | 0 | 30 | -151,0 | -0,9 | -46,2 | 3,2 | 84,6 | 10,2 | 136,4 | 0,1 |
| 4 | 0 | 45 | -158,9 | 57,0 | -53,2 | 68,8 | 90,6 | -0,3 | 165,8 | 9,7 |
| 5 | 0 | 60 | 171,5 | 43,6 | -155,8 | 46,7 | 89,4 | -1,0 | 133,9 | 10,1 |
| 6 | 30 | 0 | 155,6 | 13,7 | 44,9 | 7,1 | 24,4 | 23,2 | -176,0 | 43,9 |
| 7 | 30 | 15 | 159,1 | -12,5 | 60,5 | 27,3 | 43,7 | 0,5 | 68,0 | 1,1 |
| 8 | 30 | 45 | 160,6 | 1,8 | 113,0 | 66,7 | 45,1 | 44,5 | 97,8 | 70,7 |
| 9 | 60 | 15 | 162,4 | -7,8 | 116,7 | 1,1 | 0,5 | 14,6 | -1,1 | 22,7 |
| 10 | 90 | 45 | 172,4 | 10,9 | 84,6 | 18,7 | -0,2 | 0,0 | 75,8 | 12,1 |
| 11 | 120 | 15 | 153,3 | 20,0 | 146,7 | 2,9 | 0,3 | 6,3 | -1,1 | 43,0 |
| 12 | 150 | 15 | 155,2 | 31,1 | 81,1 | 26,4 | 44,1 | 32,4 | 134,8 | 53,5 |
| 13 | 150 | 45 | 18,6 | 51,9 | 40,2 | 53,1 | 45,5 | 7,1 | 44,7 | 20,2 |
| 14 | 180 | 0 | 88,3 | -0,4 | 114,4 | 11,6 | 89,1 | 23,6 | 172,8 | 9,8 |
| 15 | 180 | 15 | 143,3 | -16,8 | 116,2 | -4,6 | 135,8 | 7,2 | 137,5 | 7,5 |
| 16 | 180 | 30 | 97,4 | -26,9 | 89,6 | -4,1 | 155,1 | 7,7 | 111,4 | 10,6 |
| 17 | 180 | 45 | 142,2 | -10,0 | 125,2 | 1,0 | 91,1 | -1,1 | 126,0 | 30,7 |
| 18 | -150 | 15 | -10,9 | 18,5 | -35,1 | 19,1 | -138,3 | 2,8 | -60,1 | 44,2 |
| 19 | -150 | 45 | 166,9 | -22,1 | 80,5 | 31,9 | 148,2 | 24,5 | 122,6 | 45,9 |
| 20 | -120 | 15 | -43,7 | -4,3 | -79,8 | 26,8 | -43,6 | 34,2 | -45,5 | 29,8 |
| 21 | -90 | 45 | -21,7 | 5,8 | -117,6 | 19,6 | -0,3 | 15,4 | -44,9 | 55,2 |
| 22 | -60 | 15 | -53,4 | -9,5 | -68,4 | 39,2 | -41,6 | 33,9 | -134,4 | 39,2 |
| 23 | -30 | 0 | -8,1 | -1,1 | -44,6 | 6,8 | -18,0 | 23,2 | -45,1 | 28,3 |
| 24 | -30 | 15 | -1,6 | -0,2 | -31,2 | 3,0 | 0,5 | -0,5 | -84,9 | 52,6 |
| 25 | -30 | 45 | 40,1 | 20,5 | -58,0 | 1,8 | 39,6 | 35,1 | -45,4 | 43,2 |
| Position | Reference | IEM | Apple | Dolby | ||||
|---|---|---|---|---|---|---|---|---|
| Az [°] | El [°] | Az [°] | El [°] | Az [°] | El [°] | Az [°] | El [°] | |
| 1 | 0 | 0 | -177 | 1 | 176 | 7 | 158 | 12 |
| 2 | 0 | 15 | -92 | 7 | 168 | 21 | -180 | 21 |
| 3 | 0 | 30 | -116 | 22 | 173 | 21 | 4 | 23 |
| 4 | 0 | 45 | 174 | 24 | -157 | 31 | -13 | 24 |
| 5 | 0 | 60 | -169 | 34 | 28 | 26 | 179 | 21 |
| 6 | 30 | 0 | 86 | 6 | 110 | 12 | 91 | 6 |
| 7 | 30 | 15 | 103 | 16 | 108 | 7 | 79 | 8 |
| 8 | 30 | 45 | 112 | 32 | 78 | 15 | 75 | 23 |
| 9 | 60 | 15 | 102 | 4 | 99 | 3 | 85 | -1 |
| 10 | 90 | 45 | 107 | 1 | 108 | 7 | 97 | 4 |
| 11 | 120 | 15 | 99 | 6 | 123 | 2 | -159 | 23 |
| 12 | 150 | 15 | 136 | -7 | 121 | 15 | 127 | -8 |
| 13 | 150 | 45 | 118 | 12 | 134 | 20 | 127 | 5 |
| 14 | 180 | 0 | -179 | 1 | 170 | 11 | 177 | -6 |
| 15 | 180 | 15 | 168 | 25 | 178 | 6 | 169 | 1 |
| 16 | 180 | 30 | -150 | 24 | -168 | 13 | -164 | 5 |
| 17 | 180 | 45 | 179 | 13 | -167 | 18 | -167 | 6 |
| 18 | -150 | 15 | -131 | -12 | -148 | 2 | -137 | -8 |
| 19 | -150 | 45 | -119 | 5 | -136 | 3 | -124 | 2 |
| 20 | -120 | 15 | -110 | -2 | -124 | 1 | -132 | -6 |
| 21 | -90 | 45 | -103 | 2 | -111 | 0 | -107 | 1 |
| 22 | -60 | 15 | -95 | -1 | -101 | -4 | -90 | -1 |
| 23 | -30 | 0 | -91 | 7 | -107 | 9 | -86 | 2 |
| 24 | -30 | 15 | -90 | 13 | -123 | 13 | -80 | 6 |
| 25 | -30 | 45 | -103 | 11 | -122 | 13 | -94 | 19 |
| Position | Reference | IEM | Apple | Dolby | ||||
|---|---|---|---|---|---|---|---|---|
| Az [°] | El [°] | Az [°] | El [°] | Az [°] | El [°] | Az [°] | El [°] | |
| 1 | 0 | 0 | 116 | 29 | 52 | 36 | 79 | 30 |
| 2 | 0 | 15 | 120 | 36 | 75 | 30 | 108 | 43 |
| 3 | 0 | 30 | 97 | 35 | 130 | 28 | 89 | 26 |
| 4 | 0 | 45 | 95 | 22 | 109 | 34 | 70 | 20 |
| 5 | 0 | 60 | 107 | 30 | 104 | 31 | 46 | 28 |
| 6 | 30 | 0 | 36 | 15 | 57 | 21 | 33 | 13 |
| 7 | 30 | 15 | 36 | 20 | 37 | 21 | 38 | 16 |
| 8 | 30 | 45 | 38 | 19 | 44 | 18 | 32 | 21 |
| 9 | 60 | 15 | 27 | 16 | 28 | 17 | 23 | 10 |
| 10 | 90 | 45 | 20 | 17 | 30 | 16 | 19 | 11 |
| 11 | 120 | 15 | 17 | 17 | 26 | 20 | 99 | 39 |
| 12 | 150 | 15 | 32 | 17 | 44 | 22 | 17 | 18 |
| 13 | 150 | 45 | 49 | 22 | 58 | 27 | 23 | 21 |
| 14 | 180 | 0 | 36 | 31 | 69 | 36 | 42 | 26 |
| 15 | 180 | 15 | 66 | 32 | 104 | 28 | 38 | 26 |
| 16 | 180 | 30 | 94 | 31 | 55 | 25 | 37 | 26 |
| 17 | 180 | 45 | 97 | 31 | 68 | 21 | 59 | 30 |
| 18 | -150 | 15 | 23 | 15 | 29 | 15 | 29 | 21 |
| 19 | -150 | 45 | 22 | 15 | 27 | 16 | 21 | 20 |
| 20 | -120 | 15 | 21 | 10 | 26 | 15 | 16 | 13 |
| 21 | -90 | 45 | 25 | 13 | 23 | 9 | 18 | 16 |
| 22 | -60 | 15 | 24 | 11 | 26 | 10 | 23 | 11 |
| 23 | -30 | 0 | 34 | 18 | 51 | 23 | 38 | 19 |
| 24 | -30 | 15 | 36 | 19 | 74 | 20 | 33 | 16 |
| 25 | -30 | 45 | 31 | 20 | 51 | 23 | 35 | 18 |
| Position | IEM | Apple | Dolby |
|---|---|---|---|
| 1 | 3.08 | 3.00 | 2.70 |
| 2 | 1.60 | 2.91 | 3.04 |
| 3 | 2.03 | 2.95 | 0.15 |
| 4 | 2.77 | 2.67 | 0.42 |
| 5 | 2.65 | 0.75 | 2.46 |
| 6 | 0.98 | 1.38 | 1.07 |
| 7 | 1.28 | 1.36 | 0.86 |
| 8 | 1.43 | 0.86 | 0.80 |
| 9 | 0.73 | 0.67 | 0.44 |
| 10 | 0.29 | 0.32 | 0.11 |
| 11 | 0.37 | 0.13 | 1.42 |
| 12 | 0.38 | 0.51 | 0.49 |
| 13 | 0.68 | 0.44 | 0.65 |
| 14 | 0.03 | 0.26 | 0.11 |
| 15 | 0.28 | 0.16 | 0.31 |
| 16 | 0.53 | 0.36 | 0.51 |
| 17 | 0.56 | 0.53 | 0.71 |
| 18 | 0.49 | 0.20 | 0.39 |
| 19 | 0.71 | 0.62 | 0.69 |
| 20 | 0.21 | 0.15 | 0.30 |
| 21 | 0.23 | 0.37 | 0.29 |
| 22 | 0.61 | 0.70 | 0.52 |
| 23 | 1.07 | 1.34 | 0.98 |
| 24 | 1.05 | 1.62 | 0.87 |
| 25 | 1.24 | 1.54 | 1.11 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).