Submitted:
22 October 2025
Posted:
27 October 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Debates on categorical versus continuous perception
1.2. Bayesian framework for quantifying perceptual warping
1.3. The present study
2. Method
2.1. Data source and participants
2.2. Stimuli and apparatus
2.3. Tasks and procedure
2.4. Data preprocessing
2.5. Bayesian model of individual perceptual warping
2.6. Psychometric function analysis
2.7. Statistical analysis
3. Results
3.1. Distribution of perceptual profiles across conditions
3.2. Linking computational profiles to behavioral measures
3.3. Stability of Bayesian parameters across conditions
4. Discussion
4.1. Bayesian perceptual profiles dissociate what behavior conflates
4.2. Context-dependent and asymmetric distribution of perceptual profiles
4.3. Categorical perception is a flexible state, not a stable trait
4.4. Implications for theories of speech perception
4.5. Limitations and future directions
5. Conclusions
References
- Abramson, A. S. (1961). Identification and discrimination of phonemic tones. The Journal of the Acoustical Society of America, 33(6_Supplement), 842–842. [CrossRef]
- Abramson, A. S. , & Lisker, L. (1985). Relative power of cues: F0 shift versus voice timing. In V. Fromkin (Ed.), Phonetic linguistics: Essays in honor of Peter Ladefoged (pp. 25–33). Academic Press.
- Angeli, A., Davidoff, J., & Valentine, T. (2008). Face familiarity, distinctiveness, and categorical perception. Quarterly Journal of Experimental Psychology, 61(5), 690–707. [CrossRef]
- Apfelbaum, K. S., Kutlu, E., McMurray, B., & Kapnoula, E. C. (2022). Don’t force it! Gradient speech categorization calls for continuous categorization tasks. The Journal of the Acoustical Society of America, 152(6), 3728–3745. [CrossRef]
- Beale, J. M., & Keil, F. C. (1995). Categorical effects in the perception of faces. Cognition, 57(3), 217–239. [CrossRef]
- Bürkner, P.-C. (2017). brms: An R Package for Bayesian Multilevel Models Using Stan. Journal of Statistical Software, 80(1). [CrossRef]
- Clark, A. (2013). Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behavioral and Brain Sciences, 36(3), 181–204. [CrossRef]
- Clayards, M. (2018). Differences in cue weights for speech perception are correlated for individuals within and across contrasts. The Journal of the Acoustical Society of America, 144(3), Article 3. [CrossRef]
- Clayards, M., Tanenhaus, M. K., Aslin, R. N., & Jacobs, R. A. (2008). Perception of speech reflects optimal use of probabilistic speech cues. Cognition, 108(3), Article 3. [CrossRef]
- Crystal, T. H., & House, A. S. (1988). The duration of American-English vowels: An overview. Journal of Phonetics, 16(3), 263–284. [CrossRef]
- Davidoff, J., Davies, I., & Roberson, D. (1999). Colour categories in a stone-age tribe. Nature, 398(6724), 203–204. [CrossRef]
- de Lange, F. P., Heilbron, M., & Kok, P. (2018). How do expectations shape perception? Trends in Cognitive Sciences, 22(9), 764–779. [CrossRef]
- Feldman, H. , & Friston, K. J. (2010). Attention, uncertainty, and free-energy. Frontiers in Human Neuroscience, 4. [CrossRef]
- Feldman, N. H., Griffiths, T. L., & Morgan, J. L. (2009). The influence of categories on perception: Explaining the perceptual magnet effect as optimal statistical inference. Psychological Review, 116(4), Article 4. PubMed. [CrossRef]
- Francis, A. L., Ciocca, V., & Chit Ng, B. K. (2003). On the (non)categorical perception of lexical tones. Perception & Psychophysics, 65(7), 1029–1044. [CrossRef]
- Friston, K. (2009). The free-energy principle: A rough guide to the brain? Trends in Cognitive Sciences, 13(7), 293–301. [CrossRef]
- Friston, K. (2010). The free-energy principle: A unified brain theory? Nature Reviews Neuroscience, 11(2), 127–138. [CrossRef]
- Fry, D. B., Abramson, A. S., Eimas, P. D., & Liberman, A. M. (1962). The identification and discrimination of synthetic vowels. Language and Speech, 5(4), 171–189. [CrossRef]
- Fuhrmeister, P., Phillips, M. C., McCoach, D. B., & Myers, E. B. (2023). Relationships between native and non-native speech perception. Journal of Experimental Psychology: Learning, Memory, and Cognition, 49(7), 1161–1175. [CrossRef]
- Fujimura, T., Matsuda, Y.-T., Katahira, K., Okada, M., & Okanoya, K. (2012). Categorical and dimensional perceptions in decoding emotional facial expressions. Cognition & Emotion, 26(4), 587–601. [CrossRef]
- Gaißert, N., Waterkamp, S., Fleming, R. W., & Bülthoff, I. (2012). Haptic categorical perception of shape. PLoS ONE, 7(8), e43062. [CrossRef]
- Goldinger, S. (1998). Echoes of echoes? An episodic theory of lexical access. Psychological Review, 105(2), 251–279. [CrossRef]
- Hary, J. M., & Massaro, D. W. (1982). Categorical results do not imply categorical perception. Perception & Psychophysics, 32(5), 409–418. [CrossRef]
- Hazan, V., & Rosen, S. (1991). Individual variability in the perception of cues to place contrasts in initial stops. Perception & Psychophysics, 49(2), 187–200. [CrossRef]
- Hillenbrand, J. M., Clark, M. J., & Houde, R. A. (2000). Some effects of duration on vowel recognition. The Journal of the Acoustical Society of America, 108(6), 3013–3022. [CrossRef]
- Holt, L. L., & Lotto, A. J. (2006). Cue weighting in auditory categorization: Implications for first and second language acquisition. The Journal of the Acoustical Society of America, 119(5), 3059–3071. [CrossRef]
- Holt, L. L., Tierney, A. T., Guerra, G., Laffere, A., & Dick, F. (2018). Dimension-selective attention as a possible driver of dynamic, context-dependent re-weighting in speech processing. Hearing Research, 366, 50–64. [CrossRef]
- Honda, C. T., Clayards, M., & Baum, S. R. (2024). Exploring individual differences in native phonetic perception and their link to nonnative phonetic perception. Journal of Experimental Psychology: Human Perception and Performance, 50(4), 370–394. [CrossRef]
- Iverson, P., & Kuhl, P. K. (1995). Mapping the perceptual magnet effect for speech using signal detection theory and multidimensional scaling. The Journal of the Acoustical Society of America, 97(1), Article 1. [CrossRef]
- Joanisse, M. F., Manis, F. R., Keating, P., & Seidenberg, M. S. (2000). Language deficits in dyslexic children: Speech perception, phonology, and morphology. Journal of Experimental Child Psychology, 77(1), 30–60. [CrossRef]
- Kapnoula, E. C., Edwards, J., & McMurray, B. (2021). Gradient activation of speech categories facilitates listeners’ recovery from lexical garden paths, but not perception of speech-in-noise. Journal of Experimental Psychology: Human Perception and Performance, 47(4), 578–595. [CrossRef]
- Kapnoula, E. C., Winn, M. B., Kong, E. J., Edwards, J., & McMurray, B. (2017). Evaluating the sources and functions of gradiency in phoneme categorization: An individual differences approach. Journal of Experimental Psychology: Human Perception and Performance, 43(9), Article 9. PubMed. [CrossRef]
- Kim, H. , McMurray, B., Sorensen, E., & Oleson, J. (2025). The consistency of categorization-consistency in speech perception. Psychonomic Bulletin & Review. [CrossRef]
- Kleinschmidt, D. F., & Jaeger, T. F. (2015). Robust speech perception: Recognize the familiar, generalize to the similar, and adapt to the novel. Psychological Review, 122(2), 148–203. [CrossRef]
- Kong, E. J., & Edwards, J. (2016). Individual differences in categorical perception of speech: Cue weighting and executive function. Journal of Phonetics, 59, 40–57. [CrossRef]
- Koo, T. K., & Li, M. Y. (2016). A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. Journal of Chiropractic Medicine, 15(2), 155–163. [CrossRef]
- Kronrod, Y., Coppess, E., & Feldman, N. H. (2016). A unified account of categorical effects in phonetic perception. Psychonomic Bulletin & Review, 23(6), 1681–1712. [CrossRef]
- Kuhl, P. K. (1991). Human adults and human infants show a “perceptual magnet effect” for the prototypes of speech categories, monkeys do not. Perception & Psychophysics, 50(2), 93–107. [CrossRef]
- Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2017). lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software, 82(13), Article 13. [CrossRef]
- Lenth, R. V. (2021). emmeans: Estimated Marginal Means, aka Least-Squares Means (Version R package version 1.7.1-1) [En]. https://CRAN.R-project.org/package=emmeans.
- Liberman, A. M., Harris, K. S., Hoffman, H. S., & Griffith, B. C. (1957). The discrimination of speech sounds within and across phoneme boundaries. Journal of Experimental Psychology, 54(5), 358–368. [CrossRef]
- Lisker, L., & Abramson, A. S. (1964). A cross-language study of voicing in initial stops: Acoustical measurements. WORD, 20(3), Article 3. [CrossRef]
- Manis, F. R., Mcbride-Chang, C., Seidenberg, M. S., Keating, P., Doi, L. M., Munson, B., & Petersen, A. (1997). Are speech perception deficits associated with developmental dyslexia? Journal of Experimental Child Psychology, 66(2), 211–235. [CrossRef]
- Mann, V. A., & Repp, B. H. (1980). Influence of vocalic context on perception of the [∫]-[s] distinction. Perception & Psychophysics, 28(3), Article 3. [CrossRef]
- Massaro, D. W., & Cohen, M. M. (1983). Categorical or continuous speech perception: A new test. Speech Communication, 2(1), 15–35. [CrossRef]
- McMurray, B. (2022). The myth of categorical perception. The Journal of the Acoustical Society of America, 152(6), 3819–3842. [CrossRef]
- McMurray, B., Aslin, R. N., Tanenhaus, M. K., Spivey, M. J., & Subik, D. (2008). Gradient sensitivity to within-category variation in words and syllables. Journal of Experimental Psychology: Human Perception and Performance, 34(6), 1609–1631. [CrossRef]
- Miller, J. L. (1994). On the internal structure of phonetic categories: A progress report. Cognition, 50(1–3), 271–285. [CrossRef]
- Munson, B., Johnson, J. M., & Edwards, J. (2012). The role of experience in the perception of phonetic detail in children’s speech: A comparison between speech-language pathologists and clinically untrained listeners. American Journal of Speech-Language Pathology, 21(2), 124–139. [CrossRef]
- Munson, B., Schellinger, S. K., & Edwards, J. (2017). Bias in the perception of phonetic detail in children’s speech: A comparison of categorical and continuous rating scales. Clinical Linguistics & Phonetics, 31(1), 56–79. [CrossRef]
- Myers, E., Phillips, M., & Skoe, E. (2024). Individual differences in the perception of phonetic category structure predict speech-in-noise performance. The Journal of the Acoustical Society of America, 156(3), 1707–1719. [CrossRef]
- Norris, D., & Mcqueen, J. M. (2008). Shortlist B: A Bayesian model of continuous speech recognition. Psychological Review, 115(2), Article 2. [CrossRef]
- Pisoni, D. B., & Tash, J. (1974). Reaction times to comparisons within and across phonetic categories. Perception & Psychophysics, 15(2), Article 2. [CrossRef]
- Port, R. F. (1981). Linguistic timing factors in combination. The Journal of the Acoustical Society of America, 69, 262–274. [CrossRef]
- R Development Core Team. (2024). R: A Langage and Environment for statistical Computing [En]. R Foundation for Statistical Computing. http://www.R-project.org/.
- Repp, B. H. (1981). Two strategies in fricative discrimination. Perception & Psychophysics, 30(3), 217–227. [CrossRef]
- Repp, B. H. (1984). Categorical perception: Issues, methods, findings. In Speech and Language (Vol. 10, pp. 243–335). Elsevier. [CrossRef]
- Rosen, S. M. (1979). Range and frequency effects in consonant categorization. Journal of Phonetics, 7(4), 393–402. [CrossRef]
- Serniclaes, W., Sprenger-Charolles, L., Carré, R., & Demonet, J.-F. (2001). Perceptual discrimination of speech sounds in developmental dyslexia. Journal of Speech, Language, and Hearing Research, 44(2), 384–399. [CrossRef]
- Serniclaes, W., Ventura, P., Morais, J., & Kolinsky, R. (2005). Categorical perception of speech sounds in illiterate adults. Cognition, 98(2), B35–B44. [CrossRef]
- Stan Development Team. (2025). RStan: The R interface to Stan. https://mc-stan.org/.
- Surprenant, A. M., & Watson, C. S. (2001). Individual differences in the processing of speech and nonspeech sounds by normal-hearing listeners. The Journal of the Acoustical Society of America, 110(4), 2085–2095. [CrossRef]
- Toscano, J. C., & McMurray, B. (2010). Cue integration with categories: Weighting acoustic cues in speech using unsupervised learning and distributional statistics. Cognitive Science, 34(3), 434–464. [CrossRef]
- Toscano, J. C., McMurray, B., Dennhardt, J., & Luck, S. J. (2010). Continuous perception and graded categorization: Electrophysiological evidence for a linear relationship between the acoustic signal and perceptual encoding of speech. Psychological Science, 21(10), Article 10. [CrossRef]
- Werker, J. F., & Tees, R. C. (1987). Speech perception in severely disabled and average reading children. Canadian Journal of Psychology / Revue Canadienne de Psychologie, 41(1), 48–61. [CrossRef]
- Xue, K., Shekhar, M., & Rahnev, D. (2024). Challenging the Bayesian confidence hypothesis in perceptual decision-making. Proceedings of the National Academy of Sciences, 121(48), e2410487121. [CrossRef]
- Zelazo, P. D. (2015). Executive function: Reflection, iterative reprocessing, complexity, and the developing brain. Developmental Review, 38, 55–68. [CrossRef]



| The bet-bat contrast | The dear-tear contrast | |||
|---|---|---|---|---|
| Formant frequencies of spectral steps (Hz) | Duration steps (ms) | Voice onset time steps (ms) | Onset F0 steps (Hz) | |
| F1 | F2 | |||
| 625 | 1677 | 100 | 10 | 185 |
| 647 | 1610 | 140 | 20 | 195 |
| 663 | 1560 | 180 | 30 | 205 |
| 682 | 1546 | 220 | 40 | 215 |
| 740 | 1556 | 260 | 50 | 225 |
| Parameter | Role | Prior Distribution |
|---|---|---|
| , | Category means | Condition-specific normals |
| , | Category standard deviations | Condition-specific truncated normals |
| Sensory noise standard deviation | Condition-specific truncated normal | |
| scale | VAS response scaling factor | N(0.5, 0.25-0.5)* |
| intercept | VAS response intercept | N(50, 20-30)* |
| VAS response noise | Truncated normal |
| Phonetic Condition | ||||
|---|---|---|---|---|
| Vowel formants | N(50,30) | N(150, 30) | N(20, 10)T[1,∞) | N(25, 10)T[1,∞) |
| Vowel duration | N(140, 40) | N(220, 40) | N(30, 15)T[1,∞) | N(35, 15)T[1,∞) |
| Consonant VOT | N(20, 20) | N(40, 20) | N(10, 8)T[1,∞) | N(12, 8)T[1,∞) |
| Consonant F0 | N(195, 15) | N(215, 15) | N(8, 6)T[1,∞) | N(10, 8)T[1,∞) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).