Influence of face masks on recalibration of phonetic categories.

Perceptual learning Speech perception Spoken word recognition

Journal

Attention, perception & psychophysics
ISSN: 1943-393X
Titre abrégé: Atten Percept Psychophys
Pays: United States
ID NLM: 101495384

Informations de publication

Date de publication:
Nov 2023
Historique:
accepted: 16 04 2023
medline: 26 10 2023
pubmed: 16 5 2023
entrez: 15 5 2023
Statut: ppublish

Résumé

Previous research demonstrates listeners dynamically adjust phonetic categories in line with lexical context. While listeners show flexibility in adapting speech categories, recalibration may be constrained when variability can be attributed externally. It has been hypothesized that when listeners attribute atypical speech input to a causal factor, phonetic recalibration is attenuated. The current study investigated this theory directly by examining the influence of face masks, an external factor that affects both visual and articulatory cues, on the magnitude of phonetic recalibration. Across four experiments, listeners completed a lexical decision exposure phase in which they heard an ambiguous sound in either /s/-biasing or /ʃ/-biasing lexical contexts, while simultaneously viewing a speaker with a mask off, mask on the chin, or mask over the mouth. Following exposure, all listeners completed an auditory phonetic categorization test along an /ʃ/-/s/ continuum. In Experiment 1 (when no face mask was present during exposure trials), Experiment 2 (when the face mask was on the chin), Experiment 3 (when the face mask was on the mouth during ambiguous items), and Experiment 4 (when the face mask was on the mouth during the entire exposure phase), listeners showed a robust and equivalent phonetic recalibration effect. Recalibration manifested as greater proportion /s/ responses for listeners in the /s/-biased exposure group, relative to listeners in the /ʃ/-biased exposure group. Results support the notion that listeners do not causally attribute face masks with speech idiosyncrasies, which may reflect a general speech learning adjustment during the COVID-19 pandemic.

Identifiants

pubmed: 37188863
doi: 10.3758/s13414-023-02715-3
pii: 10.3758/s13414-023-02715-3
pmc: PMC10185375
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

2700-2717

Informations de copyright

© 2023. The Psychonomic Society, Inc.

Références

Abaluck, J., Kwong, L. H., Styczynski, A., Haque, A., Kabir, M. A., Bates-Jefferys, E., ... & Mobarak, A. M. (2022). Impact of community masking on COVID-19: a cluster-randomized trial in Bangladesh. Science, 375(6577):eabi9069.
Allen, J. S., Miller, J. L., & DeSteno, D. (2003). Individual talker differences in voice-onset-time. The Journal of the Acoustical Society of America, 113(1), 544–552.
pubmed: 12558290 doi: 10.1121/1.1528172
Anwyl-Irvine, A. L., Massonnié, J., Flitton, A., Kirkham, N., & Evershed, J. K. (2020). Gorilla in our midst: An online behavioral experiment builder. Behavior Research Methods, 52(1), 388–407.
pubmed: 31016684 doi: 10.3758/s13428-019-01237-x
Atcherson, S. R., Mendel, L. L., Baltimore, W. J., Patro, C., Lee, S., Pousson, M., & Spann, M. J. (2017). The effect of conventional and transparent surgical masks on speech understanding in individuals with and without hearing loss. Journal of the American Academy of Audiology, 28(01), 058–067.
doi: 10.3766/jaaa.15151
Baart, M., & Vroomen, J. (2010). Phonetic recalibration does not depend on working memory. Experimental Brain Research, 203(3), 575–582.
pubmed: 20437168 pmcid: 2875474 doi: 10.1007/s00221-010-2264-9
Bachorowski, J. A. (1999). Vocal expression and perception of emotion. Current Directions in Psychological Science, 8(2), 53–57.
doi: 10.1111/1467-8721.00013
Balamurali, B. T., Enyi, T., Clarke, C. J., Harn, S. Y., & Chen, J. M. (2021). Acoustic effect of face mask design and material choice. Acoustics Australia, 49(3), 505–512.
pubmed: 34099950 pmcid: 8172558 doi: 10.1007/s40857-021-00245-2
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48.
doi: 10.18637/jss.v067.i01
Bent, T., Baese-Berk, M., Borrie, S. A., & McKee, M. (2016). Individual differences in the perception of regional, nonnative, and disordered speech varieties. The Journal of the Acoustical Society of America, 140(5), 3775–3786.
pubmed: 27908060 doi: 10.1121/1.4966677
Bottalico, P., Murgia, S., Puglisi, G. E., Astolfi, A., & Kirk, K. I. (2020). Effect of masks on speech intelligibility in auralized classrooms. The Journal of the Acoustical Society of America, 148(5), 2878–2884.
pubmed: 33261397 pmcid: 7857496 doi: 10.1121/10.0002450
Champion, J., & Holt, R. (2000). Dental care for children and young people who have a hearing impairment. British Dental Journal, 189(3), 155–159.
pubmed: 11021033 doi: 10.1038/sj.bdj.4800710a
Clopper, C. G., & Pisoni, D. B. (2004). Effects of talker variability on perceptual learning of dialects. Language and Speech, 47(3), 207–238.
pubmed: 15697151 pmcid: 3434464 doi: 10.1177/00238309040470030101
Clopper, C. G., Pisoni, D. B., & De Jong, K. (2005). Acoustic characteristics of the vowel systems of six regional varieties of American English. The Journal of the Acoustical Society of America, 118(3), 1661–1676.
pubmed: 16240825 doi: 10.1121/1.2000774
Coclite, D., Napoletano, A., Gianola, S., Del Monaco, A., D'Angelo, D., Fauci, A., Iacorossi, L., Latina, R., La Torre, G., Mastroianni, C. M., Renzi, C., Castellini, G., & Iannone, P. (2021). Face mask use in the community for reducing the spread of COVID-19: A systematic review. Frontiers in Medicine, 7, 594269.
pubmed: 33511141 pmcid: 7835129 doi: 10.3389/fmed.2020.594269
Cohn, M., Pycha, A., & Zellou, G. (2021). Intelligibility of face-masked speech depends on speaking style: Comparing casual, clear, and emotional speech. Cognition, 210, 104570.
pubmed: 33450446 doi: 10.1016/j.cognition.2020.104570
Corey, R. M., Jones, U., & Singer, A. C. (2020). Acoustic effects of medical, cloth, and transparent face masks on speech signals. The Journal of the Acoustical Society of America, 148(4), 2371–2375.
pubmed: 33138498 pmcid: 7857499 doi: 10.1121/10.0002279
Crinnion, A. M., Malmskog, B., & Toscano, J. C. (2020). A graph-theoretic approach to identifying acoustic cues for speech sound categorization. Psychonomic Bulletin & Review, 27(6), 1104–1125.
doi: 10.3758/s13423-020-01748-1
Crinnion, A. M., Toscano, J. C., & Toscano, C. M. (2022). Effects of experience on recognition of speech produced with a face mask. Cognitive Research: Principles and Implications, 7(1), 1–13.
Derrick, D., Kabaliuk, N., Longworth, L., Pishyar-Dehkordi, P., & Jermy, M. (2022). Speech air flow with and without face masks. Scientific Reports, 12(1), 1–10.
doi: 10.1038/s41598-021-04745-z
Drouin, J. R., & Theodore, R. M. (2018). Lexically guided perceptual learning is robust to task-based changes in listening strategy. The Journal of the Acoustical Society of America, 144(2), 1089–1099.
pubmed: 30180678 pmcid: 6117182 doi: 10.1121/1.5047672
Drouin, J. R., Theodore, R. M., & Myers, E. B. (2016). Lexically guided perceptual tuning of internal phonetic category structure. The Journal of the Acoustical Society of America, 140(4), EL307–EL313.
pubmed: 27794292 pmcid: 6910001 doi: 10.1121/1.4964468
Eikenberry, S. E., Mancuso, M., Iboi, E., Phan, T., Eikenberry, K., Kuang, Y., Kostelich, E., & Gumel, A. B. (2020). To mask or not to mask: Modeling the potential for face mask use by the general public to curtail the COVID-19 pandemic. Infectious Disease Modelling, 5, 293–308.
pubmed: 32355904 pmcid: 7186508 doi: 10.1016/j.idm.2020.04.001
Eisner, F., & McQueen, J. M. (2006). Perceptual learning in speech: Stability over time. The Journal of the Acoustical Society of America, 119(4), 1950–1953.
pubmed: 16642808 doi: 10.1121/1.2178721
Fecher, N., & Watt, D. (2013). Effects of forensically-realistic facial concealment on auditory-visual consonant recognition in quiet and noise conditions. In Auditory-Visual Speech Processing (AVSP) 2013.
Fleming, J. T., & Winn, M. B. (2022). Strategic perceptual weighting of acoustic cues for word stress in listeners with cochlear implants, acoustic hearing, or simulated bimodal hearing. The Journal of the Acoustical Society of America, 152(3), 1300–1316.
pubmed: 36182279 pmcid: 9439712 doi: 10.1121/10.0013890
Gama, R., Castro, M. E., van Lith-Bijl, J. T., & Desuter, G. (2021). Does the wearing of masks change voice and speech parameters? European Archives of Oto-Rhino-Laryngology. https://doi.org/10.1007/s00405-021-07086-9
doi: 10.1007/s00405-021-07086-9 pubmed: 34550454 pmcid: 8456395
Ganong, W. F. (1980). Phonetic categorization in auditory word perception. Journal of Experimental Psychology: Human Perception and Performance, 6(1), 110–125.
pubmed: 6444985
Georgiou, G. P. (2022). Acoustic markers of vowels produced with different types of face masks. Applied Acoustics, 191, 108691.
pubmed: 35250034 doi: 10.1016/j.apacoust.2022.108691
Goldin, A., Weinstein, B., & Shiman, N. (2020). How do medical masks degrade speech perception. Hearing Review, 27(5), 8–9.
Goldstone, R. L. (1998). Perceptual learning. Annual Review of Psychology, 49(1), 585–612.
pubmed: 9496632 doi: 10.1146/annurev.psych.49.1.585
Jaekl, P., Pesquita, A., Alsius, A., Munhall, K., & Soto-Faraco, S. (2015). The contribution of dynamic visual cues to audiovisual speech perception. Neuropsychologia, 75, 402–410.
pubmed: 26100561 doi: 10.1016/j.neuropsychologia.2015.06.025
Jesse, A., & McQueen, J. M. (2011). Positional effects in the lexical retuning of speech perception. Psychonomic Bulletin & Review, 18(5), 943–950.
doi: 10.3758/s13423-011-0129-2
Joo, T., Takeuchi, M., Liu, F., Rivera, M. P., Barr, J., Blum, E. S., . . . Ng, N. L. (2021). Evaluation of particle filtration efficiency of commercially available materials for homemade face mask usage. Aerosol Science and Technology, 55(8), 930–942.
doi: 10.1080/02786826.2021.1905149
Kraljic, T., & Samuel, A. G. (2005). Perceptual learning for speech: Is there a return to normal? Cognitive Psychology, 51(2), 141–178.
pubmed: 16095588 doi: 10.1016/j.cogpsych.2005.05.001
Kraljic, T., & Samuel, A. G. (2006). Generalization in perceptual learning for speech. Psychonomic Bulletin & Review, 13(2), 262–268.
doi: 10.3758/BF03193841
Kraljic, T., & Samuel, A. G. (2011). Perceptual learning evidence for contextually-specific representations. Cognition, 121(3), 459–465.
pubmed: 21939965 pmcid: 3214006 doi: 10.1016/j.cognition.2011.08.015
Kraljic, T., Samuel, A. G., & Brennan, S. E. (2008). First impressions and last resorts: How listeners adjust to speaker variability. Psychological Science, 19(4), 332–338.
pubmed: 18399885 doi: 10.1111/j.1467-9280.2008.02090.x
Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2017). lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software, 82(13).
Lenth, R. (2020). emmeans: Estimated marginal means, aka least-squares means (R Package Version 1.4.7). https://CRAN.R-project.org/package=emmeans
Liberman, A. M., Cooper, F. S., Shankweiler, D. P., & Studdert-Kennedy, M. (1967). Perception of the speech code. Psychological Review, 74(6), 431–461.
pubmed: 4170865 doi: 10.1037/h0020279
Liu, L., & Jaeger, T. F. (2018). Inferring causes during speech perception. Cognition, 174, 55–70.
pubmed: 29425987 pmcid: 6553948 doi: 10.1016/j.cognition.2018.01.003
Liu, L., & Jaeger, T. F. (2019). Talker-specific pronunciation or speech error? Discounting (or not) atypical pronunciations during speech perception. Journal of Experimental Psychology: Human Perception and Performance, 45(12), 1562.
pubmed: 31750716
Luthra, S., Mechtenberg, H., & Myers, E. B. (2021). Perceptual learning of multiple talkers requires additional exposure. Attention, Perception, & Psychophysics, 83(5), 2217–2228.
doi: 10.3758/s13414-021-02261-w
MacIntyre, C. R., Nguyen, P. Y., Chughtai, A. A., Trent, M., Gerber, B., Steinhofel, K., & Seale, H. (2021). Mask use, risk-mitigation behaviours and pandemic fatigue during the COVID-19 pandemic in five cities in Australia, the UK and USA: A cross-sectional survey. International Journal of Infectious Diseases, 106, 199–207.
pubmed: 33771668 pmcid: 7985682 doi: 10.1016/j.ijid.2021.03.056
Magee, M., Lewis, C., Noffs, G., Reece, H., Chan, J. C., Zaga, C. J., . . . Vogel, A. P. (2020). Effects of face masks on acoustic analysis and speech perception: Implications for peripandemic protocols. The Journal of the Acoustical Society of America, 148(6), 3562–3568.
pubmed: 33379897 pmcid: 7857500 doi: 10.1121/10.0002873
McAuliffe, M., & Babel, M. (2016). Stimulus-directed attention attenuates lexically-guided perceptual learning. The Journal of the Acoustical Society of America, 140(3), 1727–1738.
pubmed: 27914402 doi: 10.1121/1.4962529
McClelland, J. L., Mirman, D., & Holt, L. L. (2006). Are there interactive processes in speech perception? Trends in Cognitive Sciences, 10(8), 363–369.
pubmed: 16843037 pmcid: 3523348 doi: 10.1016/j.tics.2006.06.007
McGurk, H., & MacDonald, J. (1976). Hearing lips and seeing voices. Nature, 264(5588), 746–748.
pubmed: 1012311 doi: 10.1038/264746a0
McQueen, J. M., Norris, D., & Cutler, A. (2006). The dynamic nature of speech perception. Language and Speech, 49(1), 101–112.
pubmed: 16922064 doi: 10.1177/00238309060490010601
Mendel, L. L., Gardino, J. A., & Atcherson, S. R. (2008). Speech understanding using surgical masks: A problem in health care? Journal of the American Academy of Audiology, 19(09), 686–695.
pubmed: 19418708 doi: 10.3766/jaaa.19.9.4
Nelson, S., & Durvasula, K. (2021). Lexically-guided perceptual learning does generalize to new phonetic contexts. Journal of Phonetics, 84, 101019.
doi: 10.1016/j.wocn.2020.101019
Norris, D., McQueen, J. M., & Cutler, A. (2003). Perceptual learning in speech. Cognitive Psychology, 47(2), 204–238.
pubmed: 12948518 doi: 10.1016/S0010-0285(03)00006-9
Pillot-Loiseau, C., & Harmegnies, B. (2021). Effect of different face masks on speech and singing: Self-perception and acoustic analysis. Annali di Ca’Foscari. Serie occidentale, 55, 9–28.
Rahne, T., Fröhlich, L., Plontke, S., & Wagner, L. (2021). Influence of surgical and N95 face masks on speech perception and listening effort in noise. PLOS ONE, 16(7), e0253874.
pubmed: 34197513 pmcid: 8248731 doi: 10.1371/journal.pone.0253874
Saeidi, R., Huhtakallio, I., & Alku, P. (2016, September). Analysis of face mask effect on speaker recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association (pp. 1800–1804). Interspeech.
Samuel, A. G. (2016). Lexical representations are malleable for about one second: Evidence for the non-automaticity of perceptual recalibration. Cognitive Psychology, 88, 88–114.
pubmed: 27423485 doi: 10.1016/j.cogpsych.2016.06.007
Samuel, A. G., & Kraljic, T. (2009). Perceptual learning for speech. Attention, Perception, & Psychophysics, 71(6), 1207–1218.
doi: 10.3758/APP.71.6.1207
Scherer, K. R. (2003). Vocal communication of emotion: A review of research paradigms. Speech Communication, 40(1/2), 227–256.
doi: 10.1016/S0167-6393(02)00084-5
Smiljanic, R., Keerstock, S., Meemann, K., & Ransom, S. M. (2021). Face masks and speaking style affect audio-visual word recognition and memory of native and non-native speech. The Journal of the Acoustical Society of America, 149(6), 4013–4023.
pubmed: 34241444 pmcid: 8269755 doi: 10.1121/10.0005191
Story, B. H., Titze, I. R., & Hoffman, E. A. (2001). The relationship of vocal tract shape to three voice qualities. The Journal of the Acoustical Society of America, 109(4), 1651–1667.
pubmed: 11325134 doi: 10.1121/1.1352085
Sumby, W. H., & Pollack, I. (1954). Visual contribution to speech intelligibility in noise. The Journal of the Acoustical Society of America, 26(2), 212–215.
doi: 10.1121/1.1907309
Theodore, R. M., Miller, J. L., & DeSteno, D. (2009). Individual talker differences in voice-onset-time: Contextual influences. The Journal of the Acoustical Society of America, 125(6), 3974–3982.
pubmed: 19507979 pmcid: 2806434 doi: 10.1121/1.3106131
Tiippana, K. (2014). What is the McGurk effect? Frontiers in Psychology, 5, 725.
pubmed: 25071686 pmcid: 4091305 doi: 10.3389/fpsyg.2014.00725
Toscano, J. C., & McMurray, B. (2010). Cue integration with categories: Weighting acoustic cues in speech using unsupervised learning and distributional statistics. Cognitive Science, 34(3), 434–464.
pubmed: 21339861 pmcid: 3039883 doi: 10.1111/j.1551-6709.2009.01077.x
Toscano, J. C., & Toscano, C. M. (2021). Effects of face masks on speech recognition in multi-talker babble noise. PLOS ONE, 16(2), e0246842.
pubmed: 33626073 pmcid: 7904190 doi: 10.1371/journal.pone.0246842
Truong, T. L., & Weber, A. (2021). Intelligibility and recall of sentences spoken by adult and child talkers wearing face masks. The Journal of the Acoustical Society of America, 150(3), 1674–1681.
pubmed: 34598631 pmcid: 8487070 doi: 10.1121/10.0006098
Tzeng, C. Y., Nygaard, L. C., & Theodore, R. M. (2021). A second chance for a first impression: Sensitivity to cumulative input statistics for lexically guided perceptual learning. Psychonomic Bulletin & Review, 28(3), 1003–1014.
doi: 10.3758/s13423-020-01840-6
Van de Rijt, L. P., Roye, A., Mylanus, E. A., Van Opstal, A. J., & Van Wanrooij, M. M. (2019). The principle of inverse effectiveness in audiovisual speech perception. Frontiers in Human Neuroscience, 13, 335.
pubmed: 31611780 pmcid: 6775866 doi: 10.3389/fnhum.2019.00335
Van Linden, S., & Vroomen, J. (2008). Audiovisual speech recalibration in children. Journal of Child Language, 35(4), 809–822.
pubmed: 18838013 doi: 10.1017/S0305000908008817
Woods, K. J., Siegel, M. H., Traer, J., & McDermott, J. H. (2017). Headphone screening to facilitate web-based auditory experiments. Attention, Perception, & Psychophysics, 79(7), 2064–2072.
doi: 10.3758/s13414-017-1361-2
Yi, H., Pingsterhaus, A., & Song, W. (2021). Effects of wearing face masks while using different speaking styles in noise on speech intelligibility during the COVID-19 pandemic. Frontiers in Psychology, 12, 682677.
Zhang, X., & Samuel, A. G. (2014). Perceptual learning of speech under optimal and adverse conditions. Journal of Experimental Psychology: Human Perception and Performance, 40(1), 200.

Auteurs

Julia R Drouin (JR)

Division of Speech and Hearing Sciences, University of North Carolina School of Medicine, Chapel Hill, NC, USA. julia_drouin@med.unc.edu.
Department of Communication Sciences and Disorders, California State University Fullerton, Fullerton, CA, USA. julia_drouin@med.unc.edu.

Jose A Rojas (JA)

Department of Communication Sciences and Disorders, California State University Fullerton, Fullerton, CA, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH