| Literature DB >> 34241444 |
Rajka Smiljanic1, Sandie Keerstock2, Kirsten Meemann1, Sarah M Ransom1.
Abstract
Though necessary, protective mask wearing in response to the COVID-19 pandemic presents communication challenges. The present study examines how signal degradation and loss of visual information due to masks affects intelligibility and memory for native and non-native speech. We also test whether clear speech can alleviate perceptual difficulty for masked speech. One native and one non-native speaker of English recorded video clips in conversational speech without a mask and conversational and clear speech with a mask. Native English listeners watched video clips presented in quiet or mixed with competing speech. The results showed that word recognition and recall of speech produced with a mask can be as accurate as without a mask in optimal listening conditions. Masks affected non-native speech processing at easier noise levels than native speech. Clear speech with a mask significantly improved accuracy in all listening conditions. Speaking clearly, reducing noise, and using surgical masks as well as good signal amplification can help compensate for the loss of intelligibility due to background noise, lack of visual cues, physical distancing, or non-native speech. The findings have implications for communication in classrooms and hospitals where listeners interact with teachers and healthcare providers, oftentimes non-native speakers, through their protective barriers.Entities:
Year: 2021 PMID: 34241444 PMCID: PMC8269755 DOI: 10.1121/10.0005191
Source DB: PubMed Journal: J Acoust Soc Am ISSN: 0001-4966 Impact factor: 1.840
FIG. 1.(Color online) Experiment overview. Each participant heard one of the talkers (native talker or non-native) and one noise condition (quiet, +5 dB SNR, and 0 dB SNR; an additional −5 dB SNR was used for the native talker only) (A). Each listener saw video clips from all three speaking style and mask conditions: conversational speech no mask; conversational speech mask, clear speech mask. Speaking style and mask were counterbalanced across listeners (B). Immediately after each audio-video clip, listeners were instructed to type what they heard to test intelligibility. At the end of each block, they were asked to answer content-related questions to test recall.
FIG. 2.Intelligibility (top panels) and recall (bottom panels) for the native talker in quiet (n = 32), in 6T +5 dB SNR (n = 40), in 6T 0 dB SNR (n = 38), and in 6T −5 dB SNR (n = 32). ***, p < 0.001; **, p < 0.01; *, p < 0.05; ns, no statistical difference (mixed-effect logistic regression simple effects analysis).
FIG. 3.Intelligibility (top panels) and recall (bottom panels) for the non-native talker in quiet (n = 45), in 6T +5 dB SNR (n = 32), and in 6T 0 dB SNR (n = 32). ***, p < 0.001; **, p < 0.01; *, p < 0.05; ns, no statistical difference (mixed-effect logistic regression simple effects analysis).
Summary of analysis of deviance table (type III Wald chi square tests).
| Model | Term | Chi squared | Df | |
|---|---|---|---|---|
| Overall three-way interaction mixed-effect logistic regression model | StyleMask:Noise:Talker | 30.4164 | 4 | <0.001 |
| Simple two-way interaction mixed-effect logistic regression model for native talker | StyleMask:Noise | 76.6756 | 6 | <0.001 |
| Simple two-way interaction mixed-effect logistic regression model for non-native talker | StyleMask:Noise | 82.053 | 4 | <0.001 |
Degrees of freedom (Df).
Summary of simple effects. Conv., Conversational speech; Clear, clear speech; ***, p < 0.001.
| Term | Estimate | Standard error | ||
|---|---|---|---|---|
| Conv. Mask vs Conv. NoMask | 0.11996 | 0.07024 | 1.708 | 0.087670 |
| Clear Mask vs Conv. NoMask | 0.27478 | 0.07137 | 3.850 | 0.000118 |
| Clear Mask vs Conv. Mask | 0.15482 | 0.07288 | 2.124 | 0.0336 |
| Conv. Mask vs Conv. NoMask | −0.43171 | 0.07007 | −6.161 | 7.22E−10 |
| Clear Mask vs Conv. NoMask | 0.14067 | 0.07119 | 1.976 | 0.048170 |
| Clear Mask vs Conv. Mask | 0.57237 | 0.06991 | 8.188 | 2.67E−16 |
| Conv. Mask vs Conv. NoMask | −0.78223 | 0.07177 | −10.899 | <2E−16 |
| Clear Mask vs Conv. NoMask | −0.20675 | 0.06916 | −2.989 | 0.0028 |
| Clear Mask vs Conv. Mask | 0.57549 | 0.07166 | 8.031 | 9.70E−16 |
| Conv. Mask vs Conv. NoMask | −0.02931 | 0.09059 | −0.324 | 0.7462 |
| Clear Mask vs Conv. NoMask | 0.23478 | 0.09358 | 2.509 | 0.0121 |
| Clear Mask vs Conv. Mask | 0.26409 | 0.0935 | 2.824 | 0.00474 |
| Conv. Mask vs Conv. NoMask | −0.14581 | 0.07547 | −1.932 | 0.053352 |
| Clear Mask vs Conv. NoMask | 0.28618 | 0.08044 | 3.558 | 0.000374 |
| Clear Mask vs Conv. Mask | 0.43198 | 0.07827 | 5.519 | 3.41E−08 |
| Conv. Mask vs Conv. NoMask | −0.18207 | 0.06793 | −2.68 | 0.00735 |
| Clear Mask vs Conv. NoMask | 0.35672 | 0.07074 | 5.043 | 4.58E−07 |
| Clear Mask vs Conv. Mask | 0.53879 | 0.07014 | 7.681 | 1.58E−14 |
| Conv. Mask vs Conv. NoMask | −0.81299 | 0.07098 | −11.454 | <2E−16*** |
| Clear Mask vs Conv. NoMask | 0.08543 | 0.07008 | 1.219 | 0.223 |
| Clear Mask vs Conv. Mask | 0.89842 | 0.07158 | 12.552 | < 2E−16*** |