| Literature DB >> 33608184 |
Youri Maryn1, Floris L Wuyts2, Andrzej Zarowski3.
Abstract
BACKGROUND: Worldwide use of nose-and-mouth-covering respiratory protective mask (RPM) has become ubiquitous during COVID19 pandemic. Consequences of wearing RPMs, especially regarding perception and production of spoken communication, are gradually emerging. The present study explored how three prevalent RPMs affect various speech and voice sound properties.Entities:
Keywords: Respiratory protection masks–Speech–Voice–Acoustics
Year: 2021 PMID: 33608184 PMCID: PMC7885637 DOI: 10.1016/j.jvoice.2021.01.013
Source DB: PubMed Journal: J Voice ISSN: 0892-1997 Impact factor: 2.009
FIGURE 1Illustration of the chaining of the original voice recordings in this study. Top oscillogram: sequence of extracted sound signal segments (3-second sustained [a] vowel, and 2 sentences of read text) of the fifty subjects to one long sound chain of 555.39 seconds. Bottom two oscillograms: sixth (left) and forty-third (right) concatenated sound files (pause, text segment, pause, vowel segment, and pause) with their boundaries designated by an imprinted acoustic mark.
FIGURE 2Photographs of the VESPA model, as it was situated in the sound treated room. Top (A, B, C): VESPA model without microphone or RPM. To ensure consistent microphone placement relative to VESPA's sound source, blue dashed elliptic markings where applied to indicate the left (B) and right (C) spots where the microphone's behind-the-neck headband should make contact with the model's head. Bottom (D, E, F, G): VESPA with only microphone as control condition (D) and with microphone plus surgical mask (E), FFP2 mask (F) or transparent mask (G).
Mean (ie, M), Standard Deviation (ie, SD), Minimum (ie, Min) and Maximum (ie, Max) of the 26 Acoustic Measures per Recording Condition (C1: No Mask; C2: Surgical Mask; C3: FFP2 Mask; C4: Transparent Mask) and per Difference (ie, ) Between No-Mask and Mask Conditions
| Acoustic marker | C1 | C2 | C3 | C4 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| M | SD | Min | Max | M | SD | Min | Max | M | SD | Min | Max | M | SD | Min | Max | |
| 175 | 62 | 75 | 367 | 178 | 61 | 76 | 365 | 176 | 63 | 62 | 363 | 168 | 53 | 69 | 290 | |
| – | – | – | – | -2 | 16 | −106 | 5 | −1 | 18 | −107 | 61 | 7 | 42 | −107 | 197 | |
| IL | 58,3 | 4,6 | 46,5 | 67,1 | 57,9 | 4,6 | 46,7 | 66,7 | 57,0 | 4,7 | 45,7 | 66,0 | 56,8 | 5,7 | 45,6 | 68,4 |
| – | – | – | – | 0,4 | 0,4 | −0,6 | 1,7 | 1,3 | 0,6 | 0,0 | 2,5 | 1,5 | 3,0 | −4,2 | 9,8 | |
| JL | 1,12 | 1,62 | 0,16 | 7,34 | 1,14 | 1,56 | 0,17 | 7,03 | 1,14 | 1,60 | 0,16 | 6,86 | 1,14 | 1,49 | 0,13 | 7,11 |
| – | – | – | – | −0,02 | 0,34 | −1,17 | 1,63 | −0,02 | 0,25 | −1,07 | 0,68 | −0,02 | 0,41 | −0,96 | 1,83 | |
| SL | 0,59 | 0,48 | 0,09 | 2,06 | 0,63 | 0,44 | 0,10 | 2,00 | 0,66 | 0,45 | 0,11 | 1,88 | 0,79 | 0,50 | 0,09 | 2,03 |
| – | – | – | – | −0,03 | 0,14 | −0,43 | 0,30 | −0,06 | 0,17 | −0,61 | 0,24 | −0,20 | 0,29 | −0,95 | 0,31 | |
| HNR | 15,97 | 6,58 | 2,90 | 29,35 | 15,64 | 6,41 | 2,85 | 28,45 | 15,85 | 6,49 | 2,68 | 28,35 | 15,09 | 6,96 | 1,11 | 29,97 |
| – | – | – | – | 0,33 | 0,71 | −0,84 | 2,12 | 0,12 | 1,02 | −1,71 | 2,74 | 0,88 | 3,15 | −5,20 | 8,32 | |
| CPPS | 11,94 | 3,81 | 4,31 | 21,24 | 11,37 | 3,70 | 3,77 | 19,92 | 11,18 | 3,64 | 3,87 | 19,61 | 10,93 | 3,57 | 3,84 | 18,81 |
| – | – | – | – | 0,57 | 0,24 | 0,08 | 1,32 | 0,76 | 0,29 | 0,00 | 1,63 | 1,01 | 0,71 | −0,91 | 2,42 | |
| AVQI | 4,01 | 1,77 | 0,80 | 8,02 | 4,26 | 1,77 | 1,28 | 8,12 | 4,37 | 1,74 | 1,31 | 8,17 | 4,42 | 1,84 | 1,10 | 8,87 |
| – | – | – | – | −0,25 | 0,31 | −0,85 | 0,65 | −0,36 | 0,39 | −1,66 | 0,56 | −0,41 | 0,71 | −2,40 | 1,14 | |
| 602 | 158 | 330 | 1068 | 617 | 163 | 328 | 1071 | 617 | 164 | 322 | 1028 | 764 | 177 | 326 | 1027 | |
| – | – | – | – | −15 | 25 | −70 | 43 | −15 | 42 | −93 | 158 | −162 | 204 | −550 | 476 | |
| 330 | 169 | 52 | 859 | 349 | 181 | 61 | 961 | 318 | 162 | 62 | 842 | 302 | 189 | 48 | 995 | |
| – | – | – | – | −19 | 46 | −172 | 98 | 12 | 64 | −152 | 227 | 28 | 197 | −371 | 544 | |
| 1387 | 220 | 1000 | 1868 | 1377 | 208 | 992 | 1825 | 1331 | 190 | 981 | 1818 | 1274 | 156 | 932 | 1576 | |
| – | – | – | – | 10 | 42 | −88 | 152 | 56 | 95 | −75 | 463 | 113 | 195 | −71 | 736 | |
| 310 | 162 | 34 | 741 | 323 | 260 | 28 | 1570 | 326 | 290 | 20 | 1659 | 159 | 168 | 9 | 1173 | |
| – | – | – | – | −13 | 225 | −1290 | 410 | −16 | 276 | −1378 | 388 | 151 | 210 | −893 | 535 | |
| SM1 | 588 | 193 | 322 | 1329 | 574 | 184 | 307 | 1228 | 557 | 160 | 313 | 1028 | 625 | 207 | 230 | 1023 |
| – | – | – | – | 14 | 28 | −36 | 103 | 31 | 54 | −33 | 301 | −36 | 183 | −387 | 569 | |
| SM2 | 406 | 153 | 157 | 821 | 404 | 136 | 170 | 784 | 378 | 120 | 149 | 733 | 384 | 93 | 177 | 605 |
| – | – | – | – | 2 | 32 | −95 | 78 | 28 | 53 | −140 | 159 | 22 | 131 | −304 | 336 | |
| SM3 | 4,1 | 2,8 | 0,5 | 13,8 | 7,1 | 6,8 | 0,8 | 29,4 | 8,6 | 8,7 | 1,2 | 41,7 | 6,4 | 8,7 | −0,7 | 51,1 |
| – | – | – | – | −2,9 | 4,2 | −15,7 | 0,8 | −4,5 | 6,1 | −27,9 | 1,2 | −2,2 | 6,8 | −41,3 | 4,2 | |
| SM4 | 83 | 122 | 0 | 622 | 296 | 474 | 4 | 2281 | 424 | 735 | 8 | 4046 | 311 | 692 | 8 | 4520 |
| – | – | – | – | −213 | 356 | −1659 | 18 | −341 | 618 | −3424 | 8 | −228 | 625 | −4219 | 36 | |
| SS | −19,1 | 5,4 | −30,1 | −8,2 | −19,0 | 5,4 | −30,2 | −8,7 | −19,6 | 5,3 | −30,9 | −8,5 | −16,3 | 5,5 | −31,2 | −3,7 |
| – | – | – | – | −0,1 | 0,5 | −1,1 | 0,9 | 0,4 | 0,8 | −0,9 | 2,9 | −2,8 | 3,5 | −9,6 | 5,2 | |
| FB1 | 27,8 | 4,4 | 16,2 | 36,4 | 27,3 | 4,4 | 16,4 | 35,9 | 26,5 | 4,5 | 15,4 | 35,2 | 25,7 | 5,3 | 14,1 | 35,2 |
| – | – | – | – | 0,5 | 0,4 | −0,6 | 1,6 | 1,3 | 0,6 | −0,1 | 2,5 | 2,1 | 3,2 | −4,0 | 9,9 | |
| FB2 | 17,5 | 7,6 | −0,6 | 32,7 | 17,4 | 7,6 | −0,7 | 32,7 | 16,1 | 7,7 | −1,4 | 32,2 | 18,9 | 8,1 | 1,5 | 37,2 |
| – | – | – | – | 0,1 | 0,1 | −0,2 | 0,7 | 1,4 | 0,6 | 0,4 | 3,5 | −1,4 | 1,9 | −4,6 | 3,7 | |
| FB3 | 6,6 | 7,6 | −7,9 | 24,3 | 3,9 | 7,6 | −9,9 | 21,1 | 1,4 | 7,6 | −12,6 | 18,4 | −8,7 | 7,2 | −20,8 | 6,6 |
| – | – | – | – | 2,6 | 0,3 | 2,1 | 3,3 | 5,1 | 0,5 | 4,2 | 6,2 | 15,3 | 1,6 | 10,0 | 17,7 | |
| FB4 | −0,4 | 8,2 | −22,6 | 14,7 | −2,5 | 8,3 | −26,4 | 12,5 | −6,0 | 8,2 | −28,0 | 9,2 | −12,9 | 7,3 | −26,9 | 1,5 |
| – | – | – | – | 2,2 | 0,3 | 1,7 | 3,7 | 5,6 | 0,4 | 4,7 | 6,7 | 12,5 | 1,7 | 4,3 | 14,8 | |
| FB5 | −15,3 | 5,7 | −27,1 | 1,1 | −16,5 | 5,2 | −26,5 | −0,3 | −19,2 | 4,9 | −27,5 | −4,6 | −20,9 | 4,4 | −28,0 | −7,0 |
| – | – | – | – | 1,2 | 1,1 | −1,3 | 4,0 | 3,8 | 1,3 | 0,2 | 5,9 | 5,5 | 2,1 | 0,2 | 9,5 | |
| FB6 | −19,4 | 7,2 | −31,2 | −1,2 | −16,0 | 4,3 | −21,5 | −2,3 | −16,7 | 3,6 | −21,8 | −5,3 | −17,3 | 2,8 | −22,5 | −8,4 |
| – | – | – | – | −3,4 | 4,0 | −12,8 | 3,4 | −2,6 | 4,8 | −13,2 | 5,1 | −2,1 | 5,7 | −14,5 | 7,8 | |
| FB7 | −24,6 | 6,0 | −32,7 | −12,5 | −18,4 | 2,5 | −22,9 | −12,5 | −18,8 | 1,8 | −21,8 | −16,1 | −19,3 | 2,0 | −22,8 | −15,5 |
| – | – | – | – | −6,2 | 4,3 | −13,9 | 1,9 | −5,8 | 5,5 | −15,4 | 4,3 | −5,3 | 5,8 | −13,4 | 6,7 | |
| FB8 | −27,7 | 4,9 | −33,8 | −11,7 | −28,4 | 4,2 | −33,8 | −13,7 | −29,7 | 3,6 | −34,1 | −17,2 | −30,1 | 3,4 | −33,9 | −22,2 |
| – | – | – | – | 0,8 | 3,3 | −9,1 | 11,5 | 2,0 | 3,4 | −6,3 | 13,8 | 2,5 | 4,3 | −5,1 | 12,1 | |
| FB9 | −29,9 | 4,3 | −35,6 | −17,3 | −29,7 | 3,2 | −33,6 | −18,5 | −30,4 | 2,7 | −33,4 | −22,1 | −30,8 | 2,9 | −33,8 | −24,4 |
| – | – | – | – | −0,3 | 3,4 | −6,1 | 12,1 | 0,5 | 3,5 | −7,0 | 12,6 | 0,9 | 4,0 | −5,9 | 11,3 | |
| FB10 | −27,4 | 6,4 | −35,9 | −9,7 | −24,3 | 3,8 | −27,6 | −11,0 | −25,2 | 2,7 | −27,9 | −14,7 | −26,3 | 1,2 | −27,4 | −22,8 |
| – | – | – | – | −3,1 | 3,5 | −9,5 | 5,4 | −2,2 | 4,4 | −9,8 | 6,4 | −1,1 | 5,8 | −8,9 | 13,3 | |
C1, condition without mask; C2, condition with surgical mask; C3, condition with FFP2 mask; C4, condition with transparent mask; IL, median sound intensity level; JL, jitter local; SL, shimmer local dB; HNR, harmonics-to-noise ratio; CPPS, smoothed cepstral peak prominence; AVQI, Acoustic Voice Quality Index; SM, spectral moment; SS, spectral slope; FB, mean energy in 1-kHz frequency bands.
Darker grey boxes indicate nonsignificant differences (corresponding with Wilcoxon test results in TABLE 2).
FIGURE 3Multiple line plots illustrating differences per token in seven vocal physiology-related markers between without-mask condition (C1) and three with-mask conditions (C2, C3 and C4).
FIGURE 4Multiple line plots illustrating differences per token in nine frequency-domain speech signal properties between without-mask condition (C1) and three with-mask conditions (C2, C3 and C4).
FIGURE 5Multiple line plots illustrating differences per token in mean energy in 10 1-kHz frequency bands between without-mask condition (C1) and three with-mask conditions (C2, C3, and C4).
Differences in the Acoustic Markers on the Speech and/or Voice Signals Between the No-Mask and the Three With-Mask Recording Conditions
| Acoustic marker | 2-way ANOVA | Dunnett (C1–C2) | Dunnett (C1–C3) | Dunnett (C1–C4) | Acoustic marker | 2-way ANOVA | Dunnett (C1–C2) | Dunnett (C1–C3) | Dunnett (C1–C4) |
|---|---|---|---|---|---|---|---|---|---|
| 0.168 | 0.924 | 0.994 | 0.277 | SM3 | <0.001 | <0.001 | <0.001 | 0.010 | |
| IL | <0.001 | 0.349 | <0.001 | <0.001 | SM4 | <0.001 | 0.021 | <0.001 | 0.012 |
| JL | 0.913 | 0.903 | 0.875 | 0.875 | SS | <0.001 | 0.975 | 0.437 | <0.001 |
| SL | <0.001 | 0.595 | 0.102 | 0.000 | FB1 | <0.001 | 0.319 | <0.001 | <0.001 |
| HNR | 0.024 | 0.568 | 0.959 | 0.013 | FB2 | <0.001 | 0.906 | <0.001 | <0.001 |
| CPPS | <0.001 | <0.001 | <0.001 | <0.001 | FB3 | <0.001 | <0.001 | <0.001 | <0.001 |
| AVQI | <0.001 | 0.001 | <0.001 | <0.001 | FB4 | <0.001 | <0.001 | <0.001 | <0.001 |
| <0.001 | 0.818 | 0.819 | <0.001 | FB5 | <0.001 | <0.001 | <0.001 | <0.001 | |
| 0.145 | 0.684 | 0.891 | 0.394 | FB6 | <0.001 | <0.001 | <0.001 | 0.001 | |
| <0.001 | 0.908 | 0.011 | <0.001 | FB7 | <0.001 | <0.001 | <0.001 | <0.001 | |
| <0.001 | 0.945 | 0.918 | <0.001 | FB8 | <0.001 | 0.333 | <0.001 | <0.001 | |
| SM1 | 0.002 | 0.770 | 0.197 | 0.107 | FB9 | 0.071 | 0.908 | 0.578 | 0.144 |
| SM2 | 0.058 | 0.996 | 0.066 | 0.196 | FB10 | <0.001 | <0.001 | <0.001 | 0.114 |
C1, condition without mask; C2, condition with surgical mask; C3, condition with FFP2 mask; C4, condition with transparent mask; Z, Wilcoxon test value; fO, median fundamental frequency; IL, median sound intensity level; JL, jitter local; SL. shimmer local dB; HNR, harmonics-to-noise ratio; CPPS, smoothed cepstral peak prominence; AVQI, Acoustic Voice Quality Index; F1, first formant; BF1, bandwidth of F1; F2, second formant; BF2, bandwidth of F2; SM, spectral moment; SS, spectral slope; FB, mean energy in 1-kHz frequency bands.
Darker grey boxes denote non-significant ( > .05) findings.
FIGURE 6Averaged spectra (with frequency bins of 100 Hz) across the 47 vowel (top) and sentences tokens (bottom) for the four recording conditions, and after zeroing relative to the no mask spectra: no mask (black), surgical mask (red), FFP2 mask (blue), and transparent mask (green). (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)