| Literature DB >> 35166414 |
Duy Duong Nguyen1,2, Antonia Chacon1, Christopher Payten1, Rebecca Black1, Meet Sheth1, Patricia McCabe1, Daniel Novakovic1,3,4, Catherine Madill1.
Abstract
BACKGROUND: Previous research has found that high-frequency energy of speech signals decreased while wearing face masks. However, no study has examined the specific spectral characteristics of fricative consonants and vowels and the perception of clarity of speech in mask wearing. AIMS: To investigate acoustic-phonetic characteristics of fricative consonants and vowels and auditory perceptual rating of clarity of speech produced with and without wearing a face mask. METHODS & PROCEDURES: A total of 16 healthcare workers read the Rainbow Passage using modal phonation in three conditions: without a face mask, with a standard surgical mask and with a KN95 mask (China GB2626-2006, a medical respirator with higher barrier level than the standard surgical mask). Speech samples were acoustically analysed for root mean square (RMS) amplitude (ARMS ) and spectral moments of four fricatives /f/, /s/, /ʃ/ and /z/; and amplitude of the first three formants (A1, A2 and A3) measured from the reading passage and extracted vowels. Auditory perception of speech clarity was performed. Data were compared across mask and non-mask conditions using linear mixed models. OUTCOMES &Entities:
Keywords: acoustic analysis; formant amplitude; fricative signal; speech clarity; vowel amplitude
Mesh:
Year: 2022 PMID: 35166414 PMCID: PMC9305964 DOI: 10.1111/1460-6984.12705
Source DB: PubMed Journal: Int J Lang Commun Disord ISSN: 1368-2822 Impact factor: 2.909
FIGURE 1Wide‐band spectrogram illustrating the extraction of the fricative /ʃ/ in Praat. Vertical dashed lines indicate boundaries of this consonant [Colour figure can be viewed at wileyonlinelibrary.com]
Intraclass correlation coefficient (ICC) for interrater reliability of spectral measures for two fricatives /s/ and /ʃ/ in non‐mask
|
|
| ||
|---|---|---|---|
|
|
|
|
|
| Root mean square (RMS) | SM | 0.997 (0.991–0.999) | 0.999 (0.997–1.000) |
| AM | 0.998 (0.996–0.999) | 1.000 (0.999–1.000) | |
| Centre of gravity | SM | 0.998 (0.994–0.999) | 0.996 (0.988–0.998) |
| AM | 0.999 (0.997–1.000) | 0.998 (0.994–0.999) | |
| Standard deviation | SM | 0.987 (0.962–0.995) | 0.966 (0.905–0.988) |
| AM | 0.993 (0.981–0.998) | 0.983 (0.950–0.994) | |
| Skewness | SM | 0.968 (0.910–0.989) | 0.991 (0.974–0.997) |
| AM | 0.984 (0.953–0.994) | 0.995 (0.987–0.998) | |
| Kurtosis | SM | 0.991 (0.976–0.997) | 0.998 (0.995–0.999) |
| AM | 0.996 (0.988–0.999) | 0.999 (0.998–1.000) |
Notes: AM, average measures; CI, confidence interval; and SM, single measures.
All p‐values < 0.001.
FIGURE 2Root mean square (RMS) amplitude of all four fricatives in all conditions. The trend line is shown for each fricative. The p‐values represent the significant levels of parameter estimate of the decrease in RMS amplitude in the KN95 mask compared with non‐mask. NM, non‐mask; and SM, surgical mask [Colour figure can be viewed at wileyonlinelibrary.com]
Results of linear mixed model analysis for the four fricatives
|
|
|
|
|
|---|---|---|---|
| /f/ | [ | ( |
|
| /s/ | [ | ( |
|
| /ʃ/ | [ | ( |
|
| /z/ | [ | ( |
|
FIGURE 3Spectral moments of the four fricatives in all conditions including a trend line for each fricative. 0 = Non‐mask, 1 = surgical mask and 2 = KN95 [Colour figure can be viewed at wileyonlinelibrary.com]
Results of type III tests of fixed effects for spectral moments of four fricatives
|
|
|
|
|
|
|---|---|---|---|---|
| /f/ |
|
|
|
|
| /s/ |
|
|
|
|
| /ʃ/ |
|
|
|
|
| /z/ |
|
|
|
|
Notes: *Significance at 0.05. Significant results are shown in bold.
FIGURE 4Normalized formant amplitude of the Rainbow Passage. Error bars indicate standard deviation. Dots represent individual values. NM, no mask; and SM, surgical mask
FIGURE 5Normalized formant amplitude of three extracted vowels. Error bars indicate standard deviation. Dots represent individual values. NM, no mask; and SM, surgical mask
FIGURE 6Rating score of speech clarity in three conditions including a trend line (rating scale 1–100, higher scores represent poorer speech clarity). 0 = Non‐mask, 1 = surgical mask and 2 = KN95