| Literature DB >> 32627053 |
Louise Schubotz1, Judith Holler2,3, Linda Drijvers1,4, Aslı Özyürek1,4,5.
Abstract
When comprehending speech-in-noise (SiN), younger and older adults benefit from seeing the speaker's mouth, i.e. visible speech. Younger adults additionally benefit from manual iconic co-speech gestures. Here, we investigate to what extent younger and older adults benefit from perceiving both visual articulators while comprehending SiN, and whether this is modulated by working memory and inhibitory control. Twenty-eight younger and 28 older adults performed a word recognition task in three visual contexts: mouth blurred (speech-only), visible speech, or visible speech + iconic gesture. The speech signal was either clear or embedded in multitalker babble. Additionally, there were two visual-only conditions (visible speech, visible speech + gesture). Accuracy levels for both age groups were higher when both visual articulators were present compared to either one or none. However, older adults received a significantly smaller benefit than younger adults, although they performed equally well in speech-only and visual-only word recognition. Individual differences in verbal working memory and inhibitory control partly accounted for age-related performance differences. To conclude, perceiving iconic gestures in addition to visible speech improves younger and older adults' comprehension of SiN. Yet, the ability to benefit from this additional visual information is modulated by age and verbal working memory. Future research will have to show whether these findings extend beyond the single word level.Entities:
Mesh:
Year: 2020 PMID: 32627053 PMCID: PMC8289811 DOI: 10.1007/s00426-020-01363-8
Source DB: PubMed Journal: Psychol Res ISSN: 0340-0727
Fig. 1Experimental overview. a Overview of conditions. Action words are in Dutch: lopen (“to walk”), fietsen (“to cycle”), rijden (“to drive”). b Trial structure. Answer options are in Dutch: strijden (“to fight”, phonological competitor), sturen (“to steer”, semantic competitor), afgieten (“to drain”, unrelated foil), rijden (“to drive”, target)
Fig. 2Response accuracy in percent per age group and condition. Error bars represent SE. The dotted line separates the audiovisual trials (left) from the visual-only trials (right). ***p < 0.001, **p < 0.01, *p < 0.05
Model predicting response accuracy in multimodal trials, age group = young and visual modality = visible speech are on the intercept. N = 56
| Intercept | .97 | .07 | 13.49 | < .001 |
| Age group | – .40 | .10 | – 4.07 | < .001 |
| Visual modality | –.83 | .07 | – 11.32 | < .001 |
| Visual modality | 1.17 | .10 | 12.15 | < .001 |
| Age group | .25 | .10 | 2.42 | .02 |
| Age group | –.32 | .13 | – 2.55 | .01 |
Models predicting response accuracy in speech-only (mouth blurred), visible speech, and visible speech + gesture trials, age group = young and noise = SNR -18 are on the intercept. N = 56a
| Intercept | – .75 | .07 | – 11.07 | < .001 | .59 | .13 | 4.60 | < .001 | 1.91 | .15 | 12.40 | < .001 |
| Hearing acuity (PTA) | – .15 | .05 | – 3.12 | .002 | – | – | – | – | – | – | – | – |
| Age group | –b | – | – | – | – .57 | .18 | – 3.13 | .002 | – .99 | .20 | – 4.93 | < .001 |
| Noise | 3.64 | .15 | 24.29 | < .001 | 3.17 | .29 | 11.08 | < .001 | 2.57 | .40 | 6.44 | < .001 |
| Noise | – .24 | .09 | – 2.57 | .01 | – .37 | .13 | – 2.93 | .003 | – .25 | .17 | – 1.49 | .14 |
| Noise | n.a.c | n.a | n.a | n.a | – .51 | .13 | – 4.08 | < .001 | – .46 | .16 | – 2.78 | .006 |
| Age group | – | – | – | – | .60 | .41 | 1.48 | .14 | .41 | .50 | .82 | .41 |
| Age group | – | – | – | – | .01 | .18 | .04 | .97 | .32 | .22 | 1.47 | .14 |
| Age group | n.a | n.a | n.a | n.a | .43 | .18 | 2.47 | .01 | .72 | .21 | 3.33 | < .001 |
aIn the model predicting response accuracy in the speech-only (mouth blurred) condition, N = 55
bA hyphen indicates a non-significant predictor that was eliminated in the model-comparison process
cNote that there were no visual-only trials in the speech-only (mouth blurred) condition
Model predicting the size of the relative visual benefit, age group = young, benefit type = gestural benefit, and noise = SNR -18 are on the intercept. N = 56
| Intercept | .51 | .04 | 14.28 | < .001 |
| Age group | – .14 | .03 | – 4.50 | < .001 |
| Benefit type | – .07 | .04 | – 1.53 | .13 |
| Benefit type | .24 | .04 | 5.68 | < .001 |
| Noise | .11 | .04 | 2.67 | .008 |
| Benefit type | – .20 | .06 | – 3.23 | .001 |
| Benefit type | – .11 | .06 | – 1.80 | .07 |
Fig. 3Relative benefit per age group, noise level, and benefit type. The black line represents the median; the two hinges represent the 1st and 3rd quartile; the whiskers capture the largest and smallest observation but extend no further than 1.5 * IQR (data points outside 1.5 * IQR are represented by dots)