| Literature DB >> 29558514 |
Abstract
We investigated the effect of auditory noise added to speech on patterns of looking at faces in 40 toddlers. We hypothesised that noise would increase the difficulty of processing speech, making children allocate more attention to the mouth of the speaker to gain visual speech cues from mouth movements. We also hypothesised that this shift would cause a decrease in fixation time to the eyes, potentially decreasing the ability to monitor gaze. We found that adding noise increased the number of fixations to the mouth area, at the price of a decreased number of fixations to the eyes. Thus, to our knowledge, this is the first study demonstrating a mouth-eyes trade-off between attention allocated to social cues coming from the eyes and linguistic cues coming from the mouth. We also found that children with higher word recognition proficiency and higher average pupil response had an increased likelihood of fixating the mouth, compared to the eyes and the rest of the screen, indicating stronger motivation to decode the speech.Entities:
Mesh:
Year: 2018 PMID: 29558514 PMCID: PMC5860771 DOI: 10.1371/journal.pone.0194491
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Differential heatmap for patterns of fixations in the high level of noise and no noise conditions.
We estimated a smooth kernel distribution with standard Gaussian bandwidth separately for the empirical distribution of fixations in the no noise and high level of noise conditions. Next, we calculated the difference between corresponding probability density functions of the two estimates. The bounded area shows regions where the number of fixations was higher in the high level of noise condition than in the no noise condition. Warmer colors indicate bigger difference. Rectangles represent the position of dynamic eyes and mouth AOIs used in the task. The frame from the clip was filtered to preserve the anonymity of the person in the video.
The fixed effect estimates of the mixed-effects multinomial logistic regression, with the likelihood of looking at the eyes rather than the mouth and at the rest of the screen rather than the mouth modelled as a function of the noise level, word recognition proficiency score, age and average pupil response.
| b | SE | t | p | 95% CI | |||
|---|---|---|---|---|---|---|---|
| Eyes vs. mouth | |||||||
| Fixed effect | |||||||
| Intercept | 1.90 | 0.67 | 2.84 | .01 | [0.59, 3.21] | ||
| Age | 0.63 | 0.29 | 2.22 | .03 | [0.07, 1.19] | ||
| WRPS | -5.19 | 1.51 | -3.43 | .001 | [-8.15, -2.22] | ||
| Random effect | |||||||
| Noise level = 3 | -0.55 | 0.22 | -2.52 | .01 | [-0.97, -0.12] | ||
| Noise level = 2 | -0.44 | 0.22 | -1.99 | .05 | [-0.88, -0.01] | ||
| APR | -0.18 | 0.07 | -2.71 | .01 | [-0.31, -0.05] | ||
| Rest of the screen vs. mouth | |||||||
| Fixed effect | |||||||
| Intercept | 1.76 | 0.51 | 3.42 | .001 | [0.75, 2.76] | ||
| Age | 0.43 | 0.22 | 2.00 | 0.05 | [0.01, 0.86] | ||
| WRPS | -3.04 | 1.15 | -2.65 | .01 | [-5.28, -0.79] | ||
| Random effect | |||||||
| Noise level = 3 | -0.56 | 0.17 | -3.32 | .001 | [-0.88, -0.23] | ||
| Noise level = 2 | -0.07 | 0.17 | -0.42 | .68 | [-0.40, 0.26] | ||
| APR | -0.28 | 0.05 | -5.33 | <.001 | [-0.37, -0.17] | ||
Notes: WRPS = Word Recognition Proficiency Score, APR = Average Pupil Response.
Fig 2Proportion of fixations to the mouth and to the eyes depending on the noise level.