| Literature DB >> 23227018 |
Pia Knoeferle1, Helene Kreysa.
Abstract
During comprehension, a listener can rapidly follow a frontally seated speaker's gaze to an object before its mention, a behavior which can shorten latencies in speeded sentence verification. However, the robustness of gaze-following, its interaction with core comprehension processes such as syntactic structuring, and the persistence of its effects are unclear. In two "visual-world" eye-tracking experiments participants watched a video of a speaker, seated at an angle, describing transitive (non-depicted) actions between two of three Second Life characters on a computer screen. Sentences were in German and had either subject(NP1)-verb-object(NP2) or object(NP1)-verb-subject(NP2) structure; the speaker either shifted gaze to the NP2 character or was obscured. Several seconds later, participants verified either the sentence referents or their role relations. When participants had seen the speaker's gaze shift, they anticipated the NP2 character before its mention and earlier than when the speaker was obscured. This effect was more pronounced for SVO than OVS sentences in both tasks. Interactions of speaker gaze and sentence structure were more pervasive in role-relations verification: participants verified the role relations faster for SVO than OVS sentences, and faster when they had seen the speaker shift gaze than when the speaker was obscured. When sentence and template role-relations matched, gaze-following even eliminated the SVO-OVS response-time differences. Thus, gaze-following is robust even when the speaker is seated at an angle to the listener; it varies depending on the syntactic structure and thematic role relations conveyed by a sentence; and its effects can extend to delayed post-sentence comprehension processes. These results suggest that speaker gaze effects contribute pervasively to visual attention and comprehension processes and should thus be accommodated by accounts of situated language comprehension.Entities:
Keywords: eye tracking; sentence structure; speaker gaze; visual context effects; visually situated sentence comprehension
Year: 2012 PMID: 23227018 PMCID: PMC3514542 DOI: 10.3389/fpsyg.2012.00538
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Overview of the experimental conditions (congruence is not depicted).
| Condition | Gaze | Sentence structure | Video | Sentence |
|---|---|---|---|---|
| (a) | Gaze | SVO | ||
| (b) | Gaze | OVS | ||
| (c) | No gaze | SVO | ||
| (d) | No gaze | OVS |
The English translation of the SVO sentence is “the waiter congratulates the millionaire outside the shop,” while the OVS sentence implies that the waiter is .
Figure 1Verification templates. (A) Experiment 1, referent verification: Were the circled characters mentioned in the sentence? (B) Experiment 2, thematic role relations verification: Does the arrow reflect the thematic role relations of the sentence?
Figure 2Experiment 1: Proportion of fixations to (A) the target character, (B) the competitor, (C) the NP1 referent, and (D) the speaker region and the background. All graphs begin at the onset of the speaker’s gaze shift. Mean onsets of the NP2 and the ending phrase are marked with vertical gray bars.
Experiment 1, SHIFT time window: Mean log-gaze probability ratios by condition for fixations to the target character (a) over the competitor or (b) over the NP1 referent.
| Gaze | No gaze | Total | ||
|---|---|---|---|---|
| (a) Target vs. | SVO | 0.39 (2.92) | −0.98 (2.65) | −0.3 (2.86) |
| competitor fixations | OVS | 1.34 (2.68) | 0.25 (2.76) | 0.8 (2.77) |
| Total | 0.86 (2.84) | −0.36 (2.77) | 0.25 (2.87) | |
| (b) Target vs. N1 | SVO | −0.27 (2.74) | −1.51 (3.02) | −0.89 (2.94) |
| referent fixations | OVS | 0.38 (2.42) | −0.78 (2.68) | −0.20 (2.62) |
| Total | 0.05 (2.60) | −1.14 (2.87) | −0.55 (2.81) |
A positive number indicates preferred inspection of the target character; negative numbers preferred inspection of the competitor/NP1 referent. .
Experiment 1, SHIFT time window: Coefficients, .
| By participants | by items | |||||
|---|---|---|---|---|---|---|
| Coefficient | Coefficient | |||||
| Intercept | 0.25 | 0.20 | 1.25 | 0.30 | 0.17 | 1.83 |
| Gaze | −1.22 | 0.33 | −1.24 | 0.31 | ||
| Structure | −1.09 | 0.33 | −0.81 | 0.36 | ||
| Time window | 0.17 | 0.06 | 0.16 | 0.05 | ||
| Gaze × structure | −0.28 | 0.06 | −0.43 | −0.52 | 0.69 | −0.75 |
| Gaze × time bin | −0.45 | 0.12 | −0.47 | 0.09 | ||
| Structure × time bin | <0.01 | 0.09 | 0.3 | 0.05 | 0.09 | 0.59 |
| Intercept | −0.54 | 0.23 | −0.61 | 0.16 | ||
| Gaze | −1.20 | 0.39 | −1.15 | 0.30 | ||
| Structure | −0.69 | 0.27 | −0.46 | 0.27 | −1.73 | |
| Time window | 0.22 | 0.05 | 0.18 | 0.03 | ||
| Gaze × structure | −0.09 | 0.63 | −0.14 | −0.57 | 0.45 | −1.27 |
| Gaze × time bin | −0.33 | 0.10 | −0.30 | 0.06 | ||
| Structure × time bin | 0.03 | 0.06 | 0.44 | <0.01 | 0.08 | −0.04 |
t-values in bold indicate a significant effect.
Experiment 1, NP2 time window: Coefficients, .
| By participants | By items | |||||
|---|---|---|---|---|---|---|
| Coefficient | Coefficient | |||||
| Intercept | 1.71 | 0.16 | 1.60 | 0.15 | ||
| Gaze | −2.65 | 0.37 | −2.88 | 0.38 | ||
| Structure | −0.32 | 0.23 | −1.39 | −0.36 | 0.30 | −1.22 |
| Time bin | 0.26 | 0.04 | 0.22 | 0.04 | ||
| Gaze × structure | −0.25 | 0.64 | −0.38 | −0.41 | 0.75 | −0.55 |
| Gaze × time bin | −0.02 | 0.11 | −0.22 | 0.01 | 0.08 | 0.15 |
| Structure × time bin | 0.21 | 0.08 | 0.09 | 0.11 | 0.89 | |
t-values in bold indicate a significant effect.
Figure 3Experiment 2: Proportion of fixations to the target character from the onset of the speaker’s gaze shift. Mean onsets of the NP2 and the ending phrase are marked with vertical gray bars.
Experiment 2, SHIFT time window: Mean log-gaze probability ratios by condition for fixations to the target character (a) over the competitor or (b) over the NP1 referent.
| Gaze | No gaze | Total | ||
|---|---|---|---|---|
| (a) Target vs. | SVO | 0.70 (2.67) | −0.54 (2.69) | 0.08 (2.75) |
| competitor fixations | OVS | 1.01 (2.99) | 0.36 (2.72) | 0.69 (2.87) |
| Total | 0.85 (2.83) | −0.09 (2.74) | 0.38 (2.83) | |
| (b) Target vs. N1 | SVO | −0.16 (2.76) | −1.67 (2.61) | −0.92 (2.79) |
| referent fixations | OVS | −0.53 (2.63) | −0.57 (2.68) | −0.55 (2.65) |
| Total | −0.35 (2.70) | −1.12 (2.70) | −0.73 (2.72) | |
.
Experiment 2, SHIFT time window: Coefficients, .
| By participants | By items | |||||
|---|---|---|---|---|---|---|
| Estimate | Estimate | |||||
| Intercept | 0.38 | 0.17 | 0.34 | 0.21 | 1.65 | |
| Gaze | −0.94 | 0.36 | −1.23 | 0.29 | ||
| Structure | −0.60 | 0.38 | −1.58 | −0.55 | 0.35 | −1.55 |
| Time bin | 0.23 | 0.05 | 0.22 | 0.05 | ||
| Gaze × structure | −0.59 | 0.64 | −0.92 | −0.33 | 0.73 | −0.45 |
| Gaze × time bin | −0.37 | 0.10 | −0.42 | 0.09 | ||
| Structure × time bin | 0.16 | 0.09 | 1.85 | 0.24 | 0.11 | |
| Intercept | −0.73 | 0.25 | −0.84 | 0.18 | ||
| Gaze | −0.77 | 0.26 | −0.87 | 0.25 | ||
| Structure | −0.37 | 0.32 | −1.13 | −0.39 | 0.27 | −1.45 |
| Time bin | 0.24 | 0.04 | 0.22 | 0.03 | ||
| Gaze × structure | 1.48 | 0.63 | −0.65 | 0.58 | −1.12 | |
| Gaze × time bin | −0.39 | 0.08 | −0.28 | 0.05 | ||
| Structure × time bin | 0.02 | 0.08 | 0.28 | 0.12 | 0.08 | 1.45 |
t-values in bold indicate a significant effect.
Experiment 2, NP2 time window: coefficients, .
| By participants | By items | |||||
|---|---|---|---|---|---|---|
| Estimate | Estimate | |||||
| Intercept | 1.78 | 0.18 | 1.78 | 0.13 | ||
| Gaze | −2.76 | 0.37 | −2.73 | 0.27 | ||
| Structure | −0.29 | 0.34 | −0.86 | −0.36 | 0.39 | −0.92 |
| Time Bin | 0.12 | 0.04 | 0.12 | 0.05 | ||
| Gaze × structure | −1.10 | 0.50 | −0.07 | 0.61 | −0.11 | |
| Gaze × time bin | 0.08 | 0.09 | 0.88 | 0.11 | 0.09 | 1.25 |
| Structure × time bin | −0.06 | 0.08 | −0.77 | −0.05 | 0.08 | −0.71 |
Descriptive statistics are included in the main text.
t-values in bold indicate a significant effect.
Figure 4Experiment 2: Interaction effect of gaze-following, sentence structure and congruence on response times. Since we collapsed across the factor speaker, the bars labeled “gaze-following” include both gaze and no gaze trials, whenever the target character was fixated in the SHIFT time window. In these trials, participants’ response times to OVS sentences were shorter than for OVS sentences when they did not follow gaze, but only when the template matched the sentence. Error bars are SE.
| Final model structure | Sigma | |||
|---|---|---|---|---|
| Response times | logrt ∼ match + gaze + struc + (1 + match + gaze + struc | participant) + (1 + item) | 0.61 | 0.08 | |
| logrt ∼ match*gazefollow + (1 + match*gazefollow | participant) + (1 + item) | 0.60 | 0.19 | ||
| Log-gaze probability ratio | Participants | lograt ∼ gaze*struc + gaze*time + struc*time + (1 + gaze*struc + gaze*time + struc*time + participant) | >0.7 | <1.5 |
| items | lograt ∼ gaze*struc + gaze*time + struc*time + (1 + gaze*struc + gaze*time + struc*time + item) | >0.7 | <1.3 | |
| First fixation latency | logfixlat ∼ gaze + struc + (1 + gaze + struc | participant) + (1 + gaze + struc | item) | 0.30 | 0.63 | |
| Response times | logrt ∼ match*gaze + struc + (1 + match + gaze + struc | participant) + (1 | item) | 0.57 | 0.21 | |
| logrt ∼ match*gazefollow*struc + (1 + gazefollow + match*struc ∼ participant) + (1 ∼ item) | 0.56 | 0.21 | ||
| Log-gaze probability ratio | Participants | lograt ∼ gaze*struc + gaze*time + struc*time + (1 + gaze*struc + gaze*time + struc*time | participant) | >0.7 | <1.5 |
| items | lograt ∼ gaze*struc + gaze*time + struc*time + (1 + gaze*struc + gaze*time + struc*time | item) | >0.7 | <1.3 | |
| First fixation latency | logfixlat ∼ gaze*struc + (1 + gaze*struc | participant) + (1 + gaze + struc | item) | 0.24 | 0.69 | |