| Literature DB >> 26696914 |
Alasdair D F Clarke1, Micha Elsner2, Hannah Rohde3.
Abstract
In complex stimuli, there are many different possible ways to refer to a specified target. Previous studies have shown that when people are faced with such a task, the content of their referring expression reflects visual properties such as size, salience, and clutter. Here, we extend these findings and present evidence that (i) the influence of visual perception on sentence construction goes beyond content selection and in part determines the order in which different objects are mentioned and (ii) order of mention influences comprehension. Study 1 (a corpus study of reference productions) shows that when a speaker uses a relational description to mention a salient object, that object is treated as being in the common ground and is more likely to be mentioned first. Study 2 (a visual search study) asks participants to listen to referring expressions and find the specified target; in keeping with the above result, we find that search for easy-to-find targets is faster when the target is mentioned first, while search for harder-to-find targets is facilitated by mentioning the target later, after a landmark in a relational description. Our findings show that seemingly low-level and disparate mental "modules" like perception and sentence planning interact at a high level and in task-dependent ways.Entities:
Keywords: referring expressions; visual salience; visual search
Year: 2015 PMID: 26696914 PMCID: PMC4674625 DOI: 10.3389/fpsyg.2015.01793
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Example stimulus used in the production and comprehension studies. In production, participants had to identify a designated target. In comprehension, the four referring expressions for this trial were (i) “at the upper right, the sphinx” [landmark only]; (ii) “at the upper right, the man holding the red vase with a stripe” [target only]; (iii) “at the upper right, the man holding the red vase with a stripe to the left of the sphinx” [landmark follows target]; (iv) “at the upper right, to the left of the sphinx, the man holding the red vase with a stripe on it” [landmark precedes target].
Figure 2Instructions for the picture description task in Clarke et al. (.
One-vs.-all regression effects predicting order of anchor and landmark in relative descriptions.
| intercept | 2.64 | −3.38 | −2.44 | −5.26 |
| anch area | −0.42** | −0.21 | −0.22** | 0.40** |
| anch centr | 0.16* | −0.13 | ||
| anch deps | −0.19 | −0.77** | 0.26** | 0.11 |
| anch = targ | 0.16 | −0.32 | 0.84** | −0.80** |
| anch sal | −0.09 | 0.00 | 0.00 | 0.05 |
| distance | 0.02 | 0.03 | ||
| sign. lr. dist. | −0.01 | 0.01 | ||
| lmk = reg | 15.68** | −∞ | −∞ | −16.42** |
| lmk area | 3.97** | −0.67 | 1.53** | −4.48** |
| lmk centr | −1.12** | −1.03 | −0.03 | 1.37** |
| lmk deps | 0.07 | 1.31** | −0.57** | −0.75** |
| lmk sal | 0.22** | 0.13 | −0.07 | −0.17* |
Figure 3Notched boxplot of reaction time as a function of referring expression order (red: target first, blue: landmark first) grouped by which object is easier to find. Notches represent 95% confidence interval of the median (computed with GGPlot default settings).
Figure 4Plot of median (target-first-landmark-first) reaction time as a function of median (target-landmark) reaction time. Each point represents a stimulus; fitted regression line uses linear model.