Literature DB >> 25339978

The effect of spatial organization of targets and distractors on the capacity to selectively memorize objects in visual short-term memory.

Aymen Ben Abbes¹, Emmanuelle Gavault², Thierry Ripoll².

Abstract

We conducted a series of experiments to explore how the spatial configuration of objects influences the selection and the processing of these objects in a visual short-term memory task. We designed a new experiment in which participants had to memorize 4 targets presented among 4 distractors. Targets were cued during the presentation of distractor objects. Their locations varied according to 4 spatial configurations. From the first to the last configuration, the distance between targets' locations was progressively increased. The results revealed a high capacity to select and memorize targets embedded among distractors even when targets were extremely distant from each other. This capacity is discussed in relation to the unitary conception of attention, models of split attention, and the competitive interaction model. Finally, we propose that the spatial dispersion of objects has different effects on attentional allocation and processing stages. Thus, when targets are extremely distant from each other, attentional allocation becomes more difficult while processing becomes easier. This finding implicates that these 2 aspects of attention need to be more clearly distinguished in future research.

Entities: Chemical Disease Gene Species

Keywords: cueing; selective attention; spatial organization; visual short-term memory

Year: 2014 PMID： 25339978 PMCID： PMC4197781 DOI： 10.5709/acp-0160-x

Source DB: PubMed Journal: Adv Cogn Psychol ISSN： 1895-1171

Introduction

The visual working memory system, responsible for the short-term retention and the manipulation of visual information, is subject to severe storage capacity limitations (Cowan, 2001; Jiang, Olson, & Chun, 2000; Pashler, 1988; Phillips, 1974; Rensink, 2000; Vogel, Woodman, & Luck, 2001). To overcome this constraint, our visual system uses mechanisms which enable us to select the relevant information to be stored. In this context, many studies clearly show that this selection and transfer of information in visual short-term memory (VSTM) are controlled by bottom-up and top-down attentional processes (Cowan & Morey, 2006; Henderson & Hollingworth, 1999; Schmidt, Vogel, Woodman, & Luck, 2002; Vogel, Woodman, & Luck, 2005). However, even though we know that attending to a particular location or object improves its transfer into VSTM, we do not know how attentional allocation to multiple locations leads to the transfer of multiple corresponding objects in VSTM. This study was conducted in order to explore this capacity. Indeed, we propose to test whether the VSTM capacity is reduced when there is more than one to-be-stored object and when these objects are interspersed with distractors. Furthermore, we try to explore the effect of the spatial distribution of these objects between distractors on VSTM capacity. Such questions are ecologically important since the to-be-stored objects are frequently combined with other irrelevant objects in natural environment. As a consequence, the evaluation of VSTM capacity in a noisy environment is relevant and informative. Moreover, these questions are theoretically relevant, as they involve the complex relationships which link attention to VSTM (Awh, Vogel, & Oh, 2006). As said above, the spatial organization of objects is one of the variables we investigate since, as Jiang et al. (2000) suggest, spatial configuration is the framework supporting VSTM. Thus, any change in the spatial configuration of objects is accompanied by a facilitation or interference in selecting and storing these objects. The predictions we can make concerning the impact of such manipulation are related to a huge theoretical background. Indeed, there is no consensus about the characteristics of selective spatial attention, and there are opposite views that describe the impact of the spatial organization of objects on the capacity to select them. In this context, we can differentiate three theoretical positions that could make different predictions according to the different types of spatial configuration of objects. First, according to the traditional “spotlight,” “zoom lens”, and “gradient” theories, an attentional “beam” facilitates the processing of only the stimuli that are located within its focus (Eriksen & St. James, 1986). Attentional facilitation is limited to a single region, excluding any possibility to prioritize targets dispersed among distractors (Heinze et al., 1994; McCormick & Klein, 1990). A clear prediction from this theoretical approach is that identifying and memorizing targets should be more difficult when dispersion increases: Global performance should be maximal when targets are contiguous and clearly separated from distractors. When targets and distractors are fully intermixed (high level of dispersion), the cueing of targets should not improve significantly the global performance. Although a lot of empirical data are consistent with the unitary conception of attention, several recent studies, taking an approach different from spotlight theory, support the hypothesis that attention can be simultaneously applied over multiple distant locations or objects (Awh & Pashler, 2000; Bundesen, Habekost, & Kyllingsbaeck, 2005; Gobell, Tseng, & Sperling, 2005; McMains & Somers, 2005; Ripoll, Albert, Ben Abbes, & Gavault, 2008). If multiple foci of attention at different locations can be allocated simultaneously, it should be possible to take advantage of multiple location cues in a visual memory task. And yet, a clear prediction from this theoretical approach of attention remains difficult for two main reasons (for more details, see Wright & Richard, 2003). First, if we consider experimental paradigms used to study split attention, in most cases, only two attentional locations were cued (and so there is a real lack of empirical and coherent data for visual stimuli involving more than two targets). Second, it is gene-rally accepted (apart from McMains & Somers, 2005) that the division of attention has a cost. Consequently, as in the unitary spotlight approach, the hypothesis of multiple attentional foci would predict that performance decreases as dispersion increases. Given the difficulty to estimate quantitatively how the performance should be affected by the splitting of attention, no clear empirical and crucial prediction can differentiate the unitary from the multi-focal conception of attention. A third interesting approach, the competitive interaction model (Bahcall & Kowler, 1999; Caputo & Guerra, 1998; Cutzu & Tsotsos, 2003; McCarley, Mounts, & Kramer, 2004; Mounts & Gavett, 2004), inspired by the biased competition model of Desimone and Duncan (1995), leads to different predictions. According to this model, objects compete with one another for representation within the visual system. Consequently, attentional selection allows to enhance processing of the selected objects at the representational expense of other objects in the near visual environment. The competition between two objects is maximal when these objects are in close spatial proximity and decreases when they are distant because these objects compete only to the extent that they draw from the same pool of receptive fields. For example, Bahcall and Kowler (1999) showed that the comparison of two targets is both more rapid and accurate when the distance between them increases and when the targets are separated by one or more distractors. Recently, Franconeri, Alvarez, and Enns (2007) observed the same effect, that is, accuracy in a visual search task diminishes as the spatial separation between targets decreases. To summarize, in the following five experiments, we tried to evaluate the VSTM capacity using a visual environment in which targets and distractors were simultaneously present. We investigated how the targets/distractors organization determined both the deployment of attention and the capacity to select and memorize the targets. In our paradigm, the participants were presented with a circular array of eight objects: four distractors and four targets. Their task was to memorize the identities and the locations of the targets while ignoringthe distractors. Time of exposure was short (150 ms), limiting the possibility to move the eyes among the elements of the circular array. Furthermore, targets and distractors were physically similar: No feature properties (such as color or abrupt onsets) allowed for pre-attentively distinguishing targets from distractors. Only spatial information, given at the beginning of each trial, could be used to prioritize targets. Thus, the capacity to select targets among distractors efficiently will only depend on the characteristics of attentional deployment.

Experiment 1

In this first experiment, we investigated how temporal factors and stimulus organization (i.e., the dispersion level of the targets) determined performance in a VSTM task. Participants had to memorize four targets presented among four distractors. Targets differed from distractors only in their location and their history. The four distractors were presented prior to the presentation of targets. Thus, these distractors played the role of cues by informing the participants about the future locations of the targets: Targets appeared where distractors were absent. After a variable delay during which neither distractors nor targets were present, both objects appeared simultaneously for 150 ms. Three main factors were manipulated. The first two factors concerned temporal aspects. First, between subjects, we varied the distractors presentation time (100, 300, or 500 ms). These values have been chosen because they concord with the classical estimation of the time necessary for an endogenous deployment of attention (Müller & Rabbitt, 1989). Such time manipulation from 100 to 500 ms seems to be suited to evaluate the dynamics of the attentional process involved in this task. The second temporal aspect we manipulated involved the inter-stimulus interval (ISI) that we varied between subjects: The time between the offset of distractors and the onset of the entire array in which the four targets and the four distractors appeared. ISI could be of 50 or 900 ms. The last and the most crucial factor concerned the spatial organization of the targets among distractors. Four conditions of spatial organization were manipulated, as a within-subjects factor, defining four increasing levels of dispersion. In the first condition (C1), no distractor was present between the four targets. Consequently, the four targets could be considered as present within one and the same contiguous spatial area. In this case, targets’ dispersion was minimal. In the second condition (C2), targets were distributed across two non-contiguous areas, separated by at least one distractor. In the third condition (C3), targets were distributed across three non-contiguous areas; and in the last condition (C4), all targets were separated from another by one interleaving distractor. Thus, the level of dispersion increased from C1 to C4. Finally, two control groups were distinguished. In the first group (control group 4), only the four targets were present whereas in the second group (control group 8), targets and distractors were both present, and participants did not have information about targets locations. As a consequence, they could not distinguish targets from distractors.

Method

Participants

A total of 80 undergraduate students (43 male and 37 female; Mage = 24.6, range 19-27) volunteered for this experiment, 10 in each group (“control group 4”, “control group 8”, and “preview groups”: 100/50 [100 ms corresponding to the distractors presentation time and 50 ms corresponding to the ISI], 300/50, 500/50, 100/900, 300/900, and 500/900). All the participants reported normal or corrected-to-normal visual acuity.

Apparatus

The experiment was conducted on a Macintosh computer with a 14” screen and was programmed with PsyScope software® (Cohen, MacWhinney, Flatt, & Provost, 1993). Participants were tested individually in a dimly lit room. They sat at a distance of about 70 cm from the computer screen.

Stimuli

All stimuli were presented in grey on a white background. Memory arrays consisted of eight 1 × 1 cm objects (square, circle, triangle, heart, star, cross, diamond, crescent) evenly spaced on an imaginary circle with a radius of 4.7° that was centred on a fixation cross. The stimuli were arranged in a circular array display with a fixation point in its centre to ensure that retinal resolution was constant for any possible stimulus location. The spatial organization of the targets among distractors was systematically controlled. As indicated previously,four conditions of spatial organization (C1, C2, C3, and C4) were distinguished according to the locations of targets among distractors (Figure 1).

Figure 1.

Examples of the four types of configuration used in Experiment 1: Configuration 1 (C1) in which targets were presented within a single spatial area, Configuration 2 (C2) in which targets were distributed across two spatially noncontiguous areas, Configuration 3 (C3) in which targets were distributed across three spatially noncontiguous areas, and Configuration 4 (C4) in which targets were distri- buted across four spatially noncontiguous areas. Note that arrows did not appear on the screen and just indicate the targets.

Eighty entire arrays (targets plus distractors) were built, 20 for each configuration. During the experiment, each array was presented two times as a function of a to-be-recognized object (probe) that appeared at target locations: once with an identical probe and once with a different probe. Examples of the four types of configuration used in Experiment 1: Configuration 1 (C1) in which targets were presented within a single spatial area, Configuration 2 (C2) in which targets were distributed across two spatially noncontiguous areas, Configuration 3 (C3) in which targets were distributed across three spatially noncontiguous areas, and Configuration 4 (C4) in which targets were distri- buted across four spatially noncontiguous areas. Note that arrows did not appear on the screen and just indicate the targets.

Procedure

Each subject completed 24 practice trials and 160 experimental trials randomly presented (see Figure 2). Participants initiated each trial by pressing the space bar. They were asked to focus on the central cross for the entire trial. Each trial began with an articulatory suppression task: Two-digit numbers were presented for 500 ms at fixation, and the participants were asked to repeat them at a rate of 3-4 digits per second for the entire trial (Schmidt et al., 2002). This was followed by a 1,500-ms delay. From that moment, the procedure was quite different for the four preview groups and for the two control groups (control group 4 and control group 8).

Figure 2.

Example of a valid trial in the experimental condition (Experiment 1). Note that the numbers are used for the parallel articulatory suppression which had to be performed throughout stimulus presentation. These numbers were always presented at the same locations. ISI = inter-stimulus interval. In preview groups, the distractor objects appeared for 100 ms (100 ms condition), 300 ms (300 ms condition), or for 500 ms (500 ms condition). Following the offset of the distractors, an ISI of 50 ms or 900 ms occurred. At the end of the ISI period, the memory array consisting of the eight objects appeared for 150 ms. Participants were asked to memorize only the new objects (targets) in the memory array. In control group 8, the full memory array (four targets and four distractors) appeared for 150 ms. In control group 4, only the four targets were presented. The offset of the memory array was followed by a delay of 900 ms, in order to ensure testing the VSTM and not the sensory memory. The probe object then appeared and was always at target locations. The participants responded on a standard keyboard by pressing the “q” button to indicate that the probe object was identical to the target at the same location and the “p” button if probe and target were different. The probe and target shape were identical in half of the trials.

Results

The percentage of correct responses was calculated for each condition. Mean accuracies are reported in Table 1.

Table 1.

Percentage of Correct Responses as a Function of Configuration Type and Conditions

ISI	Times	Configurations
		1	2	3	4	Means
50 ms	100 ms	69 ± 02	69 ± 02	63 ± 04	68 ± 04	67 ± 03
	300 ms	69 ± 03	70 ± 04	70 ± 03	78 ± 02	71 ± 03
	500 ms	76 ± 03	68 ± 02	69 ± 02	80 ± 01	73 ± 02
Means		71 ± 03	69 ± 03	67 ± 03	74 ± 03
900 ms	100 ms	73 ± 03	70 ± 03	68 ± 02	79 ± 02	73 ± 02
	300 ms	79 ± 02	73 ± 02	69 ± 02	79 ± 02	75 ± 02
	500 ms	79 ± 01	71 ± 02	75 ± 03	80 ± 02	76 ± 02
Means		77 ± 02	71 ± 02	71 ± 02	79 ± 02
Control 4		75 ± 02	81 ± 02	76 ± 03	80 ± 02	78 ± 02
Control 8		65 ± 02	61 ± 03	65 ± 02	64 ± 02	64 ± 02

Note Data are shown ± standard errors. ISI = inter-stimulus interval.

Note Data are shown ± standard errors. ISI = inter-stimulus interval. First, data were submitted to an analysis of variance (ANOVA) with Type of Presentation as between-subjects factor (comparison between control group 4, control group 8, and the preview group) and with Type of Configuration (C1, C2, C3, and C4) as within-subject factor. This analysis revealed a significant main effect of the type of presentation, F(2, 77) = 12.89, p < .001, ηp2 = .25. In order to analyze this global effect, planned comparisons were conducted. Performance of control group 4 was better than performance of control group 8, F(1, 77) = 24.80, p < .001, ηp2 = .24. More interestingly, participants of preview groups who previewed the distractors had better performance than participants of control group 8, F(1, 77) = 16.00, p < .001, ηp2 = .17. Nevertheless, the performance of participants of the preview groups did not reach the level of performance of participants of the group control 4, F(1, 77) = 6.30, p = .01, ηp2 = .07. Second, the effect of the type of configuration was significant, F(3, 231) = 3.10, p = .02, ηp2 = .04. This effect was modulated by the type of presentation, F(6, 231) = 2.40, p = .02, for the effect of interaction between the Type of Configuration and the Type of Presentation. Subsequently, in order to analyze the effect of spatial configuration, we conducted an ANOVA, with Type of Configuration (C1, C2, C3, and C4) as within-subject factor, separately for each group. An ANOVA with the Type of Configuration as the only within-subject factor applied to each control group revealed no significant effect of the type of configuration in control group 4 (p = .50) as well as in control group 8 (p = .45). In preview groups, an analysis with the Type of Configuration as within-subject factor, and ISI and Distractors Presentation Time as between-subjects factors was conducted. The main effects of distractors presentation time and ISI were significant; F(2, 54) = 3.40, p < .05, ηp2 = .11; and F(1, 54) = 5.90, p < .05,ηp2 = .09, respectively. The interaction between these two factors was not significant (p = .84). Performance was better with a 900-ms ISI than with a 50-ms ISI. Furthermore, planned comparisons revealed that participants who previewed the distractors for 500 ms performed better than participants who previewed the distractors for 100 ms, F(1, 54) = 6.48, p < .01, ηp2 = .10; but neither the difference between the 500 ms preview group and the 300 ms preview group nor the difference between the 300 ms preview group and the 100 ms preview group were significant; F(1, 54) = 0.60, p = .43; and F(1, 54) = 3.00, p = .08, respectively. This ANOVA also revealed a strong main effect of the type of configuration (see Figure 3), F(3, 162) = 22.10, p < .001, ηp2 = .29. We conducted a post hoc analysis (Fisher’s LSD) which showed that participants achieved better performances in condition C4 than in conditions C1, C2, and C3 (all ps < .001). Furthermore, performance in condition C1 was better than performance in conditions C2 and C3 (p < .001). No difference was found between performance in conditions C2 and C3 (p = .37). The interaction effect between Type of Configuration and Distractors Presentation Time (p = .09) as well as between Type of Configuration and ISI (p = .63) were not significant.

Figure 3.

Percentage of correct responses in experimental groups as a function of Configuration Type (Experiment 1).

Discussion

Results clearly show that participants can take advantage of the prior presentation of distractors to selectively process and memorize four targets among distractors. This selective processing is quite remarkable since, if participants are given sufficient time to accurately encode four targets locations (e.g., 500 ms in one of the preview conditions), performance is close to that in control group 4. For configurations C1 and C4, the performance of preview groups was as good as performance in control group 4, as if participants could perfectly ignore the distractors. The selective processing of targets is strongly modulated by temporal parameters. After a period as short as 100 ms, participants began to take advantage of the preview. This benefit is maximal for a presentation time of distractors of 300 ms and 500 ms. By increasing distractors presentation duration, participants can extract more information about future target locations and can allocate their attention to these locations more accurately. The ISI manipulation shows that attentional deployment onto target locations cannot result from an automatic capture by target onset. Indeed, an abrupt onset by the targets might have automatically captured attention with a 50-ms ISI but not with a 900-ms ISI (Theeuwes, 1991; Yantis & Hillstrom, 1994). Yet, the results showed a small but significant difference in favor of a 900-ms ISI: Endogenous attention is, thus, clearly involved in this capacity to memorize targets among distractors. This finding is of some importance because it shows a clear difference with a similar phenomenon that Watson and Humphreys (1997) have revealed in visual marking experiments. We need to remember that in visual marking, a preview benefit is observed only when new items onset. Indeed, the preview benefit was abolished when the new items were isoluminant with the background (Donk & Theeuwes, 2001) or when old items disappeared for more than 400 msbefore appearing again with new items. In our paradigm, targets and distractors onset simultaneously (as is the case in natural environments) preventing any sort of sensory facilitation. Thus, these findings support previous results highlighting the existence of top-down goal-based mechanisms that bias inputs into VSTM (Gavault & Ripoll, 2004; Schmidt et al., 2002). Our findings, thus, complement earlier studies showing that bottom-up factors (peripheral cues, popout, perceptual organization) have an impact on memory storage (e.g., Woodman, Vecera, & Luck, 2003).

Spatial organization effect

The spatial organization effect takes an unexpected and very interesting form. First, there is not a linear relation between accuracy and target dispersion: Accuracy is higher for the minimal and maximal levels of dispersion (conditions C1 and C4) and lower for intermediate levels (conditions C2 and C3). Second, performance is the best when dispersion is maximal. In this case, the performance reaches a very high degree of accuracy in a way that the performance in condition C4 does not differ from performance in control group 4. This last result is especially important because it shows that the presence of distractors between each target does not present a real difficulty for the visual system. We carried out an additional analysis to test whether the accuracy varied with the relative spatial position of the probed target. We did not observe any effect of such factor. In the same line, we did not find any dispersion effect in the two control groups. We could then deduce that the dispersion effect in experimental groups is not a consequence of variation in the ability to identify and memorize targets at different locations along the circular array. Moreover, since the location of targets is exactly the same in preview groups and control group 4, we can conclude that attentional parameters are responsible for the observed dispersion effects. Similarly, Cutzu and Tsotsos (2003), in a quite different visual matching task, found that effects of inter-target separation disappeared when attention was not cued before the onset of the circular array. From a theoretical point of view, the absence of a dispersion effect in control group 4 is particularly interesting because it suggests that this effect is not a consequence of any low-level sensory masking effects (lateral masking or crowding), but is contingent upon the spatial distribution of attention within the display. In other words, it is not the intrinsic property of targets’ organization which is responsible for the dispersion effect but the kind of attentional distribution that the targets/distractors organization involves. On the whole, this general pattern of results does not concord with the unitary conception of attention since the performance is the highest when dispersion is maximal. Such results seem relatively compatible with the competitive interaction model. It explains perfectly that the highest level of performance is observed in condition C4. In this condition, targets are not in close spatial proximity and they do not draw on the same pool of receptive fields. As a consequence, their competition and so, their mutual interference were reduced. In this condition, encoding and consolidation in VSTM are optimal because the distance between targets was maximal. Nevertheless, an aspect of these data is not consistent with the competitive interaction model. Indeed, the performance is higher in condition C1 than in conditions C2 and C3 whereas the distance between targets is minimal in condition C1. We will discuss this discrepancy later. A full understanding of this pattern of findings will require some methodological considerations. One of the potentially most important methodological concerns is linked to the circular organization of the eight objects in the final array. Only two variants of condition C4 can be created (diamond organization and square organization) whereas many more different variants are possible for conditions C1, C2, and C3. Consequently, the frequency of the two variants of condition C4 is higher than that of the several possible variants of conditions C1, C2, and C3 in such a way that a simple frequency of spatial pattern effect could explain the surprisingly high level of performance in condition C4. Therefore, we designed an experiment for neutralizing this potential bias by using only two variants for each type of configuration (C1, C2, C3, and C4).

Experiment 2

The aim of Experiment 2 was to make sure that the configuration effects found in Experiment 1 were not due to a possible frequency effect. Twenty four undergraduate students (10 male and 14 female; Mage = 22.9, range 20-26) with normal or corrected-to-normal vision participated in this experiment which concerned only one preview condition (300/900). The same apparatus as in Experiment 1 was used. We chose only two versions for each type of the four conditions of configuration. The procedure was the same as in Experiment 1 but we restricted the comparison to the preview group. We selected an ISI of 900 ms and a distractors presentation time of 300 ms because the configuration effects were very clear with these temporal parameters. The percentage of correct responses was calculated for each type of configuration. Mean accuracies are plotted in Figure 4. A repeated measures ANOVA with Type of Configuration as within-subject factor showed a significant main effect, F(3, 69) = 3.56, p < .05, ηp2 = .13. Post hoc tests (Fisher’s LSD) yielded significant differences between performances for Configuration 4 and those for the other configurations (Configurations 1, 2, and 3; ps < .01). No difference was detected when contrasting C1 with C2 and C1 with C3 (p = .397). Finally, performances with C2 did not differ from performances with C3 (p = .95).

Figure 4.

Percentage of correct responses in experimental groups as a function of Configuration Type (Experiment 2).

Percentage of correct responses in experimental groups as a function of Configuration Type (Experiment 2). Taken as a whole, the pattern of results is very similar to the pattern of results found in the previous experiment. Performance is better in condition C4 than in any other condition. Consequently, a possible frequency bias cannot explain the high level of performance observed in condition C4. Nevertheless, there is still another important factor, confounded with the level of dispersion, which could explain the high level of performance in condition C4: The level of dispersion of this condition is maximal but its organization has a special and unique characteristic. Indeed, the two configurations in condition C4 have the status of a good form because the four targets are organized either in a square-object shape or in diamond-object shape with perfect symmetry. Thus, level of dispersion and form goodness are confounded factors. Such a confounding is problematic since previous research has shown that perceptual organization in general can bias the storage of visual information (Woodman et al., 2003) or the capacity to track moving objects (Yantis, 1992). So, performance in condition C4 could be very high not because the level of dispersion is maximal, but because the good form of targets organization allows the subjects to allocate their attention more easily and more accurately to target locations. The aim of the next experiment was to dissociate the impact of the “good form” from that of dispersion of targets. To do so, we contrasted a condition in which targets are organized according to a regular spatial configuration to another one in which this spatial organization is considered as perceptually irregular. Since condition C4 allows only regular configurations, we increased the number of distractors to overcome this problem. This manipulation allows at the same time to design two types of spatial configurations which could or could not have the good form’s property and at the same time maintain the same high level of dispersion (i.e., at least one distractor between each target). In such a way, we will be able to dissociate these two factors and to evaluate them separately.

Experiment 3

Twelve undergraduate students (seven male and five female; Mage = 24.3, range 21-26) with normal or corrected-to-normal vision volunteered for this experiment. The same apparatus as in Experiment 1 was used. The only difference to the previous experiments is that two additional objects were used: The memory array contained six distractors and four targets. Targets were separated by one or two distractors. Only condition C4 was manipulated and could take two different forms: a regular configuration (good form of targets: square and diamond configurations) or an irregular configuration (Figure 5). A total of 80 entire arrays (targets plus distractors) were built, 40 for each configuration.

Figure 5.

Example of regular and irregular C4 configurations used in Experiment 3. Note that arrows did not appear on the screen and are just used here to indicate the targets.

Example of regular and irregular C4 configurations used in Experiment 3. Note that arrows did not appear on the screen and are just used here to indicate the targets. The procedure was exactly the same as in preview groups of Experiment 1. As in Experiment 2, ISI was of 900 ms, and the distractors presentation time was of 300 ms. Mean accuracies were calculated as a function of the type of configuration. A repeated measures ANOVA with Type of Configuration as within-subject factor yielded a main effect of this factor, F(1, 11) = 12.60,p < .01, ηp2 = .53. Performance was better when targets were presented in a regular configuration than when they were presented in an irregular configuration. The best performance in the “good form” condition shows that it is easier to select and memorize targets among distractors when these targets can be organized in a good form. Such a finding is consistent with the “chunking” account suggesting that the VSTM capacity can be improved by grouping items into an integrated object or shape. This combination leads to a better selection and then, to a more sophisticated and complete VSTM storage (Jiang et al., 2000; Miller, 1956). Given that attention plays a crucial role in the selection of visual information in VSTM, one can infer that it is easier to allocate attention to different locations when these locations are organized in a good form. However, the high performance observed in condition C4 in the two previous experiments cannot be exclusively explained by this factor. Indeed, performance in the irregular configuration condition remains relatively high (77%). Such a result is all the more surprising as the spatial distance between locations is smaller than in the previous experiments. Consequently, even if Gestalt principles of organization play a role in the capacity to select targets among distractors (e.g., Yantis, 1992), the dispersion level of the targets seems to be a crucial determinant of performance. Globally, these results suggest that memorizing targets is easier when they are distant and separated from each other by distractors whenever these objects are or are not organized in a good form.

Experiment 4

The high performance in condition C4 seems very reliable, and the best way to explain this superiority is most probably linked to the fact that the level of dispersion is maximal in this condition. As the competitive interaction model assumes, the competition between attended objects is inversely related to their spatial distance. In condition C4, this spatial distance is maximal because targets are separated from each other by a distractor. As a consequence, competition is minimal and VSTM storage is facilitated because the encoding of each target does not interfere with that of the other targets. Nevertheless, this theoretical model cannot explain the high performance in condition C1. The first explanation of this discrepancy is methodological. We cannot exclude that an ocular saccade contributed to the performance before the onset of the memory array in condition C1. An ocular movement could facilitate the task only in this condition. One of the aims of this experiment is to evaluate the possible effect of ocular movements. The second explanation of this general pattern involves a more theoretical analysis. Many authors suggested that the concept of attention may involve distinct aspects (Huang & Pashler, 2007; Vogel et al., 2005). Probably the most obvious distinction relates to two different aspects of attention which are frequently confounded: selection and processing. When Desimone and Duncan (1995) laid down the basis of the biased competition model, they described two basic phenomena in relation with the nature of visual attention. The first basic phenomenon was the ability to filter out unwanted information or/and to select relevant information. The second was the limited capacity for processing information. In general, selectivity is conceived as a way to overcome the limited capacity of the visual system (Broadbent, 1958). Thus, attention is involved in the selection of relevant objects, this selection being a necessary condition for optimizing the processing of these objects. In our task, performance depends on both the difficulty to orient attention to target locations (selection) and the difficulty to consolidate their visual traces in VSTM (processing). In concrete terms, the participants have first to allocate their attention to the target locations and, once attention has been allocated towards them, every sensory trace of target has to be consolidated to reach a stable state in VSTM. Consequently, the difficulty to allocate attention to target locations and the difficulty to consolidate their visual trace in VSTM could vary differently from condition C1 to condition C4. There is no reason to assume that the spatial dispersion determines the difficulty to allocate attention to target locations and the difficulty to encode and memorize them in the same way. As many previous studies showed (e.g., Heinze et al., 1994; McCormick & Klein, 1990; Posner, Snyder, & Davidson, 1980), it is easier to allocate attention to one contiguous spatial area (as in condition C1) than to allocate attention to several spatially non-contiguous areas (conditions C2 to C4). Conversely, the consolidation process could be more difficult when targets are spatially close because each target can compete with the others for its representation in VSTM as assumed by the competitive interaction model. So, the global pattern we observed could result from the combination of these two different effects of target/distractor spatial organization: the effect on selection and the effect on consolidation. We decided to introduce a strong visual contrast (targets were black and distractors were red) that would allow a very easy distinction between targets and distractors. Such a manipulation cannot perfectly neutralize the difficulty to allocate attention in the different conditions of dispersion but it should reduce this selection difficulty considerably. Having neutralized the influence of attention, we should theoretically only observe a positive linear relation between dispersion and performance if the targets are better consolidated as the spatial distance between them increases. In the same line, we had already seen in Experiment 3 that the good form allowed to improve the attentional capacity. If we obtain the positive linear increase of performance mentioned above, we could not explain such result only by the good form factor. Indeed, this factor could not explain the superiority of the performance in conditions C2 and C3 (in which no good form effect was suspected) compared to condition C1. Such finding would be due exclusively to the increase of the spatial distance between targets. Twenty-four undergraduate students (seven male and 17 female; Mage = 21.7, range 19-23) with normal or corrected-to-normal vision volunteered for this experiment, eight in each group (control group 8, and 100, and 500 ms preview groups). The same apparatus as in Experiment 1 was used. The stimuli were the same as in Experiment 2, unless otherwise noted. Targets differed from distractors by their colors: Targets were black and distractors were red. The procedure was the same as in Experiment 1. We only tested control group 8 and two preview groups distinguished by the presentation time of distractors (100 or 500 ms). The ISI was constant (900 ms). The percentage of correct responses was calculated for each group. Mean accuracies are presented in Figure 6.

Figure 6.

Percentage of correct responses as a function of Configuration Regularity.

First, data were entered in an ANOVA with Distractors Presentation Time as between-subjects factor (0 ms for control group 8, and 100 ms and 500 ms for preview groups) and Type of Configuration as within-subject factor. As in Experiment 1, the main effect of presentation time was significant, F(2, 19) = 10.80, p < .0001, ηp2 = .53. Percentage of correct responses as a function of Configuration Regularity. This analysis showed also a strong main effect of the type of configuration, F(3, 63) = 55.00, p < .0001, ηp2 = .72. Planned comparisons revealed that participants performed better: (a) in condition C4 than in conditions C1, F(1, 21) = 115.00, p < .0001, ηp2 = .84; C2, F(1, 21) = 44.60, p < .0001, ηp2 = .67; and C3, F(1, 21) = 67.00, p < .0001, ηp2 = .76; (b) in condition C3 than in conditions C1, F(1, 21) = 67.00, p < .0001, ηp2 = .10; and C2, F(1, 21) = 65.00, p = .02, ηp2 = .75; and (c) in condition C2 than in condition C1, F(1, 21) = 3.90, p < .001, ηp2 = .15. The interaction between the Distractors Presentation Time and the Type of Configuration was not significant (p = .91). In order to evaluate the global impact of the introduction of different colors, we compared these results with those obtained in the absence of distinctive colors (equivalent conditions of Experiment 2). This analysis showed a significant main effect of color, F(1, 32) = 20.40, p < .0001; performances being higher when color distinguishes targets from distractors. The interaction effect between Color and Type of Configuration was significant, F(3, 96)= 15.00, p < .0001. With color, performances increase in all conditions except in condition C1. The presence of color has a global strong positive impact on performance. This result is consistent with previous findings (Cave & Bichot, 1999), showing that the selection of noncontiguous locations is facilitated when targets can be discriminated from distractors by a basic visual property. This result is also consistent with the data showing that target saliency can reduce classical effects of crowding and lateral-masking (Felisberti, Solomon, & Morgan, 2005). Importantly, when attention can only be feature-driven (control group 8), we found again the dispersion effects observed in the previous experiments: The performance is very good when the level of dispersion is high. Thus, the dispersion effects are not specific to a kind of attentional deployment. Furthermore, the dispersion effect appeared very early when attention was feature-driven (when distractors presentation time is only 100 ms), whereas the dispersion effect appeared later when attention was spatially and endogenously driven (see previous experiments). Nevertheless, whatever the impact of color was, the endogenous control of spatial attention continues to play an important role since performance in preview groups is still better than performance in control group 8. Thus, it is clear that the visual system can take advantage of both spatial information and object feature information. Furthermore, the global pattern of performance associated with the dispersion effect is identical (same pattern in control group 8 and in preview groups) for spatial and object-based ways of controlling attention: The dispersion effect is a very robust and stable effect. From a methodological point of view, such finding is interesting because it shows that ocular movements before the onset of the circular array play a secondary role (if they play any role at all) and that they cannot explain the dispersion effect in our study: Ocular movements before the onset of the final array were possible in preview conditions but not in control group 8, and, yet, the pattern of results was the same in both of these conditions. Another important finding of this experiment is that improvement of performance caused by the introduction of color is observed only in conditions C2, C3, and C4, but not in condition C1. Actually, in condition C1, the introduction of color did not improve the performance either in preview groups or in control group 8, as if participants were not able to take advantage of the color whereas they took advantage of this information in conditions C2, C3, and C4. This pattern of results is coherent with the hypothesis that color facilitated attentional deployment but had no real impact on the consolidation process. As allocating attention to one group of targets in condition C1 is already an easy task, the introduction of color did not further improve the performance in this condition. Conversely, when targets are in non-contiguous locations, the deployment of attention is more difficult and the introduction of color helped the participants to allocate attention to these locations more accurately and efficiently. As the introduction of color reduces and neutralizes the difficulty to allocate attention to target locations, the consolidation process becomes the main source of variation and, as predicted by the competitive interaction model, the consolidation is all the more difficult as the objects to-be-memorized are close to one another. In the absence of specific difficulties to orient attention to cued locations, the performance increases linearly with an increasing level of spatial dispersion. As coherent as this interpretation might be, we did not anticipate the following result: Performance of control group 8 in condition C1 was particularly low. Such a result is quite surprising and difficult to explain since in this experiment, participants could take advantage of the color to distinguish targets from distractors. We can only conjecture that the common and contrasting color of targets leads to a fast and strong grouping in a way that the perception of the whole prevails on the separate perception of each target. This interpretation is coherent with two related phenomena. First, lateral-masking is known to increase with target-flanker similarity (Kooi, Toet, Tripathy, & Levi, 1994; Polat & Sagi, 1993) and, in this context, each target can be conceived as a competing flanker. Second, the grouping of objects increases their mutual interference (Livne & Sagi, 2007), and the similarity of color contributes to the grouping of targets. More generally, it is possible that the similarity of targets has increased their mutual competition and prevented the participants from identifying and memorizing them as independent objects. Finally, the last result that deserves attention is the remarkably high level of performance (89%) observed in condition C4 in the preview group. This result is quite surprising because, in theory, the presence of distractors between targets should have led to a clear drop in performance. Nevertheless, the preview group and control group 4 are not really comparable since in the first case, participants know where the targets are going to appear whereas they do not have this information in the second case. Thus, control group 4 was not an appropriate reference group to evaluate the capacity of participants to ignore the distractors. The last experiment is designed to overcome this problem. To reach this objective, a very simple and direct solution consisted of cueing targets’ locations in control group 4. In this way, control group 4 and preview groups will be distinguished only by the presence of distractors. Moreover, this new experiment served another interest. We showed in Experiment 1 that the dispersion effect did not appear in control group 4 as if the distance between targets would not be a sufficient precondition for the dispersion effect. The absence of the dispersion effect could result either from the absence of distractors or from the fact that attention was deployed in a diffuse mode when the participants do not have any spatial information about targets location (as in control group 4). Given that these two characteristics are confounded in the previous experiments, we cannot dissociate the effects linked to the way participants allocate attention before targets onset and the effects associated to the presence of distractors. If the dispersion effect is obtained when target locations are cued, we can therefore underscore the attentional nature of this effect.

Experiment 5

Fifteen undergraduate students (nine male and 16 female; Mage = 22.5, range 21-26) with normal or corrected-to-normal vision volunteered for this experiment. The same apparatus as in Experiment 1 was used. The stimuli were the same as the stimuli used in control group 4 of Experiment 1. Cues were constructed as follows: They consisted of four 0.3 × 0.3 cm asterisks located at the future locations of the targets on the imaginary circle centered on the fixation cross. The procedure was the same as in control group 4 of Experiment 1apart from that four asterisk cues were presented during 300 ms and followed by a blank delay of 900 ms before the presentation of targets. Mean accuracies were calculated as a function of the type of configuration (Figure 7). A repeated measures ANOVA with Type of Configuration as within-subject factor showed a main effect, F(3, 42) = 3.69, p < .05, ηp2 = .20. Planned comparisons indicated that participants performed better with configuration C4 than with configurations C1, C2, and C3; F(1, 14) = 7.80, p < .05, ηp2 = .35; F(1, 14) = 5.60, p < .05, ηp2 = .28; and F(1, 14) = 7.10, p < .05, ηp2 = .33; respectively. No significant difference was found between C1 and C2 (p = .65) as well as between C1 and C3 (p = .30) and between C2 and C3 (p = .67).

Figure 7.

Percentage of correct responses as a function of Configuration Type and Groups (Experiment 4).

Percentage of correct responses as a function of Configuration Type and Groups (Experiment 4). The cueing of target locations strongly improved performance. Even in the absence of distractors, performance was much better when participants could allocate attention only to target locations rather than, in a distributed way, to the entire circular array. This result could be easily anticipated in condition C1 because attention has to be allocated to a unique spatial area which includes the four targets. In this case, the attentional window, limited to the group of four targets, would have had less than half the size of the total area of the circular array. In this way, attentional resources would have been more concentrated in a relative small area. However, cue facilitation is more interesting in the three other conditions and especially in condition C4 in which the targets occupied the entire circular array and were fully dispersed. The high level of performance in condition C4 suggests that participants can accurately allocate attention to the four different cued locations and ignore the blank locations. The above finding shows that, when several targets have to be processed, the endogenous deployment of attention facilitates the processing of targets even in the absence of distractors. We can explain this result in two non exclusive ways. First, it could be that the attentional window is reduced when targets have been cued because the attentional system excludes blank locations. In this case, the higher level of performance results from the reduction of the size of the attentional window. Second, the role of attention would be to reduce the mutual interference between targets. Such reduction would be optimal when targets are distant, explaining why the cueing of target locations improves performance more if the distance between targets is large. Indeed, the most interesting result is that we reproduced the same dispersion effect as in the previous experiments. The performance tends to improve as the dispersion between targets increases. This result is all the more interesting as it contrasts with the total absence of the dispersion effect observed when no cue guides attention deployment as in control group 4 of Experiment 1. Such a finding is yet another clear demonstration that the dispersion effect is not a consequence of a low-level sensory masking, but is contingent on the spatial distribution of attention within a display. Percentage of correct responses as a function of Configuration Type (Experiment 5).

General discussion

Generally, the results we obtained revealed a very good capacity to select and memorize four targets among distractors. This is consistent with the estimation of VSTM capacity (around four objects; Vogel et al., 2001) and the number of different objects or locations that could be attended during one ocular fixation (Franconeri et al., 2007) or tracked in visual tracking experiments (Pylyshyn, 1989; Shim, Alvarez, & Jiang, 2008). This link between attention and VSTM has been confirmed by fMRI results showing strong cerebral convergences between memory and attentional process (Cavanagh & Alvarez, 2005; Todd & Marois, 2004). In line with some VSTM studies (e.g., Sperling, 1960), these results show that the visual system can process and memorize four objects in VSTM, whatever their location in space and without being strongly impaired by the presence of distractors. Moreover, they are consistent with the findings of previous studies suggesting that the targets’ perceptually “good” spatial organization allows better selection and storage in VSTM (Jiang et al., 2000; Vidal, Gauchou, Tallon-Baudry, & O’Regan, 2005). The superiority of performance when targets are organized in a “good Gestalt” underlines the classical role of chunking as a combinatory process of the attended objects in VSTM (Ripoll, Fiere, & Pelissier, 2005; Woodman et al., 2003). Another important robust result is that the target dispersion does not reduce the capacity to memorize targets. Such a result underlines the capacity to allocate attention to different and non-contiguous locations. At first glance, this capacity seems to contradict the unitary conception of attention. However, it could be argued that certain properties of our material and procedure do not allow us to reject a unitary conception of attention. Furthermore, it is generally very difficult to distinguish a true division of attention across non-contiguous areas from a strategy in which a single attentional focus switches rapidly between several targets (Van Rullen, Carlson, & Cavanagh, 2007). In the following analysis of the dispersion effect, we therefore wanted to provide further arguments in favor of the competitive interaction model of attention, implying the possibility to allocate attention to different non-contiguous locations.

Dispersion effects

The dispersion effect we obtained in a visual memory task involving genuine attentional processing constitutes a basic finding showing that (a) the processing of four targets is only weakly affected by the presence of spatially interleaving distractors; (b) the relative spatial location of targets and distractors is a strong determinant of the capacity to selectively encode and memorize targets in VSTM; and (c) the capacity to memorize the targets improves as the spatial dispersion increases. One notable remark is that the dispersion effects can be explained neither by classical crowding and lateral masking nor by any other kind of sensory interactions between objects. Indeed, the dispersion effect was not observed in control groups 4 and 8. This dispersion effect is therefore a consequence of a genuine attentional effect which may be both feature-driven and goal-driven attention. As a whole, these results are consistent with the competitive interaction model (Caputo & Guerra, 1998; Desimone & Duncan, 1995; Mounts, 2000a, 2000b) since we observed that performance improved as distance between targets increased. In other words, when targets are in close spatial proximity, they draw largely or entirely on the same pool of processing structures. As a consequence, targets compete for their representation within the visual system and the capacity to memorize targets decreases as competition between them increases. In the same line, the observed dispersion effect in control group 4with cueing of targets (Experiment 5) is a clear illustration of the efficiency of the spatial selective attention in our task. If the organism is precued about which locations to attend to, a saliency map may be configured before stimulus exposure. As proposed by Bundesen et al. (2005), neural structures (e.g., receptive fields at different levels of processing) contract around cued stimuli allowing parallel and independent processing of several cued objects. In the opposite case (without precue), the saliency map is not specifically configured and several (e.g., four) stimuli compete for representation by common neural structures. More generally, these results suggest that selectivity depends on both the spatial distance between targets and the attentional deployment that precedes targets onset. The same dispersion effect was also observed based on feature-driven attentional allocation. In control group 8 (Experiment 4), the dispersion effect was clearly observed where target locations were not cued before target onset. In this condition, a distinctive feature property (color) discriminating targets from distractors has been used to allow for efficient selection and processing of the four targets. Such a finding underlines that the dispersion effect is not specific to only one kind of attentional deployment.

Selection, processing, and dispersion effect

Many models and theories about visual attention do not clearly distinguish the capacity to allocate attention to several different locations (selection) from the capacity to encode and memorize the selected objects (consolidation). These two aspects of visual processing are combined in many visual tasks but it is not always easy to evaluate how each of them contributes to performance. For example, it is obviously more difficult to process two non-contiguous objects than to process one isolated object. However, any cost observed for the processing of two objects results from both the difficulty to deploy two attentional foci simultaneously and to process both objects simultaneously. The results we obtained suggest that targets’ dispersion may have two opposite effects linked to these two sequential stages. For example, the high level of performance obtained in condition C1 of the first experiment can be explained by an easy attentional allocation to a single area grouping the four targets compared to multiple non-contiguous locations. In this case, the decrease of targets’ dispersion seems to improve attentional allocation efficiency. When the difficulty to allocate attention and to select the targets is reduced by the introduction of color (Experiment 4), the process of consolidation in VSTM becomes the main source of variation. In this case, we observed that performance is very low in condition C1 (Experiment 4) and increases linearly as target dispersion increases. To sum up, it is easier to allocate attention to multiple foci when they are in close spatial proximity. On the other hand, the consolidation capacity for multiple objects should be more efficient as the distance between them increases. The contribution of selection and consolidation could explain why such dispersion effect has been obtained whereas many previous results showed that participants encounter great difficulties in processing several targets among distractors, as for example, in the study of Palmer, Ames, and Lindsey (1993) which involved visual search of four targets interspersed with four distractors. At this point, it is essential to underline the importance of the task’s requirements and in particular the required perceptual level of processing. For low levels of processing (e.g., during a detection task), the selective component should play a more important role and performance should decrease as the dispersion of targets increases. On the contrary, for high levels of processing (memorization task), the second component (encoding and consolidation) should play a more important role. In this case, performance should increase as spatial distance between targets increases because competition between them is reduced. A related conclusion has been reached recently in independent research (Catena, Castillo, Fuentes, & Milliken, 2006; Vogel et al., 2005). Authors assumed that attention can only be split into discontinuous foci during high levels of processing (e.g., in a memory task). This theoretical conclusion is strongly supported by the empirical data we recently obtained. We tested again the dispersion effect with the same material as in the current experiments but we used a visual search task in which stimuli (four targets and four distractors) were presented for 100 ms and masked. As in the current experiments, targets were cued so that participants could restrict their search to only the target locations. In these conditions, the pattern of performance was the opposite of the pattern we obtained in the current study: Performance decreased as the level of dispersion increased. This clear dissociation suggests that attention can be deployed flexibly depending on the task. In the case of a memory intensive task, the visual system can memorize four targets among distractors efficiently even though such targets are located at non-contiguous locations. Overall, our data are consistent with a multifocal attentional hypothesis as suggested by the attentional division hypothesis and biased competition model. Indeed, the findings reject the proposal of the unitary conception that the system is unable to process non-contiguoustargets simultaneously while filtering out embedded distractors (Posner et al., 1980). More important, we think that this multifocal capacity is set differently given the requirement of attentional allocation and processing.

49 in total

The effect of spatial organization of targets and distractors on the capacity to selectively memorize objects in visual short-term memory.

Introduction

Experiment 1

Method

Participants

Apparatus

Stimuli

Procedure

Results

Discussion

Spatial organization effect

Experiment 2

Experiment 3

Experiment 4

Experiment 5

General discussion

Dispersion effects

Selection, processing, and dispersion effect

1. Storage of features, conjunctions and objects in visual working memory.

2. Attentional capture by abrupt onsets and feature singletons produces inhibitory surrounds.

3. Visuospatial attention: beyond a spotlight model.

4. Reflexive and voluntary orienting of visual attention: time course of activation and resistance to interruption.

5. Relational information in visual short-term memory: the structural gist.

6. Pushing around the locus of selection: evidence for the flexible-selection hypothesis.

7. A Boolean map theory of visual attention.

8. How many locations can be selected at once?

9. Attention to adjacent and separate positions in space: an electrophysiological analysis.

10. Attention and the detection of signals.