| Literature DB >> 30240717 |
Leili Soo1, Ramakrishna Chakravarthi2, Søren K Andersen2.
Abstract
Visual object recognition is essential for adaptive interactions with the environment. It is fundamentally limited by crowding, a breakdown of object recognition in clutter. The spatial extent over which crowding occurs is proportional to the eccentricity of the target object, but nevertheless varies substantially depending on various stimulus factors (e.g. viewing time, contrast). However, a lack of studies jointly manipulating such factors precludes predictions of crowding in more heterogeneous scenes, such as the majority of real life situations. To establish how such co-occurring variations affect crowding, we manipulated combinations of 1) flanker contrast and backward masking, 2) flanker contrast and presentation duration, and 3) flanker preview and pop-out while measuring participants' ability to correctly report the orientation of a target stimulus. In all three experiments, combining two manipulations consistently modulated the spatial extent of crowding in a way that could not be predicted from an additive combination. However, a simple transformation of the measurement scale completely abolished these interactions and all effects became additive. Precise quantitative predictions of the magnitude of crowding when combining multiple manipulations are thus possible when it is expressed in terms of what we label the 'critical resolution'. Critical resolution is proportional to the inverse of the smallest flanker free area surrounding the target object necessary for its unimpaired identification. It offers a more parsimonious description of crowding than the traditionally used critical spacing and may thus constitute a measure of fundamental importance for understanding object recognition. CrownEntities:
Keywords: Critical resolution; Critical spacing; Flanker interference; Object recognition; Psychophysics; Visual crowding; Visual perception
Mesh:
Year: 2018 PMID: 30240717 PMCID: PMC6294650 DOI: 10.1016/j.visres.2018.08.005
Source DB: PubMed Journal: Vision Res ISSN: 0042-6989 Impact factor: 1.886
Fig. 1(A & B) The sequence of events in a single trial in Experiment 1 (A) and Experiment 2 (B). A fixation cross was presented in the middle of the screen throughout the experiment. The target and flankers were presented either to the right or left of the fixation (9° eccentricity). Targets (‘T’) were either presented in isolation or surrounded by equal contrast (Weber contrast of 0.25) or higher contrast (Weber contrast of 1.5) flankers (‘H’) at one of seven different target-flanker distances (closest spacing depicted in the figure). In Experiment 1, the target display was followed by a backward mask (same side as stimulus display) or no mask. In Experiment 2, target display was presented for either 20 ms or 200 ms (no masking). The next trial started immediately after participants had responded to the target orientation (up, down, left or right) by a key press. (C) The sequence of events in a single trial in Experiment 3. While participants fixated on the central cross, bilateral place-holders or flankers were presented for 150 ms at one of nine flanker distances (closest spacing depicted in the figure) and with positive or negative contrast polarity. Subsequently, a target with the same or opposite polarity was presented for 50 ms along with flankers that matched the place-holders’ contrast polarity or in isolation. The following trial started 1000 ms after participants had reported the orientation of the target.
Fig. 2Results of experiment 1, 2 and 3. Mean accuracy as a function of the target-flanker distance (plus the No flanker condition) for each condition in experiments 1 (A), 2 (C) and 3 (E) with exponential fits for each condition. Critical spacing for each condition is computed by determining the target-flanker distance at which performance is at 90% of asymptotic performance. These are depicted by the vertical lines, drawn where the horizontal lines (90% of asymptote) intersect with the psychometric curves. (B, D, & F) Mean and standard error of the mean of the critical spacing overlaid on Violin plots (smoothened histogram with normal Kernel). Black dots depict individual participants’ critical spacing.
Critical spacing with proportion of eccentricity in brackets (e = eccentricity = distance of the target from fixation at 9°), pairwise comparisons of critical spacing and mean change in each condition in degrees of visual angle in each experiment separately. Significant p-values () indicated in bold (Bonferroni-Holm correction was used for post hoc analysis).
| Experiment 1: Masking and flanker contrast | |||||
| Condition | Critical spacing | Equal contrast, no mask | High contrast flankers, no mask | Equal contrast, mask | High contrast flankers, mask |
| Equal contrast, no mask | 4.28° (0.48 e) | +0.76° | +1.42° | +3.40° | |
| High contrast flankers, no mask | 5.04° (0.56 e) | +0.66° | +2.64° | ||
| Equal contrast, mask | 5.70° (0.63 e) | +1.98° | |||
| High contrast flankers, mask | 7.68° (0.85 e) | ||||
| Experiment 2: Display duration and flanker contrast | |||||
| Condition | Critical spacing | Equal contrast, 200 ms | Higher contrast flankers, 200 ms | Equal contrast, 20 ms | Higher contrast flankers, 20 ms |
| Equal contrast, 200 ms | 3.01° (0.33 e) | +0.43° | +1.97° | +4.73° | |
| Higher contrast flankers, 200 ms | 3.44° (0.38 e) | +1.54° | +4.30° | ||
| Equal contrast, 20 ms | 4.98°0.55 e | +2.76° | |||
| Higher contrast flankers, 20 ms | 7.74°0.86 e | ||||
| Experiment 3: Target pop-out and flanker preview | |||||
| Condition | Critical spacing | Target pop-out, flanker preview | No pop-out, flanker preview | Target pop-out, no preview | No pop-out, no preview |
| Target pop-out, flanker preview | 1.58° (0.18 e) | +0.39° | +1.19° | +3.15° | |
| No pop-out, flanker preview | 1.97° (0.22 e) | +0.80° | +2.76° | ||
| Target pop-out, no preview | 2.77° (0.31 e) | +1.96° | |||
| No pop-out, no preview | 4.73° (0.53 e) | ||||
Fig. 3(A) Examples of power functions for different values of parameter γ. The functions are monotonically increasing for γ > 0 and monotonically decreasing for γ < 0. (B) Likelihood of different values of the exponent γ under the null hypothesis of no interaction. The combined likelihood was obtained by multiplying the three likelihoods of the separate experiments. The black bar at the bottom indicates the 14.7% likelihood region for γ. (C) The combined maximum likelihood γ as a function of the criterion (percentage of the asymptote) used to calculate the critical spacing. (D) Estimated proportion of all neurons of receptive field size r processing a target stimulus which are subject to biased competition by a flanker stimulus at distance d (see discussion for details). The displayed receptive field sizes may roughly correspond to neurons in V1 (0.5° and 1.0°), V2 (2.0°) and V4 (4.0°) (Kastner et al., 2001). Although the individual functions for neurons of the same receptive field size are almost linear for d < 2r, the function resulting from averaging over neurons of different receptive field sizes is strongly convex. Thus, a change in distance affects the extent of biased competition much more at smaller distances than at larger distances.
Fig. 4Critical resolution in Experiment 1, 2 and 3. Mean and standard error of the mean critical resolution for each of the four conditions are overlaid on Violin plots (smoothened histogram with normal Kernel). Black dots depict individual participants’ critical resolution. (A) Experiment 1, (B) Experiment 2 and (C) Experiment 3.
Critical resolution (one divided by the squared critical spacing) ANOVA results. Significant p-values () indicated in bold.
| Critical resolution ANOVA results | |||
|---|---|---|---|
| Experiment 1 | Masking: | Flanker contrast: | Interaction: |
| Experiment 2 | Display duration: | Flanker contrast: | Interaction: |
| Experiment 3 | Preview: | Pop-out: | Interaction: |