| Literature DB >> 28887785 |
Nasim Nematzadeh1, David M W Powers2, Trent W Lewis2.
Abstract
Why does our visual system fail to reconstruct reality, when we look at certain patterns? Where do Geometrical illusions start to emerge in the visual pathway? How far should we take computational models of vision with the same visual ability to detect illusions as we do? This study addresses these questions, by focusing on a specific underlying neural mechanism involved in our visual experiences that affects our final perception. Among many types of visual illusion, 'Geometrical' and, in particular, 'Tilt Illusions' are rather important, being characterized by misperception of geometric patterns involving lines and tiles in combination with contrasting orientation, size or position. Over the last decade, many new neurophysiological experiments have led to new insights as to how, when and where retinal processing takes place, and the encoding nature of the retinal representation that is sent to the cortex for further processing. Based on these neurobiological discoveries, we provide computer simulation evidence from modelling retinal ganglion cells responses to some complex Tilt Illusions, suggesting that the emergence of tilt in these illusions is partially related to the interaction of multiscale visual processing performed in the retina. The output of our low-level filtering model is presented for several types of Tilt Illusion, predicting that the final tilt percept arises from multiple-scale processing of the Differences of Gaussians and the perceptual interaction of foreground and background elements. The model is a variation of classical receptive field implementation for simple cells in early stages of vision with the scales tuned to the object/texture sizes in the pattern. Our results suggest that this model has a high potential in revealing the underlying mechanism connecting low-level filtering approaches to mid- and high-level explanations such as 'Anchoring theory' and 'Perceptual grouping'.Entities:
Keywords: Biological neural network; Classical receptive field (CRF) models; Cognitive systems; Difference of Gaussians; Geometrical illusions; Gestalt grouping principles; Pattern recognition; Perceptual grouping; Self-organizing systems; Tilt effects; Visual perception
Year: 2017 PMID: 28887785 PMCID: PMC5709283 DOI: 10.1007/s40708-017-0072-8
Source DB: PubMed Journal: Brain Inform ISSN: 2198-4026
Geometrical and Brightness/Lightness Illusion patterns (Reproduced by permission from [145]). The source and original references of illusions in this table are provided in Table 2
The source and original references of the illusion patterns in Table 1
| Illusion patterns | Appendix | Source/original | References |
|---|---|---|---|
| Mach Bands | Rl-a | Penacchio et al. (2013)/Mach (1865), Fiorentini (1972) | [ |
| Chevruell | Rl-b | Geier (2011)/Chevreul (1890) | [ |
| Pyramid | Rl-c | Troncoso et al. (2005)/Vasarely (1966, 1970) | [ |
| Grating Induction (GI) | Rl-d | Penacchio et al. (2013)/McCourt (1982) | [ |
| Craik–O’Brien Cornsweet (COC) | R2-a | Web Img, Lu & Sperling (1995)/Craik (1940), O’Brien (1958), Cornsweet (1970) | [ |
| Irradiation | R2-b | Westheimer (2007)/Helmholtz (1896) | [ |
| Simultaneous Brightness Contrast (SBC) | R2-c | Adelson (2000)/Heinemann (1955) | [ |
| White’s effect (WE) | R2-d | Blakeslee and McCourt (1999)/White (1979) | [ |
| Herman Grid | R3-a | Blakeslee and McCourt (1997)/Hermann (1870) | [ |
| Café Wall | R3-b | Nematzadeh (2015)/Gregory (1973), Munsterberg (1897), Pierce (1898), Fraser (1908) | [ |
| Fraser | R3-c | Kitaoka (2007)/Fraser (1908) | [ |
| Twisted cord | R3-d | McCourt (1983)/Fraser (1908) | [ |
| Spring | R4-a | Fermuller & Malm (2004)/Kitaoka (2003) | [ |
| Spiral Café Wall | R4-b | Kitaoka (2007)/Kitaoka et al. (2001) | [ |
| Spiral Fraser | R4-c | Kitaoka (2007)/Gregory & Heard (1979), Kitaoka et al. (2001) | [ |
| Sound Wave | R4-d | Stevanov et al. (2012)/Kitaoka (1998) | [ |
| Zöllner | R5-a | Web–Wikipedia/Zöllner (1862) | [ |
| Herring–Wundt | R5-b | Changizi et al. (2008)/Herring (1861), Wundt (1898) | [ |
| Orbison | R5-c | Changizi et al. (2008)/Orbison (1939) | [ |
| Poggendorff | R5-d | Ninio (2014)/Zöllner (1860) | [ |
| Ebbinghaus | R6-a | Ninio (2014)/Ebinghaus (1902) | [ |
| Müller–Lyer | R6-b | Ninio (2014)/Müller-Lyer (1889) | [ |
| Ponzo | R6-c | Ninio (2014)/Ponzo (1910) | [ |
| Kanizsa Triangle | R6-d | Eagleman (2001)/Kanizsa (1974), Frisby & Clatworthy (1975) | [ |
| Face–Vase | R7-a | Sturmberg (2011)/Rubin (1915) | [ |
| Necker Cube | R7-b | Sturmberg (2011)/Necker (1832) | [ |
| Penrose Triangle | R7-c | Web—OpenClipArt/Pappas (1989) | [ |
| Impossible Staircase | R7-d | Web—Wikipedia/Penrose (1958) | [ |
| Knill & Kersten | R8-a | Adelson (2000)/Knill & Kresten (1991) | [ |
| Wall of Block | R8-b | Logvinenko et al. (2002)/Adelson (1993) | [ |
| Crisscross | R8-c | Adelson (2000) | [ |
| Snake | R8-d | Adelson (2000) | [ |
Fig. 1Sample Tile Illusion patterns. Left Trampoline pattern [70], right Spiral Café Wall illusion [71]
Fig. 2Left 3D surface of a Difference of Gaussian filter with the scale of the centre Gaussian equal to 8 (). The Surround ratio is , and the Window ratio is . Right Top view (2D) of the DoG filter. Window size is 65 × 65. Jet white colour map is used to display the graph. (Color figure online)
Fig. 3Top left Café Wall pattern with 200 × 200 px tiles and 8 px mortar. Top right Enlarged DoG output at scale 8 () in the edge map. Centre The binary edge map at six different scales ( with incremental steps of 4). Bottom Jet white colour map of the above edge map. Rather than the centre Gaussian, other parameters of the model are: , and (the Surround and Window ratios, respectively) (Reproduced by permission from [145]). (Color figure online)
Fig. 4Top left Munsterberg pattern with 200 × 200 px tiles. Top right Enlarged DoG output at scale 8 () in the edge map. Bottom The binary edge map at six different scales () with incremental steps of 4. Other parameters of the model are: and similar to Fig. 3 (Reproduced by permission from [145])
Fig. 5Top A crop section of 4 × 5 tiles (enlarged) from a Café Wall of 9 × 14 tiles with 200 × 200 px tiles and 8 px mortar. Bottom left Edge map of the crop section at six scales (), with incremental steps of 4 in jet white colour map. Bottom right Detected houghlines displayed in green on the binary edge map and around four reference orientations of horizontal, vertical and diagonals. Blue lines indicate the longest detected lines at each scale of the edge map (Reproduced by permission from [145]). (Color figure online)
Fig. 6Top left Complex Bulge pattern (574 × 572 px) and top right The DoG output at scale 2 ()—enlarged that highlights the bulge effect in the pattern. Bottom The binary edge map of the pattern at eight different scales () with incremental steps of 1. Other parameters of the DoG model are: , and (the Surround and Window ratios, respectively) (Reproduced by permission from [145])
Fig. 7Detected houghlines (centre) for two scales of 2 and 4 () from the edge map of the Complex Bulge pattern (top). The two images at the bottom of the figure are zoomed-in versions of the two images at the centre row. The parameters of the model and Hough investigation have been provided on the figure (Reproduced by permission from [145]). (Color figure online)