Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 How does the brain rapidly learn and reorganize view-invariant and position-invariant object representations in the inferotemporal cortex?

Literature DB >> 21596523

How does the brain rapidly learn and reorganize view-invariant and position-invariant object representations in the inferotemporal cortex?

Yongqiang Cao¹, Stephen Grossberg, Jeffrey Markowitz.

Abstract

All primates depend for their survival on being able to rapidly learn about and recognize objects. Objects may be visually detected at multiple positions, sizes, and viewpoints. How does the brain rapidly learn and recognize objects while scanning a scene with eye movements, without causing a combinatorial explosion in the number of cells that are needed? How does the brain avoid the problem of erroneously classifying parts of different objects together at the same or different positions in a visual scene? In monkeys and humans, a key area for such invariant object category learning and recognition is the inferotemporal cortex (IT). A neural model is proposed to explain how spatial and object attention coordinate the ability of IT to learn invariant category representations of objects that are seen at multiple positions, sizes, and viewpoints. The model clarifies how interactions within a hierarchy of processing stages in the visual brain accomplish this. These stages include the retina, lateral geniculate nucleus, and cortical areas V1, V2, V4, and IT in the brain's What cortical stream, as they interact with spatial attention processes within the parietal cortex of the Where cortical stream. The model builds upon the ARTSCAN model, which proposed how view-invariant object representations are generated. The positional ARTSCAN (pARTSCAN) model proposes how the following additional processes in the What cortical processing stream also enable position-invariant object representations to be learned: IT cells with persistent activity, and a combination of normalizing object category competition and a view-to-object learning law which together ensure that unambiguous views have a larger effect on object recognition than ambiguous views. The model explains how such invariant learning can be fooled when monkeys, or other primates, are presented with an object that is swapped with another object during eye movements to foveate the original object. The swapping procedure is predicted to prevent the reset of spatial attention, which would otherwise keep the representations of multiple objects from being combined by learning. Li and DiCarlo (2008) have presented neurophysiological data from monkeys showing how unsupervised natural experience in a target swapping experiment can rapidly alter object representations in IT. The model quantitatively simulates the swapping data by showing how the swapping procedure fools the spatial attention mechanism. More generally, the model provides a unifying framework, and testable predictions in both monkeys and humans, for understanding object learning data using neurophysiological methods in monkeys, and spatial attention, episodic learning, and memory retrieval data using functional imaging methods in humans.

Entities: Disease Species

Mesh：

Year: 2011 PMID： 21596523 DOI： 10.1016/j.neunet.2011.04.004

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

11 in total

1. Binocular fusion and invariant category learning due to predictive remapping during scanning of a depthful scene with eye movements.

Authors: Stephen Grossberg; Karthik Srinivasan; Arash Yazdanbakhsh
Journal: Front Psychol Date: 2015-01-14

2. Where's Waldo? How perceptual, cognitive, and emotional brain processes cooperate during learning to categorize and find desired objects in a cluttered scene.

Authors: Hung-Cheng Chang; Stephen Grossberg; Yongqiang Cao
Journal: Front Integr Neurosci Date: 2014-06-17

3. Neural dynamics of object-based multifocal visual spatial attention and priming: object cueing, useful-field-of-view, and crowding.

Authors: Nicholas C Foley; Stephen Grossberg; Ennio Mingolla
Journal: Cogn Psychol Date: 2012-03-14 Impact factor: 3.468

4. Desirability, availability, credit assignment, category learning, and attention: Cognitive-emotional and working memory dynamics of orbitofrontal, ventrolateral, and dorsolateral prefrontal cortices.

Authors: Stephen Grossberg
Journal: Brain Neurosci Adv Date: 2018-05-08

5. A Canonical Laminar Neocortical Circuit Whose Bottom-Up, Horizontal, and Top-Down Pathways Control Attention, Learning, and Prediction.

Authors: Stephen Grossberg
Journal: Front Syst Neurosci Date: 2021-04-23

6. What are the Visual Features Underlying Rapid Object Recognition?

Authors: Sébastien M Crouzet; Thomas Serre
Journal: Front Psychol Date: 2011-11-15

7. Cortical Dynamics of Figure-Ground Separation in Response to 2D Pictures and 3D Scenes: How V2 Combines Border Ownership, Stereoscopic Cues, and Gestalt Grouping Rules.

Authors: Stephen Grossberg
Journal: Front Psychol Date: 2016-01-26

8. Neural Dynamics of Autistic Repetitive Behaviors and Fragile X Syndrome: Basal Ganglia Movement Gating and mGluR-Modulated Adaptively Timed Learning.

Authors: Stephen Grossberg; Devika Kishnan
Journal: Front Psychol Date: 2018-03-13

9. How the venetian blind percept emerges from the laminar cortical dynamics of 3D vision.

Authors: Yongqiang Cao; Stephen Grossberg
Journal: Front Psychol Date: 2014-08-05

10. Neural Computation of Surface Border Ownership and Relative Surface Depth from Ambiguous Contrast Inputs.

Authors: Birgitta Dresp-Langley; Stephen Grossberg
Journal: Front Psychol Date: 2016-07-28