Literature DB >> 21596523

How does the brain rapidly learn and reorganize view-invariant and position-invariant object representations in the inferotemporal cortex?

Yongqiang Cao1, Stephen Grossberg, Jeffrey Markowitz.   

Abstract

All primates depend for their survival on being able to rapidly learn about and recognize objects. Objects may be visually detected at multiple positions, sizes, and viewpoints. How does the brain rapidly learn and recognize objects while scanning a scene with eye movements, without causing a combinatorial explosion in the number of cells that are needed? How does the brain avoid the problem of erroneously classifying parts of different objects together at the same or different positions in a visual scene? In monkeys and humans, a key area for such invariant object category learning and recognition is the inferotemporal cortex (IT). A neural model is proposed to explain how spatial and object attention coordinate the ability of IT to learn invariant category representations of objects that are seen at multiple positions, sizes, and viewpoints. The model clarifies how interactions within a hierarchy of processing stages in the visual brain accomplish this. These stages include the retina, lateral geniculate nucleus, and cortical areas V1, V2, V4, and IT in the brain's What cortical stream, as they interact with spatial attention processes within the parietal cortex of the Where cortical stream. The model builds upon the ARTSCAN model, which proposed how view-invariant object representations are generated. The positional ARTSCAN (pARTSCAN) model proposes how the following additional processes in the What cortical processing stream also enable position-invariant object representations to be learned: IT cells with persistent activity, and a combination of normalizing object category competition and a view-to-object learning law which together ensure that unambiguous views have a larger effect on object recognition than ambiguous views. The model explains how such invariant learning can be fooled when monkeys, or other primates, are presented with an object that is swapped with another object during eye movements to foveate the original object. The swapping procedure is predicted to prevent the reset of spatial attention, which would otherwise keep the representations of multiple objects from being combined by learning. Li and DiCarlo (2008) have presented neurophysiological data from monkeys showing how unsupervised natural experience in a target swapping experiment can rapidly alter object representations in IT. The model quantitatively simulates the swapping data by showing how the swapping procedure fools the spatial attention mechanism. More generally, the model provides a unifying framework, and testable predictions in both monkeys and humans, for understanding object learning data using neurophysiological methods in monkeys, and spatial attention, episodic learning, and memory retrieval data using functional imaging methods in humans.
Copyright © 2011 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 21596523     DOI: 10.1016/j.neunet.2011.04.004

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  11 in total

1.  Binocular fusion and invariant category learning due to predictive remapping during scanning of a depthful scene with eye movements.

Authors:  Stephen Grossberg; Karthik Srinivasan; Arash Yazdanbakhsh
Journal:  Front Psychol       Date:  2015-01-14

2.  Where's Waldo? How perceptual, cognitive, and emotional brain processes cooperate during learning to categorize and find desired objects in a cluttered scene.

Authors:  Hung-Cheng Chang; Stephen Grossberg; Yongqiang Cao
Journal:  Front Integr Neurosci       Date:  2014-06-17

3.  Neural dynamics of object-based multifocal visual spatial attention and priming: object cueing, useful-field-of-view, and crowding.

Authors:  Nicholas C Foley; Stephen Grossberg; Ennio Mingolla
Journal:  Cogn Psychol       Date:  2012-03-14       Impact factor: 3.468

4.  Desirability, availability, credit assignment, category learning, and attention: Cognitive-emotional and working memory dynamics of orbitofrontal, ventrolateral, and dorsolateral prefrontal cortices.

Authors:  Stephen Grossberg
Journal:  Brain Neurosci Adv       Date:  2018-05-08

5.  A Canonical Laminar Neocortical Circuit Whose Bottom-Up, Horizontal, and Top-Down Pathways Control Attention, Learning, and Prediction.

Authors:  Stephen Grossberg
Journal:  Front Syst Neurosci       Date:  2021-04-23

6.  What are the Visual Features Underlying Rapid Object Recognition?

Authors:  Sébastien M Crouzet; Thomas Serre
Journal:  Front Psychol       Date:  2011-11-15

7.  Cortical Dynamics of Figure-Ground Separation in Response to 2D Pictures and 3D Scenes: How V2 Combines Border Ownership, Stereoscopic Cues, and Gestalt Grouping Rules.

Authors:  Stephen Grossberg
Journal:  Front Psychol       Date:  2016-01-26

8.  Neural Dynamics of Autistic Repetitive Behaviors and Fragile X Syndrome: Basal Ganglia Movement Gating and mGluR-Modulated Adaptively Timed Learning.

Authors:  Stephen Grossberg; Devika Kishnan
Journal:  Front Psychol       Date:  2018-03-13

9.  How the venetian blind percept emerges from the laminar cortical dynamics of 3D vision.

Authors:  Yongqiang Cao; Stephen Grossberg
Journal:  Front Psychol       Date:  2014-08-05

10.  Neural Computation of Surface Border Ownership and Relative Surface Depth from Ambiguous Contrast Inputs.

Authors:  Birgitta Dresp-Langley; Stephen Grossberg
Journal:  Front Psychol       Date:  2016-07-28
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.