Literature DB >> 26024454

Spatiotemporal information during unsupervised learning enhances viewpoint invariant object recognition.

Moqian Tian, Kalanit Grill-Spector.   

Abstract

Recognizing objects is difficult because it requires both linking views of an object that can be different and distinguishing objects with similar appearance. Interestingly, people can learn to recognize objects across views in an unsupervised way, without feedback, just from the natural viewing statistics. However, there is intense debate regarding what information during unsupervised learning is used to link among object views. Specifically, researchers argue whether temporal proximity, motion, or spatiotemporal continuity among object views during unsupervised learning is beneficial. Here, we untangled the role of each of these factors in unsupervised learning of novel three-dimensional (3-D) objects. We found that after unsupervised training with 24 object views spanning a 180° view space, participants showed significant improvement in their ability to recognize 3-D objects across rotation. Surprisingly, there was no advantage to unsupervised learning with spatiotemporal continuity or motion information than training with temporal proximity. However, we discovered that when participants were trained with just a third of the views spanning the same view space, unsupervised learning via spatiotemporal continuity yielded significantly better recognition performance on novel views than learning via temporal proximity. These results suggest that while it is possible to obtain view-invariant recognition just from observing many views of an object presented in temporal proximity, spatiotemporal information enhances performance by producing representations with broader view tuning than learning via temporal association. Our findings have important implications for theories of object recognition and for the development of computational algorithms that learn from examples.

Entities:  

Mesh:

Year:  2015        PMID: 26024454      PMCID: PMC4429262          DOI: 10.1167/15.6.7

Source DB:  PubMed          Journal:  J Vis        ISSN: 1534-7362            Impact factor:   2.240


  45 in total

1.  The use of facial motion and facial form during the processing of identity.

Authors:  Barbara Knappmeyer; Ian M Thornton; Heinrich H Bülthoff
Journal:  Vision Res       Date:  2003-08       Impact factor: 1.886

2.  Exploring the role of characteristic motion when learning new faces.

Authors:  Karen Lander; Rebecca Davies
Journal:  Q J Exp Psychol (Hove)       Date:  2007-04       Impact factor: 2.143

3.  A network that learns to recognize three-dimensional objects.

Authors:  T Poggio; S Edelman
Journal:  Nature       Date:  1990-01-18       Impact factor: 49.962

Review 4.  Invariant face and object recognition in the visual system.

Authors:  G Wallis; E T Rolls
Journal:  Prog Neurobiol       Date:  1997-02       Impact factor: 11.685

5.  Performance-optimized hierarchical models predict neural responses in higher visual cortex.

Authors:  Daniel L K Yamins; Ha Hong; Charles F Cadieu; Ethan A Solomon; Darren Seibert; James J DiCarlo
Journal:  Proc Natl Acad Sci U S A       Date:  2014-05-08       Impact factor: 11.205

6.  Recognition-by-components: a theory of human image understanding.

Authors:  Irving Biederman
Journal:  Psychol Rev       Date:  1987-04       Impact factor: 8.934

7.  Recognizing people from dynamic and static faces and bodies: dissecting identity with a fusion approach.

Authors:  Alice J O'Toole; P Jonathon Phillips; Samuel Weimer; Dana A Roark; Julianne Ayyad; Robert Barwick; Joseph Dunlop
Journal:  Vision Res       Date:  2010-10-20       Impact factor: 1.886

Review 8.  Change blindness: past, present, and future.

Authors:  Daniel J Simons; Ronald A Rensink
Journal:  Trends Cogn Sci       Date:  2005-01       Impact factor: 20.229

9.  The representation of object viewpoint in human visual cortex.

Authors:  David R Andresen; Joakim Vinberg; Kalanit Grill-Spector
Journal:  Neuroimage       Date:  2008-11-25       Impact factor: 6.556

10.  Learning sequence of views of three-dimensional objects: the effect of temporal coherence on object memory.

Authors:  Taosheng Liu
Journal:  Perception       Date:  2007       Impact factor: 1.490

View more
  4 in total

1.  Learning the 3-D structure of objects from 2-D views depends on shape, not format.

Authors:  Moqian Tian; Daniel Yamins; Kalanit Grill-Spector
Journal:  J Vis       Date:  2016-05-01       Impact factor: 2.240

Review 2.  The functional neuroanatomy of face perception: from brain measurements to deep neural networks.

Authors:  Kalanit Grill-Spector; Kevin S Weiner; Jesse Gomez; Anthony Stigliani; Vaidehi S Natu
Journal:  Interface Focus       Date:  2018-06-15       Impact factor: 3.906

3.  Neurons in inferior temporal cortex are sensitive to motion trajectory during degraded object recognition.

Authors:  Diana C Burk; David L Sheinberg
Journal:  Cereb Cortex Commun       Date:  2022-08-18

4.  Temporal Contiguity Training Influences Behavioral and Neural Measures of Viewpoint Tolerance.

Authors:  Chayenne Van Meel; Hans P Op de Beeck
Journal:  Front Hum Neurosci       Date:  2018-01-30       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.