Literature DB >> 32325309

Minimal videos: Trade-off between spatial and temporal information in human and machine vision.

Guy Ben-Yosef1, Gabriel Kreiman2, Shimon Ullman3.   

Abstract

Objects and their parts can be visually recognized from purely spatial or purely temporal information but the mechanisms integrating space and time are poorly understood. Here we show that visual recognition of objects and actions can be achieved by efficiently combining spatial and motion cues in configurations where each source on its own is insufficient for recognition. This analysis is obtained by identifying minimal videos: these are short and tiny video clips in which objects, parts, and actions can be reliably recognized, but any reduction in either space or time makes them unrecognizable. Human recognition in minimal videos is invariably accompanied by full interpretation of the internal components of the video. State-of-the-art deep convolutional networks for dynamic recognition cannot replicate human behavior in these configurations. The gap between human and machine vision demonstrated here is due to critical mechanisms for full spatiotemporal interpretation that are lacking in current computational models.
Copyright © 2020 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Comparing deep neural networks and humans; Integration of spatial and temporal visual information; Minimal images; Minimal videos; Visual dynamic recognition

Mesh:

Year:  2020        PMID: 32325309      PMCID: PMC7330814          DOI: 10.1016/j.cognition.2020.104263

Source DB:  PubMed          Journal:  Cognition        ISSN: 0010-0277


  22 in total

1.  POST-RETINAL VISUAL STORAGE.

Authors:  T E PARKS
Journal:  Am J Psychol       Date:  1965-03

2.  Integration of form and motion in the anterior superior temporal polysensory area (STPa) of the macaque monkey.

Authors:  M W Oram; D I Perrett
Journal:  J Neurophysiol       Date:  1996-07       Impact factor: 2.714

3.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

4.  Sensitivity to timing and order in human visual cortex.

Authors:  Jedediah M Singer; Joseph R Madsen; William S Anderson; Gabriel Kreiman
Journal:  J Neurophysiol       Date:  2014-11-26       Impact factor: 2.714

5.  Long-Term Recurrent Convolutional Networks for Visual Recognition and Description.

Authors:  Jeff Donahue; Lisa Anne Hendricks; Marcus Rohrbach; Subhashini Venugopalan; Sergio Guadarrama; Kate Saenko; Trevor Darrell
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2016-09-01       Impact factor: 6.226

6.  Phi movement as a subtraction process.

Authors:  S M Anstis
Journal:  Vision Res       Date:  1970-12       Impact factor: 1.886

7.  Recognition memory for a rapid sequence of pictures.

Authors:  M C Potter; E I Levy
Journal:  J Exp Psychol       Date:  1969-07

8.  Functional neuroanatomy of biological motion perception in humans.

Authors:  L M Vaina; J Solomon; S Chowdhury; P Sinha; J W Belliveau
Journal:  Proc Natl Acad Sci U S A       Date:  2001-09-11       Impact factor: 11.205

9.  Critical features for the recognition of biological motion.

Authors:  Antonino Casile; Martin A Giese
Journal:  J Vis       Date:  2005-04-18       Impact factor: 2.240

10.  Temporal "Bubbles" reveal key features for point-light biological motion perception.

Authors:  Steven M Thurman; Emily D Grossman
Journal:  J Vis       Date:  2008-03-27       Impact factor: 2.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.