Literature DB >> 29951197

Image interpretation above and below the object level.

Guy Ben-Yosef1,2,3, Shimon Ullman2,3.   

Abstract

Computational models of vision have advanced in recent years at a rapid rate, rivalling in some areas human-level performance. Much of the progress to date has focused on analysing the visual scene at the object level-the recognition and localization of objects in the scene. Human understanding of images reaches a richer and deeper image understanding both 'below' the object level, such as identifying and localizing object parts and sub-parts, as well as 'above' the object level, such as identifying object relations, and agents with their actions and interactions. In both cases, understanding depends on recovering meaningful structures in the image, and their components, properties and inter-relations, a process referred here as 'image interpretation'. In this paper, we describe recent directions, based on human and computer vision studies, towards human-like image interpretation, beyond the reach of current schemes, both below the object level, as well as some aspects of image interpretation at the level of meaningful configurations beyond the recognition of individual objects, and in particular, interactions between two people in close contact. In both cases the recognition process depends on the detailed interpretation of so-called 'minimal images', and at both levels recognition depends on combining 'bottom-up' processing, proceeding from low to higher levels of a processing hierarchy, together with 'top-down' processing, proceeding from high to lower levels stages of visual analysis.

Entities:  

Keywords:  interaction recognition; minimal images; social interactions; visual interpretation; visual recognition

Year:  2018        PMID: 29951197      PMCID: PMC6015807          DOI: 10.1098/rsfs.2018.0020

Source DB:  PubMed          Journal:  Interface Focus        ISSN: 2042-8898            Impact factor:   3.906


  22 in total

1.  Object detection with discriminatively trained part-based models.

Authors:  Pedro F Felzenszwalb; Ross B Girshick; David McAllester; Deva Ramanan
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2010-09       Impact factor: 6.226

2.  Atoms of recognition in human and computer vision.

Authors:  Shimon Ullman; Liav Assif; Ethan Fetaya; Daniel Harari
Journal:  Proc Natl Acad Sci U S A       Date:  2016-02-16       Impact factor: 11.205

Review 3.  Distributed hierarchical processing in the primate cerebral cortex.

Authors:  D J Felleman; D C Van Essen
Journal:  Cereb Cortex       Date:  1991 Jan-Feb       Impact factor: 5.357

Review 4.  Learning multiple layers of representation.

Authors:  Geoffrey E Hinton
Journal:  Trends Cogn Sci       Date:  2007-10       Impact factor: 20.229

5.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

Authors:  Liang-Chieh Chen; George Papandreou; Iasonas Kokkinos; Kevin Murphy; Alan L Yuille
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2017-04-27       Impact factor: 6.226

6.  Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge.

Authors:  Oriol Vinyals; Alexander Toshev; Samy Bengio; Dumitru Erhan
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2016-07-07       Impact factor: 6.226

7.  Performance-optimized hierarchical models predict neural responses in higher visual cortex.

Authors:  Daniel L K Yamins; Ha Hong; Charles F Cadieu; Ethan A Solomon; Darren Seibert; James J DiCarlo
Journal:  Proc Natl Acad Sci U S A       Date:  2014-05-08       Impact factor: 11.205

8.  Recognition-by-components: a theory of human image understanding.

Authors:  Irving Biederman
Journal:  Psychol Rev       Date:  1987-04       Impact factor: 8.934

9.  A dedicated network for social interaction processing in the primate brain.

Authors:  J Sliwa; W A Freiwald
Journal:  Science       Date:  2017-05-19       Impact factor: 47.728

10.  Representation of stable social dominance relations by human infants.

Authors:  Olivier Mascaro; Gergely Csibra
Journal:  Proc Natl Acad Sci U S A       Date:  2012-04-16       Impact factor: 11.205

View more
  2 in total

1.  Minimal videos: Trade-off between spatial and temporal information in human and machine vision.

Authors:  Guy Ben-Yosef; Gabriel Kreiman; Shimon Ullman
Journal:  Cognition       Date:  2020-04-20

2.  What Does a Language-And-Vision Transformer See: The Impact of Semantic Information on Visual Representations.

Authors:  Nikolai Ilinykh; Simon Dobnik
Journal:  Front Artif Intell       Date:  2021-12-03
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.