Literature DB >> 36051852

When Pigs Fly: Contextual Reasoning in Synthetic and Natural Scenes.

Philipp Bomatter1, Mengmi Zhang2,3, Dimitar Karev4, Spandan Madan3,5, Claire Tseng4, Gabriel Kreiman2,3.   

Abstract

Context is of fundamental importance to both human and machine vision; e.g., an object in the air is more likely to be an airplane than a pig. The rich notion of context incorporates several aspects including physics rules, statistical co-occurrences, and relative object sizes, among others. While previous work has focused on crowd-sourced out-of-context photographs from the web to study scene context, controlling the nature and extent of contextual violations has been a daunting task. Here we introduce a diverse, synthetic Out-of-Context Dataset (OCD) with fine-grained control over scene context. By leveraging a 3D simulation engine, we systematically control the gravity, object co-occurrences and relative sizes across 36 object categories in a virtual household environment. We conducted a series of experiments to gain insights into the impact of contextual cues on both human and machine vision using OCD. We conducted psychophysics experiments to establish a human benchmark for out-of-context recognition, and then compared it with state-of-the-art computer vision models to quantify the gap between the two. We propose a context-aware recognition transformer model, fusing object and contextual information via multi-head attention. Our model captures useful information for contextual reasoning, enabling human-level performance and better robustness in out-of-context conditions compared to baseline models across OCD and other out-of-context datasets. All source code and data are publicly available at https://github.com/kreimanlab/WhenPigsFlyContext.

Entities:  

Year:  2022        PMID: 36051852      PMCID: PMC9432425          DOI: 10.1109/iccv48922.2021.00032

Source DB:  PubMed          Journal:  IEEE Int Conf Comput Vis Workshops        ISSN: 2473-9936


  7 in total

Review 1.  The role of context in object recognition.

Authors:  Aude Oliva; Antonio Torralba
Journal:  Trends Cogn Sci       Date:  2007-11-19       Impact factor: 20.229

2.  A tree-based context model for object recognition.

Authors:  Myung Jin Choi; Antonio Torralba; Alan S Willsky
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2012-02       Impact factor: 6.226

3.  Object co-occurrence serves as a contextual cue to guide and facilitate visual search in a natural viewing environment.

Authors:  Stephen C Mack; Miguel P Eckstein
Journal:  J Vis       Date:  2011-08-19       Impact factor: 2.240

4.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

Authors:  Liang-Chieh Chen; George Papandreou; Iasonas Kokkinos; Kevin Murphy; Alan L Yuille
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2017-04-27       Impact factor: 6.226

5.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.

Authors:  Shaoqing Ren; Kaiming He; Ross Girshick; Jian Sun
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2016-06-06       Impact factor: 6.226

6.  Recurrent computations for visual pattern completion.

Authors:  Hanlin Tang; Martin Schrimpf; William Lotter; Charlotte Moerman; Ana Paredes; Josue Ortega Caro; Walter Hardesty; David Cox; Gabriel Kreiman
Journal:  Proc Natl Acad Sci U S A       Date:  2018-08-13       Impact factor: 11.205

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.