Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 When Pigs Fly: Contextual Reasoning in Synthetic and Natural Scenes.

Literature DB >> 36051852

When Pigs Fly: Contextual Reasoning in Synthetic and Natural Scenes.

Philipp Bomatter¹, Mengmi Zhang^2,3, Dimitar Karev⁴, Spandan Madan^3,5, Claire Tseng⁴, Gabriel Kreiman^2,3.

Abstract

Context is of fundamental importance to both human and machine vision; e.g., an object in the air is more likely to be an airplane than a pig. The rich notion of context incorporates several aspects including physics rules, statistical co-occurrences, and relative object sizes, among others. While previous work has focused on crowd-sourced out-of-context photographs from the web to study scene context, controlling the nature and extent of contextual violations has been a daunting task. Here we introduce a diverse, synthetic Out-of-Context Dataset (OCD) with fine-grained control over scene context. By leveraging a 3D simulation engine, we systematically control the gravity, object co-occurrences and relative sizes across 36 object categories in a virtual household environment. We conducted a series of experiments to gain insights into the impact of contextual cues on both human and machine vision using OCD. We conducted psychophysics experiments to establish a human benchmark for out-of-context recognition, and then compared it with state-of-the-art computer vision models to quantify the gap between the two. We propose a context-aware recognition transformer model, fusing object and contextual information via multi-head attention. Our model captures useful information for contextual reasoning, enabling human-level performance and better robustness in out-of-context conditions compared to baseline models across OCD and other out-of-context datasets. All source code and data are publicly available at https://github.com/kreimanlab/WhenPigsFlyContext.

Entities: Chemical

Year: 2022 PMID： 36051852 PMCID： PMC9432425 DOI： 10.1109/iccv48922.2021.00032

Source DB: PubMed Journal: IEEE Int Conf Comput Vis Workshops ISSN： 2473-9936

Keyword Cloud
References

7 in total

Review 1. The role of context in object recognition.

Authors: Aude Oliva; Antonio Torralba
Journal: Trends Cogn Sci Date: 2007-11-19 Impact factor: 20.229

2. A tree-based context model for object recognition.

Authors: Myung Jin Choi; Antonio Torralba; Alan S Willsky
Journal: IEEE Trans Pattern Anal Mach Intell Date: 2012-02 Impact factor: 6.226

3. Object co-occurrence serves as a contextual cue to guide and facilitate visual search in a natural viewing environment.

Authors: Stephen C Mack; Miguel P Eckstein
Journal: J Vis Date: 2011-08-19 Impact factor: 2.240

When Pigs Fly: Contextual Reasoning in Synthetic and Natural Scenes.

Review 1. The role of context in object recognition.

2. A tree-based context model for object recognition.

3. Object co-occurrence serves as a contextual cue to guide and facilitate visual search in a natural viewing environment.

4. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

5. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.

6. Recurrent computations for visual pattern completion.