Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Humans, but Not Deep Neural Networks, Often Miss Giant Targets in Scenes.

Literature DB >> 28889976

Humans, but Not Deep Neural Networks, Often Miss Giant Targets in Scenes.

Miguel P Eckstein¹, Kathryn Koehler², Lauren E Welbourne³, Emre Akbas⁴.

Abstract

Even with great advances in machine vision, animals are still unmatched in their ability to visually search complex scenes. Animals from bees [1, 2] to birds [3] to humans [4-12] learn about the statistical relations in visual environments to guide and aid their search for targets. Here, we investigate a novel manner in which humans utilize rapidly acquired information about scenes by guiding search toward likely target sizes. We show that humans often miss targets when their size is inconsistent with the rest of the scene, even when the targets were made larger and more salient and observers fixated the target. In contrast, we show that state-of-the-art deep neural networks do not exhibit such deficits in finding mis-scaled targets but, unlike humans, can be fooled by target-shaped distractors that are inconsistent with the expected target's size within the scene. Thus, it is not a human deficiency to miss targets when they are inconsistent in size with the scene; instead, it is a byproduct of a useful strategy that the brain has implemented to rapidly discount potential distractors.

Entities: Species

Keywords: computer vision; convolutional neural networks; deep neural networks; guided search; object detection; perception; scene context; search errors; visual attention; visual search

Mesh：

Year: 2017 PMID： 28889976 DOI： 10.1016/j.cub.2017.07.068

Source DB: PubMed Journal: Curr Biol ISSN： 0960-9822 Impact factor: 10.834

Keyword Cloud
Cited

13 in total

1. Real-world size coding of solid objects, but not 2-D or 3-D images, in visual agnosia patients with bilateral ventral lesions.

Authors: Desiree E Holler; Marlene Behrmann; Jacqueline C Snow
Journal: Cortex Date: 2019-03-09 Impact factor: 4.027

Humans, but Not Deep Neural Networks, Often Miss Giant Targets in Scenes.

1. Real-world size coding of solid objects, but not 2-D or 3-D images, in visual agnosia patients with bilateral ventral lesions.

Review 2. Beyond the feedforward sweep: feedback computations in the visual cortex.

3. Local features and global shape information in object classification by deep convolutional neural networks.

4. Saliency-Aware Subtle Augmentation Improves Human Visual Search Performance in VR.

5. Cyborg groups enhance face recognition in crowded environments.

6. Qualitative similarities and differences in visual object representations between brains and deep networks.

Review 7. How context changes the neural basis of perception and language.

8. Under-exploration of Three-Dimensional Images Leads to Search Errors for Small Salient Targets.

9. Object detection through search with a foveated visual system.

10. Scenes Modulate Object Processing Before Interacting With Memory Templates.