Literature DB >> 33096374

Meaning maps and saliency models based on deep convolutional neural networks are insensitive to image meaning when predicting human fixations.

Marek A Pedziwiatr1, Matthias Kümmerer2, Thomas S A Wallis2, Matthias Bethge2, Christoph Teufel3.   

Abstract

Eye movements are vital for human vision, and it is therefore important to understand how observers decide where to look. Meaning maps (MMs), a technique to capture the distribution of semantic information across an image, have recently been proposed to support the hypothesis that meaning rather than image features guides human gaze. MMs have the potential to be an important tool far beyond eye-movements research. Here, we examine central assumptions underlying MMs. First, we compared the performance of MMs in predicting fixations to saliency models, showing that DeepGaze II - a deep neural network trained to predict fixations based on high-level features rather than meaning - outperforms MMs. Second, we show that whereas human observers respond to changes in meaning induced by manipulating object-context relationships, MMs and DeepGaze II do not. Together, these findings challenge central assumptions underlying the use of MMs to measure the distribution of meaning in images.
Copyright © 2020. Published by Elsevier B.V.

Entities:  

Keywords:  Deep neural networks; Eye movements; Meaning maps; Natural scenes; Saliency

Year:  2020        PMID: 33096374     DOI: 10.1016/j.cognition.2020.104465

Source DB:  PubMed          Journal:  Cognition        ISSN: 0010-0277


  4 in total

1.  Disrupted object-scene semantics boost scene recall but diminish object recall in drawings from memory.

Authors:  Wilma A Bainbridge; Wan Y Kwok; Chris I Baker
Journal:  Mem Cognit       Date:  2021-05-24

2.  Salience-based object prioritization during active viewing of naturalistic scenes in young and older adults.

Authors:  Antje Nuthmann; Immo Schütz; Wolfgang Einhäuser
Journal:  Sci Rep       Date:  2020-12-16       Impact factor: 4.379

3.  Semantic object-scene inconsistencies affect eye movements, but not in the way predicted by contextualized meaning maps.

Authors:  Marek A Pedziwiatr; Matthias Kümmerer; Thomas S A Wallis; Matthias Bethge; Christoph Teufel
Journal:  J Vis       Date:  2022-02-01       Impact factor: 2.240

4.  Cognitive load influences oculomotor behavior in natural scenes.

Authors:  Kerri Walter; Peter Bex
Journal:  Sci Rep       Date:  2021-06-11       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.