Literature DB >> 32347528

The Linguistic Analysis of Scene Semantics: LASS.

Dylan Rose1, Peter Bex2.   

Abstract

In this paper, we define a new method for analyzing object-scene contextual relationships using computational linguistics: Linguistic Analysis of Scene Semantics, or LASS. LASS uses linguistic semantic similarity relationships between scene object and context labels embedded in a vector-space language model: Facebook Research's fastText. Importantly, the use of fastText permits semantic similarity score calculation between any set of strings and thus elements of any set of image data for which labels are available. Scene semantic similarity scores are then embedded in object segmentation mask locations in the image, creating a semantic similarity map. LASS can also be fully automated by generating context and object labels, as well as object segmentation masks, using deep learning. We compare semantic similarity maps between human- and neural network-generated annotations on a corpus of images taken from the LabelMe database. Semantic similarity maps produced by the fully automated LASS have a number of desirable properties, while maintaining a high degree of spatial and semantic similarity to them. Finally, we use LASS to evaluate the distribution of semantically consistent scene elements in space. Both show relatively uniform distributions of semantic relatedness to scene context, suggesting that contextually appropriate objects are likely to be found in all image regions. Taken together, these results suggest that LASS is accurate, automatic, flexible, and useful in a number of research contexts such as scene grammar and novelty detection.

Entities:  

Keywords:  Computational linguistics; Natural scenes; Scene semantics

Mesh:

Year:  2020        PMID: 32347528      PMCID: PMC8765594          DOI: 10.3758/s13428-020-01390-8

Source DB:  PubMed          Journal:  Behav Res Methods        ISSN: 1554-351X


  21 in total

1.  Visual memory and motor planning in a natural task.

Authors:  Mary M Hayhoe; Anurag Shrivastava; Ryan Mruczek; Jeff B Pelz
Journal:  J Vis       Date:  2003       Impact factor: 2.240

Review 2.  On the temporal dynamics of language-mediated vision and vision-mediated language.

Authors:  Sarah E Anderson; Eric Chiu; Stephanie Huette; Michael J Spivey
Journal:  Acta Psychol (Amst)       Date:  2010-10-18

Review 3.  In praise of artifice.

Authors:  Nicole C Rust; J Anthony Movshon
Journal:  Nat Neurosci       Date:  2005-12       Impact factor: 24.884

Review 4.  How close are we to understanding v1?

Authors:  Bruno A Olshausen; David J Field
Journal:  Neural Comput       Date:  2005-08       Impact factor: 2.026

5.  Visual saliency and semantic incongruency influence eye movements when inspecting pictures.

Authors:  Geoffrey Underwood; Tom Foulsham
Journal:  Q J Exp Psychol (Hove)       Date:  2006-11       Impact factor: 2.143

6.  SCEGRAM: An image database for semantic and syntactic inconsistencies in scenes.

Authors:  Sabine Öhlschläger; Melissa Le-Hoa Võ
Journal:  Behav Res Methods       Date:  2017-10

7.  Seek and you shall remember: scene semantics interact with visual search to build better memories.

Authors:  Dejan Draschkow; Jeremy M Wolfe; Melissa L H Võ
Journal:  J Vis       Date:  2014-07-11       Impact factor: 2.240

8.  Disentangling stimulus plausibility and contextual congruency: Electro-physiological evidence for differential cognitive dynamics.

Authors:  Moreno I Coco; Susana Araujo; Karl Magnus Petersson
Journal:  Neuropsychologia       Date:  2016-12-11       Impact factor: 3.139

9.  Cognitive determinants of fixation location during picture viewing.

Authors:  G R Loftus; N H Mackworth
Journal:  J Exp Psychol Hum Percept Perform       Date:  1978-11       Impact factor: 3.332

10.  BOiS-Berlin Object in Scene Database: Controlled Photographic Images for Visual Search Experiments with Quantified Contextual Priors.

Authors:  Johannes Mohr; Julia Seyfarth; Andreas Lueschow; Joachim E Weber; Felix A Wichmann; Klaus Obermayer
Journal:  Front Psychol       Date:  2016-05-23
View more
  2 in total

1.  Semantic object-scene inconsistencies affect eye movements, but not in the way predicted by contextualized meaning maps.

Authors:  Marek A Pedziwiatr; Matthias Kümmerer; Thomas S A Wallis; Matthias Bethge; Christoph Teufel
Journal:  J Vis       Date:  2022-02-01       Impact factor: 2.240

2.  Cognitive load influences oculomotor behavior in natural scenes.

Authors:  Kerri Walter; Peter Bex
Journal:  Sci Rep       Date:  2021-06-11       Impact factor: 4.379

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.