Literature DB >> 22868572

Quantitative analysis of human-model agreement in visual saliency modeling: a comparative study.

Ali Borji1, Dicky N Sihite, Laurent Itti.   

Abstract

Visual attention is a process that enables biological and machine vision systems to select the most relevant regions from a scene. Relevance is determined by two components: 1) top-down factors driven by task and 2) bottom-up factors that highlight image regions that are different from their surroundings. The latter are often referred to as "visual saliency." Modeling bottom-up visual saliency has been the subject of numerous research efforts during the past 20 years, with many successful applications in computer vision and robotics. Available models have been tested with different datasets (e.g., synthetic psychological search arrays, natural images or videos) using different evaluation scores (e.g., search slopes, comparison to human eye tracking) and parameter settings. This has made direct comparison of models difficult. Here, we perform an exhaustive comparison of 35 state-of-the-art saliency models over 54 challenging synthetic patterns, three natural image datasets, and two video datasets, using three evaluation scores. We find that although model rankings vary, some models consistently perform better. Analysis of datasets reveals that existing datasets are highly center-biased, which influences some of the evaluation scores. Computational complexity analysis shows that some models are very fast, yet yield competitive eye movement prediction accuracy. Different models often have common easy/difficult stimuli. Furthermore, several concerns in visual saliency modeling, eye movement datasets, and evaluation scores are discussed and insights for future work are provided. Our study allows one to assess the state-of-the-art, helps to organizing this rapidly growing field, and sets a unified comparison framework for gauging future efforts, similar to the PASCAL VOC challenge in the object recognition and detection domains.

Entities:  

Mesh:

Year:  2012        PMID: 22868572     DOI: 10.1109/TIP.2012.2210727

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  50 in total

1.  Saliency and saccade encoding in the frontal eye field during natural scene search.

Authors:  Hugo L Fernandes; Ian H Stevenson; Adam N Phillips; Mark A Segraves; Konrad P Kording
Journal:  Cereb Cortex       Date:  2013-07-17       Impact factor: 5.357

2.  Modeling peripheral visual acuity enables discovery of gaze strategies at multiple time scales during natural scene search.

Authors:  Pavan Ramkumar; Hugo Fernandes; Konrad Kording; Mark Segraves
Journal:  J Vis       Date:  2015-03-26       Impact factor: 2.240

3.  Scanpath estimation based on foveated image saliency.

Authors:  Yixiu Wang; Bin Wang; Xiaofeng Wu; Liming Zhang
Journal:  Cogn Process       Date:  2016-10-14

4.  Transient pupil response is modulated by contrast-based saliency.

Authors:  Chin-An Wang; Susan E Boehnke; Laurent Itti; Douglas P Munoz
Journal:  J Neurosci       Date:  2014-01-08       Impact factor: 6.167

Review 5.  The what, where, and why of priority maps and their interactions with visual working memory.

Authors:  Gregory J Zelinsky; James W Bisley
Journal:  Ann N Y Acad Sci       Date:  2015-01-07       Impact factor: 5.691

6.  Feature-based attention and spatial selection in frontal eye fields during natural scene search.

Authors:  Pavan Ramkumar; Patrick N Lawlor; Joshua I Glaser; Daniel K Wood; Adam N Phillips; Mark A Segraves; Konrad P Kording
Journal:  J Neurophysiol       Date:  2016-06-01       Impact factor: 2.714

7.  Information-theoretic model comparison unifies saliency metrics.

Authors:  Matthias Kümmerer; Thomas S A Wallis; Matthias Bethge
Journal:  Proc Natl Acad Sci U S A       Date:  2015-12-10       Impact factor: 11.205

8.  What do saliency models predict?

Authors:  Kathryn Koehler; Fei Guo; Sheng Zhang; Miguel P Eckstein
Journal:  J Vis       Date:  2014-03-11       Impact factor: 2.240

9.  Meaning guides attention during scene viewing, even when it is irrelevant.

Authors:  Candace E Peacock; Taylor R Hayes; John M Henderson
Journal:  Atten Percept Psychophys       Date:  2019-01       Impact factor: 2.199

Review 10.  Using multidimensional scaling to quantify similarity in visual search and beyond.

Authors:  Michael C Hout; Hayward J Godwin; Gemma Fitzsimmons; Arryn Robbins; Tamaryn Menneer; Stephen D Goldinger
Journal:  Atten Percept Psychophys       Date:  2016-01       Impact factor: 2.199

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.