Literature DB >> 28113265

Top-Down Visual Saliency via Joint CRF and Dictionary Learning.

Jimei Yang, Ming-Hsuan Yang.   

Abstract

Top-down visual saliency is an important module of visual attention. In this work, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a visual dictionary. The proposed model incorporates a layered structure from top to bottom: CRF, sparse coding and image patches. With sparse coding as an intermediate layer, CRF is learned in a feature-adaptive manner; meanwhile with CRF as the output layer, the dictionary is learned under structured supervision. For efficient and effective joint learning, we develop a max-margin approach via a stochastic gradient descent algorithm. Experimental results on the Graz-02 and PASCAL VOC datasets show that our model performs favorably against state-of-the-art top-down saliency methods for target object localization. In addition, the dictionary update significantly improves the performance of our model. We demonstrate the merits of the proposed top-down saliency model by applying it to prioritizing object proposals for detection and predicting human fixations.

Entities:  

Year:  2016        PMID: 28113265     DOI: 10.1109/TPAMI.2016.2547384

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  5 in total

Review 1.  Salient Object Detection Techniques in Computer Vision-A Survey.

Authors:  Ashish Kumar Gupta; Ayan Seal; Mukesh Prasad; Pritee Khanna
Journal:  Entropy (Basel)       Date:  2020-10-19       Impact factor: 2.524

2.  Segmentation of multicorrelated images with copula models and conditionally random fields.

Authors:  Jérôme Lapuyade-Lahorgue; Su Ruan
Journal:  J Med Imaging (Bellingham)       Date:  2022-01-08

3.  Finding any Waldo with zero-shot invariant and efficient visual search.

Authors:  Mengmi Zhang; Jiashi Feng; Keng Teck Ma; Joo Hwee Lim; Qi Zhao; Gabriel Kreiman
Journal:  Nat Commun       Date:  2018-09-13       Impact factor: 14.919

4.  3D ear identification based on sparse representation.

Authors:  Lin Zhang; Zhixuan Ding; Hongyu Li; Ying Shen
Journal:  PLoS One       Date:  2014-04-16       Impact factor: 3.240

5.  Joint Learning of Binocularly Driven Saccades and Vergence by Active Efficient Coding.

Authors:  Qingpeng Zhu; Jochen Triesch; Bertram E Shi
Journal:  Front Neurorobot       Date:  2017-11-03       Impact factor: 2.650

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.