Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Top-Down Visual Saliency via Joint CRF and Dictionary Learning.

Literature DB >> 28113265

Top-Down Visual Saliency via Joint CRF and Dictionary Learning.

Abstract

Top-down visual saliency is an important module of visual attention. In this work, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a visual dictionary. The proposed model incorporates a layered structure from top to bottom: CRF, sparse coding and image patches. With sparse coding as an intermediate layer, CRF is learned in a feature-adaptive manner; meanwhile with CRF as the output layer, the dictionary is learned under structured supervision. For efficient and effective joint learning, we develop a max-margin approach via a stochastic gradient descent algorithm. Experimental results on the Graz-02 and PASCAL VOC datasets show that our model performs favorably against state-of-the-art top-down saliency methods for target object localization. In addition, the dictionary update significantly improves the performance of our model. We demonstrate the merits of the proposed top-down saliency model by applying it to prioritizing object proposals for detection and predicting human fixations.

Entities: Disease Species

Year: 2016 PMID： 28113265 DOI： 10.1109/TPAMI.2016.2547384

Source DB: PubMed Journal: IEEE Trans Pattern Anal Mach Intell ISSN： 0098-5589 Impact factor: 6.226

Keyword Cloud
Cited

5 in total

Top-Down Visual Saliency via Joint CRF and Dictionary Learning.

Review 1. Salient Object Detection Techniques in Computer Vision-A Survey.

2. Segmentation of multicorrelated images with copula models and conditionally random fields.

3. Finding any Waldo with zero-shot invariant and efficient visual search.

4. 3D ear identification based on sparse representation.

5. Joint Learning of Binocularly Driven Saccades and Vergence by Active Efficient Coding.