Literature DB >> 26024466

Learning-based saliency model with depth information.

Chih-Yao Ma, Hsueh-Ming Hang.   

Abstract

Most previous studies on visual saliency focused on two-dimensional (2D) scenes. Due to the rapidly growing three-dimensional (3D) video applications, it is very desirable to know how depth information affects human visual attention. In this study, we first conducted eye-fixation experiments on 3D images. Our fixation data set comprises 475 3D images and 16 subjects. We used a Tobii TX300 eye tracker (Tobii, Stockholm, Sweden) to track the eye movement of each subject. In addition, this database contains 475 computed depth maps. Due to the scarcity of public-domain 3D fixation data, this data set should be useful to the 3D visual attention research community. Then, a learning-based visual attention model was designed to predict human attention. In addition to the popular 2D features, we included the depth map and its derived features. The results indicate that the extra depth information can enhance the saliency estimation accuracy specifically for close-up objects hidden in a complex-texture background. In addition, we examined the effectiveness of various low-, mid-, and high-level features on saliency prediction. Compared with both 2D and 3D state-of-the-art saliency estimation models, our methods show better performance on the 3D test images. The eye-tracking database and the MATLAB source codes for the proposed saliency model and evaluation methods are available on our website.

Entities:  

Mesh:

Year:  2015        PMID: 26024466     DOI: 10.1167/15.6.19

Source DB:  PubMed          Journal:  J Vis        ISSN: 1534-7362            Impact factor:   2.240


  5 in total

1.  A proto-object based saliency model in three-dimensional space.

Authors:  Brian Hu; Ralinkae Kane-Jackson; Ernst Niebur
Journal:  Vision Res       Date:  2016-01-19       Impact factor: 1.886

2.  Gaze distribution analysis and saliency prediction across age groups.

Authors:  Onkar Krishna; Andrea Helo; Pia Rämä; Kiyoharu Aizawa
Journal:  PLoS One       Date:  2018-02-23       Impact factor: 3.240

3.  Deep Multimodal Fusion Autoencoder for Saliency Prediction of RGB-D Images.

Authors:  Kengda Huang; Wujie Zhou; Meixin Fang
Journal:  Comput Intell Neurosci       Date:  2021-05-05

4.  Hierarchical Multimodal Adaptive Fusion (HMAF) Network for Prediction of RGB-D Saliency.

Authors:  Ying Lv; Wujie Zhou
Journal:  Comput Intell Neurosci       Date:  2020-11-20

Review 5.  RGB-D salient object detection: A survey.

Authors:  Tao Zhou; Deng-Ping Fan; Ming-Ming Cheng; Jianbing Shen; Ling Shao
Journal:  Comput Vis Media (Beijing)       Date:  2021-01-07
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.