Literature DB >> 31449021

Video Saliency Prediction using Spatiotemporal Residual Attentive Networks.

Qiuxia Lai, Wenguan Wang, Hanqiu Sun, Jianbing Shen.   

Abstract

This paper proposes a novel residual attentive learning network architecture for predicting dynamic eye-fixation maps. The proposed model emphasizes two essential issues, i.e, effective spatiotemporal feature integration and multi-scale saliency learning. For the first problem, appearance and motion streams are tightly coupled via dense residual cross connections, which integrate appearance information with multi-layer, comprehensive motion features in a residual and dense way. Beyond traditional two-stream models learning appearance and motion features separately, such design allows early, multi-path information exchange between different domains, leading to a unified and powerful spatiotemporal learning architecture. For the second one, we propose a composite attention mechanism that learns multi-scale local attentions and global attention priors end-to-end. It is used for enhancing the fused spatiotemporal features via emphasizing important features in multi-scales. A lightweight convolutional Gated Recurrent Unit (convGRU), which is flexible for small training data situation, is used for long-term temporal characteristics modeling. Extensive experiments over four benchmark datasets clearly demonstrate the advantage of the proposed video saliency model over other competitors and the effectiveness of each component of our network. Our code and all the results will be available at https://github.com/ashleylqx/STRA-Net.

Entities:  

Year:  2019        PMID: 31449021     DOI: 10.1109/TIP.2019.2936112

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  1 in total

1.  Towards Making Videos Accessible for Low Vision Screen Magnifier Users.

Authors:  Ali Selman Aydin; Shirin Feiz; Vikas Ashok; I V Ramakrishnan
Journal:  IUI       Date:  2020-03
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.