Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Video Saliency Prediction using Spatiotemporal Residual Attentive Networks.

Literature DB >> 31449021

Video Saliency Prediction using Spatiotemporal Residual Attentive Networks.

Qiuxia Lai, Wenguan Wang, Hanqiu Sun, Jianbing Shen.

Abstract

This paper proposes a novel residual attentive learning network architecture for predicting dynamic eye-fixation maps. The proposed model emphasizes two essential issues, i.e, effective spatiotemporal feature integration and multi-scale saliency learning. For the first problem, appearance and motion streams are tightly coupled via dense residual cross connections, which integrate appearance information with multi-layer, comprehensive motion features in a residual and dense way. Beyond traditional two-stream models learning appearance and motion features separately, such design allows early, multi-path information exchange between different domains, leading to a unified and powerful spatiotemporal learning architecture. For the second one, we propose a composite attention mechanism that learns multi-scale local attentions and global attention priors end-to-end. It is used for enhancing the fused spatiotemporal features via emphasizing important features in multi-scales. A lightweight convolutional Gated Recurrent Unit (convGRU), which is flexible for small training data situation, is used for long-term temporal characteristics modeling. Extensive experiments over four benchmark datasets clearly demonstrate the advantage of the proposed video saliency model over other competitors and the effectiveness of each component of our network. Our code and all the results will be available at https://github.com/ashleylqx/STRA-Net.

Entities: Disease

Year: 2019 PMID： 31449021 DOI： 10.1109/TIP.2019.2936112

Source DB: PubMed Journal: IEEE Trans Image Process ISSN： 1057-7149 Impact factor: 10.856

Keyword Cloud
Cited

1 in total

1. Towards Making Videos Accessible for Low Vision Screen Magnifier Users.

Authors: Ali Selman Aydin; Shirin Feiz; Vikas Ashok; I V Ramakrishnan
Journal: IUI Date: 2020-03

1 in total