Literature DB >> 28945593

Video Salient Object Detection via Fully Convolutional Networks.

Wenguan Wang1, Jianbing Shen1, Ling Shao2.   

Abstract

This paper proposes a deep learning model to efficiently detect salient regions in videos. It addresses two important issues: 1) deep video saliency model training with the absence of sufficiently large and pixel-wise annotated video data and 2) fast video saliency training and detection. The proposed deep video saliency network consists of two modules, for capturing the spatial and temporal saliency information, respectively. The dynamic saliency model, explicitly incorporating saliency estimates from the static saliency model, directly produces spatiotemporal saliency inference without time-consuming optical flow computation. We further propose a novel data augmentation technique that simulates video training data from existing annotated image data sets, which enables our network to learn diverse saliency information and prevents overfitting with the limited number of training videos. Leveraging our synthetic video data (150K video sequences) and real videos, our deep video saliency model successfully learns both spatial and temporal saliency cues, thus producing accurate spatiotemporal saliency estimate. We advance the state-of-the-art on the densely annotated video segmentation data set (MAE of .06) and the Freiburg-Berkeley Motion Segmentation data set (MAE of .07), and do so with much improved speed (2 fps with all steps).This paper proposes a deep learning model to efficiently detect salient regions in videos. It addresses two important issues: 1) deep video saliency model training with the absence of sufficiently large and pixel-wise annotated video data and 2) fast video saliency training and detection. The proposed deep video saliency network consists of two modules, for capturing the spatial and temporal saliency information, respectively. The dynamic saliency model, explicitly incorporating saliency estimates from the static saliency model, directly produces spatiotemporal saliency inference without time-consuming optical flow computation. We further propose a novel data augmentation technique that simulates video training data from existing annotated image data sets, which enables our network to learn diverse saliency information and prevents overfitting with the limited number of training videos. Leveraging our synthetic video data (150K video sequences) and real videos, our deep video saliency model successfully learns both spatial and temporal saliency cues, thus producing accurate spatiotemporal saliency estimate. We advance the state-of-the-art on the densely annotated video segmentation data set (MAE of .06) and the Freiburg-Berkeley Motion Segmentation data set (MAE of .07), and do so with much improved speed (2 fps with all steps).

Keywords:  Computational modeling; Computer vision; Machine learning; Object detection; Optical imaging; Spatiotemporal phenomena; Training

Year:  2018        PMID: 28945593     DOI: 10.1109/TIP.2017.2754941

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  6 in total

1.  Boundary-Weighted Domain Adaptive Neural Network for Prostate MR Image Segmentation.

Authors:  Qikui Zhu; Bo Du; Pingkun Yan
Journal:  IEEE Trans Med Imaging       Date:  2019-08-13       Impact factor: 10.048

2.  Multi-Scale Feature Fusion Convolutional Neural Network for Indoor Small Target Detection.

Authors:  Li Huang; Cheng Chen; Juntong Yun; Ying Sun; Jinrong Tian; Zhiqiang Hao; Hui Yu; Hongjie Ma
Journal:  Front Neurorobot       Date:  2022-05-19       Impact factor: 3.493

Review 3.  RGB-D salient object detection: A survey.

Authors:  Tao Zhou; Deng-Ping Fan; Ming-Ming Cheng; Jianbing Shen; Ling Shao
Journal:  Comput Vis Media (Beijing)       Date:  2021-01-07

4.  A novel fully convolutional network for visual saliency prediction.

Authors:  Bashir Muftah Ghariba; Mohamed S Shehata; Peter McGuire
Journal:  PeerJ Comput Sci       Date:  2020-07-13

5.  CariesNet: a deep learning approach for segmentation of multi-stage caries lesion from oral panoramic X-ray image.

Authors:  Haihua Zhu; Zheng Cao; Luya Lian; Guanchen Ye; Honghao Gao; Jian Wu
Journal:  Neural Comput Appl       Date:  2022-01-07       Impact factor: 5.102

Review 6.  Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review.

Authors:  Dengshan Li; Rujing Wang; Peng Chen; Chengjun Xie; Qiong Zhou; Xiufang Jia
Journal:  Micromachines (Basel)       Date:  2021-12-31       Impact factor: 2.891

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.