Literature DB >> 33232224

Zero-Shot Video Object Segmentation With Co-Attention Siamese Networks.

Xiankai Lu, Wenguan Wang, Jianbing Shen, David Crandall, Jiebo Luo.   

Abstract

We introduce a novel network, called CO-attention siamese network (COSNet), to address the zero-shot video object segmentation task in a holistic fashion. We exploit the inherent correlation among video frames and incorporate a global co-attention mechanism to further improve the state-of-the-art deep learning based solutions that primarily focus on learning discriminative foreground representations over appearance and motion in short-term temporal segments. The co-attention layers in COSNet provide efficient and competent stages for capturing global correlations and scene context by jointly computing and appending co-attention responses into a joint feature space. COSNet is a unified and end-to-end trainable framework where different co-attention variants can be derived for capturing diverse properties of the learned joint feature space. We train COSNet with pairs (or groups) of video frames, and this naturally augments training data and allows increased learning capacity. During the segmentation stage, the co-attention model encodes useful information by processing multiple reference frames together, which is leveraged to infer the frequently reappearing and salient foreground objects better. Our extensive experiments over three large benchmarks demonstrate that COSNet outperforms the current alternatives by a large margin. Our implementations are available at https://github.com/carrierlxk/COSNet.

Entities:  

Year:  2022        PMID: 33232224     DOI: 10.1109/TPAMI.2020.3040258

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  4 in total

1.  An information-rich sampling technique over spatio-temporal CNN for classification of human actions in videos.

Authors:  S H Shabbeer Basha; Viswanath Pulabaigari; Snehasis Mukherjee
Journal:  Multimed Tools Appl       Date:  2022-05-09       Impact factor: 2.577

2.  Deep learning techniques for observing the impact of the global warming from satellite images of water-bodies.

Authors:  Rajdeep Chatterjee; Ankita Chatterjee; Sk Hafizul Islam
Journal:  Multimed Tools Appl       Date:  2022-01-06       Impact factor: 2.577

3.  Dermoscopic image segmentation based on Pyramid Residual Attention Module.

Authors:  Yun Jiang; Tongtong Cheng; Jinkun Dong; Jing Liang; Yuan Zhang; Xin Lin; Huixia Yao
Journal:  PLoS One       Date:  2022-09-16       Impact factor: 3.752

4.  Deep-learning approach for automated thickness measurement of epithelial tissue and scab using optical coherence tomography.

Authors:  Yubo Ji; Shufan Yang; Kanheng Zhou; Holly R Rocliffe; Antonella Pellicoro; Jenna L Cash; Ruikang Wang; Chunhui Li; Zhihong Huang
Journal:  J Biomed Opt       Date:  2022-01       Impact factor: 3.758

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.