Literature DB >> 29641401

Self-Supervised Video Hashing With Hierarchical Binary Auto-Encoder.

Jingkuan Song, Hanwang Zhang, Xiangpeng Li, Lianli Gao, Meng Wang, Richang Hong.   

Abstract

Existing video hash functions are built on three isolated stages: frame pooling, relaxed learning, and binarization, which have not adequately explored the temporal order of video frames in a joint binary optimization model, resulting in severe information loss. In this paper, we propose a novel unsupervised video hashing framework dubbed self-supervised video hashing (SSVH), which is able to capture the temporal nature of videos in an end-to-end learning to hash fashion. We specifically address two central problems: 1) how to design an encoder-decoder architecture to generate binary codes for videos and 2) how to equip the binary codes with the ability of accurate video retrieval. We design a hierarchical binary auto-encoder to model the temporal dependencies in videos with multiple granularities, and embed the videos into binary codes with less computations than the stacked architecture. Then, we encourage the binary codes to simultaneously reconstruct the visual content and neighborhood structure of the videos. Experiments on two real-world data sets show that our SSVH method can significantly outperform the state-of-the-art methods and achieve the current best performance on the task of unsupervised video retrieval.

Year:  2018        PMID: 29641401     DOI: 10.1109/TIP.2018.2814344

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  5 in total

1.  A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval.

Authors:  Hanqing Chen; Chunyan Hu; Feifei Lee; Chaowei Lin; Wei Yao; Lu Chen; Qiu Chen
Journal:  Sensors (Basel)       Date:  2021-04-29       Impact factor: 3.576

2.  Self-Supervised Learning to Detect Key Frames in Videos.

Authors:  Xiang Yan; Syed Zulqarnain Gilani; Mingtao Feng; Liang Zhang; Hanlin Qin; Ajmal Mian
Journal:  Sensors (Basel)       Date:  2020-12-04       Impact factor: 3.576

3.  Application of region-based video surveillance in smart cities using deep learning.

Authors:  Asma Zahra; Mubeen Ghafoor; Kamran Munir; Ata Ullah; Zain Ul Abideen
Journal:  Multimed Tools Appl       Date:  2021-12-27       Impact factor: 2.757

4.  DNA circuits compatible encoder and demultiplexer based on a single biomolecular platform with DNA strands as outputs.

Authors:  Tianci Xie; Yuhan Deng; Jiarui Zhang; Zhen Zhang; Zhe Hu; Tongbo Wu
Journal:  Nucleic Acids Res       Date:  2022-08-26       Impact factor: 19.160

Review 5.  An Overview of Image Caption Generation Methods.

Authors:  Haoran Wang; Yue Zhang; Xiaosheng Yu
Journal:  Comput Intell Neurosci       Date:  2020-01-09
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.