Literature DB >> 33946745

A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval.

Hanqing Chen1, Chunyan Hu2, Feifei Lee1, Chaowei Lin1, Wei Yao1, Lu Chen1, Qiu Chen3.   

Abstract

Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, content-based video retrieval (CBVR) has aroused the interest of many researchers. A typical CBVR system mainly contains the following two essential parts: video feature extraction and similarity comparison. Feature extraction of video is very challenging, previous video retrieval methods are mostly based on extracting features from single video frames, while resulting the loss of temporal information in the videos. Hashing methods are extensively used in multimedia information retrieval due to its retrieval efficiency, but most of them are currently only applied to image retrieval. In order to solve these problems in video retrieval, we build an end-to-end framework called deep supervised video hashing (DSVH), which employs a 3D convolutional neural network (CNN) to obtain spatial-temporal features of videos, then train a set of hash functions by supervised hashing to transfer the video features into binary space and get the compact binary codes of videos. Finally, we use triplet loss for network training. We conduct a lot of experiments on three public video datasets UCF-101, JHMDB and HMDB-51, and the results show that the proposed method has advantages over many state-of-the-art video retrieval methods. Compared with the DVH method, the mAP value of UCF-101 dataset is improved by 9.3%, and the minimum improvement on JHMDB dataset is also increased by 0.3%. At the same time, we also demonstrate the stability of the algorithm in the HMDB-51 dataset.

Entities:  

Keywords:  3D CNN; supervised hashing; triplet loss; video retrieval

Year:  2021        PMID: 33946745     DOI: 10.3390/s21093094

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  4 in total

1.  Iterative quantization: a Procrustean approach to learning binary codes for large-scale image retrieval.

Authors:  Yunchao Gong; Svetlana Lazebnik; Albert Gordo; Florent Perronnin
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2013-12       Impact factor: 6.226

2.  Enhancing Sketch-Based Image Retrieval by CNN Semantic Re-ranking.

Authors:  Luo Wang; Xueming Qian; Yuting Zhang; Jialie Shen; Xiaochun Cao
Journal:  IEEE Trans Cybern       Date:  2019-03-15       Impact factor: 11.448

3.  Unsupervised Topic Hypergraph Hashing for Efficient Mobile Image Retrieval.

Authors:  Lei Zhu; Jialie Shen; Liang Xie; Zhiyong Cheng
Journal:  IEEE Trans Cybern       Date:  2016-10-21       Impact factor: 11.448

4.  Self-Supervised Video Hashing With Hierarchical Binary Auto-Encoder.

Authors:  Jingkuan Song; Hanwang Zhang; Xiangpeng Li; Lianli Gao; Meng Wang; Richang Hong
Journal:  IEEE Trans Image Process       Date:  2018-07       Impact factor: 10.856

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.