Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Property-Constrained Dual Learning for Video Summarization.

Literature DB >> 31825876

Property-Constrained Dual Learning for Video Summarization.

Abstract

Video summarization is the technique to condense large-scale videos into summaries composed of key-frames or key-shots so that the viewers can browse the video content efficiently. Recently, supervised approaches have achieved great success by taking advantages of recurrent neural networks (RNNs). Most of them focus on generating summaries by maximizing the overlap between the generated summary and the ground truth. However, they neglect the most critical principle, i.e., whether the viewer can infer the original video content from the summary. As a result, existing approaches cannot preserve the summary quality well and usually demand large amounts of training data to reduce overfitting. In our view, video summarization has two tasks, i.e., generating summaries from videos and inferring the original content from summaries. Motivated by this, we propose a dual learning framework by integrating the summary generation (primal task) and video reconstruction (dual task) together, which targets to reward the summary generator under the assistance of the video reconstructor. Moreover, to provide more guidance to the summary generator, two property models are developed to measure the representativeness and diversity of the generated summary. Practically, experiments on four popular data sets (SumMe, TVsum, OVP, and YouTube) have demonstrated that our approach, with compact RNNs as the summary generator, using less training data, and even in the unsupervised setting, can get comparable performance with those supervised ones adopting more complex summary generators and trained on more annotated data.

Entities: Chemical

Year: 2019 PMID： 31825876 DOI： 10.1109/TNNLS.2019.2951680

Source DB: PubMed Journal: IEEE Trans Neural Netw Learn Syst ISSN： 2162-237X Impact factor: 10.451

Keyword Cloud
Cited

2 in total

1. Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network.

Authors: Xin Xiong; Weidong Min; Qing Han; Qi Wang; Cheng Zha
Journal: Comput Intell Neurosci Date: 2022-06-13

2. A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency.

Authors: Xu Wang; Yujie Li; Haoyu Wang; Longzhao Huang; Shuxue Ding
Journal: Sensors (Basel) Date: 2022-10-10 Impact factor: 3.847

2 in total