Literature DB >> 26087486

RPCA-KFE: Key Frame Extraction for Video Using Robust Principal Component Analysis.

Chinh Dang, Hayder Radha.   

Abstract

Key frame extraction algorithms consider the problem of selecting a subset of the most informative frames from a video to summarize its content. Several applications, such as video summarization, search, indexing, and prints from video, can benefit from extracted key frames of the video under consideration. Most approaches in this class of algorithms work directly with the input video data set, without considering the underlying low-rank structure of the data set. Other algorithms exploit the low-rank component only, ignoring the other key information in the video. In this paper, a novel key frame extraction framework based on robust principal component analysis (RPCA) is proposed. Furthermore, we target the challenging application of extracting key frames from unstructured consumer videos. The proposed framework is motivated by the observation that the RPCA decomposes an input data into: 1) a low-rank component that reveals the systematic information across the elements of the data set and 2) a set of sparse components each of which containing distinct information about each element in the same data set. The two information types are combined into a single l1-norm-based non-convex optimization problem to extract the desired number of key frames. Moreover, we develop a novel iterative algorithm to solve this optimization problem. The proposed RPCA-based framework does not require shot(s) detection, segmentation, or semantic understanding of the underlying video. Finally, experiments are performed on a variety of consumer and other types of videos. A comparison of the results obtained by our method with the ground truth and with related state-of-the-art algorithms clearly illustrates the viability of the proposed RPCA-based framework.

Year:  2015        PMID: 26087486     DOI: 10.1109/TIP.2015.2445572

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  2 in total

1.  Cross-Modal Reconstruction for Tactile Signal in Human-Robot Interaction.

Authors:  Mingkai Chen; Yu Xie
Journal:  Sensors (Basel)       Date:  2022-08-29       Impact factor: 3.847

2.  Deep Learning Intervention for Health Care Challenges: Some Biomedical Domain Considerations.

Authors:  Igbe Tobore; Jingzhen Li; Liu Yuhang; Yousef Al-Handarish; Abhishek Kandwal; Zedong Nie; Lei Wang
Journal:  JMIR Mhealth Uhealth       Date:  2019-08-02       Impact factor: 4.773

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.