Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Gaze-enabled Egocentric Video Summarization via Constrained Submodular Maximization.

Literature DB >> 26973428

Gaze-enabled Egocentric Video Summarization via Constrained Submodular Maximization.

Jia Xut¹, Lopamudra Mukherjee², Yin Li³, Jamieson Warner¹, James M Rehg³, Vikas Singht¹.

Abstract

With the proliferation of wearable cameras, the number of videos of users documenting their personal lives using such devices is rapidly increasing. Since such videos may span hours, there is an important need for mechanisms that represent the information content in a compact form (i.e., shorter videos which are more easily browsable/sharable). Motivated by these applications, this paper focuses on the problem of egocentric video summarization. Such videos are usually continuous with significant camera shake and other quality issues. Because of these reasons, there is growing consensus that direct application of standard video summarization tools to such data yields unsatisfactory performance. In this paper, we demonstrate that using gaze tracking information (such as fixation and saccade) significantly helps the summarization task. It allows meaningful comparison of different image frames and enables deriving personalized summaries (gaze provides a sense of the camera wearer's intent). We formulate a summarization model which captures common-sense properties of a good summary, and show that it can be solved as a submodular function maximization with partition matroid constraints, opening the door to a rich body of work from combinatorial optimization. We evaluate our approach on a new gaze-enabled egocentric video dataset (over 15 hours), which will be a valuable standalone resource.

Entities: Chemical Disease Gene Species

Year: 2015 PMID： 26973428 PMCID： PMC4784707 DOI： 10.1109/CVPR.2015.7298836

Source DB: PubMed Journal: Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit ISSN： 1063-6919

2 in total

1. Active visual segmentation.

Authors: Ajay K Mishra; Yiannis Aloimonos; Loong-Fah Cheong; Ashraf A Kassim
Journal: IEEE Trans Pattern Anal Mach Intell Date: 2012-04 Impact factor: 6.226

2. A hierarchical visual model for video object summarization.

Authors: David Liu; Gang Hua; Tsuhan Chen
Journal: IEEE Trans Pattern Anal Mach Intell Date: 2010-12 Impact factor: 6.226

2 in total

1 in total

1. Human Eye Movements Reveal Video Frame Importance.

Authors: Zheng Ma; Jiaxin Wu; Sheng-Hua Zhong; Jianmin Jiang; Stephen J Heinen
Journal: Computer (Long Beach Calif) Date: 2019-05-14 Impact factor: 2.683

1 in total