Literature DB >> 26363682

Scalable gastroscopic video summarization via similar-inhibition dictionary selection.

Shuai Wang1, Yang Cong2, Jun Cao3, Yunsheng Yang4, Yandong Tang2, Huaici Zhao5, Haibin Yu6.   

Abstract

OBJECTIVE: This paper aims at developing an automated gastroscopic video summarization algorithm to assist clinicians to more effectively go through the abnormal contents of the video. METHODS AND MATERIALS: To select the most representative frames from the original video sequence, we formulate the problem of gastroscopic video summarization as a dictionary selection issue. Different from the traditional dictionary selection methods, which take into account only the number and reconstruction ability of selected key frames, our model introduces the similar-inhibition constraint to reinforce the diversity of selected key frames. We calculate the attention cost by merging both gaze and content change into a prior cue to help select the frames with more high-level semantic information. Moreover, we adopt an image quality evaluation process to eliminate the interference of the poor quality images and a segmentation process to reduce the computational complexity.
RESULTS: For experiments, we build a new gastroscopic video dataset captured from 30 volunteers with more than 400k images and compare our method with the state-of-the-arts using the content consistency, index consistency and content-index consistency with the ground truth. Compared with all competitors, our method obtains the best results in 23 of 30 videos evaluated based on content consistency, 24 of 30 videos evaluated based on index consistency and all videos evaluated based on content-index consistency.
CONCLUSIONS: For gastroscopic video summarization, we propose an automated annotation method via similar-inhibition dictionary selection. Our model can achieve better performance compared with other state-of-the-art models and supplies more suitable key frames for diagnosis. The developed algorithm can be automatically adapted to various real applications, such as the training of young clinicians, computer-aided diagnosis or medical report generation.
Copyright © 2015 Elsevier B.V. All rights reserved.

Keywords:  Gastroscopic video; Image attention prior; Key frame; Similar-inhibition dictionary selection; Video summarization

Mesh:

Year:  2015        PMID: 26363682     DOI: 10.1016/j.artmed.2015.08.006

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  1 in total

1.  Improving Temporal Stability and Accuracy for Endoscopic Video Tissue Classification Using Recurrent Neural Networks.

Authors:  Tim Boers; Joost van der Putten; Maarten Struyvenberg; Kiki Fockens; Jelmer Jukema; Erik Schoon; Fons van der Sommen; Jacques Bergman; Peter de With
Journal:  Sensors (Basel)       Date:  2020-07-24       Impact factor: 3.576

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.