| Literature DB >> 27652068 |
Khan Muhammad1, Jamil Ahmad1, Muhammad Sajjad2, Sung Wook Baik1.
Abstract
In clinical practice, diagnostic hysteroscopy (DH) videos are recorded in full which are stored in long-term video libraries for later inspection of previous diagnosis, research and training, and as an evidence for patients' complaints. However, a limited number of frames are required for actual diagnosis, which can be extracted using video summarization (VS). Unfortunately, the general-purpose VS methods are not much effective for DH videos due to their significant level of similarity in terms of color and texture, unedited contents, and lack of shot boundaries. Therefore, in this paper, we investigate visual saliency models for effective abstraction of DH videos by extracting the diagnostically important frames. The objective of this study is to analyze the performance of various visual saliency models with consideration of domain knowledge and nominate the best saliency model for DH video summarization in healthcare systems. Our experimental results indicate that a hybrid saliency model, comprising of motion, contrast, texture, and curvature saliency, is the more suitable saliency model for summarization of DH videos in terms of extracted keyframes and accuracy.Entities:
Keywords: Diagnostic hysteroscopy; Image and video processing; Medical image analysis; Video summarization; Visual saliency models
Year: 2016 PMID: 27652068 PMCID: PMC5013008 DOI: 10.1186/s40064-016-3171-8
Source DB: PubMed Journal: Springerplus ISSN: 2193-1801
Fig. 1a Non-important frames, indicating irrelevant DH frames contaminated by lighting and biological effects, b, c important frames representing diagnostically important DH frames from relevant DH video segments
Fig. 2Mechanism of keyframes extraction from a sequence of diagnostic hysteroscopy frames
Performance evaluation of numerous saliency detection models for a sample hysteroscopy video
| Serial no. | Saliency detection model | Total keyframes | Extracted keyframes | Attention curve threshold | Accuracy (%) | Extracted frame numbers |
|---|---|---|---|---|---|---|
| 1 | MS | 10 | 3 | 0.024 | 30 | 1584, 2231, 2977 |
| 2 | MSCM | 10 | 6 | 0.015 | 60 | 1584, 2070, 2289, 2323, 2668, 2977 |
| 3 | TS | 10 | 5 | 0.872 | 50 | 2070, 2264, 2289, 2323, 2977 |
| 4 | CM | 10 | 4 | 0.226 | 40 | 2150, 2386, 2668, 2977 |
| 5 | MS + MSCM + TS | 10 | 6 | 0.571 | 60 | 1584, 2070, 2231, 2323, 2386, 2668 |
| 6 | HSDM | 10 | 7 | 0.847 |
| 1584, 2231, 2264, 2323, 2386, 2668, 2977 |
| 7 | SIM | 10 | 5 | 0.984 | 50 | 1584, 2070, 2231, 2289, 2977 |
The score in italic font represents the best accuracy among the given methods
Fig. 3a The most frequently selected keyframe, b and least recurring keyframe
Fig. 4F-measure based performance evaluation of numerous saliency detection models for summarization of DH videos
Comparison of general video summarization methods, general-purpose and domain-specific saliency detection based summarization schemes for keyframes extraction from a sample hysteroscopy video
| Serial. no | Method name | Category of video summarization | Total keyframes | Feature extraction model | Total number of frames | Accuracy (%) |
|---|---|---|---|---|---|---|
| 1 | Ejaz et al. ( | General-purpose VS | 10 | Low-level features | 3529 | 20 |
| 2 | Ejaz and Baik ( | General-purpose VS | 10 | High-level features | 3529 | 40 |
| 3 | SIM (Bruce and Tsotsos | General-purpose VS | 10 | General-purpose saliency detection | 3529 | 50 |
| 4 | Hybrid saliency detection model | Domain-specific VS | 10 | Domain-specific saliency detection | 3529 |
|
The score in italic font represents the best accuracy among the given methods
Fig. 5Performance evaluation of different summarization methods based on F-measure for DH videos
Fig. 6Execution time analysis for various saliency detection models