| Literature DB >> 23197930 |
Abstract
Wireless capsule endoscopy (WCE) enables a physician to diagnose a patient's digestive system without surgical procedures. However, it takes 1-2 hours for a gastroenterologist to examine the video. To speed up the review process, a number of analysis techniques based on machine vision have been proposed by computer science researchers. In order to train a machine to understand the semantics of an image, the image contents need to be translated into numerical form first. The numerical form of the image is known as image abstraction. The process of selecting relevant image features is often determined by the modality of medical images and the nature of the diagnoses. For example, there are radiographic projection-based images (e.g., X-rays and PET scans), tomography-based images (e.g., MRT and CT scans), and photography-based images (e.g., endoscopy, dermatology, and microscopic histology). Each modality imposes unique image-dependent restrictions for automatic and medically meaningful image abstraction processes. In this paper, we review the current development of machine-vision-based analysis of WCE video, focusing on the research that identifies specific gastrointestinal (GI) pathology and methods of shot boundary detection.Entities:
Year: 2012 PMID: 23197930 PMCID: PMC3502844 DOI: 10.1155/2012/418037
Source DB: PubMed Journal: Diagn Ther Endosc ISSN: 1026-714X
Technical specifications of commercially available small intestine capsules.
| Company | Given imaging Inc. | Olympus | IntroMedic | Jinshan |
|---|---|---|---|---|
| Capsule | PillCam SB/SB2 | EndoCapsule | MiroCam | OMOM |
| Size (diameter × length) | 11 mm × 26 mm | 11 mm × 26 mm | 11 mm × 24 mm | 13 mm × 27.9 mm |
| Image Sensor | CMOS | CCD | CMOS | CCD |
| Resolution | 256 × 256 | NA | 320×320 | 640 × 480 |
| Field of View | 140°/156° | 145° | 150˚ | 140 ± 10° |
| Image Capture Rate | 2 fps | 2 fps | 3 fps | 0.5–2 fps |
| Illumination | 6 LEDs | 6 LEDs | 6 LEDs | 6 LEDs |
| Battery Life | 8 hr | 8+ hr | 11+ hr | 8 ± 1 hr |
| Communication | RF | RF | HBC | RF |
| Approval | FDA 2001/2011 | FDA 2007 | FDA 2012 | PRC FDA 2004 |
CMOS: complementary metal oxide semiconductor, CCD: charge-coupled device, LED: light-emitting diode, RF: radio frequency, HBC: human body communications, NA: not available.
Figure 1Typical images captured by WCE at different organs.
Figure 2Mucosa representations based on a joint histogram of LBP operator (LBP ) and local gray level variance (VAR).
Figure 3Image decomposition using DWT where (a) represents a one-level wavelet decomposition and (b) represents a two-level wavelet decomposition.
Figure 4General steps involving in computer-aided diagnosis (CAD) system where gray boxes may be optional.
Figure 5A shot detection scheme based on event boundary detection approach as described in [3].
Figure 6A bleeding detection scheme as described in [3].
Figure 7Ulcer detection scheme described in [4].
Summary of abnormality types, features and claimed experimental results.
| Paper | Pathology | Feature | Experiment results | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Bleeding | Polyp | Tumor | Ulcer | Color | Texture | Shape | Image count | Sensitivity | Specificity | |
| Chen et al. [ |
|
| 1,857,657 | 71.54% | N/A | |||||
| Lau and Correia [ |
|
| 1,705 | 88.30% | N/A | |||||
| Liu and Yuan [ |
|
| 800 | 99.73% | 98.89% | |||||
| Giritharan et al. [ |
|
| 275,000 | 83.1% | 93.6% | |||||
| Penna et al. [ |
|
| 1,111 | 92% | 88% | |||||
| Karargyris and Bourbakis [ |
|
| N/A | N/A | N/A | |||||
| Li and Meng [ |
|
|
| 200 | 92.6% | 91% | ||||
| Li and Meng [ |
|
| 100 | 93.28% | 91.46% | |||||
| Karargyris and Bourbakis [ |
|
| 50 | 100% | 67.5% | |||||
| Iakovidis et al. [ |
|
| 4,000 | 87.5% | N/A | |||||
| Karargyris and Bourbakis [ |
|
| 50 | 75% | 73.3% | |||||
| Li et al. [ |
|
| 300 | 89.8% | 82.5% | |||||
| Barbosa et al. [ |
|
| 192 | 98.7% | 96.6% | |||||
| Barbosa et al. [ |
|
| 600 | 97.4% | 97.5% | |||||
| Karkanis et al. [ |
|
| N/A | N/A | N/A | |||||
| Li and Meng [ |
|
| 300 | 97.33% | 96% | |||||
Sensitivity = Number of True Postives/(Number of True Postives + Number of False Negatives). Sensitivity of 100% means no positives are incorrectly marked as negative. In other words, the test recognizes all positives.
Specificity = Number of True Negatives/(Number of True Negatives + Number of False Postives). Specificity of 100% means no negatives are incorrectly marked as positive. In other words, the test recognizes all negatives.
N/A: data is not available.