| Literature DB >> 29760397 |
Hirotoshi Takiyama1,2, Tsuyoshi Ozawa3,4, Soichiro Ishihara1,5, Mitsuhiro Fujishiro6, Satoki Shichijo7, Shuhei Nomura8,9, Motoi Miura1,10, Tomohiro Tada1,2.
Abstract
The use of convolutional neural networks (CNNs) has dramatically advanced our ability to recognize images with machine learning methods. We aimed to construct a CNN that could recognize the anatomical location of esophagogastroduodenoscopy (EGD) images in an appropriate manner. A CNN-based diagnostic program was constructed based on GoogLeNet architecture, and was trained with 27,335 EGD images that were categorized into four major anatomical locations (larynx, esophagus, stomach and duodenum) and three subsequent sub-classifications for stomach images (upper, middle, and lower regions). The performance of the CNN was evaluated in an independent validation set of 17,081 EGD images by drawing receiver operating characteristics (ROC) curves and calculating the area under the curves (AUCs). ROC curves showed high performance of the trained CNN to classify the anatomical location of EGD images with AUCs of 1.00 for larynx and esophagus images, and 0.99 for stomach and duodenum images. Furthermore, the trained CNN could recognize specific anatomical locations within the stomach, with AUCs of 0.99 for the upper, middle, and lower stomach. In conclusion, the trained CNN showed robust performance in its ability to recognize the anatomical location of EGD images, highlighting its significant potential for future application as a computer-aided EGD diagnostic system.Entities:
Mesh:
Year: 2018 PMID: 29760397 PMCID: PMC5951793 DOI: 10.1038/s41598-018-25842-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
The distribution of the probability score and the accuracy of the CNN system.
| Probability Score | Correct number (%) | Whole number (%) | Accuracy (%) |
|---|---|---|---|
| >99% | 15,168 (91) | 15,265 (89) | 99.4 |
| 99–90% | 980 (6) | 1,101 (6) | 89.0 |
| 90–70% | 336 (2) | 437 (3) | 76.9 |
| 70–50% | 143 (1) | 264 (2) | 54.2 |
| <50% | 5 (0) | 14 (0) | 35.7 |
| Total | 16,632 (100) | 17,081 (100) | 97.4 |
Figure 1(a) The convolutional neural network (CNN) recognized the anatomical location of gastro-intestinal endoscopy images very accurately with area under the curve (AUC) values of 1.00 [95% confidence interval (CI): 1.00-1.00, p < 0.0001] for the larynx, 1.00 (95%CI: 1.00–1.00, p < 0.0001) for the esophagus, 0.99 (95%CI: 0.99–1.00, p < 0.0001) for the stomach, and 0.99 (95%CI: 0.99–0.99, p < 0.0001). (b) The convolutional neural network (CNN) recognized the anatomical location of the stomach images accurately with AUC values of 0.99 (95%CI: 0.99–0.99, p < 0.0001) for the upper stomach, 0.99 (95%CI: 0.99–0.99, p < 0.0001) for the middle stomach, and 0.99 (95%CI: 0.99-0.99, p < 0.0001) for the lower stomach.
Anatomical classification of the images of gastrointestinal scope by CNN.
| Output | Larynx (n = 363) (%) | Esophagus (n = 2,142) (%) | Stomach (n = 13,048) (%) | Duodenum (n = 1,528) (%) |
|---|---|---|---|---|
| Larynx | 341 (94) | 3 (0) | 1 (0) | 0 (0) |
| Esophagus | 5 (1) | 2,053 (96) | 28 (0) | 9 (1) |
| Stomach | 17 (5) | 75 (4) | 12,908 (99) | 189 (12) |
| Duodenum | 0 (0) | 11 (1) | 111 (1) | 1,330 (87) |
| Sensitivity (%) | 93.9 | 95.8 | 98.9 | 87.0 |
| Specificity (%) | 100 | 99.7 | 93.0 | 99.2 |
Anatomical classification of stomach images by CNN.
| Output | Upper (n = 3,532) (%) | Middle (n = 6,379) (%) | Lower (n = 3,137) (%) |
|---|---|---|---|
| Upper | 3,423 (97) | 148 (2) | 15 (0) |
| Middle | 60 (2) | 6,119 (96) | 75 (2) |
| Lower | 8 (0) | 32 (1) | 3,012 (96) |
| The others | 41 (1) | 80 (1) | 35 (1) |
| Sensitivity (%) | 96.9 | 95.9 | 96.0 |
| Specificity (%) | 98.5 | 98.0 | 98.8 |
Figure 2Examples of images which were correctly and incorrectly classified by the convolutional neural network (CNN). PS represents the probability score. (a) Duodenum image, which was incorrectly classified by the CNN as the lower stomach (left) and a correctly classified image from the duodenum (right). (b) Esophagus image, which was incorrectly classified by the CNN as the lower stomach (left) and a correctly classified image from the lower stomach (right). (c) Duodenum image, which was incorrectly classified by the CNN as the middle stomach (left) and a correctly classified image from the upper stomach (right). (d) Larynx image, which was incorrectly classified by the CNN as the esophagus (left) and a correctly classified image from the esophagus (right).
Anatomical classification of the training image set and validation image set.
| Main category | Sub category | Training set (n) (%) | Validation set (n) (%) |
|---|---|---|---|
| Larynx | 663 (2) | 363 (2) | |
| Esophagus | Upper, middle part | 1,460 (5) | 2,142 (13) |
| Lower part | 1,792 (7) | ||
| Upper stomach | Cardia | 1,830 (7) | 3,532 (21) |
| Upper body | 3,649 (13) | ||
| Middle stomach | Angle | 2,247 (8) | 6,379 (37) |
| Middle body | 4,937 (18) | ||
| Lower stomach | Antrum | 2,517 (9) | 3,137 (18) |
| Pylorus | 3,012 (11) | ||
| Lower body | 2,010 (7) | ||
| Duodenum | Bulbs | 1,578 (6) | 1,528 (9) |
| Second portion | 1,640 (6) | ||
| Total | 27,335 (100) | 17,081 (100) |
Figure 3The main anatomical categories and sub-anatomical categories of the stomach according to Japanese guidelines.
Figure 4A flow chart depicting our study design. The lower figure shows a representative image of the output probability score using a convolutional neural network (CNN) for gastro-intestinal images (in this case, an image from the lower stomach).