| Literature DB >> 32282110 |
Yohei Ikenoyama1,2, Toshiaki Hirasawa1,3, Mitsuaki Ishioka1, Ken Namikawa1, Shoichi Yoshimizu1, Yusuke Horiuchi1, Akiyoshi Ishiyama1, Toshiyuki Yoshio1,3, Tomohiro Tsuchida1, Yoshinori Takeuchi4, Satoki Shichijo5, Naoyuki Katayama2, Junko Fujisaki1, Tomohiro Tada6,3.
Abstract
OBJECTIVES: Detecting early gastric cancer is difficult, and it may even be overlooked by experienced endoscopists. Recently, artificial intelligence based on deep learning through convolutional neural networks (CNNs) has enabled significant advancements in the field of gastroenterology. However, it remains unclear whether a CNN can outperform endoscopists. In this study, we evaluated whether the performance of a CNN in detecting early gastric cancer is better than that of endoscopists.Entities:
Keywords: artificial intelligence; convolutional neural network; deep learning; endoscopy; gastric cancer
Mesh:
Year: 2020 PMID: 32282110 PMCID: PMC7818187 DOI: 10.1111/den.13688
Source DB: PubMed Journal: Dig Endosc ISSN: 0915-5635 Impact factor: 6.337
Figure 1Patient recruitment flowchart.
Figure 2Representative gastric cancer and non‐cancer endoscopic images. (a) A slightly reddish and depressed lesion of gastric cancer appears on the lesser curvature of the antrum. [0–IIc, 10 mm, tub1, T1a(M)]. (b) This image shows the Helicobacter pylori uninfected gastric mucosa. There is no cancer.
Figure 3Definition of correct answer. (a) A reddish, depressed lesion of gastric cancer appears on the greater curvature of the lower body. [0–IIc, 9 mm, tub1, T1a(M)]. (b) The correct marking is the red rectangle. The green rectangle is the convolutional neural network (CNN) marking, and the blue rectangle is the endoscopists’ marking. In this case, when the correct marking and the marking of the CNN or endoscopists overlap by 40% or more, they were judged to be correct.
Patient and lesion characteristics of gastric cancer in test image sets
|
| |
|---|---|
| Patient characteristics ( | |
| Sex, | 49/21 |
| Age, median, (range), years | 68 (46–89) |
| Lesion characteristics ( | |
| Number of images | 209 |
| Tumor location (upper/middle/lower) | 12/24/39 |
| Tumor size, median (range), mm | 10 (1.5–20) |
| Depth of tumor (T1a/T1b) | 66/9 |
| Macroscopic type (0‐I/0‐IIa/0‐IIb/0‐IIc/0‐III) | 2/8/0/65/0 |
| Pathology (differentiated/undifferentiated) | 64/11 |
|
| 19/54/2 |
H. pylori, Helicobacter pylori; T1a, mucosa; T1b, submucosa.
Diagnostic performances of CNN and endoscopists for each image
| CNN | Endoscopists | |||
|---|---|---|---|---|
| Certified ( | Non‐certified ( | All ( | ||
|
Diagnostic time (SD) (total) | 45.5 (1.8) s | 172.9 (68.4) min | 173.0 (63.6) min | 173.0 (66.0) min |
| Diagnostic time (per image) | 0.0154 s | 3.53 s | 3.53 s | 3.53 s |
| Sensitivity, % (95% CI) | 58.4 (51.7–65.1) | 37.2 (33.5–40.8) | 26.9 (23.6–30.1) | 31.9 (28.6–35.3) |
| PPV, % (95% CI) | 26.0 (22.0–30.0) | 48.2 (43.4–53.1) | 43.8 (38.6–49.0) | 46.2 (41.3–51.1) |
| Specificity, % (95% CI) | 87.3 (86.0–88.5) | 97.0 (96.7–97.2) | 97.4 (97.1–97.6) | 97.2 (96.9–97.4) |
| NPV, % (95% CI) | 96.5 (95.8–97.2) | 95.3 (94.6–96.0) | 94.6 (93.8–95.3) | 94.9 (94.2–95.6) |
| AUC | 0.757 | — | — | — |
—, not applicable;AUC, area under the curve; CI, confidence interval; CNN, convolutional neural network; NPV, negative predictive value; PPV, positive predictive value; SD, standard deviation.
Figure 4This graph shows the receiver operating characteristic curves for the convolutional neural network (CNN) and predictions of the endoscopists. Each endoscopist's prediction is represented by a single point. The CNN outputs a gastric cancer probability score per image, and the program then calculates a mean square of the probabilities per image. The area under the curve is 75.7%. At a cut‐off value of 0.412, the sensitivity and specificity of the CNN were 58.4% and 87.3, respectively.
Sensitivity of the CNN and endoscopists for each lesion by lesion characteristics
| Characteristics ( | CNN sensitivity, % (95% CI) | Endoscopists sensitivity, % (95% CI) | |
|---|---|---|---|
| Size | <10 mm (36) | 91.7 (82.6–100) | 62.9 (55.8–70.0) |
| ≦10 mm (39) | 69.2 (54.8–83.7) | 44.4 (35.3–53.6) | |
| Depth | T1a (66) | 77.3 (67.2–87.4) | 50.6 (44.2–57.1) |
| T1b (9) | 100 | 72.8 (69.3–76.4) | |
| Macroscopic type | 0‐I (2) | 50 (0–100) | 74.6 (43.6–100) |
| 0‐IIa (8) | 88.9 (68.4–100) | 55.7 (36.1–75.4) | |
| 0‐IIc (65) | 79.7 (69.8–89.5) | 52.3 (45.7–58.9) | |
| Location | Upper (12) | 75.0 (50.5–99.5) | 66.5 (52.5–80.6) |
| Middle (24) | 75.0 (57.7–92.3) | 45.8 (33.8–57.8) | |
| Lower (39) | 84.6 (73.3–99.5) | 53.9 (46.1–61.6) | |
| Histology | Differentiated type (64) | 79.7 (69.8–89.5) | 52.9 (46.2–59.7) |
| Undifferentiated type (11) | 81.8 (59.0–100) | 55.5 (39.9–71.1) | |
|
| Current infection (19) | 79.0 (60.6–97.3) | 56.9 (44.7–69.1) |
| Past infection (54) | 79.6 (68.9–90.4) | 51.8 (44.5–59.1) | |
| No infection (2) | 100 | 59.7 (24.5–94.9) |
CI, confidence interval; CNN, convolutional neural network; T1a, mucosa; T1b, submucosa; H. pylori, Helicobacter pylori.
Details of false‐positive images in the CNN and endoscopists diagnosis
| Cause for false positives | CNN, |
Endoscopists, ( |
|---|---|---|
| Total number | 347 | 5203 |
| Gastritis (redness, atrophy, intestinal metaplasia) | 190 (54.8) | 3823 (73.5) |
| Normal anatomical structure (cardia, pylorus, angulus) | 79 (22.8) | 0 (0.0) |
| Fold | 20 (5.8) | 23 (0.4) |
| Mucus | 13 (3.7) | 243 (4.7) |
| Halation | 13 (3.7) | 21 (0.4) |
| Scar | 12 (3.5) | 252 (4.8) |
| Foam | 5 (1.4) | 8 (0.2) |
| Blood | 4 (1.2) | 6 (0.1) |
| Vessel | 2 (0.6) | 138 (2.7) |
| Extrinsic compression | 2 (0.6) | 14 (0.3) |
| Xanthoma | 2 (0.6) | 103 (2.0) |
| Hyperplastic polyp | 2 (0.6) | 342 (6.6) |
| Submucosal tumor | 1 (0.3) | 178 (3.4) |
| Ulcer | 1 (0.3) | 10 (0.2) |
| Suction mark | 1 (0.3) | 42 (0.8) |
CNN, convolutional neural network.
Details of false negative images in the CNN and endoscopists diagnosis
| Cause for false negatives | CNN, |
Endoscopists, ( |
|---|---|---|
| Total number | 87 | 7885 |
| Small (≦10 mm) | 50 (57.5) | 1284 (16.3) |
| Tangential line | 14 (16.1) | 1089 (13.8) |
| Distant | 9 (10.3) | 1474 (18.7) |
| Inflammation‐like | 8 (9.2) | 3910 (49.6) |
| Blood | 2 (2.3) | 64 (0.8) |
| Halation | 2 (2.3) | 64 (0.8) |
| Scar‐like | 2 (2.3) | 0 (0.0) |
CNN, convolutional neural network.
Figure 5Representative images of false positives. The green rectangular frames show areas that the convolutional neural network misdiagnosed as gastric cancer. (a) Spotty redness associated with Helicobacter pylori (H. pylori) infection (gastritis). (b) Cardia (normal anatomical structure). (c) White scar (S2 stage) at the lesser curvature of the upper body (ulcer scar).
Figure 6Representative images of false negatives. The following cancers were misdiagnosed and the assumed causes for this misdiagnosis were as follows. (a) 0– IIc, 4 mm, tub1, T1a (too small lesion). (b) Images from tangential line (tangential line). (c) Lesion at the angle captured about 7 cm away (too distant lesion).