| Literature DB >> 32549190 |
Bum-Joo Cho1,2,3,4, Chang Seok Bang4,5,6,7, Jae Jun Lee4,8, Chang Won Seo1, Ju Han Kim3.
Abstract
Endoscopic resection is recommended for gastric neoplasms confined to mucosa or superficial submucosa. The determination of invasion depth is based on gross morphology assessed in endoscopic images, or on endoscopic ultrasound. These methods have limited accuracy and pose an inter-observer variability. Several studies developed deep-learning (DL) algorithms classifying invasion depth of gastric cancers. Nevertheless, these algorithms are intended to be used after definite diagnosis of gastric cancers, which is not always feasible in various gastric neoplasms. This study aimed to establish a DL algorithm for accurately predicting submucosal invasion in endoscopic images of gastric neoplasms. Pre-trained convolutional neural network models were fine-tuned with 2899 white-light endoscopic images. The prediction models were subsequently validated with an external dataset of 206 images. In the internal test, the mean area under the curve discriminating submucosal invasion was 0.887 (95% confidence interval: 0.849-0.924) by DenseNet-161 network. In the external test, the mean area under the curve reached 0.887 (0.863-0.910). Clinical simulation showed that 6.7% of patients who underwent gastrectomy in the external test were accurately qualified by the established algorithm for potential endoscopic resection, avoiding unnecessary operation. The established DL algorithm proves useful for the prediction of submucosal invasion in endoscopic images of gastric neoplasms.Entities:
Keywords: artificial intelligence; convolutional neural networks; endoscopy; gastric neoplasms
Year: 2020 PMID: 32549190 PMCID: PMC7356204 DOI: 10.3390/jcm9061858
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Composition of datasets used in the development and testing of the deep-learning algorithm.
| Whole Dataset | Training Set | Internal Test Set | External Test Set | |||||
|---|---|---|---|---|---|---|---|---|
| Number of Images | Number of Patients | Number of Images | Number of Patients | Number of Images | Number of Patients | Number of Images | Number of Patients | |
| Overall | 2899 | 846 | 2590 | 762 | 309 | 85 | 206 | 197 |
| Mucosa-confined lesions | 1900 | 580 | 1693 | 522 | 207 | 58 | 126 | 119 |
| Low-grade dysplasia | 727 | 233 | 630 | 205 | 97 | 28 | 68 | 66 |
| High-grade dysplasia | 421 | 131 | 390 | 123 | 31 | 8 | 21 | 21 |
| EGC | 752 | 230 | 673 | 205 | 79 | 25 | 37 | 37 |
| Submucosa-invaded lesion | 999 | 270 | 897 | 243 | 102 | 27 | 80 | 78 |
| EGC | 282 | 81 | 242 | 71 | 40 | 10 | 23 | 23 |
| AGC | 717 | 189 | 655 | 172 | 62 | 17 | 57 | 55 |
EGC, early gastric cancer; AGC, advanced gastric cancer.
Diagnostic performance of the established algorithm classifying submucosal invasion in the internal test dataset.
| Model | AUC | Accuracy (%) | Sensitivity (%) | Specificity (%) | Positive Predictive Value (%) | Negative Predictive Value (%) |
|---|---|---|---|---|---|---|
| Whole dataset | ||||||
| Inception-Resnet-v2 | 0.786 (0.779–0.793) | 77.4 (76.7–78.0) | 72.5 (71.5–73.6) | 72.9 (71.3–74.6) | 56.9 (55.2–58.7) | 84.4 (83.6–85.1) |
| DenseNet−161 | 0.887 (0.849–0.924) | 84.1 (81.6–86.7) | 78.8 (75.4–82.2) | 80.0 (76.8–83.2) | 66.1 (61.6–70.6) | 88.4 (86.4–90.4) |
| EGC ( | ||||||
| Inception-Resnet-v2 | 0.612 (0.599–0.626) | 66.1 (65.0–67.2) | 56.7 (55.0–58.3) | 57.0 (54.1–59.9) | 40.0 (38.3–41.8) | 72.2 (70.9–73.4) |
| DenseNet−161 | 0.694 (0.607–0.781) | 71.4 (67.1–75.8) | 60.8 (54.9–66.7) | 61.6 (54.2–69.0) | 44.7 (37.7–51.7) | 75.5 (70.4–80.6) |
AUC, area under the curve; EGC, early gastric cancer.
Figure 1Receiver operating characteristic curve of the best performance model. (a): internal test, (b): external test.
Diagnostic performance of the established algorithm for the prediction of submucosal invasion in the external test dataset.
| Model | AUC | Accuracy (%) | Sensitivity (%) | Specificity (%) | Positive Predictive Value (%) | Negative Predictive Value (%) |
|---|---|---|---|---|---|---|
| Whole dataset | ||||||
| Inception-Resnet-v2 | 0.769 (0.755–0.783) | 74.1 (71.0–77.2) | 72.5 (72.5–72.5) | 74.3 (73.0–75.7) | 64.2 (62.9–65.5) | 81.0 (80.7–81.3) |
| DenseNet−161 | 0.887 (0.863–0.910) | 77.3 (75.4–79.3) | 80.4 (79.6–81.3) | 80.7 (78.5–83.0) | 72.6 (70.1–75.1) | 86.6 (85.9–87.4) |
| EGC ( | ||||||
| Inception-Resnet-v2 | 0.609 (0.572–0.647) | 65.0 (61.7–68.3) | 58.0 (55.1–60.8) | 62.2 (56.9–67.5) | 52.2 (40.8–63.6) | 70.4 (67.3–73.4) |
| DenseNet−161 | 0.747 (0.712–0.782) | 67.2 (64.4–70.1) | 65.2 (65.2–65.2) | 70.3 (67.2–73.4) | 57.8 (55.3–60.3) | 76.5 (75.7–77.2) |
AUC, area under the curve; EGC, early gastric cancer.
Figure 2Confusion matrix of the best performance model in the external test. (a): the Inception-Resnet-v2, (b): the DenseNet−161. The vertical axis is true label and the horizontal axis is predicted label.
Figure 3Clinical simulation in the application of the deep-learning algorithm for the determination of a therapeutic strategy of external test dataset; DL, deep-learning.
Figure 4Representative images of correctly or incorrectly determined lesions by the deep-learning (DL) algorithm. (a): Wrong answer case by the DL algorithm in endoscopically resected lesions (moderately differentiated adenocarcinoma with SM2 invasion); (b): wrong answer case by the DL algorithm in endoscopically resected lesions (well differentiated adenocarcinoma with SM3 invasion); (c): correct answer case by the DL algorithm for endoscopic submucosal dissection (ESD) candidate in surgically resected lesions (mucosa-confined signet ring cell carcinoma with 1 cm diameter); (d): wrong answer case by the DL algorithm in surgically resected lesions (mucosa-confined poorly differentiated adenocarcinoma within expanded indication); DL, deep-learning; ESD, endoscopic submucosal dissection.
Figure 5Representative images of the attention map.