| Literature DB >> 33254026 |
Dehua Tang1, Lei Wang1, Tingsheng Ling1, Ying Lv1, Muhan Ni1, Qiang Zhan2, Yiwei Fu3, Duanming Zhuang4, Huimin Guo1, Xiaotan Dou1, Wei Zhang1, Guifang Xu5, Xiaoping Zou6.
Abstract
BACKGROUND: We aimed to develop and validate a real-time deep convolutional neural networks (DCNNs) system for detecting early gastric cancer (EGC).Entities:
Keywords: Artificial intelligence; Convolutional neural network; Detection; Early gastric cancer
Year: 2020 PMID: 33254026 PMCID: PMC7708824 DOI: 10.1016/j.ebiom.2020.103146
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Fig. 1Workflow for the development and validation of the DCNN system for diagnosing EGC. DCNN: Deep convolutional neural networks; EGC: Early gastric cancer.
Fig. 2Architecture and workflow of the DCNN system. DCNN: Deep convolutional neural networks. .
Clinical characteristics of training and validation datasets.
| Characteristics | Training dataset(NJDTH, 1085 cases)January 2016–October 2018 | Temporal validation dataset (NJDTH, 279 cases) November 2018–January 2019 | External validation datasets June 2019–October 2019 | Video dataset(NJDTH, 26 cases)November 2019–December 2019 | ||
|---|---|---|---|---|---|---|
| WXPH (20 cases) | TZPH (20 cases) | GCPH (20 cases) | ||||
| 808 / 277 | 191 / 88 | 13 / 7 | 12 / 8 | 15 / 5 | 16 / 10 | |
| 63.4 (27–90) | 64.0 (34–86) | 66.3 (51–78) | 61.8 (47–73) | 62.6 (54–79) | 62.6 (39–77) | |
| 2.1 (0.2–4.4) | 1.9 (0.3–3.9) | 1.9 (0.6–3.5) | 1.7 (0.5–3.6) | 1.4 (0.5–2.6) | 1.7 (0.4–3.5) | |
| 340 / 8 / 194 / 164 / 379 | 92 / 4 / 41 / 40 / 102 | 12 / 0 / 0 / 3 / 5 | 8 / 0 / 2 / 4 / 6 | 7 / 0 / 0 / 4 / 9 | 8 / 0 / 4 / 6 / 8 | |
| 275 / 183 / 48 / 363 / 150 / 27 / 10 / 29 | 32 / 68 / 16 / 107 / 45 / 5 / 3 / 3 | 0 / 4 / 1 / 8 / 4 / 3 / 0 / 0 | 0 / 6 / 1 / 6 / 3 / 3 / 1 / 0 | 2 / 4 / 0 / 5 / 4 / 3 / 1 / 1 | 0 / 3 / 0 / 17 / 5 / 0 / 1 / 0 | |
| 1002 / 14 / 69 | 256 / 2 / 21 | 19 / 0 / 1 | 17 / 1 / 2 | 19 / 0 / 1 | 24 / 0 / 2 | |
| 471 / 368 / 246 | 99 / 135 / 45 | 4 / 15 / 1 | 3 / 17 / 0 | 6 / 12 / 2 | 3 / 22 / 1 | |
| 799 / 286 | 173 / 106 | 17 / 3 | 13 / 7 | 16 / 4 | 20 / 6 | |
LGD: Low grade dysplasia; HGD: High grad dysplasia; M: Mucosal gastric cancer; SM: Submucosal gastric cancer.
. Performance of the DCNN system in validation datasets.
| NJDTH validation | External validation | |||
|---|---|---|---|---|
| Internal validation | WXPH | TZPH | GCPH | |
| Accuracy (95% CI) | 87.8 (87.1–88.5) | 88.7 (85.2–91.4) | 91.2 (88.5–93.3) | 85.1 (81.9–87.9) |
| Sensitivity (95% CI) | 95.5 (94.8–96.1) | 91.1 (86.1–94.5) | 92.1 (88.1–94.9) | 85.9 (81.0–89.7) |
| Specificity (95% CI) | 81.7 (80.7–82.8) | 86.2(80.5–90.5) | 90.3 (86.0–93.4) | 84.4 (79.5–88.4) |
| Positive predictive value (95% CI) | 80.5 (79.4–81.6) | 86.9 (81.4–90.9) | 90.5 (86.3–93.5) | 84.6 (79.8–88.6) |
| Negative predictive value (95% CI) | 95.9 (95.2–96.4) | 90.7 (85.4–94.2) | 91.9 (87.9–94.8) | 85.7 (80.8–89.5) |
| AUC | 0.940 | 0.906 | 0.925 | 0.887 |
Fig. 3(a) Predictive results of the DCNN system and corresponding positive pathological tissues. (b) Predictive results of the DCNN system and corresponding annotations of experts. DCNN: Deep convolutional neural networks. .
Fig. 4Receiver operating characteristic curves illustrating the ability of the DCNN system to diagnose EGC. Sample size: 4153 cancer images and 5264 non-cancer images in NJDTH; 203 cancer images and 203 non-cancer images in WXPH; 228 cancer images and 228 non-cancer images in TZPH; 226 cancer images and 226 non-cancer images in GCPH. NJDTH: Nanjing University Medical School Affiliated Drum Tower Hospital; WXPH: Wuxi People's Hospital; TZPH: Taizhou People's Hospital; GCPH: Gaochun People's Hospital; DCNN: Deep convolutional neural networks; EGC: Early gastric cancer.
Comparison between the DCNN system and endoscopists.
| Accuracy (95% CI) | Sensitivity (95% CI) | Specificity (95% CI) | Positive predictive value (95% CI) | Negative predictive value (95% CI) | |
|---|---|---|---|---|---|
| 95.3 (93.3–96.8) | 93.0 (89.3–95.5) | 97.7 (95.0–99.0) | 97.6 (94.8–98.9) | 93.3 (89.8–95.7) | |
| 87.3 (85.2–89.3) | 82.7 (75.5–89.9) | 91.9 (87.2–96.6) | 92.1 (88.4–95.7) | 85.4 (80.1–90.7) | |
| 73.6 (71.0–76.3) | 50.2 (44.1–56.4) | 97.1 (95.6–98.5) | 95.1 (93.2–96.9) | 66.7 (63.9–69.5) | |
| 94.3 (91.0–97.5) | 97.4 (95.0–99.8) | 91.1 (83.1–99.1) | 92.1 (85.6–98.5) | 97.9 (96.3–99.4) | |
| 96.2 (95.8–96.7) | 94.7 (93.9–95.6) | 97.7 (96.8–98.6) | 97.7 (96.8–98.5) | 94.9 (94.1–95.7) |
Intra-observer agreement of the testing dataset.
| Expert / trainee | κ |
|---|---|
| Expert 1 | 0.802 |
| Expert 2 | 0.765 |
| Expert 3 | 0.778 |
| Expert 4 | 0.727 |
| Expert 5 | 0.769 |
| Expert 6 | 0.802 |
| Trainee 1 | 0.552 |
| Trainee 2 | 0.534 |
| Trainee 3 | 0.535 |
| Trainee 4 | 0.570 |
| Trainee 5 | 0.419 |
| Trainee 6 | 0.634 |
| Trainee 7 | 0.355 |
| Trainee 8 | 0.744 |
| Trainee 9 | 0.662 |
| Trainee 10 | 0.672 |