| Literature DB >> 31913337 |
Eun Mi Song1, Beomhee Park2, Chun-Ae Ha1, Sung Wook Hwang1, Sang Hyoung Park1, Dong-Hoon Yang1, Byong Duk Ye1, Seung-Jae Myung1, Suk-Kyun Yang1, Namkug Kim3, Jeong-Sik Byeon4.
Abstract
We aimed to develop a computer-aided diagnostic system (CAD) for predicting colorectal polyp histology using deep-learning technology and to validate its performance. Near-focus narrow-band imaging (NBI) pictures of colorectal polyps were retrieved from the database of our institution. Of these, 12480 image patches of 624 polyps were used as a training set to develop the CAD. The CAD performance was validated with two test datasets of 545 polyps. Polyps were classified into three histological groups: serrated polyp (SP), benign adenoma (BA)/mucosal or superficial submucosal cancer (MSMC), and deep submucosal cancer (DSMC). The overall kappa value measuring the agreement between the true polyp histology and the expected histology by the CAD was 0.614-0.642, which was higher than that of trainees (n = 6, endoscopists with experience of 100 NBI colonoscopies in <6 months; 0.368-0.401) and almost comparable with that of the experts (n = 3, endoscopists with experience of 2,500 NBI colonoscopies in ≥5 years) (0.649-0.735). The areas under the receiver operating curves for CAD were 0.93-0.95, 0.86-0.89, and 0.89-0.91 for SP, BA/MSMC, and DSMC, respectively. The overall diagnostic accuracy of the CAD was 81.3-82.4%, which was significantly higher than that of the trainees (63.8-71.8%, P < 0.01) and comparable with that of experts (82.4-87.3%). The kappa value and diagnostic accuracies of the trainees improved with CAD assistance: that is, the kappa value increased from 0.368 to 0.655, and the overall diagnostic accuracy increased from 63.8-71.8% to 82.7-84.2%. CAD using a deep-learning model can accurately assess polyp histology and may facilitate the diagnosis of colorectal polyps by endoscopists.Entities:
Mesh:
Year: 2020 PMID: 31913337 PMCID: PMC6949236 DOI: 10.1038/s41598-019-56697-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Baseline characteristics of the included colorectal polyps.
| Variables | Overall | Training set | Test set I | Test set II | P* |
|---|---|---|---|---|---|
| Number | 1169 | 624 | 182 | 363 | |
| Median size, mm (range) | 10 (2–100) | 8 (2–100) | 10 (4–50) | 12 (3–90) | 0.332 |
| Localization (n, %) | 0.407 | ||||
| Right colon | 656 (56.1) | 334 (53.5) | 103 (56.6) | 219 (60.3) | |
| Left colon | 513 (43.9) | 290 (45.5) | 79 (43.4) | 144 (39.7) | |
| Macroscopic type (n, %) | 0.825 | ||||
| Ip | 42 (3.6) | 18 (2.9) | 8 (4.4) | 16 (4.4) | |
| Is | 820 (70.1) | 464 (74.4) | 122 (67.0) | 234 (64.5) | |
| LST | 307 (26.3) | 142 (22.8) | 52 (28.6) | 113 (31.1) | |
| LST-NG | 140 (12.0) | 53 (8.5) | 26 (14.3) | 61 (16.8) | |
| LST-G | 167 (14.3) | 89 (14.3) | 26 (14.3) | 52 (14.3) | |
| Pathology (n, %) | 0.294 | ||||
| Hyperplastic polyp | 93 (8.0) | 48 (7.7) | 15 (8.2) | 30 (8.3) | |
| Sessile serrated polyp | 170 (14.5) | 76 (12.2) | 24 (13.2) | 70 (19.3) | |
| BA | 705 (60.3) | 393 (63.0) | 106 (58.2) | 206 (56.7) | |
| MSMC | 110 (9.4) | 62 (9.9) | 20 (11.0) | 28 (7.7) | |
| DSMC | 91 (7.8) | 45 (7.2) | 17 (9.3) | 29 (8.0) |
LST, laterally spreading tumor; NG, non-granular; G, granular; BA, benign conventional adenoma; MSMC, mucosal or superficial submucosal tumor; DSMC, deep submucosal cancer.
*P values in comparison between the test sets I and II.
Figure 1A schematic of the training strategy of the computer-aided diagnostic system (CAD) using a 50-layered convolutional neural network and image patches. SP, serrated polyp; BA, benign conventional adenoma; MSMC, mucosal or superficial submucosal cancer; DSMC, deep submucosal cancer.
Cohen’s kappa value measuring the agreement between true and predicted histopathological diagnoses in test sets I and II.
| Colorectal tumor | Number | Experts | CAD | Trainees | CAD+trainees | ||||
|---|---|---|---|---|---|---|---|---|---|
| Kappa | 95% CI | Kappa | 95% CI | Kappa | 95% CI | Kappa | 95% CI | ||
| Overall | 182 | 0.649 | 0.564–0.725 | 0.368 | 0.281–0.459 | 0.665 | 0.560–0.758 | ||
| Tumor size | |||||||||
| >10mm | 78 | 0.550 | 0.381–0.685 | 0.280 | 0.117–0.430 | 0.594 | 0.405–0.744 | ||
| ≤10mm | 104 | 0.700 | 0.583–0.800 | 0.410 | 0.300–0.499 | 0.697 | 0.547–0.825 | ||
| Tumor location | |||||||||
| Right colon | 103 | 0.677 | 0.556–0.775 | 0.386 | 0.286–0.490 | 0.743 | 0.614–0.856 | ||
| Left colon | 79 | 0.602 | 0.457–0.729 | 0.321 | 0.173–0.455 | 0.555 | 0.375–0.716 | ||
| Tumor morphology | |||||||||
| LST type | 52 | 0.583 | 0.382–0.752 | 0.225 | 0.038–0.380 | 0.508 | 0.262–0.728 | ||
| Is type | 122 | 0.671 | 0.564–0.762 | 0.436 | 0.536–0.740 | 0.740 | 0.616–0.846 | ||
| Overall | 363 | 0.735 | 0.690–0.780 | 0.401 | 0.348–0.450 | 0.658 | 0.585–0.729 | ||
| Tumor size | |||||||||
| >10mm | 189 | 0.724 | 0.650–0.784 | 0.416 | 0.338–0.487 | 0.674 | 0.568–0.773 | ||
| ≤10mm | 174 | 0.714 | 0.637–0.781 | 0.300 | 0.226–0.370 | 0.599 | 0.481–0.693 | ||
| Tumor location | |||||||||
| Right colon | 219 | 0.748 | 0.687–0.805 | 0.355 | 0.289–0.418 | 0.623 | 0.523–0.715 | ||
| Left colon | 144 | 0.696 | 0.604–0.776 | 0.420 | 0.326–0.509 | 0.687 | 0.575–0.781 | ||
| Tumor morphology | |||||||||
| LST type | 113 | 0.744 | 0.654–0.820 | 0.385 | 0.276–0.481 | 0.684 | 0.538–0.805 | ||
| Is type | 234 | 0.712 | 0.650–0.767 | 0.394 | 0.324–0.455 | 0.635 | 0.525–0.710 | ||
CAD, computer-aided diagnostic system; CI, confidence interval; LST, laterally spreading tumor.
Diagnostic performance of the CAD in each histological group in comparison with the diagnostic performance of endoscopists.
| All polyps | Test set I | Test set II | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Experts | CAD | Trainees | CAD+trainees | P* | P† | P‡ | Experts | CAD | Trainees | CAD+trainees | P* | P† | P‡ | |
| 82.4 (450/546) | 71.8 (392/546) | 84.2 (460/546) | 0.724 | 0.005 | <0.001 | 87.3 (951/1089) | 63.8 (695/1089) | 82.7 (901/1089) | 0.005 | <0.001 | <0.001 | |||
| Sensitivity, % (fraction) | 88.9 (104/117) | 55.6 (65/117) | 82.1 (96/117) | 0.179 | <0.001 | <0.001 | 81.7 (245/300) | 92.0 (276/300) | 81.3 (244/300) | 0.059 | <0.001 | 0.0003 | ||
| Specificity, % (fraction) | 92.1 (395/429) | 90.4 (388/429) | 94.9 (407/429) | 0.498 | 0.210 | 0.057 | 94.6 (746/789) | 61.0 (481/789) | 89.2 (704/789) | 0.452 | <0.001 | <0.001 | ||
| PPV, % (fraction) | 75.4 (104/138) | 61.3 (65/106) | 81.4 (96/118) | 0.666 | 0.018 | 0.003 | 85.1 (245/288) | 47.3 (276/584) | 74.2 (244/329) | 0.250 | <0.001 | <0.001 | ||
| NPV, % (fraction) | 96.8 (395/408) | 88.2 (388/440) | 95.1 (407/428) | 0.193 | 0.001 | 0.001 | 93.1 (746/801) | 95.2 (481/505) | 92.6 (704/760) | 0.046 | 0.003 | 0.050 | ||
| Sensitivity, % (fraction) | 83.1 (314/378) | 81.7 (309/378) | 88.1 (333/378) | 0.775 | 0.508 | 0.040 | 92.2 (647/702) | 53.0 (372/702) | 85.8 (602/702) | 0.041 | <0.001 | <0.001 | ||
| Specificity, % (fraction) | 81.0 (136/168) | 51.8 (87/168) | 75.6 (127/168) | 0.304 | 0.001 | <0.001 | 78.6 (304/387) | 84.5 (327/387) | 77.8 (301/387) | 0.083 | <0.001 | 0.0167 | ||
| PPV, % (fraction) | 90.8 (314/346) | 79.2 (309/390) | 89.0 (333/374) | 0.330 | 0.002 | <0.001 | 88.6 (647/730) | 86.1 (372/432) | 87.5 (602/688) | 0.043 | 0.655 | 0.429 | ||
| NPV, % (fraction) | 68.0 (136/200) | 55.8 (87/156) | 73.8 (123/172) | 0.961 | 0.028 | <0.001 | 84.7 (304/359) | 49.8 (327/657) | 75.1 (301/401) | 0.016 | <0.001 | <0.001 | ||
| Sensitivity, % (fraction) | 62.7 (32/51) | 35.3 (18/51) | 58.8 (30/51) | 0.839 | 0.043 | 0.013 | 67.8 (59/87) | 54.0 (47/87) | 63.2 (55/87) | 0.532 | 0.268 | 0.161 | ||
| Specificity, % (fraction) | 93.9 (465/495) | 93.5 (463/495) | 92.9 (472/495) | 0.771 | >0.999 | 0.678 | 98.8 (990/1002) | 97.4 (976/1002) | 98.3 (985/1002) | 0.004 | 0.436 | 0.201 | ||
| PPV, % (fraction) | 51.6 (32/62) | 36.0 (18/50) | 46.2 (31/54) | 0.655 | 0.215 | 0.274 | 83.1 (59/71) | 64.4 (47/74) | 76.4 (55/72) | 0.004 | 0.760 | 0.094 | ||
| NPV, % (fraction) | 96.1 (465/484) | 93.3 (463/496) | 95.6 (472/492) | 0.742 | 0.114 | 0.117 | 97.2 (990/1018) | 96.1 (976/1016) | 96.9 (985/1017) | 0.471 | 0.324 | 0.186 | ||
CAD, computer-aided diagnostic system; PPV, positive predictive value; NPV, negative predictive value; BA, benign conventional adenoma; MSMC, mucosal or superficial submucosal tumor; DSMC, deep submucosal cancer.
The diagnostic performance of the three experts and trainees was evaluated by combining the results of all the endoscopists. Therefore, the total number of examined polyps was 546 (3 times 182) in test set I and 1089 (3 times 363) in test set II.
*P value in the comparison between CAD vs. experts.
P value in the comparison between CAD vs. trainees.
‡P value in the comparison between trainees vs. CAD+trainees.
Figure 2The receiver operating characteristic (ROC) curves evaluating the diagnostic performance of the computer-aided diagnostic system (CAD). The performance of the CAD was evaluated and compared with the performances of three expert endoscopists and three trainees using ROC curves. (A–C) The ROC curves for the CAD in the SP, BA/MSMC, and DSMC groups of test dataset I; (D–F) The ROC curves for the CAD in the SP, BA/MSMC, and DSMC groups of test dataset II. AUC, area under the ROC curve; SP, serrated polyp; BA, benign conventional adenoma; MSMC, mucosal or superficial submucosal cancer (cancer with invasion depth <1000 µm from the muscularis mucosa); DSMC, deep submucosal cancer (cancer with invasion depth ≥1000 µm from the muscularis mucosa).
Figure 3The visualized class activation map images. The figures in small rectangles in each image show the probability of each class being predicted by the computer-aided diagnostic system (CAD). The red area represents the region that the CAD considers to be compatible with the particular histology with high probability. The blue area represents the region that CAD considers to have a low probability for the particular histology. SP, serrated polyp; BA, benign conventional adenoma; MSMC, mucosal or superficial submucosal cancer; DSMC, deep submucosal cancer.
Figure 4Improvement of the diagnostic performance of trainees with the assistance of the computer-aided diagnostic system (CAD). All empty circles representing trainees’ performance moved to solid circles representing the performance of the CAD+trainees at the left upper side or near the yellow curved line; this suggests that the performance of the CAD+trainees was superior to that of trainees and comparable to that of the CAD (yellow curved line). (A–C) Improved diagnostic performance of the CAD+trainee in the SP, BA/MSMC, and DSMC groups of test dataset I; (D–F) Improved diagnostic performance of the CAD+trainee in the SP, BA/MSMC, and DSMC groups of test dataset II. SP, serrated polyp; BA, benign conventional adenoma; MSMC, mucosal or superficial submucosal cancer; DSMC, deep submucosal cancer.
Diagnostic performance of the CAD according to the size, location, and morphology of the colorectal polyps in test set I.
| Polyp size | Serrated polyp | BA/MSMC | DSMC | |||
|---|---|---|---|---|---|---|
| ≤10mm | >10mm | ≤10mm | >10mm | ≤10mm | >10mm | |
| Accuracy, % (fraction) | 85.6 (267/312) | 98.7 (231/234) | 79.5 (186/234) | 79.5 (186/234) | 97.1 (303/312) | 80.8 (189/234) |
| Sensitivity, % (fraction) | 77.4 (72/93) | 100.0 (24/24) | 81.5 (132/162) | 81.5 (132/162) | 0.0 (0/3) | 62.5 (30/48) |
| Specificity, % (fraction) | 89.0 (195/219) | 98.6 (207/210) | 75.0 (54/72) | 75.0 (54/72) | 98.1 (303/309) | 85.5 (159/186) |
| PPV, % (fraction) | 75.0 (72/96) | 88.9 (24/27) | 88.0 (132/150) | 88.0 (132/150) | 0.0 (0/6) | 52.6 (30/57) |
| NPV, % (fraction) | 90.3 (195/216) | 100.0 (207/207) | 64.3 (54/84) | 64.3 (54/84) | 99.0 (303/306) | 89.8 (159/177) |
| Accuracy, % (fraction) | 91.3 (282/309) | 91.1 (216/237) | 86.4 (267/309) | 74.7 (177/237) | 95.1 (294/306) | 83.6 (198/237) |
| Sensitivity, % (fraction) | 89.3 (75/93) | 63.6 (21/33) | 86.5 (192/222) | 80.8 (126/156) | 0.0 (0/3) | 62.5 (30/48) |
| Specificity, % (fraction) | 92.0 (207/225) | 95.6 (195/204) | 86.2 (75/87) | 63.0 (51/81) | 96.1 (294/306) | 88.9 (168/189) |
| PPV, % (fraction) | 80.6 (75/93) | 70.0 (21/30) | 94.1 (192/204) | 80.8 (126/156) | 0.0 (0/12) | 58.8 (30/51) |
| NPV, % (fraction) | 95.8 (207/215) | 94.2 (195/207) | 71.4 (75/105) | 63.0 (51/81) | 99.0 (294/297) | 90.3 (168/186) |
| Accuracy, % (fraction) | 96.2 (150/156) | 89.3 (327/366) | 76.9 (120/156) | 82.8 (303/366) | 80.8 (126/156) | 93.4 (342/366) |
| Sensitivity, % (fraction) | 80.0 (24/30) | 85.7 (72/84) | 80.0 (84/105) | 84.5 (213/252) | 57.1 (12/21) | 60.0 (18/30) |
| Specificity, % (fraction) | 100.0 (126/126) | 90.4 (255/282) | 70.6 (36/51) | 78.9 (90/114) | 84.4 (114/135) | 96.4 (324/336) |
| PPV, % (fraction) | 100.0 (24/24) | 72.7 (72/99) | 84.8 (84/99) | 89.9 (213/237) | 36.4 (12/33) | 60.0 (18/30) |
| NPV, % (fraction) | 95.5 (126/132) | 95.5 (225/267) | 63.2 (36/57) | 69.8 (90/129) | 92.7 (114/123) | 96.4 (324/336) |
CAD, computer-aided diagnostic system; BA, benign conventional adenoma; MSMC, mucosal or superficial submucosal tumor; DSMC, deep submucosal cancer; PPV, positive predictive value; NPV, negative predictive value; LST, laterally spreading tumor.