| Literature DB >> 31041367 |
Thomas K L Lui1,2,3, Kenneth K Y Wong2, Loey L Y Mak1, Michael K L Ko1, Stephen K K Tsao3, Wai K Leung1.
Abstract
Background and study aims We evaluated use of artificial intelligence (AI) assisted image classifier in determining the feasibility of curative endoscopic resection of large colonic lesion based on non-magnified endoscopic images Methods AI image classifier was trained by 8,000 endoscopic images of large (≥ 2 cm) colonic lesions. The independent validation set consisted of 567 endoscopic images from 76 colonic lesions. Histology of the resected specimens was used as gold standard. Curative endoscopic resection was defined as histology no more advanced than well-differentiated adenocarcinoma, ≤ 1 mm submucosal invasion and without lymphovascular invasion, whereas non-curative resection was defined as any lesion that could not meet the above requirements. Performance of the trained AI image classifier was compared with that of endoscopists. Results In predicting endoscopic curative resection, AI had an overall accuracy of 85.5 %. Images from narrow band imaging (NBI) had significantly higher accuracy (94.3 % vs 76.0 %; P < 0.00001) and area under the ROC curve (AUROC) (0.934 vs 0.758; P = 0.002) than images from white light imaging (WLI). AI was superior to two junior endoscopists in terms of accuracy (85.5 % vs 61.9 % or 82.0 %, P < 0.05), AUROC (0.837 vs 0.638 or 0.717, P < 0.05) and confidence level (90.1 % vs 83.7 % or 78.3 %, P < 0.05). However, there was no statistical difference in accuracy and AUROC between AI and a senior endoscopist. Conclusions The trained AI image classifier based on non-magnified images can accurately predict probability of curative resection of large colonic lesions and is better than junior endoscopists. NBI images have better accuracy than WLI for AI prediction.Entities:
Year: 2019 PMID: 31041367 PMCID: PMC6447402 DOI: 10.1055/a-0849-9548
Source DB: PubMed Journal: Endosc Int Open ISSN: 2196-9736
Fig. 1Diagnostic flow from an endoscopic image to final AI result.
Characteristics of large colonic lesions in the final testing set.
| Endoscopically curable lesion | Endoscopically incurable lesion | Total | |
| Number of lesions | 56 | 20 | 76 |
| Location | |||
Cecum | 5 (8.9 %) | 1 (5 %) | 6 (7.9 %) |
Ascending colon | 10 (17.8 %) | 1 (5 %) | 11 (14.5 %) |
Transverse colon | 8 (14.3 %) | 0 (0 %) | 8 (10.5 %) |
Descending colon | 4 (7.1 %) | 2 (10 %) | 6 (7.9 %) |
Sigmoid | 28 (50 %) | 13 (65 %) | 41 (53.9 %) |
Rectum | 1(1.8 %) | 3 (15 %) | 4 (5.3 %) |
| Histology | |||
Invasive adenocarcinoma (SM2)
| 0 (0 %) | 20 (100 %) | 20 (26.3 %) |
Slightly invasive adenocarcinoma (SM1) | 10 (17.9 %) | 0 (%) | 10 (13.1 %) |
Tubulovillous adenoma | 8 (14.3 %) | 0 (0 %) | 8 (10.5 %) |
Tubular adenoma | 28 (50 %) | 0 (0 %) | 28 (36.8 %) |
Sessile serrated adenoma | 10 (17.9 %) | 0 (0 %) | 10 (13.2 %) |
12 lesions had presence of lymphovascular invasion
Performance of the AI image classifier in predicting ECL based on non-magnified endoscopic images.
| All | NBI | WLI |
| |
| Sensitivity | 88.2 % (95 % CI: 84.7 %- 91.1 %) | 94.6 % (95 % CI:91.0 %-97.0 %) | 78.2 % (95 % CI:71.1 %-84.2 %) | < 0.0001 |
| Specificity | 77.9 % (95 % CI: 70.3 % – 84.4 %) | 92.3 % (95 % CI:79.1 % – 98.4 %) | 72.6 % (95 % CI:63.1 %-80.9 %) | 0.05 |
| PPV | 92.1 % (95 % CI: 89.5 % – 94.1 %) | 98.8 % (95 % CI:96.5 %-99.6 %) | 81.7 % (95 % CI:76.4 %-86.0 %) | < 0.0001 |
| NPV | 69.3 % (95 % CI: 63.2 %-74.8 %) | 72.0 % (95 % CI:60.5 %-81.2 %) | 68.1 % (95 % CI:61.0 %-74.5 %) | 0.41 |
| Accuracy | 85.5 % (95 % CI:82.4 % – 88.3 %) | 94.3 % (95 % CI:91.0 %-96.2 %) | 76.0 % (95 % CI:70.5 %-81.0 %) | < 0.00001 |
| AUROC | 0.837 (95 % CI: 0.794 – 0.880) | 0.934 (95 % CI: 0.883 – 0.985) | 0.758 (95 % CI: 0.697 – 0.819) | 0.002 |
AI, artificial intelligence; ECL, endoscopically curable lesion; CI, confidence interval; NBI, narrow-band imaging; WLI, white light imaging; PPV, positive predictive value; NPV, negative predictive value; AUROC, area under the receiver operator curve
P value: NBI vs WLI
Performance of the AI image classifier in predicting ECL based on non-magnified endoscopic images according to location.
| All n = 76 | Left-sided colon n = 45 | Right sided colon n = 31 |
| |
| Sensitivity | 88.2 % (95 % CI: 84.7 %- 91.1 %) | 84.9 % (95 % CI:79.5 %-89.5 %) | 90.3 % (95 % CI:85.6 %-93.9 %) | 0.213 |
| Specificity | 77.9 % (95 % CI: 70.3 % – 84.4 %) | 79.2 % (95 % CI:68.0 % – 87.8 %) | 80.3 % (95 % CI:68.7 %-89.1 %) | 0.871 |
| PPV | 92.1 % (95 % CI: 89.5 % – 94.1 %) | 92.4 % (95 % CI:88.5 %-95.0 %) | 93.8 % (95 % CI:90.2 %-96.1 %) | 0.403 |
| NPV | 69.3 % (95 % CI: 63.2 %-74.8 %) | 64.0 % (95 % CI:55.9 %-71.5 %) | 71.6 % (95 % CI:62.3 %-79.4 %) | 0.116 |
| Accuracy | 85.5 % (95 % CI:82.4 % – 88.3 %) | 83.5 % (95 % CI:78.7 %-87.6 %) | 88.0 % (95 % CI:83.6 %-91.5 %) | 0.317 |
| AUROC | 0.837 (95 % CI: 0.794 – 0.880) | 0.821 (95 % CI:0.760 – 0.882) | 0.853 (95 % CI: 0.792 – 0.913) | 0.548 |
AI, artificial intelligence; ECL, endoscopically curable lesion; CI, confidence interval; PPV, positive predictive value; NPV, negative predictive value; AUROC, area under the receiver operator curve
P value: left-sided colon vs right-sided colon
Fig. 2 Example of inaccurate results.
Performance of the AI image classifier vs human endoscopists in predicting ECL based on non-magnified endoscopic images.
| Al | Junior Endoscopist A | Junior Endoscopist B | Senior Endoscopist C | |
|
Sensitivity
| 88.2 % (95 % CI: 84.7 %- 91.1 %) | 60.1 % (95 % CI:55.3 %-64.8 %) | 87.6 % (95 % CI:84.2 %-90.6 %) | 91.8 % (95 % CI:89.1 %-94.1 %) |
|
Specificity
| 77.9 % (95 % CI: 70.3 % – 84.4 %) | 67.1 % (95 % CI:58.9 % – 75.1 %) | 56.4 % (95 % CI:47.5 %-64.7 %) | 52.6 % (95 % CI:40.9 %-64.0 %) |
|
PPV
| 92.1 % (95 % CI: 89.5 % – 94.1 %) | 85.1 % (95 % CI:81.6 %-88.6 %) | 86.2 % (95 % CI:83.8 %-88.4 %) | 92.4 % (95 % CI:90.6 %-93.9 %) |
|
NPV
| 69.3 % (95 % CI: 63.2 %-74.8 %) | 35.2 % (95 % CI:31.6 %-39.1 %) | 59.2 % (95 % CI:52.0 %-66.1 %) | 50.6 % (95 % CI:41.5 %-59.6 %) |
|
Accuracy
| 85.5 % (95 % CI:82.4 % – 88.3 %) | 61.9 % (95 % CI:57.8 %-65.9 %) | 80.0 % (95 % CI:76.5 %-82.0 %) | 86.4 % (95 % CI:83.4 %-89.2 %) |
|
AUROC
| 0.837 (95 % CI: 0.794 – 0.880) | 0.638 (95 % CI:0.585 – 0.690) | 0.717 (95 % CI: 0.663 – 0.771) | 0.873 (95 % CI: 0.739 – 0.907) |
|
Mean probability
| 0.901 | 0.837 | 0.783 | 0.803 |
AI, artificial intelligence; ECL, endoscopically curable lesion; CI confidence interval/ PPV, positive predictive value; NPV, negative predictive value; AUROC, area under the receiver operator curve
AI is statistically better than Junior Endoscopist A ( P < 0.05)
AI is statistically better than Junior Endoscopist B ( P < 0.05)
AI is statistically better than Senior Endoscopist C ( P < 0.05)