| Literature DB >> 35984903 |
Silvia Pecere1,2, Giulio Antonelli3,4, Mario Dinis-Ribeiro5, Yuichi Mori6,7, Cesare Hassan8,9, Lorenzo Fuccio10, Raf Bisschops11, Guido Costamagna1,2, Eun Hyo Jin12, Dongheon Lee13, Masashi Misawa7, Helmut Messmann14, Federico Iacopini4, Lucio Petruzziello1,2, Alessandro Repici8, Yutaka Saito15, Prateek Sharma16, Masayoshi Yamada15, Cristiano Spada2,17, Leonardo Frazzoni10.
Abstract
Widespread adoption of optical diagnosis of colorectal neoplasia is prevented by suboptimal endoscopist performance and lack of standardized training and competence evaluation. We aimed to assess diagnostic accuracy of endoscopists in optical diagnosis of colorectal neoplasia in the framework of artificial intelligence (AI) validation studies. Literature searches of databases (PubMed/MEDLINE, EMBASE, Scopus) up to April 2022 were performed to identify articles evaluating accuracy of individual endoscopists in performing optical diagnosis of colorectal neoplasia within studies validating AI against a histologically verified ground-truth. The main outcomes were endoscopists' pooled sensitivity, specificity, positive and negative predictive value (PPV/NPV), positive and negative likelihood ratio (LR) and area under the curve (AUC for sROC) for predicting adenomas versus non-adenomas. Six studies with 67 endoscopists and 2085 (IQR: 115-243,5) patients were evaluated. Pooled sensitivity and specificity for adenomatous histology was respectively 84.5% (95% CI 80.3%-88%) and 83% (95% CI 79.6%-85.9%), corresponding to a PPV, NPV, LR+, LR- of 89.5% (95% CI 87.1%-91.5%), 75.7% (95% CI 70.1%-80.7%), 5 (95% CI 3.9%-6.2%) and 0.19 (95% CI 0.14%-0.25%). The AUC was 0.82 (CI 0.76-0.90). Expert endoscopists showed a higher sensitivity than non-experts (90.5%, [95% CI 87.6%-92.7%] vs. 75.5%, [95% CI 66.5%-82.7%], p < 0.001), and Eastern endoscopists showed a higher sensitivity than Western (85%, [95% CI 80.5%-88.6%] vs. 75.8%, [95% CI 70.2%-80.6%]). Quality was graded high for 3 studies and low for 3 studies. We show that human accuracy for diagnosis of colorectal neoplasia in the setting of AI studies is suboptimal. Educational interventions could benefit by AI validation settings which seem a feasible framework for competence assessment.Entities:
Keywords: artificial intelligence; colonoscopy; endoscopist performance; human factor; polyp characterization; polyp detection
Mesh:
Year: 2022 PMID: 35984903 PMCID: PMC9557953 DOI: 10.1002/ueg2.12285
Source DB: PubMed Journal: United European Gastroenterol J ISSN: 2050-6406 Impact factor: 6.866
FIGURE 1Flow‐chart of included studies
Details of included studies
| First Author, Year | Design | Country | Patients ( | Consecutive Y/N | Images ( | Endoscopists ( | Expert ( | Non expert ( | AI type | Imaging type | Setting |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Chen, 2018 | U | E | 193 | Y | 284 | 6 | 2 | 4 | CAOB | HDWL/magnifying NBI | Experimental images only |
| Renner, 2018 | U | W | 250 | Y | 100 | 2 | 2 | 0 | DNN‐CAD | HDWL/NBI | Experimental images only |
| Mori et al., 2018 | U | E | 320 | Y | 450 | 4 | 2 | 2 | CAD system | EC‐NBI | Real time images and videos |
| Kudo, 2020 | M | E | 89 | N | 100 | 30 | 10 | 20 | EndoBRAIN | WL/EC‐NBI | Experimental images only |
| Jin, 2020 | U | E | 224 | N | 300 | 22 | 15 | 7 | CNN system | NBI/near‐focus | Experimental images only |
| Weigt, 2021 | M | W | 80 | Y | 134 | 3 | 3 | 0 | CAD‐EYE | WL/LCI/BLI | Experimental images and videos |
Abbreviations: BLI, Blue Light Imaging; CAD, Computer Aided Detection; CAOB, computer‐assisted optical biopsy; CNN, convolutional neural network; DNN, deep neural network; E, Eastern; EC, endocitoscopy; HDWL, high definition white light; LCI, Linked Color Imaging; M, multicentric; NBI, narrow band imaging; U, unicentric; W, Western.
Polyps characteristics
| Polyps characteristics | |||
|---|---|---|---|
| Study | Size (mm) | Shape (Paris class) | Location % (right/left colon) |
| Chen, 2018 | <5 | Is – Isp – IIa | 34.8/65.2 |
| Renner, 2018 | <5 | Is – Ip – IIa | 51/49 |
| Mori, 2018 | <5 | Is – Ip – IIa – IIc | 40.4/59.6 |
| Kudo, 2020 | <10 | Is – Isp – IIa | 38/62 |
| Jin, 2020 | <5 | Is – Isp – IIa | 54.3/45.7 |
| Weigt, 2021 | ‐ | Is – IIa | 44/35.5 (20.9 missing) |
FIGURE 2Summary receiver‐operating characteristic curve
FIGURE 3Forest plots for sensitivity and specificity by study
FIGURE 4Forest plots for sensitivity and specificity by experience and country
Subgroup meta‐analyses for summary diagnostic accuracy measures of endoscopists for adenoma characterization at colonoscopy, according to study variables
| Study variable ( | Sensitivity (95% CI) |
| Specificity (95% CI) |
|
|---|---|---|---|---|
| Endoscopists' experience | ||||
| Experienced ( | 90.5 (87.6–92.7) | <0.001 | 84.8 (82.3–87.8) | 0.084 |
| Inexperienced ( | 75.5 (66.5–82.7) | 81.4 (75.1–86.4) | ||
| Country | ||||
| Eastern ( | 85 (80.5–88.6) | 0.436 | 83.6 (80–86.6) | 0.28 |
| Western ( | 75.8 (70.2–80.6) | 76.7 (65.9–84.8) | ||
| Study design | ||||
| Monocenter ( | 90.2 (86.9–92.8) | <0.001 | 78.5 (73.7–82.7) | 0.001 |
| Multicenter ( | 76.2 (67.7–83.1) | 88.8 (84.5–92.1) | ||
| Study quality | ||||
| High ( | 83.6 (78.6–87.6) | 0.359 | 84.6 (80.9–87.8) | 0.051 |
| Low ( | 89 (80.8–93.9) | 75.6 (70.1–80.4) | ||
Meta‐regression analysis for continuous moderators
| Study variable | Coefficient for sensitivity (95% CI) |
| Coefficient for 1‐specificity (95% CI) |
|
|---|---|---|---|---|
| Number of images | 0.004 (0.001–0.006) | 0.002 | 0.002 (0.001–0.004) | 0.032 |
| Percentage of right colon lesions | −0.081 (−0.126–−0.037) | <0.001 | 0.086 (−1.084–1.256) | 0.886 |
| Relative frequency of adenomas | −5.310 (−11.429–0.809) | 0.089 | −1.305 (−6.053–3.443) | 0.590 |
Quality assessment
| Reference standard/Training set | Index test/Validation set | |||||
|---|---|---|---|---|---|---|
| Study | Selection bias | Spectrum bias | Operator bias | Overfitting bias | Operator bias | Overall quality |
| Chen, 2018 |
|
|
|
|
| Low |
| Renner, 2018 |
|
|
|
|
| Low |
| Mori et al., 2018 |
|
|
|
|
| High |
| Kudo, 2020 |
|
|
|
|
| High |
| Jin, 2020 |
|
|
|
|
| High |
| Weigt, 2021 |
|
|
|
|
| Low |
Note: low risk of bias high risk of bias.