| Literature DB >> 35461350 |
Ayaka Tajiri1, Ryu Ishihara2, Yusuke Kato3, Takahiro Inoue1, Katsunori Matsueda1, Muneaki Miyake1, Kotaro Waki1, Yusaku Shimamoto1, Hiromu Fukuda1, Noriko Matsuura1,4, Satoshi Egawa5, Shinjiro Yamaguchi6, Hideharu Ogiyama7, Kiyoshi Ogiso8, Tsutomu Nishida9, Kenji Aoi10, Tomohiro Tada3,11,12.
Abstract
Previous reports have shown favorable performance of artificial intelligence (AI) systems for diagnosing esophageal squamous cell carcinoma (ESCC) compared with endoscopists. However, these findings don't reflect performance in clinical situations, as endoscopists classify lesions based on both magnified and non-magnified videos, while AI systems often use only a few magnified narrow band imaging (NBI) still images. We evaluated the performance of the AI system in simulated clinical situations. We used 25,048 images from 1433 superficial ESCC and 4746 images from 410 noncancerous esophagi to construct our AI system. For the validation dataset, we took NBI videos of suspected superficial ESCCs. The AI system diagnosis used one magnified still image taken from each video, while 19 endoscopists used whole videos. We used 147 videos and still images including 83 superficial ESCC and 64 non-ESCC lesions. The accuracy, sensitivity and specificity for the classification of ESCC were, respectively, 80.9% [95% CI 73.6-87.0], 85.5% [76.1-92.3], and 75.0% [62.6-85.0] for the AI system and 69.2% [66.4-72.1], 67.5% [61.4-73.6], and 71.5% [61.9-81.0] for the endoscopists. The AI system correctly classified all ESCCs invading the muscularis mucosa or submucosa and 96.8% of lesions ≥ 20 mm, whereas even the experts diagnosed some of them as non-ESCCs. Our AI system showed higher accuracy for classifying ESCC and non-ESCC than endoscopists. It may provide valuable diagnostic support to endoscopists.Entities:
Mesh:
Year: 2022 PMID: 35461350 PMCID: PMC9035159 DOI: 10.1038/s41598-022-10739-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Flowchart of construction of AI and validation dataset. AI artificial intelligence, ESCC esophageal squamous cell carcinoma, CRT chemoradiotherapy, EGJ esophagogastric junction, ESD endoscopic submucosal dissection, IN intraepithelial neoplasia, LGIN low-grade intraepithelial neoplasia.
Summary of the validation dataset.
| ESCC | 83 |
|---|---|
| Tumor size in mm, < 5/5 ~ 10/10 ~ 20/20 < | 3/20/32/28 |
| Macroscopic type, 0-I, IIa/0-IIb, IIc | 7/7 |
| Depth of tumora, EP-LPM/MM/SM1/SM2–3 | 52/12/2/7 |
| Non-cancerous lesions | 64 |
| lesion size in mm, < 5/5 ~ 10/10 ~ 20/20< | 14/31/16/3 |
| LGIN/Atypical/Esophagitis/Papilloma/Others | 9/10/24/1/20 |
ESCC esophageal squamous cell carcinoma, EP epithelium, LPM lamina propria, MM muscularis mucosa, SM submucosa, LGIN low-grade intraepithelial neoplasia.
aTen ESCCs diagnosed by biopsy were excluded from analysis of invasion depth.
Diagnostic performance of the AI system and the endoscopists.
| AI system [95% CI] | All endoscopists (n = 19) (average) [95% CI] | Experts (n = 13) (average) [95% CI] | Non-experts (n = 6) (average) [95% CI] | |
|---|---|---|---|---|
| Accuracy | 80.9% (119/147) [73.6–87.0] | 69.2% [66.6–71.9] | 69.9% [66.5–73.2] | 67.9% [61.9–73.9] |
| Sensitivity | 85.5% (71/83) [76.1–92.3] | 67.5% [61.9–73.1] | 68.5% [60.9–76.1] | 65.5% [55.0–75.9] |
| Specificity | 75.0% (48/64) [62.6–85.0] | 71.5% [62.6–80.3] | 71.6% [59.7–84.8] | 71.1% [53.5–88.7] |
| Positive predictive value | 81.6% (71/87) [71.9–89.1] | 76.1% [72.8–82.5] | 78.4% [72.0–84.8] | 76.1% [66.1–86.0] |
AI artificial intelligence, CI confidence interval.
Accuracy of the AI system and endoscopist by pathological diagnosis and size of lesion.
| AI system (n), [95% CI] | Endoscopistsa | |||
|---|---|---|---|---|
| All [95% CI] | Experts [95% CI] | Non-experts [95% CI] | ||
| ESCC: 83 cases | 85.5% (71/83) [76.1–92.3] | 67.5% [61.9–73.2] | 68.5% [60.8–76.1] | 65.5% [55.0–75.9] |
| pEP/LPM: 52 cases | 77.4% (41/52) [65.3–88.9] | 57.2% [50.6–65.6] | 57.9% [48.8–69.5] | 55.7% [42.6–68.9] |
| pMM/SM1/SM2: 21 cases | 100% (21/21) [83.9–100] | 89.0% [84.6–93.4] | 89.7% [84.1–95.4] | 87.3% [77.5–97.1] |
| Non-ESCC: 64 cases | 75.0% (48/64) [62.6–85.0] | 71.5% [62.6–80.3] | 71.6% [59.7–83.6] | 71.1% [53.5–88.7] |
| LGIN/atypical: 19 cases | 68.4% (13/19) [43.5–87.4] | 64.8% [55.9–74.3] | 64.0% [51.4–77.3] | 66.7% [52.0–81.3] |
| Esophagitis/papilloma/others: 45 cases | 77.8% (35/45) [62.9–88.9] | 74.3% [66.8–84.8] | 74.9% [64.6–88.6] | 73.0% [55.4–92.8] |
| < 10 mm: 68 cases | 72.1% (49/68) [59.9–82.3] | 68.0% [64.2–73.4] | 67.6% [62.2–75.3] | 68.9% [61.4–76.3] |
| ≥ 10 mm: 79 cases | 88.6% (70/79) [79.5–94.7] | 70.3% [67.2–74.5] | 71.8% [68.2–77.1] | 67.1% [59.8–74.4] |
| ≥ 20 mm: 31 cases | 96.8% (30/31) [83.3–99.9] | 82.7% [78.3–87.4] | 84.1% [78.5–90.2] | 79.6% [70.1–89.0] |
AI artificial intelligence, CI confidence interval, ESCC esophageal squamous cell carcinoma, EP epithelium, LPM lamina propria, MM muscularis mucosa, SM submucosa, LGIN low-grade intraepithelial neoplasia.
aPercentages shown are averages.
Lesions correctly classified by the AI system but by less than 30% of the endoscopists.
| ESCC | Non-ESCC | Total | |
|---|---|---|---|
| < 10 mm | 3 | 0 | 3 (33.3%) |
| 10 mm ≤ | 5 | 1 | 6 (66.7%) |
| Total | 8 (88.9%) | 1 (11.1%) | 9 |
ESCC esophageal squamous cell carcinoma.
Figure 2ESCCs cases correctly classified by the AI system but diagnosed as non-ESCCs by more than 70% of the endoscopists. These lesions showed fainter background coloration and slightly dilatated intrapapillary capillary loops.