| Literature DB >> 35778456 |
Christoph Römmele1, Robert Mendel2,3, Caroline Barrett4, Hans Kiesl5, David Rauber2, Tobias Rückert2, Lisa Kraus1, Jakob Heinkele1, Christine Dhillon6, Bianca Grosser6, Friederike Prinz1, Julia Wanzl1, Carola Fleischmann1, Sandra Nagl1, Elisabeth Schnoy1, Jakob Schlottmann1, Evan S Dellon4, Helmut Messmann1, Christoph Palm2,3, Alanna Ebigbo7.
Abstract
The endoscopic features associated with eosinophilic esophagitis (EoE) may be missed during routine endoscopy. We aimed to develop and evaluate an Artificial Intelligence (AI) algorithm for detecting and quantifying the endoscopic features of EoE in white light images, supplemented by the EoE Endoscopic Reference Score (EREFS). An AI algorithm (AI-EoE) was constructed and trained to differentiate between EoE and normal esophagus using endoscopic white light images extracted from the database of the University Hospital Augsburg. In addition to binary classification, a second algorithm was trained with specific auxiliary branches for each EREFS feature (AI-EoE-EREFS). The AI algorithms were evaluated on an external data set from the University of North Carolina, Chapel Hill (UNC), and compared with the performance of human endoscopists with varying levels of experience. The overall sensitivity, specificity, and accuracy of AI-EoE were 0.93 for all measures, while the AUC was 0.986. With additional auxiliary branches for the EREFS categories, the AI algorithm (AI-EoE-EREFS) performance improved to 0.96, 0.94, 0.95, and 0.992 for sensitivity, specificity, accuracy, and AUC, respectively. AI-EoE and AI-EoE-EREFS performed significantly better than endoscopy beginners and senior fellows on the same set of images. An AI algorithm can be trained to detect and quantify endoscopic features of EoE with excellent performance scores. The addition of the EREFS criteria improved the performance of the AI algorithm, which performed significantly better than endoscopists with a lower or medium experience level.Entities:
Mesh:
Year: 2022 PMID: 35778456 PMCID: PMC9249895 DOI: 10.1038/s41598-022-14605-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Endoscopic white light images of eosinophilic esophagitis showing furrows, exudates, edema, and rings.
Figure 2Endoscopic white light images of a normal esophagus.
Baseline characteristics of patients whose images were included in the study.
| EoE | Control | P | |
|---|---|---|---|
| Age (mean ± standard deviation) | 35.1 ± 19 | 31.9 ± 25 | 0.039 |
| Sex (Male/Female) | 39/22 (64%/36%) | 180/213 (46%/54%) | 0.006 |
| Dysphagia | 37 (61%) | 65 (17%) | 0.000 |
Figure 3ROC curves and AUC values of AI-EoE and AI-EREFS on the internal data set (InD).
Performance of human endoscopists and AI- models in diagnosing eosinophilic esophagitis on endoscopic white light images.
| Group 1 | Group 2 | Overall | |||||
|---|---|---|---|---|---|---|---|
| 1–100 | 101–200 | all data | 1–100 | 101–200 (after EREFS training) | All data | All data and both groups | |
| Sens | 0.46 | 0.66 | 0.56 | 0.40 | 0.58 | 0.49 | 0.53 |
| Spec | 1.00 | 0.94 | 0.97 | 0.46 | 0.96 | 0.71 | 0.84 |
| Accuracy | 0.73 | 0.80 | 0.77 | 0.43 | 0.77 | 0.66 | 0.68 |
| F1 | 0.63 | 0.77 | 0.70 | 0.41 | 0.72 | 0.55 | 0.63 |
| Sens | 0.90 | 0.84 | 0.87 | 0.60 | 0.66 | 0.63 | 0.75 |
| Spec | 0.96 | 0.98 | 0.97 | 0.92 | 0.98 | 0.95 | 0.96 |
| Accuracy | 0.93 | 0.91 | 0.92 | 0.76 | 0.82 | 0.79 | 0.86 |
| F1 | 0.93 | 0.90 | 0.92 | 0.71 | 0.79 | 0.75 | 0.83 |
| Sens | 0.94 | 0.98 | 0.96 | 0.88 | 0.98 | 0.93 | 0.95 |
| Spec | 1.00 | 0.98 | 0.99 | 0.68 | 0.52 | 0.60 | 0.80 |
| Accuracy | 0.97 | 0.98 | 0.97 | 0.78 | 0.75 | 0.77 | 0.87 |
| F1 | 0.97 | 0.98 | 0.97 | 0.80 | 0.80 | 0.80 | 0.89 |
Group 1 endoscopists relied on their clinical experience, while Group 2 was educated on the EREFS criteria for the second batch of images. AI-EoE was trained with binary classification, while AI-EoE-EREFS was trained additionally using auxiliary branches generated from the EREFS scores.
Figure 4ROC curves and AUC values of AI-EoE and AI-EoE-EREFS on the external data set (ExD) compared with human endoscopists who had varying experience levels.
Figure 5Features detected on input images by AI-EoE-EREFS are highlighted using Gradient-based visualization (Grad-CAM)[28]: the top left image shows the original endoscopic image with furrows, exudates, and rings; in the top right image, furrows are highlighted, while in the bottom left and bottom right images, exudates, and rings are highlighted, respectively.