| Literature DB >> 32377939 |
Qi He1, Sophia Bano2, Omer F Ahmad2, Bo Yang3, Xin Chen3, Pietro Valdastri4, Laurence B Lovat2, Danail Stoyanov2, Siyang Zuo5.
Abstract
PURPOSE: Upper gastrointestinal (GI) endoscopic image documentation has provided an efficient, low-cost solution to address quality control for endoscopic reporting. The problem is, however, challenging for computer-assisted techniques, because different sites have similar appearances. Additionally, across different patients, site appearance variation may be large and inconsistent. Therefore, according to the British and modified Japanese guidelines, we propose a set of oesophagogastroduodenoscopy (EGD) images to be routinely captured and evaluate its efficiency for deep learning-based classification methods.Entities:
Keywords: Artificial intelligence; Deep learning; Endoscopy; Gastroenterology
Mesh:
Year: 2020 PMID: 32377939 PMCID: PMC7316667 DOI: 10.1007/s11548-020-02148-5
Source DB: PubMed Journal: Int J Comput Assist Radiol Surg ISSN: 1861-6410 Impact factor: 2.924
Fig. 1Proposed workflow
Recommended images of landmarks in upper GI endoscopy
| Recommendations | Oesophagus | Stomach | Duodenum |
|---|---|---|---|
| British guideline (BSG and AUGIS [ | Proximal oesophagus, Z-line | Cardia and fundus on retroflexed view, body (taken from the upper part of the less curvature), angulus on partial retroflexion, antrum | Duodenal bulb, second part of the duodenum |
| Japanese guideline (Yao [ | Not defined | Four quadrants (L, G, A and P) of the fundus on retroflexed view, three quadrants (L, A and P) of middle-upper body and angulus on retroflexed view, four quadrants (L, G, A and P) of antrum, lower body and middle-upper body on forward view | Not defined |
| Proposed guideline (2019) | Pharynx oesophagus, gastroesophageal junction | Cardia and fundus on retroflexed view, middle-upper body on either forward and retroflexed view, lower body on forward view, angulus on retroflexed view, antrum on forward view | Duodenal bulb, duodenal descending |
L, lesser curvature; G, greater curvature; A, anterior wall; P, posterior wall
Fig. 2Samples of ROI extraction
Fig. 3Proposed anatomical classification guideline
Fig. 4Proportion of NA and 11 anatomical landmarks
Manually annotated (ground truth) labels of four training/test sets
| No. (cite) | NA | PX | ES | SJ | FS | MA | MR | AS | LB | AM | DB | DD |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 (proposed) | – | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| 1 (proposed) | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 |
| 2 ([ | – | – | 0 | 1 | 2 | 3 | – | 4 | – | 5 | 6 | 7 |
| 3 ([ | 0 | – | 1 | 2 | 3 | 4 | – | 5 | – | 6 | 7 | 8 |
–, does not exist; NA, unqualified; PX, pharynx; ES, oesophagus; SJ, squamocolumnar junction; FS, fundus; MA, middle-upper body antegrade view; MR, middle-upper body retroflex view; AS, angulus; LB, lower body; AM, antrum; DB, duodenal bulb; DD, duodenal descending
Overall accuracy (%) of five CNN models for four datasets
| No. (cite) | ResNet-50 | Inception-v3 | VGG-11-bn | VGG-16-bn | DenseNet-121 |
|---|---|---|---|---|---|
| 0 (proposed) | 90.75 | 91.04 | 89.29 | 90.41 | |
| 1 (proposed) | 82.53 | 82.40 | 82.10 | 82.24 | |
| 2 ([ | 93.11 | 93.00 | 93.50 | 93.90 | |
| 3 ([ | 84.51 | 84.62 | 85.23 | 85.23 | |
| Means | 87.72 | 87.97 | 87.43 | 87.81 | |
| STDs | 4.34 | 4.22 | 4.25 | 4.43 | 4.62 |
The bolded values are the best overall accuracy rates under each of the data arrangements
The F1-score (%) of DensetNet-121 on four datasets
| GL | NA | PX | ES | SJ | FS | MA | MR | AS | LB | AM | DB | DD |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | – | 93.54 | 91.90 | 89.40 | 92.76 | |||||||
| 1 | 68.28 | 79.25 | 88.35 | 82.92 | 90.03 | 84.12 | 74.50 | 80.82 | 52.71 | 87.98 | 80.31 | 93.76 |
| 2 | – | – | 94.02 | 88.42 | – | – | 88.63 | 94.22 | ||||
| 3 | – | 89.78 | 83.30 | 92.16 | 87.32 | – | 85.84 | – | 88.84 | 80.76 | 93.24 |
GL, guideline. The bolded values are the best F1-score rates for each of the landmarks
Fig. 5Confusion matrix for the model based on a the proposed guideline without NA and b the proposed guideline with NA. The actual labels are on the left side of the confusion matrix, and the predicted labels are on the bottom of the confusion matrix. a has 11 classes, and b has 12 classes. The values on the diagonal grids showed the recall rates (%) of each class, respectively, for the matrix. Referring to the colour of the colour bar and the corresponding number, the sample density in the confusion matrices is shown by colours
Fig. 6Confusion matrix for the model based on a the British guideline [1, 16] without NA and b the British guideline [1, 16] with NA. The actual labels are on the left side of the confusion matrix, and the predicted labels are on the bottom of the confusion matrix. a has eight classes and b has nine classes. The values on the diagonal grids showed the recall rates (%) of each class, respectively, for the matrix. Referring to the colour of the colour bar and the corresponding number, the sample density in the confusion matrices is shown by colours