| Literature DB >> 32936088 |
Chang Seok Bang1,2,3,4, Jae Jun Lee3,4,5, Gwang Ho Baik1,2.
Abstract
BACKGROUND: Helicobacter pylori plays a central role in the development of gastric cancer, and prediction of H pylori infection by visual inspection of the gastric mucosa is an important function of endoscopy. However, there are currently no established methods of optical diagnosis of H pylori infection using endoscopic images. Definitive diagnosis requires endoscopic biopsy. Artificial intelligence (AI) has been increasingly adopted in clinical practice, especially for image recognition and classification.Entities:
Keywords: Helicobacter pylori; artificial intelligence; convolutional neural network; deep learning; endoscopy; machine learning
Year: 2020 PMID: 32936088 PMCID: PMC7527948 DOI: 10.2196/21983
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1Flow diagram of the identification of relevant studies.
Clinical characteristics of the included studies.
| Study, format, nationality | Type of AIa | Type of endoscopy, diagnostic method of | Number of cases in test dataset | Number of controls in test dataset | Age of patients in test dataset; gender in patients in test dataset (M/Fb) | TPc | FPd | FNe | TNf | Unit of analysis |
| Yasuda et al [ | Support vector machine | LCIg; more than 2 different tests in each case (histology, serum antibody, stool antigen, urea breath test) | 42 | 63 controls (46 posteradication patients and 17 uninfected patients) | Median 64 years (range 26-88); (61/44) | 38 | 9 | 4 | 54 | Patient-based |
| — | — | — | 210 | 315 control images (230 posteradication and 85 uninfected images) | — | 161 | 70 | 49 | 245 | Image-based |
| — | — | — | 210 | 85 uninfected images ( | — | 161 | 9 | 49 | 76 | Image-based (infected vs uninfected) |
| — | — | — | 210 | 230 posteradication images | — | 161 | 61 | 49 | 169 | Image-based (infected vs after-eradication) |
| — | — | — | 85 uninfected images | 230 posteradication images | — | 76 | 61 | 9 | 169 | Image-based (uninfected vs after-eradication) |
| Zheng et al [ | CNNh | WLIi; histology with immunohistochemistry (if negative, urea breath test was done) | 2575 | 1180 control images (whether posteradication or uninfected images is unknown) | Mean 48.6 years (SD 12.9); (220/232) | 2359 | 17 | 216 | 1163 | Image-based |
| Shichijo et al [ | CNN | WLI; serum or urine antibody, stool antigen, urea breath test | 70 | 777 controls (284 posteradication and 493 uninfected images) | — | 44 | 47 | 26 | 730 | Patient-based |
| — | — | — | 59 | 477 uninfected images ( | — | 44 | 12 | 15 | 465 | Image-based (infected vs uninfected) |
| — | — | — | 55 | 182 posteradication images | — | 44 | 35 | 11 | 147 | Image-based (infected vs after-eradication) |
| — | — | — | 481 uninfected images | 249 posteradication images | — | 465 | 102 | 16 | 147 | Image-based (uninfected vs after-eradication) |
| Nakashima et al [ | CNN | WLI; serum antibody ( | 30 | 30 controls (uninfected patients; | — | 20 | 12 | 10 | 18 | Patient-based |
| — | — | LCI | — | — | — | 29 | 1 | 1 | 29 | Patient-based |
| — | — | BLIj-bright | — | — | — | 29 | 4 | 1 | 26 | Patient-based |
| Itoh et al [ | CNN | WLI; serum antibody ( | 15 | 15 control images (uninfected patients; | — | 13 | 2 | 2 | 13 | Image-based |
| Shichijo et al [ | CNN | WLI; serum or urine antibody, stool antigen, urea breath test | 72 | 325 controls (uninfected patients; | mean 50.4 (SD 11.2), (168/226) | 64 | 41 | 8 | 284 | Patient-based |
| Huang et al [ | Sequential forward floating selection with SVMk | WLI; histology (3 pairs of samples from the topographic sites, including antrum, body, and cardia were obtained in a uniform way) | 130 | 106 controls (whether posteradication or uninfected patients is unknown) | — | 128 | 21 | 2 | 85 | Patient-based |
| Huang et al [ | Refined feature selection with neural network | WLI; histology (3 pairs of samples from the topographic sites, including antrum, body, and cardia were obtained in a uniform way) | 41 | 33 controls (whether posteradication or uninfected patients is unknown) | — | 35 | 3 | 6 | 30 | Patient-based |
aAI: artificial intelligence.
bM/F: make/female.
cTP: true positive.
dFP: false positive.
eFN: false negative.
fTN: true negative.
gLCI: linked color imaging.
hCNN: convolutional neural network.
iWLI: white-light imaging.
jBLI: blue-laser imaging.
kSVM: support vector machine.
Figure 2Quality Assessment of Diagnostic Accuracy Studies–2 for the assessment of the methodological qualities of all the enrolled studies. (+) denotes low risk of bias, (?) denotes unclear risk of bias, (-) denotes high risk of bias.
Summary of diagnostic test accuracy and subgroup analysis of the included studies with patient-based analysis.
| Subgroup | Number of included studies | Sensitivity (95% CI) | Specificity (95% CI) | PLRa | NLRb | DORc | AUCd | |
| Value of meta-analysis in all included studies | 6 | 0.87 (0.72-0.94) | 0.86 (0.77-0.92) | 6.2 (3.8-10.1) | 0.15 (0.07-0.34) | 40 (15-112) | 0.92 (0.90-0.94) | |
|
|
|
|
|
|
|
|
| |
|
| High quality | 5 | 0.89 (0.75-0.96) | 0.88 (0.83-0.92) | 7.7 (5.6-10.6) | 0.12 (0.05-0.28) | 64 (32-129) | 0.94 (0.91-0.95) |
|
| Low quality | 1 | Null | Null | Null | Null | Null | Null |
|
|
|
|
|
|
|
|
| |
|
| ≤100 | 4 | 0.90 (0.73-0.97) | 0.88 (0.81-0.93) | 7.6 (5.3-10.9) | 0.11 (0.04-0.32) | 68 (29-158) | 0.94 (0.91-0.95) |
|
| <100 | 2 | Null | Null | Null | Null | Null | Null |
|
|
|
|
|
|
|
|
| |
|
| Retrospective | 4 | 0.90 (0.73-0.97) | 0.88 (0.81-0.93) | 7.6 (5.3-10.9) | 0.11 (0.04-0.32) | 68 (29-158) | 0.94 (0.91-0.95) |
|
| Prospective | 2 | Null | Null | Null | Null | Null | Null |
|
|
|
|
|
|
|
|
| |
|
| After 2010 | 4 | 0.80 (0.64-0.90) | 0.86 (0.73-0.93) | 5.6 (2.8-11.3) | 0.24 (0.13-0.45) | 23 (8-72) | 0.90 (0.87-0.92) |
|
| Before 2010 | 2 | Null | Null | Null | Null | Null | Null |
|
|
|
|
|
|
|
|
| |
|
| Neural network–based | 4 | 0.78 (0.64-0.87) | 0.87 (0.74-0.94) | 6.0 (2.7-13.0) | 0.26 (0.15-0.44) | 23 (7-73) | 0.89 (0.86-0.91) |
|
| SVMg-based | 2 | Null | Null | Null | Null | Null | Null |
|
|
|
|
|
|
|
|
| |
|
| WLIh | 5 | 0.86 (0.67-0.95) | 0.86 (0.75-0.92) | 6.1 (3.4-10.9) | 0.16 (0.06-0.42) | 37 (11-124) | 0.92 (0.89-0.94) |
|
| LCIi | 1 | Null | Null | Null | Null | Null | Null |
| Classifying performance between | 2 | 0.82 (0.74-0.89) | 0.85 (0.81-0.89) | 3.5 (0.8-14.3) | 0.27 (0.05-1.41) | 13 (0.8-229) | Null | |
aPLR: positive likelihood ratio.
bNLR: negative likelihood ratio.
cDOR: diagnostic odds ratio.
dAUC: area under the curve.
eThese modifiers were significant in the meta-regression analysis.
AI: artificial intelligence.
SVM: support vector machine.
WLI: white-light imaging.
LCI: linked color imaging.
Figure 3Forest plots of sensitivity and specificity of artificial intelligence algorithm for the prediction of Helicobacter pylori infection in endoscopic images.
Figure 4Summary receiver operating characteristic curve with 95% confidence region and prediction region for the prediction of Helicobacter pylori infection in endoscopic images.
Figure 5Fagan normogram for the prediction of Helicobacter pylori infection in endoscopic images.
Summary of diagnostic test accuracy and subgroup analysis of the included studies with image-based analysis.
| Subgroup | Number of included studies | Sensitivity (95% CI) | Specificity (95% CI) | PLRa | NLRb | DORc | AUCd | ||||||||
| Value of meta-analysis in all the included (bivariate and HSROCe method) | 4 | 0.81 (0.68-0.90) | 0.93 (0.82-0.98) | 12.3 (3.8-39.2) | 0.20 (0.11-0.38) | 61 (11-322) | 0.93 (0.90-0.95) | ||||||||
| Value of meta-analysis in all the included (Moses-Shapiro-Littenberg method) |
| 0.90 (0.89-0.91) | 0.94 (0.93-0.95) | 11.1 (1.6-76.2) | 0.20 (0.08-0.52) | 56 (5-591) | 0.90 (0.71-0.99) | ||||||||
|
|
|
|
|
|
|
|
| ||||||||
|
| High quality | 3 | 0.90 (0.87-0.91) | 0.94 (0.93-0.95) | 13.1 (1.4-124.5) | 0.22 (0.08-0.62) | 61 (4-919) | 0.87 (0.43-0.99) | |||||||
|
| Low quality | 1 | Null | Null | Null | Null | Null | Null | |||||||
|
|
|
|
|
|
|
|
| ||||||||
|
| ≤100 | 3 | 0.90 (0.87-0.91) | 0.94 (0.93-0.95) | 13.1 (1.4-124.5) | 0.22 (0.08-0.62) | 61 (4-919) | 0.87 (0.43-0.99) | |||||||
|
| <100 | 1 | Null | Null | Null | Null | Null | Null | |||||||
|
|
|
|
|
|
|
|
| ||||||||
|
| Retrospective | 3 | 0.90 (0.87-0.91) | 0.94 (0.93-0.95) | 13.1 (1.4-124.5) | 0.22 (0.08-0.62) | 61 (4-919) | 0.87 (0.43-0.99) | |||||||
|
| Prospective | 1 | Null | Null | Null | Null | Null | Null | |||||||
|
|
|
|
|
|
|
|
| ||||||||
|
| After 2010 | 4 | 0.90 (0.89-0.91) | 0.94 (0.93-0.95) | 11.1 (1.6-76.2) | 0.20 (0.08-0.52) | 56 (5-591) | 0.90 (0.71-0.99) | |||||||
|
| Before 2010 | 0 |
|
|
|
|
|
| |||||||
|
|
|
|
|
|
|
|
| ||||||||
|
| Neural network–based | 3 | 0.91 (0.90-0.92) | 0.97 (0.96-0.97) | 16.8 (2.0-141.7) | 0.17 (0.05-0.61) | 98 (6-1640) | 0.95 (0.75-0.99) | |||||||
|
| SVMg–based | 1 | Null | Null | Null | Null | Null | Null | |||||||
|
|
|
|
|
|
|
|
| ||||||||
|
| WLIh | 3 | 0.91 (0.90-0.92) | 0.97 (0.96-0.97) | 16.8 (2.0-141.7) | 0.17 (0.05-0.61) | 98 (6-1640) | 0.95 (0.75-0.99) | |||||||
|
| LCIi | 1 | Null | Null | Null | Null | Null | Null | |||||||
| Classifying performance between | 3 | 0.77 (0.71-0.82) | 0.96 (0.94-0.98) | 11.8 (3.7-38.3) | 0.26 (0.21-0.32) | 53 (17-161) | 0.88 (0.79-0.96) | ||||||||
aPLR: positive likelihood ratio.
bNLR: negative likelihood ratio.
cDOR: diagnostic odds ratio.
dAUC: area under the curve.
eHSROC: hierarchical summary receiver operating characteristic.
fAI: artificial intelligence.
gSVM: support vector machine.
hWLI: white-light imaging.
iLCI: linked color imaging.
Figure 6Meta-regression for the reason of heterogeneity in the diagnostic test accuracy meta-analysis. nopt: number of patients.
Figure 7Deek funnel plot for the studies of patient-based analysis.
Figure 8Deek funnel plot for the studies of image-based analysis.