| Literature DB >> 35879649 |
Quchuan Zhao1, Qing Jia2, Tianyu Chi3.
Abstract
BACKGROUND AND AIMS: Chronic atrophic gastritis (CAG) is a precancerous disease that often leads to the development of gastric cancer (GC) and is positively correlated with GC morbidity. However, the sensitivity of the endoscopic diagnosis of CAG is only 42%. Therefore, we developed a real-time video monitoring model for endoscopic diagnosis of CAG based on U-Net deep learning (DL) and conducted a prospective nested case-control study to evaluate the diagnostic evaluation indices of the model and its consistency with pathological diagnosis.Entities:
Keywords: Artificial intelligence; Chronic atrophic gastritis; Deep learning; Gastroscopy; U-Net
Mesh:
Year: 2022 PMID: 35879649 PMCID: PMC9310473 DOI: 10.1186/s12876-022-02427-2
Source DB: PubMed Journal: BMC Gastroenterol ISSN: 1471-230X Impact factor: 2.847
Baseline characteristics before and after propensity score matching
| Characteristic | Before matching | After matching | ||||
|---|---|---|---|---|---|---|
| CAG (n = 338) (%) | CNAG (n = 793) (%) | Standardized difference | CAG (n = 338) (%) | CNAG (n = 338) (%) | Standardized difference | |
| Sex (%) | − 0.0875 | − 0.0266 | ||||
| Male | 70.4 | 61.7 | 70.4 | 67.8 | ||
| Female | 29.6 | 38.3 | 29.6 | 32.2 | ||
| Age | 0.0158 | 0.0030 | ||||
| Distribution (%) | ||||||
| < 40 yrs | 8.9 | 10.6 | 8.9 | 9.8 | ||
| 40–59 yrs | 48.8 | 46.4 | 48.8 | 45.3 | ||
| 60–75 yrs | 33.4 | 34.7 | 33.4 | 38.2 | ||
| > 75 yrs | 8.9 | 8.3 | 8.9 | 6.8 | ||
| Indication (%) | − 0.0370 | 0.0030 | ||||
| Screening | 37.9 | 34.2 | 37.9 | 38.2 | ||
| Diagnosis | 62.1 | 65.8 | 62.1 | 61.8 | ||
| HP (%) | − 0.0149 | 0.0059 | ||||
| Yes | 26.6 | 28.1 | 26.6 | 26.0 | ||
| No | 73.4 | 71.9 | 73.4 | 74.0 | ||
| Smoking (%) | 0.0294 | − 0.0207 | ||||
| Yes | 31.1 | 28.1 | 31.1 | 33.1 | ||
| No | 68.9 | 71.9 | 68.9 | 66.9 | ||
| Drinking (%) | − 0.0480 | − 0.0030 | ||||
| Yes | 21.3 | 26.1 | 21.3 | 21.6 | ||
| No | 78.7 | 73.9 | 78.7 | 78.4 | ||
| HT (%) | − 0.0150 | 0.0059 | ||||
| Yes | 32.5 | 34.0 | 32.5 | 32.0 | ||
| No | 67.5 | 66.0 | 67.5 | 68.0 | ||
| CHD (%) | − 0.0326 | 0.0059 | ||||
| Yes | 25.7 | 29.0 | 25.7 | 25.1 | ||
| No | 74.3 | 71.0 | 74.3 | 74.9 | ||
| Diabetes (%) | 0.0026 | 0.0030 | ||||
| Yes | 24.9 | 24.6 | 24.9 | 24.5 | ||
| No | 75.1 | 75.4 | 75.1 | 75.5 | ||
CAG chronic atrophic gastritis, CNAG chronic nonatrophic gastritis, HP helicobacter pylori, HT hypertension, CHD coronary heart disease
Fig. 1Flow chart of the identification of the study sample
Diagnostic evaluation indices and the evaluation of consistency with pathological diagnosis in the deep learning group and endoscopist group before and after propensity score matching
| CAG versus CNAG | Before matching (338 vs. 793) | After matching (338 vs. 338) | ||
|---|---|---|---|---|
| DL | Endoscopist | DL | Endoscopist | |
| Sensitivity | 84.02% | 62.72% | 84.02% | 62.72% |
| Specificity | 96.34% | 80.45% | 97.04% | 81.95% |
| PV+ | 90.73% | 57.77% | 96.60% | 77.66% |
| PV− | 93.40% | 83.51% | 85.86% | 68.73% |
| Accuracy | 92.66% | 75.15% | 90.53% | 72.34% |
| Youden index | 80.36% | 43.17% | 81.06% | 44.67% |
| Odd product | 91.71 | 6.93 | 172.5 | 7.64 |
| LR + | 22.96 | 3.21 | 28.39 | 3.47 |
| LR− | 0.17 | 0.46 | 0.16 | 0.45 |
| AUC (95% CI) | 0.906 (0.882–0.930) | 0.735 (0.700–0.769) | 0.909 (0.884–0.934) | 0.740 (0.702–0.778) |
| Kappa | 0.842 | 0.492 | 0.852 | 0.558 |
DL deep learning, PV+ positive predictive value, PV− negative predictive value, LR+ positive likelihood ratio, LR− negative likelihood ratio
Fig. 2The diagnostic performance comparison between DL group and endoscopist group when taking pathological diagnosis as the gold standard. A Partial AUC (The black shaded part) at the sensitivity ≥ 0.8 for DL group. B Partial AUC (The dark grey shaded part) at the sensitivity ≥ 0.8 for endoscopist group. C Partial AUC ((The black shaded part) at the specificity ≥ 0.8 for DL group. D Partial AUC (The dark grey shaded part) at the specificity ≥ 0.8 for endoscopist group. E ROC curves for DL group and endoscopist group respectively. After matching and taking pathological diagnosis as the gold standard, the diagnostic evaluation indices and the evaluation of consistency with pathological diagnosis in the DL group were better than those in the endoscopist group
Diagnostic evaluation indices in the deep learning group and endoscopist group after propensity score matching in subgroups for the severity of CAG
| Mild CAG versus CNAG (104 vs. 338) | Moderate CAG versus CNAG (147 vs. 338) | Severe CAG versus CNAG (87 vs. 338) | ||||
|---|---|---|---|---|---|---|
| DL | Endoscopist | DL | Endoscopist | DL | Endoscopist | |
| Sensitivity | 72.12% | 39.42% | 85.71% | 62.59% | 95.40% | 90.80% |
| Specificity | 97.04% | 81.95% | 97.04% | 81.95% | 97.04% | 81.95% |
| PV+ | 88.34% | 40.20% | 92.65% | 60.13% | 89.25% | 56.43% |
| PV− | 91.88% | 81.47% | 93.98% | 83.43% | 98.80% | 97.19% |
| Accuracy | 91.18% | 71.95% | 93.61% | 76.08% | 96.71% | 83.76% |
| Youden index | 69.16% | 21.37% | 82.75% | 44.54% | 92.44% | 72.75% |
| Odd product | 84.83 | 2.96 | 196.8 | 7.6 | 680.6 | 44.84 |
| LR+ | 24.36 | 2.18 | 28.96 | 3.47 | 32.23 | 5.03 |
| LR− | 0.29 | 0.74 | 0.15 | 0.46 | 0.05 | 0.11 |
DL deep learning, PV+ positive predictive value, PV− negative predictive value, LR+ positive likelihood ratio, LR− negative likelihood ratio
Diagnostic evaluation indices in the deep learning group and endoscopist group before propensity score matching in subgroups for the severity of CAG
| Mild CAG versus CNAG (104 vs. 793) | Moderate CAG versus CNAG (147 vs. 793) | Severe CAG versus CNAG (87 vs. 793) | ||||
|---|---|---|---|---|---|---|
| DL | Endoscopist | DL | Endoscopist | DL | Endoscopist | |
| Sensitivity | 72.12% | 39.42% | 85.71% | 62.59% | 95.40% | 90.80% |
| Specificity | 96.34% | 80.45% | 96.34% | 80.45% | 96.34% | 80.45% |
| PV+ | 72.12% | 20.92% | 81.29% | 37.25% | 74.11% | 33.76% |
| PV− | 96.34% | 91.01% | 97.32% | 92.06% | 99.48% | 98.76% |
| Accuracy | 93.53% | 75.70% | 94.68% | 77.66% | 96.25% | 81.48% |
| Youden index | 68.46% | 19.87% | 82.05% | 43.04% | 91.74% | 71.25% |
| Odd product | 68.13 | 2.68 | 158.07 | 6.89 | 546.66 | 40.65 |
| LR+ | 19.70 | 2.02 | 23.42 | 3.2 | 26.07 | 4.64 |
| LR− | 0.29 | 0.75 | 0.15 | 0.47 | 0.05 | 0.11 |
DL deep learning, PV+ positive predictive value, PV− negative predictive value, LR+ positive likelihood ratio, LR− negative likelihood ratio