| Literature DB >> 35372443 |
Guangcong Ruan1, Jing Qi2, Yi Cheng1, Rongbei Liu3, Bingqiang Zhang4, Min Zhi5, Junrong Chen5, Fang Xiao6, Xiaochun Shen1, Ling Fan1, Qin Li1, Ning Li1, Zhujing Qiu1, Zhifeng Xiao1, Fenghua Xu1, Linling Lv1, Minjia Chen1, Senhong Ying1, Lu Chen1, Yuting Tian1, Guanhu Li1, Zhou Zhang3, Mi He2, Liang Qiao2, Zhu Zhang2, Dongfeng Chen1, Qian Cao3, Yongjian Nian2, Yanling Wei1.
Abstract
Background and Aim: The identification of ulcerative colitis (UC) and Crohn's disease (CD) is a key element interfering with therapeutic response, but it is often difficult for less experienced endoscopists to identify UC and CD. Therefore, we aimed to develop and validate a deep learning diagnostic system trained on a large number of colonoscopy images to distinguish UC and CD.Entities:
Keywords: artificial intelligence; colonoscopy; convolutional neural network; deep learning; inflammatory bowel disease
Year: 2022 PMID: 35372443 PMCID: PMC8974241 DOI: 10.3389/fmed.2022.854677
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Graphic abstract of the study.
Figure 2Workflow diagram of the development and evaluation of the deep learning model. UC, ulcerative colitis; CD, Crohn's disease; CNN, convolutional neural network.
Patient and image characteristics in the training and validation datasets.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| Males, | 238 (42.81%) | 202 (39.45%) | 192 (37.80%) | |||
| BBPS (mean, median, range) | 2.85, 3, (1-3) | 2.61, 3, (1-3) | 2.50, 3, (1-3) | 1.92, 2, (1-3) | 2.4, 2, (1-3) | 2.0, 2, (1-3) |
| Disease duration, y, mean (SD) |
| 2.58 (1.24) | 2.38 (1.11) | 4.63 (2.35) | 4.11 (1.17) | |
|
| ||||||
| Diarrhea, |
|
| 346 (78.64%) | 61 (86.11%) | 353 (79.50%) | 43 (67.19%) |
| Abdominal pain, |
|
| 322 (73.18%) | 56 (77.78%) | 262 (66.00%) | 49 (76.56%) |
| CDAI, mean (SD) |
|
|
|
| 277.56 (104.00) | 284.06 (88.01) |
| SES-CD, mean (SD) |
|
|
|
| 6.52 (2.52) | 5.75 (2.66) |
|
| ||||||
| 1, |
|
| 94 (21.36%) | 10 (13.89%) |
|
|
| 2, |
|
| 48 (10.91%) | 11 (15.28%) |
|
|
| 3, |
|
| 298 (67.73%) | 51 (70.83%) |
|
|
| Number of images per patient, mean (SD) | 23 (2.4) | 21 (1.7) | 21 (1.2) | |||
UC, ulcerative colitis; CD, Crohn's disease, BBPS, Boston bowel preparation score; y, year; SD, standard deviation.
Figure 3ROC curves achieved by deep model ROC of the three categories of deep learning model in the test set. (A) The test results of three groups of patients and (B) the test results of the lesions (signal picture). AUC represents the area under the ROC curve, and in parentheses is the 95% confidence interval of AUC. AUC, the area under the receiver operating characteristic curve; ROC, receiver operating characteristic.
Comparison of classification performance between deep model and endoscopy doctors (per patient and per lesion).
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
| |||||
| Accuracy (95%CI) | 0.991 (0.967–0.997) | 0.780 (0.720–0.830) | <0.001 | 0.922 (0.879–0.951) | <0.001 |
|
| |||||
| Sensitivity (95%CI) | 1.000 (0.955–1.000) | 0.951 (0.881–0.981) | 0.125 | 1.000 (0.955–1.000) | 1.000 |
| Specificity (95%CI) | 1.000 (0.973–1.000) | 0.890 (0.826–0.932) | <0.001 | 0.985 (0.948–0.996) | 0.500 |
| PPV (95%CI) | 1.000 (0.955–1.000) | 0.839 (0.751–0.900) | <0.001 | 0.976 (0.917–0.993) | 0.497 |
| NPV (95%CI) | 1.000 (0.973–1.000) | 0.968 (0.921–0.987) | 0.051 | 1.000 (0.972–1.000) | 1.000 |
|
| |||||
| Sensitivity (95%CI) | 1.000 (0.949–1.000) | 0.694 (0.580–0.789) | <0.001 | 0.931 (0.848–0.970) | 0.219 |
| Specificity (95%CI) | 0.993 (0.962–0.999) | 0.870 (0.806–0.915) | <0.001 | 0.925 (0.870–0.957) | 0.002 |
| PPV (95%CI) | 0.986 (0.926–0.998) | 0.725 (0.610–0.816) | <0.001 | 0.859 (0.765–0.919) | 0.010 |
| NPV (95%CI) | 1.000 (0.974–1.000) | 0.852 (0.787–0.900) | <0.001 | 0.964 (0.919–0.985) | 0.197 |
|
| |||||
| Sensitivity (95%CI) | 0.969 (0.893–0.991) | 0.656 (0.534–0.761) | <0.001 | 0.812 (0.700–0.889) | 0.001 |
| Specificity (95%CI) | 1.000 (0.976–1.000) | 0.909 (0.853–0.945) | 0.001 | 0.974 (0.935–0.990) | 0.375 |
| PPV (95%CI) | 1.000 (0.942–1.000) | 0.750 (0.623–0.845) | <0.001 | 0.929 (0.830–0.972) | 0.285 |
| NPV (95%CI) | 0.987 (0.954–0.996) | 0.864 (0.803–0.909) | <0.001 | 0.926 (0.875–0.957) | 0.006 |
|
| |||||
| Accuracy (95%CI) | 0.904 (0.895–0.912) | 0.597 (0.583–0.610) | <0.001 | 0.699 (0.686–0.712) | <0.001 |
|
| |||||
| Sensitivity (95%CI) | 0.978 (0.971–0.984) | 0.831 (0.814–0.848) | <0.001 | 0.956 (0.946–0.964) | 0.090 |
| Specificity (95%CI) | 0.979 (0.973–0.984) | 0.757 (0.741–0.772) | <0.001 | 0.758 (0.743–0.773) | <0.001 |
| PPV (95%CI) | 0.967 (0.958–0.974) | 0.684 (0.665–0.703) | <0.001 | 0.714 (0.696–0.731) | <0.001 |
| NPV (95%CI) | 0.986 (0.981–0.990) | 0.877 (0.863–0.889) | <0.001 | 0.965 (0.956–0.971) | 0.001 |
|
| |||||
| Sensitivity (95%CI) | 0.928 (0.914–0.939) | 0.675 (0.652–0.698) | <0.001 | 0.687 (0.663–0.710) | <0.001 |
| Specificity (95%CI) | 0.878 (0.866–0.888) | 0.669 (0.653–0.685) | <0.001 | 0.807 (0.793–0.820) | <0.001 |
| PPV (95%CI) | 0.778 (0.759–0.796) | 0.486 (0.464–0.507) | <0.001 | 0.622 (0.599–0.645) | <0.001 |
| NPV (95%CI) | 0.963 (0.956–0.969) | 0.817 (0.802–0.831) | <0.001 | 0.848 (0.835–0.860) | <0.001 |
|
| |||||
| Sensitivity (95%CI) | 0.948 (0.935–0.958) | 0.206 (0.186–0.227) | <0.001 | 0.376 (0.352–0.402) | <0.001 |
| Specificity (95%CI) | 0.883 (0.872–0.893) | 0.960 (0.953–0.966) | <0.001 | 0.971 (0.964–0.976) | <0.001 |
| PPV (95%CI) | 0.773 (0.753–0.792) | 0.683 (0.638–0.725) | <0.001 | 0.844 (0.814–0.870) | 0.901 |
| NPV (95%CI) | 0.976 (0.970–0.980) | 0.742 (0.729–0.754) | <0.001 | 0.787 (0.774–0.799) | <0.001 |
UC, ulcerative colitis; CD, Crohn's disease; PPV, positive predictive value; NPV, negative predictive value; AUC, area under the receiver operating characteristic curve.
Figure 4Diagnostic performance of deep model and endoscopists in the test dataset. ROC curve for CD (A), UC (B) and normal (C) at the patient level, ROC curve for CD (D), UC (E), and normal (F) at lesions level. The blue stars indicate the diagnostic sensitivities and specificities of the trainee endoscopists, the green star indicates the pooled sensitivities and specificities of all trainee endoscopists, the yellow triangles indicate the diagnostic sensitivities and specificities of the competent endoscopists, and the red triangle indicates the pooled sensitivities and specificities of all competent endoscopists. AUC, the area under the receiver operating characteristic curve; ROC, receiver operating characteristic.
Comparison of time consumption for diagnosing the same test dataset (unit: s).
|
|
|
|
|
| |
|---|---|---|---|---|---|
| All lesion images | 12,862 | 11,326 | 6.00 | ||
| All patients | 2,421 | 2,429 | 6.20 |
Figure 5Confusion matrix and ROC curves for multicentre validation (per patient). Confusion matrix (A–C) and ROC curves (D–F) for three hospitals. (A,D) The First Affiliated Hospital of Chongqing Medical University. (B,E) The Sixth Affiliated Hospital of Sun Yat-sen University. (C,F) Tongji Hospital Affiliated with Huazhong University of Science and Technology. AUC, the area under the receiver operating characteristic curve; ROC, receiver operating characteristic.