| Literature DB >> 34565338 |
Sergei Bedrikovetski1,2, Nagendra N Dudi-Venkata3,4, Hidde M Kroon3,4, Warren Seow3, Ryash Vather4, Gustavo Carneiro5, James W Moore3,4, Tarik Sammour3,4.
Abstract
BACKGROUND: Artificial intelligence (AI) is increasingly being used in medical imaging analysis. We aimed to evaluate the diagnostic accuracy of AI models used for detection of lymph node metastasis on pre-operative staging imaging for colorectal cancer.Entities:
Keywords: Artificial intelligence; Colorectal cancer; Deep learning; Lymph node metastasis; Machine learning; Radiomics; meta-analysis
Mesh:
Year: 2021 PMID: 34565338 PMCID: PMC8474828 DOI: 10.1186/s12885-021-08773-w
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1PRISMA flow chart outlining the selection of studies for review
Fig. 2Summary of QUADAS-2 assessments of included studies
Results for deep learning radiomics models and radiologist in accuracy to detect lymph node metastasis
| First author | TP | FP | TN | FN | PPV, % | NPV, % | Sensitivity, % | Specificity, % | Accuracy, % | AUROC | 95%CI | Standard Error c |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| | ||||||||||||
| Ding [ | – | – | – | – | – | – | – | – | – | 0.920 | 0.876–0.964 | 0.0224 |
| Wang [ | 40 | 4 | 58 | 5 | 90.9 | 92.1 | 88.9 | 93.5 | 91.6 | 0.912 c | 0.842–0.958 | 0.0296 |
| Glaser [ | – | – | – | – | – | – | – | – | – | 0.860 | – | – |
| | ||||||||||||
| Lu [ | – | – | – | – | – | – | – | – | – | 0.912 | – | – |
| Li [ | – | – | – | – | – | – | – | – | 94.4 | – | – | – |
| | ||||||||||||
| Eresen [ | 29 c | 6 c | 33 c | 10 c | 82.8 c | 76.7 c | 74.36 | 84.62 | 79.49 | 0.825 | 0.778–0.872 | 0.0240 |
| Li [ | 69 c | 44 c | 128 c | 67 c | 61.06 | 65.64 | 50.74 | 74.42 | 63.96 | 0.650 | 0.583–0.713 | 0.0331 |
| Yang [ | 13 c | 5 c | 21 c | 2 c | 73.2 c | 90.5 c | 85.0 | 82.0 | 83.0 | 0.780 | 0.630–0.920 | 0.0740 |
| Nakanishi [ | – | – | – | – | – | – | – | – | – | 0.900 | 0.800–0.990 | 0.0485 |
| Zhou [ | 24 c | 27 c | 74 c | 5 c | 47.1 | 93.7 | 82.8 | 73.3 | 75.4 | 0.818 | 0.731–0.905 | 0.0444 |
| Meng [ | 46 c | 36 c | 47 c | 17 c | 56.1 c | 73.4 c | 73.0 | 56.6 | 63.7 | 0.697 | 0.612–0.781 | 0.0431 |
| Chen [ | – | – | – | – | – | – | – | – | – | 0.857 | 0.726–0.989 | 0.0671 |
| Huang [ | – | – | – | – | – | – | – | – | – | 0.788 | 0.779–0.797 | 0.0046 |
| | ||||||||||||
| Zhu [ | 18 c | 21 c | 32 c | 1 c | 46.2 | 97.0 | 94.7 | 60.4 | 69.4 c | 0.812 | 0.703–0.895 | 0.0490 |
| Cai [ | – | – | – | – | – | – | 89 | 82 | 88 | – | – | – |
| Tse [ | – | – | – | – | – | – | – | – | 91.0 | – | – | – |
| Cui [ | 111 c | 17 c | 78 c | 14 c | 86.7 c | 85.0 c | 89 | 82 | 88 | 0.855 c | 0.801–0.898 c | 0.0247 |
| | ||||||||||||
| Li [ | 94 c | 63 c | 109 c | 42 c | 59.9 | 72.2 | 69.1 | 63.4 | 65.9 | 0.708 | 0.645–0.765 | 0.0306 |
| Eresen [ | 33 c | 23 c | 16 c | 6 c | 58.9 c | 72.7 c | 84.6 | 41.0 | 62.8 | 0.772 | 0.718–0.825 | 0.0273 |
| Yang [ | 41 | 41 | 43 | 14 | 50.0 | 75.4 | 74.6 | 51.2 | 60.4 | 0.629 c | 0.543–0.709 c | 0.0423 |
| Nakanishi [ | 71 | 0 | 147 | 29 | 100.0 | 83.5 | 71.0 | 100.0 | 88.3 c | 0.855 c | 0.805–0.896 c | 0.0232 |
| Zhou [ | 49 | 89 | 215 | 38 | 35.5 | 85.0 | 56.3 | 70.7 | 67.5 | 0.635 | 0.585–0.683 | 0.0250 |
| Meng [ | 88 c | 96 c | 124 c | 37 c | 47.8 c | 77.0 c | 55.9 | 70.4 | 61.3 | 0.632 c | 0.578–0.683 c | 0.0268 |
| Chen [ | – | – | – | – | – | – | – | – | – | 0.671 | 0.511–0.831 | 0.0816 |
| Huang [ | 58 c | 30 c | 69 c | 43 c | 65.9 c | 61.6 c | 57.4 c | 69.7 c | 63.5 c | 0.636 c | 0.565–0.702 c | 0.0349 |
| | ||||||||||||
| Cui [ | 39 c | 101 c | 52 c | 36 c | 27.7 c | 59.1 c | 52 c | 34 c | 39.9 c | 0.430 c | 0.365–0.497 c | 0.0337 |
a Values extracted from training set
b Values extracted from total cohort
c Manually derived/reconstructed values using formulas from Additional file 1: Table S2
d Values extracted from clinical models
FN, false negative; FP, false positive; TN, true negative and TP, true positive; PPV, positive predictive value; NPV, negative predictive value; AUROC, area under the receiver operating characteristic; CI, confidence interval
Pooled results of per-patient and per-node diagnosis from deep learning, radiomics and radiologists
| Variable | Studies analysed | Type of malignancy | No. of studies | Pooled results (95% CI) | Heterogeneity (I2, %) | Heterogeneity |
|---|---|---|---|---|---|---|
| AUROC per-patient | [ | Rectal | 2 | 0.917 (0.882–0.952) | 0.00 | 0.829 |
| Sensitivity per-patient | [ | Rectal | 3 | 0.776 (0.685–0.851) | 0.00 | 0.368 |
| Sensitivity per-node | [ | Rectal | 2 | 0.896 (0.834–0.941) | 0.00 | 0.393 |
| Specificity per-patient | [ | Rectal | 3 | 0.676 (0.608–0.739) | 75.4 | 0.017 |
| Specificity per-node | [ | Rectal | 2 | 0.743 (0.665–0.811) | 87.8 | 0.004 |
| AUROC per patient | [ | Colorectal | 2 | 0.727 (0.633–0.821) | 94.1 | < 0.0001 |
| AUROC per patient | [ | Rectal | 5 | 0.808 (0.739–0.876) | 63.3 | 0.028 |
| AUROC per node | [ | Rectal | 2 | 0.846 (0.803–0.890) | 0.00 | 0.433 |
| Sensitivity per-patient | [ | Colorectal | 2 | 0.641 (0.577–0.702) | 70.9 | 0.064 |
| Specificity per-patient | [ | Colorectal | 2 | 0.657 (0.597–0.713) | 11.1 | 0.289 |
| Sensitivity per-patient | [ | Rectal | 4 | 0.678 (0.628–0.726) | 57.5 | 0.070 |
| Specificity per-patient | [ | Rectal | 4 | 0.701 (0.667–0.733) | 97.8 | < 0.0001 |
| AUROC per-patient | [ | Colorectal | 2 | 0.676 (0.627–0.725) | 58.4 | 0.121 |
| AUROC per-patient | [ | Rectal | 5 | 0.688 (0.603 to 0.772) | 93.4 | < 0.0001 |
AUROC, area under the receiver operating characteristic; CI, confidence interval
Fig. 3Forest plots of per-patient area under the receiver operating characteristic curve (AUROC). (a) Deep learning in rectal cancer, (b) radiomics in rectal cancer, (c) radiomics in colorectal cancer, (d) radiologist in rectal cancer and (e) radiologist in colorectal cancer