| Literature DB >> 36193283 |
Yu Xue1,2, Ying Zhou1,2, Tingrui Wang1,2, Huijuan Chen1,2, Lingling Wu1,2, Huayun Ling1,2, Hong Wang1,2, Lijuan Qiu1,2, Dongqing Ye1,2, Bin Wang1,2.
Abstract
Background: Ultrasonography (US) is the most common method of identifying thyroid nodules, but US images require an experienced surgeon for identification. Many artificial intelligence (AI) techniques such as computer-aided diagnostic systems (CAD), deep learning (DL), and machine learning (ML) have been used to assist in the diagnosis of thyroid nodules, but whether AI techniques can improve the diagnostic accuracy of thyroid nodules still needs to be explored. Objective: To clarify the accuracy of AI-based thyroid nodule US images for differentiating benign and malignant thyroid nodules.Entities:
Year: 2022 PMID: 36193283 PMCID: PMC9525757 DOI: 10.1155/2022/9492056
Source DB: PubMed Journal: Int J Endocrinol ISSN: 1687-8337 Impact factor: 2.803
Figure 1PRISMA diagram for the systematic review.
Basic characteristics of included studies.
| References | Year | Country | Design | Methodology | Sample size | Mean age | B | M | Sen | Spec |
|---|---|---|---|---|---|---|---|---|---|---|
| Gild et al. [ | 2022 | Australia | R | ML | 91 | 60.10 | 55 | 36 | 0.82 | 0.59 |
| Zhu et al. [ | 2021 | China | R | DL | 600 | 55.20 | 300 | 300 | 0.82 | 0.81 |
| Han et al. [ | 2021 | Korea | R | CAD | 454 | 49.50 | 287 | 167 | 0.95 | 0.30 |
| Zhong Liu [ | 2021 | China | R | DL | 175 | 44.34 | 67 | 96 | 0.89 | 0.91 |
| Fengying Ye [ | 2021 | China | P | CAD | 565 | 54.10 | 270 | 295 | 0.76 | 0.60 |
| Chong-Ke Zhao [ | 2021 | China | R | ML | 223 | 48.85 | 136 | 80 | 0.89 | 0.77 |
| G.R. Kim [ | 2021 | Korea | P | DL | 760 | 51.00 | 584 | 176 | 0.82 | 0.86 |
| Xi Wei [ | 2020 | China | R | DL | 7 216 | 45.29 | 2712 | 4504 | 0.89 | 0.86 |
| Yichun Zhang [ | 2020 | China | R | CAD | 365 | 46.40 | 179 | 186 | 0.72 | 0.86 |
| Marcin Barczyński [ | 2020 | Poland | P | CAD | 50 | 47.50 | 40 | 10 | 0.90 | 0.80 |
| Heng Ye [ | 2020 | China | R | DL | 1 601 | 45.16 | 861 | 740 | 0.87 | 0.86 |
| Daniele Fresilli [ | 2020 | Italy | R | CAD | 107 | 55.00 | 80 | 27 | 0.70 | 0.88 |
| Hui Zhou [ | 2020 | China | R | DL | 1097 | 47.30 | 669 | 428 | 0.90 | 0.83 |
| Chao Sun [ | 2020 | China | R | DL | 550 | 43.00 | 128 | 422 | 0.96 | 0.83 |
| Lei Wang [ | 2019 | China | R | DL | 351 | 45.76 | 109 | 242 | 0.91 | 0.90 |
| Hye Lin Kim [ | 2019 | Korea | R | CAD | 218 | 48.00 | 132 | 86 | 0.80 | 0.83 |
| Xia et al. [ | 2019 | China | P | CAD | 180 | 47.20 | 85 | 95 | 0.91 | 0.41 |
| Jeong et al. [ | 2019 | Korea | P | CAD | 100 | 46.00 | 56 | 44 | 0.89 | 0.84 |
| Zhang et al. [ | 2019 | China | R | ML | 1 238 | 45.25 | 788 | 450 | 0.97 | 0.95 |
| Park et al. [ | 2019 | Korea | R | DL | 286 | 47.18 | 130 | 156 | 0.91 | 0.80 |
| Ko et al. [ | 2019 | Korea | R | DL | 439 | 46.70 | 143 | 296 | 0.84 | 0.90 |
| Buda et al. [ | 2019 | USA | R | DL | 99 | 52.20 | 84 | 15 | 0.87 | 0.52 |
| Yoo et al. [ | 2018 | Korea | P | CAD | 117 | 43.20 | 67 | 50 | 0.80 | 0.96 |
| Choi et al. [ | 2017 | Korea | P | CAD | 102 | 45.30 | 59 | 43 | 0.91 | 0.75 |
| Zhu et al. [ | 2013 | China | R | DL | 464 | 47.70 | 187 | 277 | 0.85 | 0.79 |
P, prospective; R, retrospective; B, benign; M, malignant; Sen, sensitivity; Spec, specificity.
Figure 2Methodological quality of the included studies: the summary of risk of bias and applicability concerns for the included studies.
Figure 3Methodological quality of the included studies: the quality of individual studies.
Summary performance estimates.
| Parameter | Estimates | 95% CI |
|---|---|---|
| Sensitivity | 0.88 | 0.85–0.90 |
| Specificity | 0.81 | 0.74–0.86 |
| PLR | 4.5 | 3.4–6.1 |
| NLR | 0.15 | 0.12–0.19 |
| DOR | 30 | 19–46 |
PLR, positive likelihood ratio; NLR, negative likelihood ratio; DOR, diagnostic odds ratio.
Figure 4Summary receiver operating characteristic (SROC) curves of AI-aided diagnostic techniques for the diagnosis of TN.
Figure 5Forest plot of the comprehensive sensitivity and specificity of AI-aided diagnostic techniques for diagnosing TN.
Figure 6Meta-regression analysis of different study designs, methodologies, sample sizes and mean ages.
Meta-regression for heterogeneity within studies.
| Parameter | Number of studies | Sensitivity estimates (95% CI) |
| Specificity estimates (95% CI) |
|
|
|
|---|---|---|---|---|---|---|---|
| Design | |||||||
| P | 7 | 0.85 (0.79–0.92) | <0.001 | 0.78 (0.66–0.90) | 0.02 | 0% (0%–100%) | 0.55 |
| R | 18 | 0.88 (0.85–0.91) | 0.82 (0.75–0.88) | ||||
|
| |||||||
| Methodology | |||||||
| DL | 15 | 0.89 (0.87–0.92) | <0.001 | 0.84 (0.77–0.90) | 0.14 | 60% (11%–100%) | 0.08 |
| CAD | 10 | 0.84 (0.79–0.90) | 0.75 (0.65–0.86) | ||||
|
| |||||||
| Sample size | |||||||
| ≥500 | 8 | 0.89 (0.85–0.93) | <0.001 | 0.84 (0.77–0.92) | 0.09 | 0% (0%–100%) | 0.38 |
| <500 | 17 | 0.87 (0.83–0.90) | 0.78 (0.71–0.86) | ||||
|
| |||||||
| Mean age | |||||||
| ≥50 | 6 | 0.80 (0.73–0.88) | <0.001 | 0.73 (0.60–0.87) | 0.01 | 75% (46%–100%) | 0.02 |
| <50 | 19 | 0.89 (0.87–0.92) | 0.83 (0.77–0.88) | ||||
P, prospective; R, retrospective; DL, deep learning and machine learning; CAD, computer-aided diagnostic systems.
Figure 7Sensitivity analysis of AI-assisted diagnostic technique for TN diagnosis. (a) Graphical depiction of residual-based goodness-of-fit; (b) Bivariate normality; (c) influence; (d) outlier detection.
Figure 8Evaluation of clinical applicability of AI-assisted diagnostic techniques in TN diagnosis: Fagan nomogram.
Figure 9Evaluation of clinical applicability of AI-assisted diagnostic techniques in TN diagnosis: likelihood ratio scattergram.
Figure 10Results of Deeks' funnel plot of asymmetry test for publication bias.