| Literature DB >> 36119492 |
Hao Zhang1, Hanqi Lei1, Jun Pang1.
Abstract
Objectives: (1) To assess the methodological quality and risk of bias of radiomics studies investigating the diagnostic performance in adrenal masses and (2) to determine the potential diagnostic value of radiomics in adrenal tumors by quantitative analysis.Entities:
Keywords: adrenal tumor; diagnostic performance; machine learning; radiomics; radiomics quality score
Year: 2022 PMID: 36119492 PMCID: PMC9478189 DOI: 10.3389/fonc.2022.975183
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Figure 1Flow diagram of study selection.
Characteristics of the included studies.
| Study ID | Ref | Study Design | Diagnostic Subject | Sample Size | Image Modality | Segmentation Method (Software/Algorithm) | Feature Extraction | Features Type | Modeling method | Reference Standard | Validation |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Andersen et al. (2021) | ( | Retrospective | Adrenal metastases/Benign lesions | 160 | Contrast-enhanced CT | Semi-automatic (Philips Intellispace Tumor Tracking) | TexRAD | First-order and higher-order | Binary logistic regression | Histopathology | NR |
| Chai et al. (2017) | ( | Retrospective | Aldosterone-producing adenomas/Pheochromocytomas/Cushing adenomas | 218 | Unenhanced | Automatic (Multiscale sparse representations) | NR | Shape-based, first-order and second-order | Support vector machine, radial basis function network (ML) | Histopathology | Internal validation |
| Elmohr et al. (2019) | ( | Retrospective | Adrenocortical carcinomas/Adrenocortical adenomas | 54 | Unenhanced | Manual (Amira Software) | PyRadiomics | Shape-based, first-order and second-order | Logistic regression, boruta random forest | Histopathology | Internal validation |
| Ho et al. (2019) | ( | Retrospective | Adrenal malignancy/Lipid-poor adenoma | 23 | Unenhanced and contrast-enhanced CT, MRI 3T or 1,5T T1 in- and opposed-phase | Manual (Seg3D) | Lesion Tool (software developed by the authors) | Shape-based, first-order and second-order | Logistic regression | Histopathology | NR |
| Kong et al. (2022) | ( | Retrospective | Pheochromocytoma/Other adrenal lesions | 309 | MRI 3T T2w | Semi-automatic (3D Slicer) | 3D Slicer | Shape-based, first-order, second-order and higher-order | Logistic regression | Histopathology | Internal and external validation |
| Koyuncu et al. (2019) | ( | Retrospective | Adrenal malignant/Benign lesions | 114 | Contrast-enhanced CT | Semi-automatic | MATLAB | Shape-based, first-order, second-order and higher-order | Bounded particle swarm optimisation-neural network | NR | Internal validation |
| Li et al. (2018) | ( | Retrospective | Adrenal malignant/Benign lesions | 210 | Unenhanced and contrast-enhanced CT | Manual (NR) | NR | Second-order | Bayesian spatial gaussian process classifier | Histopathology | NR |
| Liu et al. (2021) | ( | Retrospective | Adrenal Adenoma/Pheochromocytoma | 60 | MRI 3T T1 in- and opposed-phase, T2w | Manual (Mazda) | MaZda | First-order | Support vector machine (ML) | Histopathology | Internal validation |
| Nakajo et al. (2017) | ( | Retrospective | Adrenal metastases/Benign lesions | 35 | FDG PET/CT | Semi-automatic (Advantage Windows Workstation) | Python | First-order | NR | Clinical and imaging follow-ups | NR |
| Moawad et al. (2021) | ( | Retrospective | Adrenal malignant/Benign lesions | 40 | Unenhanced and contrast-enhanced CT | Manual (Amira Software) | PyRadiomics | Shape-based, first-order, second-order and higher-order | Random forest (ML) | Histopathology | Internal validation |
| Rocha et al. (2018) | ( | Retrospective | Adrenal adenomas/Malignant lesions | 108 | Unenhanced CT | Manual (OsiriX Software) | OsiriX | First-order | NR | Histopathology or follow-up imaging | NR |
| Romeo et al. (2018) | ( | Retrospective | Lipid-rich/Lipid-poor/Nonadenoma adrenal lesions | 60 | MRI 3T T1w, T2w | Manual (ITK-SNAP) | 3D Slicer | First-order and second-order | J48 classifier, Weka software (ML) | Histopathology | Internal validation |
| Schieda et al. (2017) | ( | Retrospective | Adrenal metastases/Adrenal adenoma | 44 | MRI 1.5T or 3T T1 in- and opposed-phase, T2w, GRE | Manual (Image J) | Image J | First-order | Logistic regression | Histopathology or follow-up imaging | NR |
| Shi et al. (2019) | ( | Retrospective | Adrenal metastases/Benign lesions | 265 | Unenhanced and contrast-enhanced CT | Manual (NR) | TexRAD | First-order and higher-order | Logistic regression, support vector machin (ML) | Histopathology or follow-up imaging | Internal validation |
| Shoemaker et al. (2018) | ( | Retrospective | Adrenal malignant/Benign lesions; Adrenal functioning/Non-functioning lesions | 377 | Unenhanced CT | NR | NR | First-order and second-order | Logistic regression | Histopathology | Internal validation |
| Stanzione et al. (2021) | ( | Retrospective | Adrenal malignant/Benign lesions | 55 | MRI 3T T1w, T2w | Manual (ITK-SNAP) | PyRadiomics | Shape-based, first-order, second-order and higher-order | Extra trees classifier (ML) | Histopathology or follow-up imaging | Internal validation |
| Szász et al. (2020) | ( | Retrospective | Adrenal adenomas/Non-adenomas | 233 | Unenhanced CT | Manual (Advantage Windows workstation) | “Volume Histogram” tool | First-order | NR | Histopathology | NR |
| Torresan et al. (2021) | ( | Retrospective | Adrenocortical carcinomas/Adenoma | 19 | Unenhanced and contrast-enhanced CT | Manual (PMOD) | PMOD | First-order and second-order | K-means clustering technique (ML) | Histopathology or follow-up imaging | NR |
| Tu et al. (2018) | ( | Retrospective | Adrenal metastases/Adenomas | 76 | Contrast-enhanced CT | Manual (ImageJ) | Image J | First-order | Logistic regression | Previously described imaging thresholds or follow-up imaging | NR |
| Tu et al. (2020) | ( | Retrospective | Adrenal Metastases/Lipid-poor adenomas | 63 | MRI 1.5T or 3T T1w, T2w, GRE | Manual (Image J) | Image J | First-order | Logistic regression | Histopathology or follow-up imaging | Internal validation |
| Tüdös et al. (2019) | ( | Retrospective | Adrenal lipid-poor adenomas/Non-adenomas | 163 | Unenhanced CT | Manual (Advantage Windows workstation) | “Volume Histogram” tool | First-order | NR | Histopathology or follow-up imaging | NR |
| Umanodan et al. (2017) | ( | Retrospective | Pheochromocytomas/Adrenal adenomas | 52 | MRI 3T ADC | Manual (Synapse Vincent software) | Synapse Vincent software | First-order | NR | Histopathology or follow-up imaging | NR |
| Wu et al. (2020) | ( | Retrospective | Adrenal adenoma/Nonadenoma | 94 | Unenhanced CT | Manual (PACS software) | PACS software | First-order | NR | Histopathology or follow-up imaging | NR |
| Yi et al. (2018) | ( | Retrospective | Pheochromocytomas/Adrenal lipid-poor adenomas | 110 | Unenhanced CT | Manual (MaZda) | MaZda | First-order, second-order and higher-order | Logistic regression (ML) | Histopathology | NR |
| Yi et al. (2018) ( | ( | Retrospective | Pheochromocytoma/Adrenal lipid-poor adenoma | 265 | Unenhanced and contrast-enhanced CT | Manual (MaZda) | MaZda | First-order, second-order and higher-order | Lasso, logistic regression | Histopathology | Internal validation |
| Yu et al. (2020) | ( | Retrospective | Adrenal malignant/Benign lesions | 125 | Contrast-enhanced CT | Manual (TexRAD) | TexRAD | First-order and higher-order | NR | Histopathology or follow-up imaging | NR |
| Zhang et al. (2017) | ( | Retrospective | Pheochromocytomas/Lipid-poor adrenocortical adenoma | 164 | Unenhanced and contrast-enhanced CT | Manual (TexRAD) | TexRAD | First-order and higher-order | NR | Histopathology | NR |
| Zheng et al. (2020) | ( | Retrospective | Aldosterone-producing/Cortisol-producing functional adrenocortical adenomas | 83 | Unenhanced and contrast-enhanced CT | Manual (ITK-SNAP) | NR | Shape-based, first-order | Lasso, logistic regression (ML) | Histopathology | Internal validation |
Ref, reference; NR, not report; ML, machine learning; PACS, picture archiving and communication system.
Outcomes of the included studies.
| Study ID | P | N | TP | FP | TN | FN | Sensitivity, % | Specificity, % | Accuracy, % | AUC | 95%CI |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Andersen et al. (2021) | 71 | 89 | 41 | 20 | 69 | 30 | 58 | 77 | 68 | 0.730 | – |
| Chai et al. (2017) | – | – | – | – | – | – | – | – | 81.8 ~ 95.4 | – | – |
| Elmohr et al. (2019) | – | – | – | – | – | – | 81 | 83 | 82 | 0.890 | – |
| Ho et al. (2019) | 8 | 15 | – | – | – | – | – | – | 80 | – | – |
| Kong et al. (2022) | – | – | – | – | – | – | 85.7 | 75 | 84 | 0.906 | 0.841-0.971 |
| Koyuncu et al. (2019) | 12 | 45 | 9 | 8 | 37 | 3 | 75 | 82.2 | 80.7 | 0.786 | – |
| Li et al. (2018) | 96 | 114 | 91 | 37 | 77 | 5 | 94.8 | 67.5 | 80 | – | – |
| Liu et al. (2021) | – | – | – | – | – | – | – | – | 85 | 0.917 | – |
| Nakajo et al. (2017) | 22 | 13 | 22 | 2 | 11 | 0 | 100 | 84.6 | 94.3 | 0.970 | 0.840-0.990 |
| Moawad et al. (2021) | 19 | 21 | 16 | 6 | 15 | 3 | 84.2 | 71.4 | 77.5 | 0.850 | – |
| Rocha et al. (2018) | 88 | 20 | 77 | 1 | 19 | 11 | 87.5 | 95 | 88.9 | – | – |
| Romeo et al. (2018) | – | – | – | – | – | – | – | – | 80 | – | – |
| Schieda et al. (2017) | 15 | 29 | 14 | 4 | 25 | 1 | 93.3 | 86.2 | 88.6 | 0.970 | 0.930-1.000 |
| Shi et al. (2019) | 101 | 164 | 78 | 37 | 127 | 23 | 77 | 77 | 77.4 | 0.850 | 0.800-0.890 |
| Shoemaker et al. (2018) | – | – | – | – | – | – | – | – | – | 0.780~1.000 | – |
| Stanzione et al. (2021) | 18 | 37 | – | – | – | – | – | – | 0.91 | 0.970 | 0.870-1.000 |
| Szász et al. (2020) | 123 | 110 | – | – | – | – | – | – | – | 0.919 | – |
| Torresan et al. (2021) | 8 | 10 | 7 | 1 | 9 | 1 | 87.5 | 90 | 88.9 | – | – |
| Tu et al. (2018) | 40 | 36 | 19 | 9 | 27 | 21 | 47.5 | 75 | 60.5 | 0.650 | 0.520-0.770 |
| Tu et al. (2020) | 40 | 23 | 30 | 0 | 23 | 10 | 75 | 100 | 84.1 | 0.950 | 0.910-0.990 |
| Tüdös et al. (2019) | 83 | 80 | 44 | 1 | 79 | 39 | 53 | 98.8 | 75.5 | – | – |
| Umanodan et al. (2017) | 39 | 13 | 37 | 1 | 12 | 2 | 94.9 | 92.3 | 94.2 | 0.920 | – |
| Wu et al. (2020) | 58 | 36 | 51 | 16 | 20 | 7 | 87.9 | 55.6 | 75.5 | 0.740 | – |
| Yi et al. (2018) | 29 | 79 | 25 | 2 | 77 | 4 | 86.2 | 97.5 | 94.4 | 0.952 | 0.897-1.000 |
| Yi et al. (2018) ( | 67 | 145 | 64 | 14 | 131 | 3 | 95.5 | 90.3 | 92 | 0.957 | – |
| Yu et al. (2020) | 81 | 44 | 66 | 0 | 44 | 15 | 81 | 100 | 88 | 0.970 | 0.940-0.990 |
| Zhang et al. (2017) | 98 | 66 | 78 | 11 | 55 | 20 | 79.6 | 83.3 | 81.1 | 0.860 | 0.810-0.910 |
| Zheng et al. (2020) | – | – | – | – | – | – | 91.5 | 92.8 | 92.2 | 0.902 | 0.822-0.982 |
P,condition positive; N, condition negative; TP, true positive; FP, false positive; TN, true negative; FN, false negative; AUC, area under the receiver operating characteristic; CI, confidence interval.
Elements of the RQS and average rating achieved by the studies included in this systematic review.
| RQS scoring item | Interpretation | Mode |
|---|---|---|
| Image Protocol | +1 for well documented protocols, +1 for publicly available protocols | 1 |
| Multiple Segmentations | +1 if segmented multiple times (different physicians, algorithms, or perturbation of regions of interest) | 1 |
| Phantom Study | +1 if texture phantoms were used for feature robustness assessment | 0 |
| Multiple Time Points | +1 multiple time points for feature robustness assessment | 0 |
| Feature Reduction | −3 if nothing, +3 if either feature reduction or correction for multiple testing | 3 |
| Non Radiomics | +1 if multivariable analysis with non-radiomics features | 0 |
| Biological Correlates | +1 if present | 0 |
| Cut-off | +1 if cutoff either pre-defined or at median or continuous risk variable reported | 0 |
| Discrimination and Resampling | +1 for discrimination statistic and statistical significance, +1 if resampling applied | 1 |
| Calibration | +1 for calibration statistic and statistical significance, +1 if resampling applied | 0 |
| Prospective | +7 for prospective validation within a registered study | 0 |
| Validation | −5 if no validation/+2 for internal validation/+3 for external validation/+4 two external validation | -5 |
| Gold Standard | +2 for comparison to gold standard | 2 |
| Clinical Utility | +2 for reporting potential clinical utility | 2 |
| Cost-effectiveness | +1 for cost-effectiveness analysis | 0 |
| Open Science | +1 for open-source scans, +1 for open-source segmentations, +1 for open-source code, +1 open-source representative segmentations and features | 0 |
Figure 2The risk of bias and concerns regarding applicability of 28 included studies.
Inter-rater agreement in RQS assessment.
| RQS scoring item | ICC (95% CI) |
|---|---|
| Image Protocol | 0.52 (0.19-0.75) |
| Multiple Segmentations | 0.93 (0.86-0.97) |
| Phantom Study | 1.00 (1.00–1.00) |
| Multiple Time Points | 1.00 (1.00–1.00) |
| Feature Reduction | 0.86 (0.72-0.93) |
| Non Radiomics | 0.79 (0.59-0.90) |
| Biological Correlates | 1.00 (1.00–1.00) |
| Cut-off | 0.63 (0.34-0.81) |
| Discrimination and Resampling | 0.52 (0.19-0.75) |
| Calibration | 1.00 (1.00–1.00) |
| Prospective | 1.00 (1.00–1.00) |
| Validation | 1.00 (1.00–1.00) |
| Gold Standard | 0.54 (0.22–0.76) |
| Clinical Utility | 0.61 (0.32-0.80) |
| Cost-effectiveness | 1.00 (1.00–1.00) |
| Open Science | 0.79 (0.59-0.90) |
CI: confidence interval, RQS: Radiomics Quality Score.
Inter-rater agreement in QUADAS-2 assessment.
| RQS scoring item | ICC (95% CI) |
|---|---|
| Risk of Bias - Patient Selection | 0.79 (0.60-0.90) |
| Risk of Bias - Index Test | 0.94 (0.87-0.97) |
| Risk of Bias - Reference Standard | 1.00 (1.00–1.00) |
| Risk of Bias - Flow and Timing | 1.00 (1.00–1.00) |
| Applicability Concerns- Patient Selection | 0.52 (0.19-0.75) |
| Applicability Concerns- Index Test | 0.66 (0.39-0.83) |
| Applicability Concerns- Reference Standard | 1.00 (1.00–1.00) |
CI: confidence interval, QUADAS-2: Quality Assessment of Diagnostic Accuracy Studies-2.
The results of subgroup analysis.
| Analysis | No. of study | Sensitivity | Specificity | PLR | NLR | DOR |
|---|---|---|---|---|---|---|
|
| ||||||
| Contrast-enhanced CT | 4 | 0.66(0.47-0.80) | 0.80(0.70-0.88) | 3.15(1.69-5.89) | 0.50(0.35-0.72) | 9.02(2.59-31.43) |
| Unenhanced and contrast-enhanced CT | 4 | 0.87(0.72-0.95) | 0.74(0.66-0.80) | 3.15(2.60-3.82) | 0.17(0.07-0.41) | 18.89(8.96-39.85) |
| Unenhanced CT | 1 | 0.88(0.79-0.93) | 0.95(0.72-0.99) | 17.50(2.59-118.41) | 0.13(0.02-0.89) | 133.00(16.16-1094.59) |
|
| ||||||
| With second-order or higher-order features | 7 | 0.81(0.69-0.89) | 0.77(0.70-0.83) | 3.21(2.55-4.04) | 0.23(0.11-0.47) | 16.97(7.56-38.12) |
| Only first-order | 2 | 0.72(0.25-0.95) | 0.86(0.51-0.97) | 4.77(0.56-40.72) | 0.39(0.08-1.86) | 16.91(0.38-761.14) |
|
| ||||||
| Not use machine learing | 6 | 0.78(0.60-0.89) | 0.79(0.69-0.87) | 3.20(2.12-4.82) | 0.26(0.09-0.76) | 18.80(5.37-65.75) |
| Use machine learing | 3 | 0.79(0.71-0.85) | 0.77(0.71-0.83) | 3.41(2.59-4.50) | 0.28(0.21-0.37) | 12.54(7.28-21.59) |
|
| ||||||
| Histopathology | 3 | 0.83(0.46-0.97) | 0.72(0.65-0.78) | 2.83(2.28-3.52) | 0.21(0.05-0.81) | 12.93(3.08-54.26) |
| Histopathology or follow-up imaging | 4 | 0.82(0.76-0.86) | 0.91(0.72-0.98) | 9.33(2.60-33.52) | 0.28(0.21-0.38) | 59.05(9.39-371.52) |
| Previously described imaging thresholds or follow-up imaging | 1 | 0.48(0.33-0.63) | 0.75(0.59-0.86) | 1.90(0.99-3.65) | 0.70(0.36-1.35) | 2.71(1.02-7.21) |
| Overall | 9 | 0.80(0.68-0.88) | 0.83(0.73-0.90) | 4.70(2.80-8.00) | 0.25(0.15-0.41) | 19.06(7.87-46.19) |
PLR, positive likelihood ratio; NLR, negative likelihood ratio; DOR, diagnostic odds ratio; The 95% confidence intervals are shown in parentheses.
Figure 3Summary receiver operating characteristic (SROC). AUC, area under the curve.
Figure 4Forest plots of the sensitivity and specificity of CT-based radiomics in differentiating malignant from benign adrenal tumors. I 2 >50% indicates substantial heterogeneity among included studies.
Figure 5Deeks funnel plot reveals the possibility of publication bias is low with a p value of 0.77. ESS, effective sample size.