| Literature DB >> 35943620 |
Mingxiang Wei1,2, Yu Zhang3, Genji Bai4, Cong Ding4, Haimin Xu3, Yao Dai5, Shuangqing Chen6,7, Hong Wang8,9.
Abstract
BACKGROUND: Preoperative differentiation between benign and borderline epithelial ovarian tumors (EOTs) is challenging and can significantly impact clinical decision making. The purpose was to investigate whether radiomics based on T2-weighted MRI can discriminate between benign and borderline EOTs preoperatively.Entities:
Keywords: Machine learning; Magnetic resonance imaging; Ovary; Radiomics
Year: 2022 PMID: 35943620 PMCID: PMC9363551 DOI: 10.1186/s13244-022-01264-x
Source DB: PubMed Journal: Insights Imaging ISSN: 1869-4101
Fig. 1Patient recruitment and workflow of radiomics analysis
Fig. 2Two representative examples of volumes of interest (VOIs) segmentation of benign and borderline EOTs in axial fat-suppressed T2-weighted images. a–c A 50-year-old woman with a right benign mucinous cystadenoma; d–f a 60-year-old woman with a left borderline mucinous cystadenoma; a, d the original images; b, e the VOIs of ovarian masses showing in red; c, f 3D renderings of the VOIs
Clinical characteristics of training and validation sets
| Cohort | Training set ( | Internal validation set ( | External validation set ( | |
|---|---|---|---|---|
| Age (years) | 45.49 ± 16.06 | 46.94 ± 17.46 | 44.70 ± 19.17 | 0.740 |
| Menopausal status | 0.581 | |||
| Postmenopausal | 154 (49.8) | 39 (50.0) | 12 (40.0) | |
| Premenopausal | 155 (50.2) | 39 (50.0) | 18 (60.0) | |
| Parity | 0.219 | |||
| Multipara | 256 (82.8) | 64 (82.1) | 21 (70.0) | |
| Nullipara | 53 (17.2) | 14 (17.9) | 9 (30.0) | |
| Abdominal symptoms | 0.362 | |||
| Pain and distention | 135 (43.7) | 37 (47.4) | 17 (56.7) | |
| None | 174 (56.3) | 41 (52.6) | 13 (43.3) | |
| CA125 (U/mL) | 0.118 | |||
| > 35 | 78 (25.2) | 14 (17.9) | 11 (36.7) | |
| ≥ 35 | 231 (74.8) | 64 (82.1) | 19 (63.3) | |
| HE4a (pmol/L) | 1.000 | |||
| Abnormal | 20 (6.5) | 5 (6.4) | 2 (6.7) | |
| Normal | 289 (93.5) | 73 (93.6) | 28 (93.3) | |
| Final diagnosis | 0.549 | |||
| Benign | 224 (72.5) | 57 (73.1) | 19 (63.3) | |
| Borderline | 85 (27.5) | 21 (26.9) | 11 (36.7) | |
| Histopathology | 0.080 | |||
| Serous | 148 (47.9) | 40 (51.3) | 21 (70.0) | |
| Mucinous | 141 (45.6) | 33 (42.3) | 6 (20.0) | |
| Others | 20 (6.5) | 5 (6.4) | 3 (10.0) |
Data are presented as mean ± standard deviation for normally distributed continuous variables, median (interquartile range, IQR) for non-normally distributed continuous variables, or number (%) for categorical variables. HE4, human epididymis protein 4; CA125, carbohydrate antigen 125
aNormal value of HE4: postmenopausal woman < 121 pmol/L or premenopausal woman < 92.1 pmol/L
Clinical and radiological characteristics for benign and borderline EOTs in training set
| Cohort | Benign ( | Borderline ( | |
|---|---|---|---|
| Age (years) | 45.76 ± 16.10 | 44.78 ± 16.01 | 0.634 |
| Menopausal status | 0.266 | ||
| Postmenopausal | 116 (51.8) | 38 (44.7) | |
| Premenopausal | 108 (48.2) | 47 (55.3) | |
| Parity | 0.887 | ||
| Multipara | 186 (83.0) | 70 (84.2) | |
| Nullipara | 38 (17.0) | 15 (17.6) | |
| Abdominal symptoms | 0.321 | ||
| Pain or distention | 94 (42.0) | 41 (48.2) | |
| None | 130 (58.0) | 44 (51.8) | |
| CA125 (U/mL) | 0.000* | ||
| ≥ 35 | 35 (15.6) | 43 (50.6) | |
| < 35 | 189 (84.4) | 42 (49.4) | |
| HE4a (pmol/L) | 0.001* | ||
| Abnormal | 8 (3.6) | 12 (14.1) | |
| Normal | 216 (96.4) | 73 (85.9) | |
| Ascites | 0.000* | ||
| None | 110 (49.1) | 40 (47.1) | |
| Mild | 98 (43.8) | 29 (34.9) | |
| Moderate | 15 (6.7) | 7 (8.2) | |
| Massive | 1 (0.4) | 9 (10.6) | |
| Maximum tumor diameter (cm) | 8.65 (6.22, 11.60) | 10.40 (8.05, 13.65) | 0.001* |
| Tumor margins | 0.021* | ||
| Well-defined | 223 (99.6) | 81 (95.3) | |
| Ill-defined | 1 (0.4) | 4 (4.7) | |
| Number of loculi | 0.107 | ||
| Mild | 136 (60.7) | 43 (50.6) | |
| Multilocular | 88 (39.3) | 42 (49.4) | |
| SI of cystic on FS T2W | 0.000* | ||
| Moderate | 182 (81.3) | 48 (56.5) | |
| Low | 10 (4.5) | 17 (20.0) | |
| High | 5 (2.2) | 6 (7.1) | |
| Mixed | 27 (12.1) | 14 (16.5) | |
| SI of solid on FS T2W | 0.000* | ||
| Low | 41 (18.3) | 42 (49.4) | |
| High | 0 (0.0) | 1 (1.2) | |
| Mixed | 12 (5.4) | 10 (11.8) | |
| None | 171 (76.3) | 32 (37.6) |
Data are presented as mean ± standard deviation for normally distributed continuous variables, median (interquartile range, IQR) for non-normally distributed continuous variables, or number (%) for categorical variables. HE4, human epididymis protein 4; CA125, carbohydrate antigen 125; SI, signal intensity; FS: fat-suppressed; T2W: T2 weighted
*p < 0.05
aNormal value of HE4: postmenopausal woman < 121 pmol/L or premenopausal < 92.1 pmol/L
Fig. 3a–d The learning curves for four different radiomics models. The red and green curves represent the trend of the score with the increase in sample size in training and cross-validation data, respectively. The training and cross-validation scores of the logistic regression (LR) and SVM models were higher than those of the Naive Bayes (NB) model. The gap between the training and cross-validation scores of the LR or SVM models was smaller than that of the Random Forest (RF) model. e–h The receiver operating characteristic curves for four different radiomics models. Each light-colored curve represents each of the tenfold cross-validations (fold 0 to 9), and the dark blue curve represents their mean; the red and green curves represent internal and external validation sets, respectively. The LR model and SVM model had similar AUCs in the training set, but the LR model outperformed the SVM model in both the internal validation set and the external validation set. The RF model had the highest AUC in the training set but had low AUC in the external validation. The NB model had the lowest AUCs in all sets
Fig. 4a Mean receiver operating characteristic (ROC) curves for the radiomics model, clinic-radiological model, and combined model over the tenfold cross-validation iterations. b ROC curves in the internal validation set. c ROC curves in the external validation set
Diagnostic performances of different models after tenfold cross-validation iterations
| Models | Specificity | Sensitivity | PPV | NPV | Accuracy | AUC |
|---|---|---|---|---|---|---|
| Radiomics | 0.71 ± 0.12 | 0.80 ± 0.09 | 0.79 ± 0.08 | 0.75 ± 0.08 | 0.76 ± 0.08 | 0.82 ± 0.07 |
| Clinic-radiological | 0.74 ± 0.08 | 0.71 ± 0.10 | 0.72 ± 0.06 | 0.74 ± 0.06 | 0.72 ± 0.05 | 0.79 ± 0.06 |
| Combined | 0.76 ± 0.11 | 0.82 ± 0.13 | 0.82 ± 0.11 | 0.78 ± 0.08 | 0.79 ± 0.08 | 0.86 ± 0.07 |
Values are mean (± standard deviation) over the cross-validation iterations
PPV positive predictive value, NPV negative predictive value, AUC area under the curve
Comparison of models in internal and external validation sets
| Models | Internal validation set | External validation set | ||||
|---|---|---|---|---|---|---|
| Difference between areas | 95% CI | Difference between areas | 95% CI | |||
| Model1 versus Model2 | 0.088 | − 0.034 to 0.210 | 0.159 | 0.160 | − 0.156 to 0.476 | 0.320 |
| Model1 versus Model3 | 0.018 | − 0.044 to 0.079 | 0.574 | 0.067 | − 0.105 to 0.239 | 0.446 |
| Model2 versus Model3 | 0.070 | − 0.032 to 0.172 | 0.178 | 0.227 | 0.034 to 0.421 | 0.021* |
Model1: radiomics model; Model2: clinic-radiological model; Model3: combined model; CI, confidence interval
*p < 0.05