| Literature DB >> 28874808 |
Marika Vezzoli1, Antonella Ravaggi2, Laura Zanotti3, Rebecca Angelica Miscioscia4, Eliana Bignotti5, Monica Ragnoli5, Angela Gambino4, Giuseppina Ruggeri6, Stefano Calza7, Enrico Sartori4, Franco Odicino4.
Abstract
Some aspects of endometrial cancer (EC) preoperative work-up are still controversial, and debatable are the roles played by lymphadenectomy and radical surgery. Proper preoperative EC staging can help design a tailored surgical treatment, and this study aims to propose a new algorithm able to predict extrauterine disease diffusion. 293 EC patients were consecutively enrolled, and age, BMI, children's number, menopausal status, contraception, hormone replacement therapy, hypertension, histological grading, clinical stage, and serum HE4 and CA125 values were preoperatively evaluated. In order to identify before surgery the most important variables able to classify EC patients based on FIGO stage, we adopted a new statistical approach consisting of two-steps: 1) Random Forest with its relative variable importance; 2) a novel algorithm able to select the most representative Regression Tree (RERT) from an ensemble method. RERT, built on the above mentioned variables, provided a sensitivity, specificity, NPV and PPV of 90%, 76%, 94% and 65% respectively, in predicting FIGO stage > I. Notably, RERT outperformed the prediction ability of HE4, CA125, Logistic Regression and single cross-validated Regression Tree. Such algorithm has great potential, since it better identifies the true early-stage patients, thus providing concrete support in the decisional process about therapeutic options to be performed.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28874808 PMCID: PMC5585365 DOI: 10.1038/s41598-017-11104-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Preoperative sCA125 and sHE4 levels in all 293 EC patients divided according to clinicopathological features
| Variables | n. | HE4 pmol/L | CA125 U/mL | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Median | Range | 95th perc |
| Median | Range | 95th perc |
| ||
|
| |||||||||
| <50 | 22 | 49.70 | 24,70–176 | 148.87 | 20.15 | 5,00–433,00 | 373.48 | ||
| ≥50 | 271 | 78.00 | 6,5–653 | 312.00 |
| 17.25 | 2,40–2922,00 | 120.11 |
|
|
| |||||||||
| <18.5 | 5 | 97.50 | 54,00–649,00 | 543.76 | 18.00 | 12,70–60,00 | 58.60 | ||
| 18.5–25 | 102 | 76.75 | 6,50–639,40 | 410.22 | 19.50 | 3,90–961,00 | 198.00 | ||
| 25–30 | 94 | 69.35 | 24,70–410,00 | 262.66 | 15.20 | 2,40–165,20 | 67.10 | ||
| ≥30 | 78 | 83.95 | 32,20–653,00 | 316.5 |
| 17.95 | 3,30–2922,00 | 197.76 |
|
| Missing values | 14 | ||||||||
|
| |||||||||
| <4 | 275 | 76.60 | 6,50–653,00 | 279.53 | 17.25 | 2,40–2922,00 | 141.89 | ||
| ≥4 | 18 | 87.70 | 45,00–649,00 | 393.06 |
| 18.95 | 3,7–107,50 | 61.17 |
|
|
| |||||||||
| No | 35 | 52.20 | 24,70–541,90 | 165.92 | 16.80 | 5,00–433,00 | 282.30 | ||
| Yes | 257 | 79.40 | 6,50–653,00 | 312.00 |
| 17.75 | 2,40–2922,00 | 120.05 |
|
| Missing values | 1 | ||||||||
|
| |||||||||
| No | 193 | 77.00 | 24,70–653,00 | 312.00 | 18.15 | 2,40–2922,00 | 155.74 | ||
| Yes | 13 | 60.20 | 32,20–176,00 | 159.86 |
| 15.00 | 6,2–433,00 | 316.00 |
|
| Missing values | 87 | ||||||||
|
| |||||||||
| No | 223 | 76.70 | 6,50–653,00 | 311.23 | 18.25 | 3,30–2922,00 | 172.61 | ||
| Yes | 41 | 64.20 | 19,40–410,00 | 175.00 |
| 16.90 | 2,40–118,00 | 71.70 |
|
| Missing values | 29 | ||||||||
|
| |||||||||
| No | 143 | 67.00 | 6,50–653,00 | 223.69 | 17.80 | 3,30–2922,00 | 162.54 | ||
| Yes | 150 | 85.90 | 30,80–639,40 | 312.00 |
| 17.70 | 2,40–497,40 | 96.50 |
|
|
| |||||||||
| Hyperplasia | 6 | 46.90 | 30,80–53,30 | 52.77 | 10.65 | 7,90–15,40 | 14.50 | ||
| G1 | 82 | 67.65 | 24,70–526,00 | 219.18 | 17.25 | 5,20–263,00 | 77.76 | ||
| G2 | 91 | 85.90 | 19,40–649,00 | 311.50 | 20.60 | 3,30–497,40 | 120.11 | ||
| G3 | 87 | 78.00 | 6,50–653,00 | 308.61 |
| 16.80 | 4,60–2922,00 | 186.20 |
|
| Missing values | 27 | ||||||||
|
| |||||||||
| Early (FIGO ≤I) | 246 | 72.95 | 6,50–639,40 | 294.10 | 17.00 | 2,40–263,00 | 73.14 | ||
| Advanced (FIGO > I) | 44 | 103.55 | 41,00–653,00 | 531.10 | < | 36.55 | 5,90–2922,00 | 487.74 | < |
| Missing values | 3 | ||||||||
|
| |||||||||
| I | 194 | 66.10 | 6,50–346,70 | 197.40 | 15.90 | 2,40–238,00 | 58.12 | ||
| >I | 99 | 107.40 | 34,5–653,00 | 526.60 | < | 25.00 | 3,90–2922,00 | 299.90 |
|
|
| |||||||||
| Non endometrioid | 39 | 71.00 | 31,70–653,00 | 424.74 | 26.00 | 4,70–2922,00 | 220.07 | ||
| Endometrioid | 254 | 76.80 | 6,50–649,00 | 301.96 |
| 17.20 | 2,40–961,00 | 106.60 |
|
|
| |||||||||
| G1 | 68 | 61.95 | 24,70–304,30 | 152.70 | 15.90 | 2,40–238,00 | 51.16 | ||
| G2 | 127 | 84.50 | 6,50–649,00 | 321.69 | 18.10 | 3,30–497,40 | 120.14 | ||
| G3 | 98 | 84.25 | 31,70–653,00 | 415.22 | < | 19.00 | 3,90–2922,00 | 197.76 |
|
|
| |||||||||
| M0 | 37 | 48.90 | 6,50–138,90 | 103.12 | 15.20 | 4,70–71,20 | 42.25 | ||
| M1 | 107 | 66.20 | 31,70–653,00 | 215.68 | 15.50 | 2,40–2922,00 | 54.40 | ||
| M2 | 149 | 94.00 | 19,40–649,00 | 412.10 | < | 22.00 | 3,70–961,00 | 188.00 | < |
|
| |||||||||
| No | 208 | 67.60 | 6,50–653,00 | 219.86 | 17.00 | 2,40–2922,00 | 95.20 | ||
| Yes | 85 | 94.70 | 34,50–649,00 | 412.80 | < | 20.50 | 4,60–961,00 | 190.76 |
|
|
| |||||||||
| No | 260 | 72.95 | 6,50–649,00 | 226.61 | 16.90 | 2,40–389,00 | 84.69 | ||
| Yes | 31 | 100.30 | 50,50–653,00 | 536.95 | < | 40.00 | 7,00–2922,00 | 729.20 | < |
| Missing values | 2 | ||||||||
|
| |||||||||
| Negative | 243 | 70.70 | 6,50–649,00 | 223.19 | 15.90 | 2,40–238,00 | 59.91 | ||
| Positive | 40 | 108.40 | 50,50–532,00 | 410.75 | < | 38.25 | 4,60–961,00 | 394.42 | < |
| Missing values | 10 | ||||||||
|
| |||||||||
| Absent | 128 | 62.10 | 6,50–347,90 | 159.37 | 14.30 | 2,40–238,00 | 57.01 | ||
| Present | 156 | 90.05 | 19,40–653,00 | 416.38 | < | 22.90 | 3,90–2922,00 | 201.57 | < |
| Missing values | 9 | ||||||||
|
| |||||||||
| No | 258 | 74.65 | 6,50–649,00 | 275.30 | 17.00 | 2,40–497,40 | 109.60 | ||
| Yes | 29 | 110.00 | 50,50–653,00 | 537.94 | < | 32.00 | 7,00–2922,00 | 692.60 | < |
| Missing values | 6 | ||||||||
p-values were computed using non-parametric Wilcoxon-Mann-Whitney or Kruskal-Wallis test. In bold p-values < 0.05. Missing values are not considered in the test procedure.
BMI = Body Mass Index; HRT = Hormone Replacement Therapy.
Figure 1Relative variable importance measure. Total Decrease in Node Impurity- obtained by Random Forests on all 293 EC patients.
Figure 2REpresentative Regression Tree (RERT) obtained on all 293 EC patients. In detail, ŷ is the relative frequency of patients, clustered within the same final node, having an advanced surgical stage (FIGO stage > I). Low or high values of ŷ can be interpreted as low (paths highlighted in green) or high (paths highlighted in red) probability of having a surgical FIGO stage > I, respectively.
Metrics to assess the performance of the proposed methods evaluated in all 293 EC patients.
| Metrics | Clinical Stage | CA125 | HE4 | RERT with CV | Logistic Regression | RT with CV |
|---|---|---|---|---|---|---|
| ROC-AUC | — | 0.69*** | 0.74** | 0.87 | 0.52*** | 0.75** |
| Threshold (Youden Index) | — | 33.25 | 81.80 | 0.28 | 0.28 | 0.62 |
| Specificity | 0.95 | 0.82 | 0.66 | 0.76 | 0.66 | 0.90 |
| Sensitivity | 0.36 | 0.44 | 0.69 | 0.90 | 0.50 | 0.48 |
| Accuracy | 0.75 | 0.69 | 0.67 | 0.81 | 0.61 | 0.76 |
| PPV | 0.78 | 0.56 | 0.51 | 0.65 | 0.43 | 0.72 |
| NPV | 0.74 | 0.74 | 0.81 | 0.94 | 0.72 | 0.77 |
**p-values < 0.001, ***p-values < 0.0001: these are p-values of the DeLong’s test for the comparison of two AUCs (RERT with CV vs other methods). For major details see Table S4 in Supplementary Tables. RT with CV stands for Regression Tree with Cross-Validation.
Figure 3Relative variable importance measure. Total Decrease in Node Impurity- obtained by Random Forests on the subgroup of 219 endometrioid EC patients preoperatively classified as early stage (clinical stage).
Figure 4RERT obtained on the the subgroup of 219 endometrioid EC patients preoperatively classified as early stage (clinical stage). In detail, ŷ is the relative frequency of patients, clustered within the same final node, having an advanced surgical stage (FIGO stage > I). Low or high values of ŷ can be interpreted as low (paths highlighted in green) or high (paths highlighted in red) probability of having a surgical FIGO stage > I, respectively.
Metrics to assess the performance of the proposed methods evaluated in 219 endometrioid EC patients preoperatively classified as early stage (clinical stage).
| Metrics | CA125 | HE4 | RERT with CV | Logistic Regression | RT with CV |
|---|---|---|---|---|---|
| ROC-AUC | 0.58*** | 0.72* | 0.84 | 0.57 | 0.66** |
| Threshold (Youden Index) | 13.55 | 80.70 | 0.29 | 0.15 | 0.18 |
| Specificity | 0.39 | 0.66 | 0.67 | 0.76 | 0.69 |
| Sensitivity | 0.76 | 0.75 | 0.84 | 0.41 | 0.67 |
| Accuracy | 0.48 | 0.67 | 0.71 | 0.68 | 0.68 |
| PPV | 0.28 | 0.39 | 0.44 | 0.34 | 0.40 |
| NPV | 0.85 | 0.89 | 0.93 | 0.81 | 0.87 |
*p-value = 0.02, **p-value < 0.01, ***p-values < 0.0001: these are p-values of the DeLong’s test or Bootstrap test (underlined) for the comparison of two AUCs (RERT with CV vs other methods). For major details see Table S7 in Supplementary Tables. RT with CV stands for Regression Tree with Cross-Validation.