| Literature DB >> 32556680 |
Kevin M Veen1, Isabel B de Angst2, Mostafa M Mokhles1, Hans M Westgeest3, Malou Kuppen4, Carin A Uyl-de Groot4, Winald R Gerritsen5, Paul J M Kil6, Johanna J M Takkenberg1.
Abstract
PURPOSE: With the increasing interest in treatment decision-making based on risk prediction models, it is essential for clinicians to understand the steps in developing and interpreting such models.Entities:
Keywords: Castration-resistant prostate cancer; Cox proportional hazard model; Decision-making; Prediction modeling
Mesh:
Year: 2020 PMID: 32556680 PMCID: PMC7324416 DOI: 10.1007/s00432-020-03286-8
Source DB: PubMed Journal: J Cancer Res Clin Oncol ISSN: 0171-5216 Impact factor: 4.553
Summary of considering in prediction modelling.
[adapted from original version of Steyerberg et al. (2008)]
| Step | Specific issues | CAPRI-dataset |
|---|---|---|
| General considerations | ||
| Research question | Aim: predictors/prediction | Prediction |
| Intended application | Clinical practice/research | Clinical practice |
| Outcome | Clinically relevant | Mortality |
| Predictors | Reliable measurement Comprehensiveness | Oncological clinical work-up and literature; extensive set of candidate predictors |
| Study design | Retrospective/prospective? Cohort; case–control | Registry study: retrospective cohort |
| Statistical model | Appropriate for research question and outcome | Non-parametric cox proportional hazard |
| Sample size | Sufficient for aim? | 3588 patients; 2335 events |
| 5 modelling steps | ||
| Data inspection | Data distribution Missing values Correlation between predictors | Table Multiple imputation Using Pearson’s R or Spearman’s rho |
| Coding of predictors | Continuous predictors | Extensive checks of transformations for continues predictors |
| Combining categorical predictors | Comorbidity score was collapsed to three categories instead of eight | |
| Combining predictors with similar effects | Pain and opioid use | |
| Model specification | Appropriate selection of main effects | LASSO regression |
| Assessment of assumptions | Additivity checked with interaction terms, interaction with treatment was checked, three interaction terms included Proportional hazard assumption checked—> relaxed by time varying coefficients | |
| Model performance | Appropriate measures | Discrimination |
| Model validation | Internal validation External validation | Bootstrap and k-fold cross-validation No external dataset was available |
Baseline characteristics of patients with CRPC treated with abiraterone, enzalutamide, docetaxel or watchful waiting
| Treatment | Abiraterone | Enzalutamide | Docetaxel | Watchful waiting |
|---|---|---|---|---|
| 249 | 184 | 1006 | 2149 | |
| Anti-androgens before CRPC (%) | 114 (46.0) | 81 (44.0) | 397 (39.5) | 788 (36.8) |
| Comorbidity score (%) | ||||
| 0 | 168 (67.5) | 107 (58.2) | 703 (70.0) | 1227 (57.1) |
| 1 | 43 (17.3) | 38 (20.7) | 185 (18.4) | 496 (23.1) |
| 2 | 24 (9.6) | 23 (12.5) | 80 (8.0) | 252 (11.7) |
| 3 | 6 (2.4) | 6 (3.3) | 22 (2.2) | 86 (4.0) |
| 4 | 5 (2.0) | 4 (2.2) | 8 (0.8) | 46 (2.1) |
| 5 | 0 (0.0) | 2 (1.1) | 3 (0.3) | 13 (0.6) |
| 6 | 3 (1.2) | 2 (1.1) | 4 (0.4) | 17 (0.8) |
| 7 | 0 (0.0) | 1 (0.5) | 0 (0.0) | 5 (0.2) |
| 8 | 0 (0.0) | 1 (0.5) | 0 (0.0) | 5 (0.2) |
| Bone metastases (%) | 142 (87.7) | 103 (87.3) | 703 (91.1) | 929 (81.7) |
| Lymph node metastases (%) | 66 (80.5) | 41 (83.7) | 373 (82.5) | 507 (76.6) |
| Visceral metastases (%) | 8 (16.7) | 8 (24.2) | 57 (21.7) | 52 (16.1) |
| WHO (%) | ||||
| 1 | 37 (40.2) | 26 (43.3) | 222 (42.0) | 360 (47.1) |
| 2 | 41 (44.6) | 21 (35.0) | 245 (46.3) | 317 (41.5) |
| 3 | 14 (15.2) | 13 (21.7) | 62 (11.7) | 87 (11.4) |
| Pain (%) | 47 (42.0) | 28 (37.8) | 317 (49.2) | 323 (31.0) |
| Opioid use (%) | 22 (32.8) | 9 (24.3) | 120 (29.3) | 113 (22.7) |
| Gleason > 7 (%) | 143 (67.8) | 105 (65.2) | 591 (65.9) | 998 (55.5) |
| Time to castration (median [range]) | 11.17 [1.4, 192] | 13.34 [1, 196] | 10.12 [0.2, 172.7] | 20.47 [0.3, 248.4] |
| Age (median [range]) | 76.00 [46, 95] | 77.00 [50, 94] | 70.00 [46, 93] | 78.00 [49, 99] |
| Weight (median [range]) | 83.00 [52, 120] | 86.00 [60, 120] | 84.50 [48, 150] | 81.00 [44, 118] |
| Hemoglobulin (median [range]) | 8.00 [5.1, 9.6] | 8.00 [4.7, 10.3] | 8.00 [4.3, 10.2] | 8.10 [3.9, 10.5] |
| Platelets (median [range]) | 234.00 [37, 569] | 228.50 [54, 473] | 243.00 [0.4, 749] | 233.00 [0.3, 714] |
| Lactate dehydrogenase (median [range]) | 218.00 [72, 3179] | 216.00 [98, 730] | 232.00 [21, 4100] | 218.00 [79, 4329] |
| Alkaline phosphatase(median [range]) | 122.00 [41, 1673] | 109.00 [38, 1263] | 136.00 [34.8, 3457] | 93.00 [21, 4315] |
| PSA (median [range]) | 34.00 [0.1, 8730] | 24.40 [0.1, 4150] | 40.00 [0.0, 8700] | 9.70 [0.1, 4034] |
Fig. 1Example of a continuous outcome (y-axis) and continuous predictor (x-axis). As is shown: with the assumption the relation is linear the model (red line) does not fit the observed data well (black dots)
Performance of a linear model by adding flexibility to assumed linear association with the outcome
| Variable | |
|---|---|
| Predictor linear | 0.00938 |
| Predictor with splines with one knot | 0.9853 |
| Predictor with fractional polynomial | 0.9992 |
*R-squared is measure of how close the model fits the data, 1 indicates the model explains all the variability of the data, whereas with 0 the model does not explain any variability. For other types of models similar measurements are available
Fig. 2Example of relaxation of the linear assumed association (red line) of a continuous outcome and predictor. This can be done either with natural splines (green line) or fractional polynomials (FP) (blue line). Using splines the data is divided in separate sections, and each section has its own estimate of the line. Using fractional polynomials the relationship is described as multiple polynomials, which can produce a very flexible line
Final Cox model for predicting mortality in patients with CRPC
| Characteristic | Hazard ratio (95% CI) | |
|---|---|---|
| Age | 1.07 (1.04–1.09) | < 0.001 |
| Anti-androgens before CRPC | 0.87 (0.8–0.95) | 0.001 |
| Bone metastases | 1.16 (1.03–1.32) | 0.016 |
| AF polynomial 11 | 1.02 (0.9–1.16) | 0.75 |
| AF polynomial 22 | 0.75 (0.57–0.99) | 0.044 |
| Enzalutamide vs abiraterone | 1.17 (0.64–2.15) | 0.60 |
| Docetaxel vs abiraterone | 1.85 (1.23–2.77) | 0.003 |
| Watchful waiting vs abiraterone | 0.45 (0.31–0.67) | < 0.001 |
| Time to start castration spline 1 h for <10 months | 0.2 (0.1–0.39) | < 0.001 |
| Time to start castration spline 2 h for <10 months | 0.19 (0.13–0.26) | < 0.001 |
| Time to start castration spline 1 h for>10 months | 1.45 (0.75–2.8) | 0.27 |
| Time to start castration spline 2 h for>10 months | 0.71 (0.51–1) | 0.048 |
| WHO HR for <10 months | 1.64 (1.44–1.87) | < 0.001 |
| WHO HR for >10 months | 1.07 (0.99–1.15) | 0.11 |
| PSA polynomial 13 h for <10 months | 1.34 (1.15–1.56) | < 0.001 |
| PSA polynomial 13 h for >10 months | 1.02 (0.88–1.17) | 0.82 |
| PSA polynomial 24 h for <10 months | 1.27 (1.16–1.4) | < 0.001 |
| PSA polynomial 24 h for >10 months | 1.11 (1.01–1.21) | 0.023 |
| HB HR for <10 months | 0.82 (0.76–0.89) | < 0.001 |
| HB HR for>10 months | 0.92 (0.87–0.97) | 0.003 |
| Platelets polynomial 15HR for <10 months | 0.97 (0.95–0.99) | 0.001 |
| Platelets polynomial 15HR for >10 months | 1.01 (0.99–1.02) | 0.42 |
| Platelets polynomial 26HR for <10 months | 1 (1–1.01) | 0.001 |
| Platelets polynomial 26HR for <10 months | 1 (1–1) | 0.46 |
| LDH HR for <10 months | 1.66 (1.42–1.94) | < 0.001 |
| LDH HR for>10 months | 1.09 (0.96–1.23) | 0.18 |
| Opioid or pain vs none HR for <10 months | 1.09 (0.97–1.22) | 0.16 |
| Opioid or pain vs none HR for >10 months | 1.02 (0.94–1.09) | 0.67 |
| Age*Enzalutamide vs abiraterone7 | 0.94 (0.9–0.97) | 0.001 |
| Age*Docetaxel vs abiraterone7 | 0.96 (0.93–0.99) | 0.003 |
| Age*Watchful waiting vs abiraterone7 | 0.99 (0.96–1.01) | 0.25 |
| Log(PSA)*Enzalutamide vs abiraterone7 | 1.08 (0.92–1.26) | 0.35 |
| Log(PSA)*Docetaxel vs abiraterone7 | 0.91 (0.83–1) | 0.057 |
| Log(PSA)*Watchful waiting vs abiraterone7 | 1.23 (1.12–1.35) | < 0.001 |
The model contains fractional polynomials and splines to address non-linear associations of a continues variable with the outcome and a stepwise time-varying coefficient function; e.g. some covariates have a hazard ratio for below ten months of follow-up and above ten months of follow-up
1:(AF/100)^−2), 2: (AF/100)^−1, 3: PSA^−1, 4: log(PSA), 5: Platelets*1, 6: Platelets * log(Platelets), 7: interaction term
Fig. 3a Example of a Schoenfeld residuals plot in order to check the proportional hazard assumption. When the hazard of WHO is assumed constant over time (blue line in part a), the assumption is violated, especially in the first ten months the blue line deviates from the red line. In part b we have two coefficients for WHO, one for the first ten months and one for more than ten months. Proportional hazards assumption is not violated anymore