| Literature DB >> 27224515 |
Francisco M Ojeda1, Christian Müller2, Daniela Börnigen2, David-Alexandre Trégouët3, Arne Schillert4, Matthias Heinig5, Tanja Zeller2, Renate B Schnabel2.
Abstract
Prognostic models based on survival data frequently make use of the Cox proportional hazards model. Developing reliable Cox models with few events relative to the number of predictors can be challenging, even in low-dimensional datasets, with a much larger number of observations than variables. In such a setting we examined the performance of methods used to estimate a Cox model, including (i) full model using all available predictors and estimated by standard techniques, (ii) backward elimination (BE), (iii) ridge regression, (iv) least absolute shrinkage and selection operator (lasso), and (v) elastic net. Based on a prospective cohort of patients with manifest coronary artery disease (CAD), we performed a simulation study to compare the predictive accuracy, calibration, and discrimination of these approaches. Candidate predictors for incident cardiovascular events we used included clinical variables, biomarkers, and a selection of genetic variants associated with CAD. The penalized methods, i.e., ridge, lasso, and elastic net, showed a comparable performance, in terms of predictive accuracy, calibration, and discrimination, and outperformed BE and the full model. Excessive shrinkage was observed in some cases for the penalized methods, mostly on the simulation scenarios having the lowest ratio of a number of events to the number of variables. We conclude that in similar settings, these three penalized methods can be used interchangeably. The full model and backward elimination are not recommended in rare event scenarios.Entities:
Keywords: Coronary artery disease; Events per variable; Penalized regression; Proportional hazards regression
Mesh:
Substances:
Year: 2016 PMID: 27224515 PMCID: PMC4996851 DOI: 10.1016/j.gpb.2016.03.006
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Number of simulations used when presenting results for different models out of a maximum of 1000 simulations
| 1 | 2.5 | 928 | 345 | 785 | 681 | 903 | 913 |
| 1 | 5 | 997 | 649 | 945 | 871 | 976 | 983 |
| 1 | 10 | 1000 | 938 | 997 | 979 | 1000 | 1000 |
| 2 | 2.5 | 988 | 383 | 897 | 747 | 957 | 977 |
| 2 | 5 | 1000 | 784 | 992 | 938 | 994 | 997 |
| 2 | 10 | 1000 | 991 | 1000 | 998 | 1000 | 1000 |
Note: Presented in the table are the numbers of simulations where the model computed did not produce constant predictions nor predicted survival probabilities equal to 1. Scenario 1 candidate predictors include clinical variables and biomarkers. Scenario 2 candidate predictors include clinical variables, biomarkers, and genetic variants. BE, backward elimination; EPV, events per variable.
Figure 1Average RMSEs across simulations for both scenarios using different models
Average RMSEs of simulated datasets were calculated using different models in scenario 1 (A) and scenario 2 (B), respectively, with different EPV. The models examined include full model, BE with significance levels α = 0.05 and α = 0.5 (BE 0.05 and BE 0.5), ridge, lasso, and elastic net. Scenario 1 considers patients’ clinical variables relevant to CAD and blood-based biomarkers as predictors. Predicted event probabilities were computed at time points 0.08, 0.17, and 0.25 years, respectively. In scenario 2, information on 55 genetic variants is also considered besides the predictors used in scenario 1, while predicted event probabilities were computed at time points 1, 2.5, and 5 years, respectively. BE, backward elimination; RMSE, root mean square error; EPV, events per variable.
Figure 2Average calibration slopes across simulations using different models
Average calibration slopes of simulated datasets were calculated using different models in scenario 1 (A) and scenario 2 (B), respectively. Dashed line depicts ideal calibration slope of 1. See legend of Figure 1 for more details of the models used and the scenarios examined.
Figure 3Average concordance across simulations using different models
Average concordance of simulated datasets was calculated using different models in scenario 1 (A) and scenario 2 (B), respectively. See legend of Figure 1 for more details of the models used and the scenarios examined.
C-indices and calibration slopes for clinical data example in both scenarios considered using different models
| 1 | C-index | 0.599 | 0.586 | 0.596 | 0.586 | 0.596 | 0.600 | 0.601 | 0.600 |
| 2 | C-index | 0.601 | 0.574 | 0.577 | 0.574 | 0.578 | 0.603 | 0.607 | 0.600 |
| 1 | Calibration slope | 0.868 | 0.927 | 0.884 | 0.974 | 0.960 | 1.012 | 1.065 | 1.050 |
| 2 | Calibration slope | 0.500 | 0.649 | 0.583 | 0.708 | 0.645 | 0.861 | 1.162 | 0.885 |
Note: The C-indices and calibration slopes presented are corrected for over-optimism via the 0.632 bootstrap. BE 0.05 + ridge and BE 0.5 + ridge refer to ridge regression applied to the variables selected by BE 0.05 and BE 0.5, respectively. BE, backward elimination.