| Literature DB >> 29966490 |
Maarten van Smeden1, Karel Gm Moons1, Joris Ah de Groot1, Gary S Collins2, Douglas G Altman2, Marinus Jc Eijkemans1, Johannes B Reitsma1.
Abstract
Binary logistic regression is one of the most frequently applied statistical approaches for developing clinical prediction models. Developers of such models often rely on an Events Per Variable criterion (EPV), notably EPV ≥10, to determine the minimal sample size required and the maximum number of candidate predictors that can be examined. We present an extensive simulation study in which we studied the influence of EPV, events fraction, number of candidate predictors, the correlations and distributions of candidate predictor variables, area under the ROC curve, and predictor effects on out-of-sample predictive performance of prediction models. The out-of-sample performance (calibration, discrimination and probability prediction error) of developed prediction models was studied before and after regression shrinkage and variable selection. The results indicate that EPV does not have a strong relation with metrics of predictive performance, and is not an appropriate criterion for (binary) prediction model development studies. We show that out-of-sample predictive performance can better be approximated by considering the number of predictors, the total sample size and the events fraction. We propose that the development of new sample size criteria for prediction models should be based on these three parameters, and provide suggestions for improving sample size determination.Entities:
Keywords: EPV; Logistic regression; prediction models; predictive performance; sample size; simulations
Year: 2018 PMID: 29966490 PMCID: PMC6710621 DOI: 10.1177/0962280218784726
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Design factorial simulation study ().
| Simulation factors | Factor levels | ||
|---|---|---|---|
| 1. | Events per variable (EPV) | 3, 5, 10, 15, 20, 30, 50 | |
| 2. | Events fraction | 1/2, 1/4, 1/8, 1/16 | |
| 3. | Number of candidate predictors ( | 4, 8, 12 | |
| 4. | Model discrimination (AUC) | .65,.75,.85 | |
| 5. | Distribution of predictor variables | B(0.5): | Independent Bernoulli with success probability.5. |
| MVN(0.0): | Normal (means = 0, variances = 1, covariances = 0.0) | ||
| MVN(0.3): | Normal (means = 0, variances = 1, covariances = 0.3) | ||
| MVN(0.5): | Normal (means = 0, variances = 1, covariances = 0.5) | ||
| 6. | Predictor effects | Equal effect: |
|
| 1 strong: |
| ||
| 1 noise: |
| ||
| 1/2 noise: |
|
Prediction models: parameter shrinkage and variable selection strategies.
| Model | Parameter shrinkage | Variable selection | Abbreviation |
|---|---|---|---|
| Maximum likelihood (full model) | No | No |
|
| Maximum likelihood (backward 1) | No | Yes, |
|
| Maximum likelihood (backward 2) | No | Yes, |
|
| Heuristic shrinkage | Yes | No |
|
| Firth’s penalized likelihood (full model) | Yes | No |
|
| Firth’s penalized likelihood (backward 1) | Yes | Yes, |
|
| Firth’s penalized likelihood (backward 2) | Yes | Yes, |
|
| Ridge penalized likelihood | Yes | No |
|
| Lasso penalized likelihood | Yes | Yes |
|
Simulation estimation errors and consequences.
| No. (%) | Consequences | |
|---|---|---|
| Development datasets generated | 20,160,000 (100%) | |
| Simulation conditions | 4,032 (100%) | |
| Separation detected | 90,846 (0.45%) | The separated cases are left in (to avoid selective missing data). |
| Degenerate distributions | ||
| <3 events or <3 non-events generated | 211 (0.001%) | Data sets are treated as missing data sets. |
| <8 events or <8 non-events generated | 68,048 (0.34%) | Leave-one-out cross-validation is used for estimating Lasso and Ridge tuning parameters. |
| Degenerate predictor variable generated | 0 (0%) | |
| Heuristic shrinkage factor inestimable | 2,470,118 (12.25%) | For HS, results are replaced by ML results. |
| Degenerated linear predictor (no variables selected) | ||
| | 650,133 (3.22%) | |
| | 179,638 (0.89%) | |
| | 718,194 (3.56%) | |
| | 204,617 (1.01%) | |
| | 744,575 (3.69%) |
Figure 1.Marginal out-of-sample predictive performance.
Figure 2.Boxplot distribution of out-of-sample predictive performance outcomes (restricted to conditions with events fraction = 1/2).
Figure 3.Average relative out-of-sample performances of modeling strategies per simulation factor level.
Results of simulation meta models: Outcome: ln(MSPE).
| Natural log transformed | Original scale | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Meta model | Int | EPV |
| Events fraction |
| AUC | Cor | Bin | Noise |
| |
| Full | ML | −0.55 | . | −1.06 | 0.36 | 0.94 | 0.40 | 0.00 | 0.05 | 0.00 | 0.993 |
| Simplified | ML | −0.59 | . | −1.06 | 0.36 | 0.94 | . | . | . | . | 0.992 |
| EPV only | ML | −3.29 | −1.06 | . | . | . | . | . | . | . | 0.432 |
| Full | Firth | −0.84 | . | −1.03 | 0.33 | 0.93 | 0.31 | 0.00 | 0.04 | 0.00 | 0.993 |
| Simplified | Firth | −0.86 | . | −1.03 | 0.33 | 0.93 | . | . | . | . | 0.992 |
| EPV only | Firth | −3.42 | −1.03 | . | . | . | . | . | . | . | 0.438 |
| Full | HS | −0.39 | . | −0.97 | 0.44 | 0.74 | 1.17 | 0.00 | −0.01 | 0.00 | 0.985 |
| Simplified | HS | −0.75 | . | −0.97 | 0.44 | 0.74 | . | . | . | . | 0.977 |
| EPV only | HS | −3.64 | −0.97 | . | . | . | . | . | . | . | 0.385 |
| Full | Lasso | −0.59 | . | −0.93 | 0.46 | 0.68 | 0.97 | −0.48 | 0.04 | 0.03 | 0.983 |
| Simplified | Lasso | −0.86 | . | −0.93 | 0.46 | 0.68 | . | . | . | . | 0.973 |
| EPV only | Lasso | −3.78 | −0.93 | . | . | . | . | . | . | . | 0.371 |
| Full | Ridge | −0.39 | . | −0.88 | 0.50 | 0.49 | 1.33 | −0.85 | 0.03 | −0.02 | 0.979 |
| Simplified | Ridge | −0.93 | . | −0.88 | 0.50 | 0.49 | . | . | . | . | 0.952 |
| EPV only | Ridge | −4.08 | −0.88 | . | . | . | . | . | . | . | 0.337 |
| Full | ML | −0.85 | . | −1.02 | 0.40 | 0.95 | 0.34 | 0.03 | 0.07 | 0.17 | 0.943 |
| Simplified | ML | −0.57 | . | −1.03 | 0.40 | 0.96 | . | . | . | . | 0.939 |
| EPV only | ML | −3.18 | −1.03 | . | . | . | . | . | . | . | 0.393 |
| Full | ML | −0.74 | . | −1.05 | 0.38 | 0.95 | 0.35 | 0.00 | 0.06 | 0.10 | 0.977 |
| Simplified | ML | −0.59 | . | −1.05 | 0.38 | 0.95 | . | . | . | . | 0.975 |
| EPV only | ML | −3.25 | −1.05 | . | . | . | . | . | . | . | 0.417 |
| Full | Firth | −0.94 | . | −1.01 | 0.39 | 0.95 | 0.34 | 0.02 | 0.07 | 0.17 | 0.939 |
| Simplified | Firth | −0.66 | . | −1.01 | 0.39 | 0.95 | . | . | . | . | 0.935 |
| EPV only | Firth | −3.22 | −1.01 | . | . | . | . | . | . | . | 0.392 |
| Full | Firth | −0.90 | . | −1.03 | 0.37 | 0.95 | 0.32 | 0.00 | 0.06 | 0.10 | 0.975 |
| Simplified | Firth | −0.74 | . | −1.03 | 0.37 | 0.95 | . | . | . | . | 0.973 |
| EPV only | Firth | −3.32 | −1.03 | . | . | . | . | . | . | . | 0.418 |
Full: metamodel with all eight meta-model covariates; Simplified: model with covariates N, events fraction and P, EPV only: meta model with EPV as a covariate. Int: Intercept; EPV: Events per variable; N: Sample size; P: number of candidate predictors; AUC: Area under the ROC-curve; Cor: Predictor pairwise correlations.
Results of simulation meta models: Outcome: ln(MAPE).
| Natural log transformed | Original scale | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Meta model | Int | EPV |
| Events fraction |
| AUC | Cor | Bin | Noise |
| |
| Full | ML | −0.60 | . | −0.53 | 0.31 | 0.48 | −0.50 | 0.00 | −0.01 | 0.00 | 0.996 |
| Simplified | ML | −0.48 | . | −0.53 | 0.31 | 0.48 | . | . | . | . | 0.992 |
| EPV only | ML | −2.03 | −0.53 | . | . | . | . | . | . | . | 0.355 |
| Full | Firth | −0.74 | . | −0.51 | 0.29 | 0.47 | −0.51 | 0.00 | −0.01 | 0.00 | 0.996 |
| Simplified | Firth | −0.61 | . | −0.51 | 0.29 | 0.47 | . | . | . | . | 0.991 |
| EPV only | Firth | −2.10 | −0.51 | . | . | . | . | . | . | . | 0.357 |
| Full | HS | −0.55 | . | −0.49 | 0.33 | 0.39 | −0.15 | 0.00 | −0.03 | 0.00 | 0.991 |
| Simplified | HS | −0.56 | . | −0.49 | 0.33 | 0.39 | . | . | . | . | 0.991 |
| EPV only | HS | −2.19 | −0.49 | . | . | . | . | . | . | . | 0.326 |
| Full | Lasso | −0.59 | . | −0.48 | 0.34 | 0.35 | −0.19 | −0.24 | −0.01 | 0.01 | 0.989 |
| Simplified | Lasso | −0.59 | . | −0.48 | 0.34 | 0.35 | . | . | . | . | 0.983 |
| EPV only | Lasso | −2.24 | −0.48 | . | . | . | . | . | . | . | 0.314 |
| Full | Ridge | −0.48 | . | −0.45 | 0.36 | 0.26 | 0.03 | −0.43 | −0.02 | −0.01 | 0.986 |
| Simplified | Ridge | −0.61 | . | −0.45 | 0.36 | 0.26 | . | . | . | . | 0.970 |
| EPV only | Ridge | −2.39 | −0.45 | . | . | . | . | . | . | . | 0.285 |
| Full | ML | −0.75 | . | −0.52 | 0.31 | 0.49 | −0.58 | 0.03 | −0.01 | 0.09 | 0.951 |
| Simplified | ML | −0.45 | . | −0.52 | 0.31 | 0.49 | . | . | . | . | 0.942 |
| EPV only | ML | −1.95 | −0.52 | . | . | . | . | . | . | . | 0.334 |
| Full | ML | −0.70 | . | −0.53 | 0.31 | 0.49 | −0.55 | 0.01 | −0.01 | 0.06 | 0.982 |
| Simplified | ML | −0.48 | . | −0.53 | 0.31 | 0.49 | . | . | . | . | 0.975 |
| EPV only | ML | −2.00 | −0.53 | . | . | . | . | . | . | . | 0.348 |
| Full | Firth | −0.79 | . | −0.52 | 0.30 | 0.50 | −0.56 | 0.02 | −0.01 | 0.09 | 0.947 |
| Simplified | Firth | −0.49 | . | −0.52 | 0.30 | 0.50 | . | . | . | . | 0.938 |
| EPV only | Firth | −1.96 | −0.52 | . | . | . | . | . | . | . | 0.335 |
| Full | Firth | −0.78 | . | −0.52 | 0.30 | 0.49 | −0.55 | 0.01 | −0.01 | 0.06 | 0.979 |
| Simplified | Firth | −0.55 | . | −0.52 | 0.30 | 0.49 | . | . | . | . | 0.973 |
| EPV only | Firth | −2.03 | −0.52 | . | . | . | . | . | . | . | 0.348 |
Full: metamodel with all eight meta-model covariates; Simplified: model with covariates N, events fraction and P, EPV only: meta model with EPV as a covariate. Int: Intercept; EPV: Events per variable; N: Sample size; P: number of candidate predictors; AUC: Area under the ROC-curve; Cor: Predictor pairwise correlations.
Results of simulation meta models: Outcome: ln(Brier).
| Natural log transformed | Original scale | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Meta model | Int | EPV |
| Events fraction |
| AUC | Cor | Bin | Noise |
| |
| Full | ML | −1.23 | . | −0.04 | 0.62 | 0.04 | −1.02 | 0.00 | 0.01 | 0.00 | 0.969 |
| Simplified | ML | −0.91 | . | −0.04 | 0.62 | 0.04 | . | . | . | . | 0.925 |
| EPV only | ML | −2.06 | −0.04 | . | . | . | . | . | . | . | 0.005 |
| Full | Firth | −1.27 | . | −0.03 | 0.62 | 0.03 | −1.02 | 0.00 | 0.01 | 0.00 | 0.969 |
| Simplified | Firth | −0.95 | . | −0.03 | 0.62 | 0.03 | . | . | . | . | 0.923 |
| EPV only | Firth | −2.08 | −0.03 | . | . | . | . | . | . | . | 0.003 |
| Full | HS | −1.23 | . | −0.03 | 0.62 | 0.02 | −0.98 | 0.00 | 0.00 | 0.00 | 0.969 |
| Simplified | HS | −0.93 | . | −0.03 | 0.62 | 0.02 | . | . | . | . | 0.927 |
| EPV only | HS | −2.08 | −0.03 | . | . | . | . | . | . | . | 0.003 |
| Full | Lasso | −1.27 | . | −0.03 | 0.62 | 0.02 | −1.00 | −0.02 | 0.01 | 0.00 | 0.969 |
| Simplified | Lasso | −0.96 | . | −0.03 | 0.62 | 0.02 | . | . | . | . | 0.925 |
| EPV only | Lasso | −2.10 | −0.03 | . | . | . | . | . | . | . | 0.002 |
| Full | Ridge | −1.29 | . | −0.02 | 0.62 | 0.01 | −1.00 | −0.02 | 0.01 | 0.00 | 0.968 |
| Simplified | Ridge | −0.98 | . | −0.02 | 0.62 | 0.01 | . | . | . | . | 0.924 |
| EPV only | Ridge | −2.12 | −0.02 | . | . | . | . | . | . | . | 0.002 |
| Full | ML | −1.19 | . | −0.04 | 0.62 | 0.04 | −0.96 | −0.02 | 0.01 | 0.01 | 0.969 |
| Simplified | ML | −0.89 | . | −0.04 | 0.62 | 0.04 | . | . | . | . | 0.929 |
| EPV only | ML | −2.04 | −0.04 | . | . | . | . | . | . | . | 0.006 |
| Full | ML | −1.22 | . | −0.04 | 0.62 | 0.04 | −0.99 | −0.01 | 0.01 | 0.00 | 0.969 |
| Simplified | ML | −0.91 | . | −0.04 | 0.62 | 0.04 | . | . | . | . | 0.927 |
| EPV only | ML | −2.05 | −0.04 | . | . | . | . | . | . | . | 0.005 |
| Full | Firth | −1.20 | . | −0.04 | 0.62 | 0.04 | −0.96 | −0.02 | 0.01 | 0.01 | 0.968 |
| Simplified | Firth | −0.90 | . | −0.04 | 0.62 | 0.04 | . | . | . | . | 0.929 |
| EPV only | Firth | −2.05 | −0.04 | . | . | . | . | . | . | . | 0.005 |
| Full | Firth | −1.24 | . | −0.04 | 0.62 | 0.04 | −1.00 | −0.01 | 0.01 | 0.00 | 0.969 |
| Simplified | Firth | −0.93 | . | −0.04 | 0.62 | 0.04 | . | . | . | . | 0.926 |
| EPV only | Firth | −2.06 | −0.04 | . | . | . | . | . | . | . | 0.004 |
Full: metamodel with all eight meta-model covariates; Simplified: model with covariates N, events fraction and P, EPV only: meta model with EPV as a covariate. Int: Intercept; EPV: Events per variable; N: Sample size; P: number of candidate predictors; AUC: Area under the ROC-curve; Cor: Predictor pairwise correlations.
Results of simulation meta models: Outcome: ΔAUC × 100.
| Natural log transformed | Original scale | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Meta model | Int | EPV |
| Events fraction |
| AUC | Cor | Bin | Noise |
| |
| Full | ML | −3.63 | . | 1.47 | 0.92 | −1.66 | 5.02 | −0.03 | −0.46 | −0.06 | 0.821 |
| Simplified | ML | −6.01 | . | 1.47 | 0.92 | −1.66 | . | . | . | . | 0.700 |
| EPV only | ML | −5.44 | 1.47 | . | . | . | . | . | . | . | 0.633 |
| Full | Firth | −3.56 | . | 1.46 | 0.92 | −1.66 | 5.05 | −0.03 | −0.47 | −0.06 | 0.822 |
| Simplified | Firth | −5.97 | . | 1.46 | 0.92 | −1.66 | . | . | . | . | 0.698 |
| EPV only | Firth | −5.42 | 1.46 | . | . | . | . | . | . | . | 0.632 |
| Full | HS | −5.60 | . | 1.63 | 1.11 | −1.53 | 4.05 | −0.03 | −0.08 | −0.07 | 0.665 |
| Simplified | HS | −7.05 | . | 1.63 | 1.11 | −1.53 | . | . | . | . | 0.614 |
| EPV only | HS | −5.96 | 1.63 | . | . | . | . | . | . | . | 0.571 |
| Full | Lasso | −6.11 | . | 1.93 | 1.24 | −1.63 | 7.30 | 2.11 | −0.45 | −0.08 | 0.713 |
| Simplified | Lasso | −8.73 | . | 1.93 | 1.24 | −1.63 | . | . | . | . | 0.580 |
| EPV only | Lasso | −6.95 | 1.93 | . | . | . | . | . | . | . | 0.528 |
| Full | Ridge | −3.14 | . | 0.98 | 0.62 | −0.91 | 3.38 | 2.18 | −0.42 | −0.03 | 0.684 |
| Simplified | Ridge | −4.47 | . | 0.98 | 0.62 | −0.91 | . | . | . | . | 0.515 |
| EPV only | Ridge | −3.70 | 0.98 | . | . | . | . | . | . | . | 0.468 |
| Full | ML | −9.03 | . | 2.62 | 1.75 | −2.29 | 8.89 | 2.18 | −0.23 | −0.23 | 0.764 |
| Simplified | ML | −11.94 | . | 2.62 | 1.75 | −2.29 | . | . | . | . | 0.645 |
| EPV only | ML | −9.80 | 2.62 | . | . | . | . | . | . | . | 0.597 |
| Full | ML | −5.79 | . | 1.91 | 1.25 | −1.87 | 6.64 | 0.99 | −0.33 | −0.17 | 0.797 |
| Simplified | ML | −8.37 | . | 1.91 | 1.25 | −1.87 | . | . | . | . | 0.680 |
| EPV only | ML | −7.13 | 1.92 | . | . | . | . | . | . | . | 0.626 |
| Full | Firth | −9.92 | . | 2.75 | 1.81 | −2.33 | 8.72 | 2.36 | −0.22 | −0.22 | 0.751 |
| Simplified | Firth | −12.72 | . | 2.75 | 1.81 | −2.33 | . | . | . | . | 0.646 |
| EPV only | Firth | −10.25 | 2.75 | . | . | . | . | . | . | . | 0.592 |
| Full | Firth | −6.18 | . | 1.98 | 1.28 | −1.89 | 6.61 | 1.10 | −0.33 | −0.17 | 0.785 |
| Simplified | Firth | −8.73 | . | 1.98 | 1.28 | −1.89 | . | . | . | . | 0.677 |
| EPV only | Firth | −7.34 | 1.98 | . | . | . | . | . | . | . | 0.621 |
Full: metamodel with all eight meta-model covariates; Simplified: model with covariates N, events fraction and P, EPV only: meta model with EPV as a covariate. Int: Intercept; EPV: Events per variable; N: Sample size; P: number of candidate predictors; AUC: Area under the ROC-curve; Cor: Predictor pairwise correlations.
Results of simulation meta models: Outcome: CS.
| Natural log transformed | Original scale | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Meta model | Int | EPV |
| Events fraction |
| AUC | Cor | Bin | Noise |
| |
| Full | ML | 0.50 | . | 0.15 | 0.09 | −0.16 | 0.65 | 0.00 | 0.00 | 0.00 | 0.848 |
| Simplified | ML | 0.31 | . | 0.15 | 0.09 | −0.16 | . | . | . | . | 0.689 |
| EPV only | ML | 0.40 | 0.15 | . | . | . | . | . | . | . | 0.616 |
| Full | Firth | 0.73 | . | 0.12 | 0.08 | −0.15 | 0.77 | 0.00 | −0.01 | 0.00 | 0.835 |
| Simplified | Firth | 0.50 | . | 0.12 | 0.08 | −0.15 | . | . | . | . | 0.556 |
| EPV only | Firth | 0.52 | 0.12 | . | . | . | . | . | . | . | 0.505 |
| Full | HS | 0.73 | . | 0.07 | 0.02 | −0.08 | 0.03 | 0.00 | −0.01 | 0.00 | 0.496 |
| Simplified | HS | 0.71 | . | 0.07 | 0.02 | −0.08 | . | . | . | . | 0.495 |
| EPV only | HS | 0.77 | 0.07 | . | . | . | . | . | . | . | 0.368 |
| Full | Lasso | 0.98 | . | 0.04 | 0.03 | −0.05 | 0.43 | 0.12 | 0.00 | −0.01 | 0.513 |
| Simplified | Lasso | 0.85 | . | 0.04 | 0.03 | −0.05 | . | . | . | . | 0.190 |
| EPV only | Lasso | 0.85 | 0.04 | . | . | . | . | . | . | . | 0.180 |
| Full | Ridge | 1.19 | . | −0.05 | −0.03 | 0.03 | −0.25 | 0.14 | 0.01 | 0.00 | 0.823 |
| Simplified | Ridge | 1.31 | . | −0.05 | −0.03 | 0.03 | . | . | . | . | 0.488 |
| EPV only | Ridge | 1.23 | −0.05 | . | . | . | . | . | . | . | 0.418 |
| Full | ML | 0.51 | . | 0.15 | 0.09 | −0.15 | 0.65 | 0.09 | 0.00 | −0.01 | 0.832 |
| Simplified | ML | 0.33 | . | 0.15 | 0.09 | −0.15 | . | . | . | . | 0.652 |
| EPV only | ML | 0.42 | 0.15 | . | . | . | . | . | . | . | 0.588 |
| Full | ML | 0.52 | . | 0.15 | 0.09 | −0.15 | 0.63 | 0.06 | 0.00 | −0.01 | 0.848 |
| Simplified | ML | 0.33 | . | 0.15 | 0.09 | −0.15 | . | . | . | . | 0.682 |
| EPV only | ML | 0.42 | 0.15 | . | . | . | . | . | . | . | 0.611 |
| Full | Firth | 0.67 | . | 0.13 | 0.08 | −0.15 | 0.74 | 0.09 | −0.01 | −0.01 | 0.826 |
| Simplified | Firth | 0.44 | . | 0.13 | 0.08 | −0.15 | . | . | . | . | 0.575 |
| EPV only | Firth | 0.49 | 0.13 | . | . | . | . | . | . | . | 0.522 |
| Full | Firth | 0.69 | . | 0.13 | 0.08 | −0.15 | 0.73 | 0.05 | −0.01 | −0.01 | 0.846 |
| Simplified | Firth | 0.46 | . | 0.13 | 0.08 | −0.15 | . | . | . | . | 0.595 |
| EPV only | Firth | 0.50 | 0.13 | . | . | . | . | . | . | . | 0.537 |
Full: metamodel with all eight meta-model covariates; Simplified: model with covariates N, events fraction and P, EPV only: meta model with EPV as a covariate. Int: Intercept; EPV: Events per variable; N: Sample size; P: number of candidate predictors; AUC: Area under the ROC-curve; Cor: Predictor pairwise correlations.
Figure 4.Relation required sample size and events fraction. Calculations based on metamodels with criterion values that were kept constant. For illustration purposes, the criterion values were chosen such that they would intersect at events fraction = 1/2.