| Literature DB >> 31738461 |
Rik van Eekelen1, Hein Putter2, David J McLernon3, Marinus J Eijkemans4, Nan van Geloven2.
Abstract
We conducted a simulation study to compare two methods that have been recently used in clinical literature for the dynamic prediction of time to pregnancy. The first is landmarking, a semi-parametric method where predictions are updated as time progresses using the patient subset still at risk at that time point. The second is the beta-geometric model that updates predictions over time from a parametric model estimated on all data and is specific to applications with a discrete time to event outcome. The beta-geometric model introduces unobserved heterogeneity by modelling the chance of an event per discrete time unit according to a beta distribution. Due to selection of patients with lower chances as time progresses, the predicted probability of an event decreases over time. Both methods were recently used to develop models predicting the chance to conceive naturally. The advantages, disadvantages and accuracy of these two methods are unknown. We simulated time-to-pregnancy data according to different scenarios. We then compared the two methods by the following out-of-sample metrics: bias and root mean squared error in the average prediction, root mean squared error in individual predictions, Brier score and c statistic. We consider different scenarios including data-generating mechanisms for which the models are misspecified. We applied the two methods on a clinical dataset comprising 4999 couples. Finally, we discuss the pros and cons of the two methods based on our results and present recommendations for use of either of the methods in different settings and (effective) sample sizes.Entities:
Keywords: Cox model; beta-geometric model; dynamic prediction; frailty; heterogeneity; landmarking; time to pregnancy
Mesh:
Year: 2019 PMID: 31738461 PMCID: PMC6973003 DOI: 10.1002/bimj.201900155
Source DB: PubMed Journal: Biom J ISSN: 0323-3847 Impact factor: 2.207
Description of scenarios. Parameters not included in this table did not differ between scenarios
| Scenario number | Intercept logit( | Heterogeneity distribution | Ageing over follow‐up | Sterile fraction | Probability of censoring per cycle |
|---|---|---|---|---|---|
| 1 (main) | −0.60 | Beta | No | 0.3 | 0.1 |
| 2 (no sterile fraction) | −1.05 | Beta | No | 0 | 0.1 |
| 3 (ageing over follow‐up) | −0.60 | Beta | Yes | 0.3 | 0.1 |
| 4 (no censoring) | −0.60 | Beta | No | 0.3 | 0 |
| 5 (no frailty) | −1.20 | None | No | 0.3 | 0.1 |
| 6 (no frailty and no sterile fraction) | −1.77 | None | No | 0 | 0.1 |
| 7 (logit normal frailty) | −1.5 | Logit normal | No | 0.3 | 0.1 |
| 8 (logit normal frailty and ageing) | −1.5 | Logit normal | Yes | 0.3 | 0.1 |
| 9 (compressed beta frailty) | −0.43 | Compressed beta | No | 0.3 | 0.1 |
| 10 (compressed beta frailty and ageing) | −0.43 | Compressed beta | Yes | 0.3 | 0.1 |
RMSE (in percentage points) of average predictions for models A–F or the empirical variance of true probabilities between simulation replications in selected s for scenarios 1 to 10
| Scenario |
|
| Separate Cox (A) | Super ipl (B) | Super ipl* (C) | Beta‐geometric mixture (D) | Beta‐geometric (E) | Kaplan–Meier (F) | Empirical variance |
|---|---|---|---|---|---|---|---|---|---|
| 1 (main) | 0 | 6000 | 0.826 | 0.831 | 7.03 | 0.727 | 0.759 | 0.825 | 0.481 |
| 13 | 1023 | 1.42 | 1.41 | 1.69 | 2.20 | 1.21 | 1.42 | 0.68 | |
| 26 | 228 | 2.43 | 2.43 | 2.11 | 2.30 | 1.55 | 2.46 | 1.03 | |
| 1 (internal validation) | 0 | 0.823 | 0.829 | 7.04 | 0.727 | 0.755 | 0.825 | 0.489 | |
| 13 | 1.41 | 1.41 | 1.68 | 2.22 | 1.22 | 1.42 | 0.699 | ||
| 26 | 2.42 | 2.43 | 2.10 | 2.35 | 1.57 | 2.46 | 0.953 | ||
| 2 (no sterile fraction) | 0 | 0.815 | 0.82 | 7.18 | 0.738 | 0.715 | 0.827 | 0.451 | |
| 13 | 1.58 | 1.58 | 1.90 | 0.925 | 0.846 | 1.59 | 0.666 | ||
| 26 | 3.02 | 2.97 | 2.61 | 0.802 | 0.749 | 2.96 | 1.08 | ||
| 3 (ageing over follow‐up) | 0 | 0.797 | 0.796 | 6.84 | 0.725 | 0.76 | 0.788 | 0.477 | |
| 13 | 1.34 | 1.34 | 1.57 | 2.34 | 1.41 | 1.33 | 0.638 | ||
| 26 | 2.15 | 2.17 | 1.91 | 2.65 | 1.99 | 2.19 | 0.879 | ||
| 4 (no censoring) | 0 | 6000 | 0.603 | 0.619 | 2.09 | 0.548 | 0.903 | 0.614 | 0.468 |
| 13 | 4031 | 0.523 | 0.523 | 0.526 | 2.20 | 0.308 | 0.516 | 0.331 | |
| 26 | 3531 | 0.417 | 0.418 | 0.415 | 2.28 | 0.697 | 0.412 | 0.247 | |
| 5 (no frailty) | 0 | 0.874 | 0.891 | 7.00 | 0.731 | 0.771 | 0.888 | 0.321 | |
| 13 | 1.86 | 1.85 | 2.34 | 2.92 | 1.28 | 1.87 | 0.726 | ||
| 26 | 3.61 | 3.51 | 3.29 | 6.12 | 3.19 | 3.58 | 1.46 | ||
| 6 (no frailty and no sterile fraction) | 0 | 0.843 | 0.844 | 6.23 | 0.68 | 0.68 | 0.859 | 0.166 | |
| 13 | 2.08 | 2.08 | 2.90 | 1.07 | 1.09 | 2.08 | 0.382 | ||
| 26 | 4.61 | 4.55 | 4.66 | 1.77 | 1.78 | 4.62 | 0.904 | ||
| 7 (logit normal frailty) | 0 | 0.793 | 0.795 | 6.35 | 0.739 | 0.732 | 0.788 | 0.43 | |
| 13 | 1.45 | 1.45 | 1.77 | 2.02 | 1.08 | 1.45 | 0.673 | ||
| 26 | 2.36 | 2.32 | 2.02 | 2.43 | 1.59 | 2.34 | 1.02 | ||
| 8 (logit normal frailty and ageing) | 0 | 0.768 | 0.769 | 6.20 | 0.737 | 0.746 | 0.758 | 0.426 | |
| 13 | 1.44 | 1.43 | 1.73 | 1.85 | 1.26 | 1.44 | 0.619 | ||
| 26 | 2.28 | 2.22 | 1.91 | 2.39 | 1.92 | 2.26 | 0.879 | ||
| 9 (compressed beta frailty) | 0 | 0.807 | 0.815 | 6.33 | 0.758 | 0.694 | 0.812 | 0.387 | |
| 13 | 1.58 | 1.57 | 1.92 | 2.62 | 0.935 | 1.60 | 0.644 | ||
| 26 | 2.89 | 2.84 | 2.53 | 3.07 | 0.908 | 2.84 | 1.03 | ||
| 10 (compressed beta frailty and ageing) | 0 | 0.788 | 0.79 | 6.12 | 0.713 | 0.684 | 0.793 | 0.383 | |
| 13 | 1.56 | 1.56 | 1.90 | 2.24 | 0.907 | 1.56 | 0.603 | ||
| 26 | 2.79 | 2.78 | 2.41 | 2.75 | 1.17 | 2.79 | 0.968 |
RMSPE (in percentage points) of individual predictions for models A–F or using true individual probabilities in selected s for scenarios 1–10
| Scenario |
|
| Separate Cox (A) | Super ipl (B) | Super ipl* (C) | Beta‐geometric mixture (D) | Beta‐geometric (E) | Kaplan–Meier (F) | True probabilities |
|---|---|---|---|---|---|---|---|---|---|
| 1 (main) | 0 | 6000 | 35.2 | 35.2 | 36.0 | 35.2 | 35.2 | 37.1 | 0 |
| 13 | 1023 | 21.4 | 21.3 | 21.3 | 21.5 | 21.3 | 21.6 | 0 | |
| 26 | 228 | 15.4 | 14.9 | 14.9 | 15.0 | 14.9 | 15.0 | 0 | |
| 1 (internal validation) | 0 | 35.2 | 35.2 | 35.9 | 35.2 | 35.2 | 37.1 | 0 | |
| 13 | 21.3 | 21.3 | 21.3 | 21.5 | 21.3 | 21.6 | 0 | ||
| 26 | 15.0 | 14.7 | 14.7 | 14.9 | 14.7 | 14.8 | 0 | ||
| 2 (no sterile fraction) | 0 | 31.9 | 31.9 | 32.7 | 31.9 | 31.9 | 34.6 | 0 | |
| 13 | 20.7 | 20.6 | 20.6 | 20.5 | 20.5 | 21.9 | 0 | ||
| 26 | 16.1 | 15.2 | 15.2 | 14.9 | 14.9 | 16.0 | 0 | ||
| 3 (ageing over follow‐up) | 0 | 34.9 | 34.9 | 35.6 | 34.9 | 34.9 | 36.8 | 0 | |
| 13 | 20.3 | 20.2 | 20.2 | 20.4 | 20.2 | 20.5 | 0 | ||
| 26 | 13.9 | 13.3 | 13.3 | 13.5 | 13.4 | 13.4 | 0 | ||
| 4 (no censoring) | 0 | 6000 | 35.2 | 35.3 | 35.3 | 35.2 | 35.2 | 37.0 | 0 |
| 13 | 4031 | 21.2 | 21.2 | 21.2 | 21.5 | 21.2 | 21.6 | 0 | |
| 26 | 3531 | 14.6 | 14.6 | 14.6 | 15.0 | 14.6 | 14.7 | 0 | |
| 5 (no frailty) | 0 | 22.6 | 22.6 | 23.7 | 22.6 | 22.6 | 25.4 | 0 | |
| 13 | 23.3 | 23.3 | 23.4 | 23.6 | 23.4 | 24.0 | 0 | ||
| 26 | 21.7 | 21.6 | 21.5 | 22.9 | 22.0 | 21.1 | 0 | ||
| 6 (no frailty and no sterile fraction) | 0 | 1.5 | 1.6 | 6.8 | 1.3 | 1.3 | 12.7 | 0 | |
| 13 | 3.6 | 2.5 | 3.4 | 1.5 | 1.5 | 12.8 | 0 | ||
| 26 | 8.4 | 5.0 | 5.1 | 1.9 | 1.9 | 13.2 | 0 | ||
| 7 (logit normal frailty) | 0 | 31.5 | 31.5 | 32.2 | 31.5 | 31.5 | 33.2 | 0 | |
| 13 | 21.2 | 21.1 | 21.2 | 21.3 | 21.1 | 21.6 | 0 | ||
| 26 | 15.9 | 15.3 | 15.3 | 15.5 | 15.3 | 15.4 | 0 | ||
| 8 (logit normal frailty and ageing) | 0 | 31.1 | 31.1 | 31.8 | 31.1 | 31.1 | 32.9 | 0 | |
| 13 | 20.0 | 20.0 | 20.0 | 20.0 | 20.0 | 20.5 | 0 | ||
| 26 | 14.4 | 13.8 | 13.8 | 13.9 | 13.8 | 14.0 | 0 | ||
| 9 (compressed beta frailty) | 0 | 28.1 | 28.1 | 28.8 | 28.1 | 28.1 | 29.5 | 0 | |
| 13 | 20.7 | 20.6 | 20.7 | 20.9 | 20.7 | 20.9 | 0 | ||
| 26 | 17.2 | 16.7 | 16.6 | 17.0 | 16.5 | 16.6 | 0 | ||
| 10 (compressed beta frailty and ageing) | 0 | 27.7 | 27.7 | 28.4 | 27.7 | 27.7 | 29.2 | 0 | |
| 13 | 19.4 | 19.3 | 19.4 | 19.5 | 19.3 | 19.7 | 0 | ||
| 26 | 15.4 | 14.7 | 14.6 | 14.9 | 14.5 | 14.7 | 0 |
Estimated parameters for the landmarking‐based models in the data application
| Parameter | Separate Cox (A) | Super ipl (B) | Super ipl* (C) |
|---|---|---|---|
| Coefficient for age < 33 years | Varied from −0.12 to −0.01 | −0.04 | −0.04 |
| Coefficient for age > 33 years | Varied from −0.15 to 0.02 | −0.08 | −0.08 |
| Coefficient for duration | Varied from −0.44 to −0.17 | −0.23 | −0.23 |
| Coefficient for linear term for baseline hazard | – | – | −5.67 |
| Coefficient for squared term for baseline hazard | – | – | 1.50 |
Estimated parameters for the beta‐geometric models in the data application
| Parameter | Beta‐geometric mixture (D) | Beta‐geometric (E) |
|---|---|---|
| Coefficient for age < 33 years | −0.03 | −0.03 |
| Coefficient for age > 33 years | −0.08 | −0.08 |
| Coefficient for duration | −0.29 | −0.29 |
| Intercept (μ0) | 0.17 | 0.17 |
| Heterogeneity (θ) | 0.11 | 0.11 |
| Sterile fraction (π) | 0.002 | – |