| Literature DB >> 30073700 |
Matthew Sperrin1, Glen P Martin1, Alexander Pate1, Tjeerd Van Staa1, Niels Peek1, Iain Buchan1,2.
Abstract
Clinical prediction models (CPMs) can inform decision making about treatment initiation, which requires predicted risks assuming no treatment is given. However, this is challenging since CPMs are usually derived using data sets where patients received treatment, often initiated postbaseline as "treatment drop-ins." This study proposes the use of marginal structural models (MSMs) to adjust for treatment drop-in. We illustrate the use of MSMs in the CPM framework through simulation studies that represent randomized controlled trials and real-world observational data and the example of statin initiation for cardiovascular disease prevention. The simulations include a binary treatment and a covariate, each recorded at two timepoints and having a prognostic effect on a binary outcome. The bias in predicted risk was examined in a model ignoring treatment, a model fitted on treatment-naïve patients (at baseline), a model including baseline treatment, and the MSM. In all simulation scenarios, all models except the MSM underestimated the risk of outcome given absence of treatment. These results were supported in the statin initiation example, which showed that ignoring statin initiation postbaseline resulted in models that significantly underestimated the risk of a cardiovascular disease event occurring within 10 years. Consequently, CPMs that do not acknowledge treatment drop-in can lead to underallocation of treatment. In conclusion, when developing CPMs to predict treatment-naïve risk, researchers should consider using MSMs to adjust for treatment drop-in, and also seek to exploit the ability of MSMs to allow estimation of individual treatment effects.Entities:
Keywords: clinical prediction models; counterfactual causal inference; longitudinal data; marginal structural models; treatment drop-in; validation
Mesh:
Substances:
Year: 2018 PMID: 30073700 PMCID: PMC6282523 DOI: 10.1002/sim.7913
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 1Causal diagram for simplified example
Figure 2Causal diagram and parameters of the data‐generating mechanism
Description and parameter formulisation across each simulation scenario
| Simulation Scenario | Description | Parameter values |
|---|---|---|
| RCT: 10% dropout | A randomized controlled |
|
| trial with treatment |
| |
| randomly allocated to 50% |
| |
| of the population at baseline, |
| |
| with 10% treatment dropout. |
| |
|
| ||
|
| ||
| Observational: 50% | An observational study |
|
| treated | where 50% of the population |
|
| have treatment. |
| |
|
| ||
|
| ||
|
| ||
| Observational: 20% | An observational study |
|
| treated | where 20% of the population |
|
| have treatment. |
| |
|
| ||
|
| ||
|
|
: results from across a range of percentage dropouts (values of ) gave similar results as those for the RCT: 10% dropout scenario and so are omitted. They are available on request.
Description of the performance‐settings and corresponding test datasets
| Performance‐setting | Description | Test set data‐generating mechanism |
|---|---|---|
| Mix of Treatment | Model validation on samples | Test set 1 ( |
| (MT) | drawn from a similar | Generated under exactly the same |
| population to the | process as the development cohort. | |
| development set. | ||
| Corresponds to estimating E1. | ||
| No Baseline Treatment | Model validation on samples | Test set 1 ( |
| (NBT) | drawn from a similar |
|
| population to the | Generated under exactly the same | |
| development set, but | process as the development cohort, | |
| restricted to those without | but restricted to examining | |
| treatment at baseline. |
| |
| Corresponds to estimating E2. | ||
| No Treatment | Model validation in a | Test set 2 ( |
| Throughout (NTT) | population where treatment is | generated as |
| withheld from all patients, |
| |
| but where the distribution of |
| |
| covariates is similar to the |
| |
| development cohort. | ||
| Corresponds to estimating E3. |
Figure 3Calibration‐in‐the‐large in each simulation scenario (rows), across all performance‐settings (columns), and values of (the cholesterol lowering effect of statins). In performance settings no baseline treatment (NBT) and no treatment throughout (NTT), the calibration‐in‐the‐large for the treatment‐naïve model and the model treatment is indistinguishable. MSM, marginal structural model; MT, mix of treatment [Colour figure can be viewed at http://wileyonlinelibrary.com]
Figure 4Proportion of patients in the Observational: 50% treated simulation scenario who would initiate treatment at baseline if their predicted risk given no current or future intervention exceeded a given treatment threshold. Note that values of −2.5, −1.5, and −0.5 have been removed for clarity. The treatment‐naïve model and the model including baseline treatment are identical. MSM, marginal structural model [Colour figure can be viewed at http://wileyonlinelibrary.com]
Parameter estimates from the Clinical Practice Research Datalink example estimated from the treatment‐naïve model and the marginal structural model (MSM). All of the variables are those extracted at baseline (ie, the index date). Note that both models include a year‐specific intercept fitted as a restricted cubic spline
| Variable | Treatment Naïve Model (SE) | MSM (SE) |
|---|---|---|
| Statin Initiation | N/A | −0.1002 (0.0103) |
| Female | −0.5566 (0.0066) | −0.5331 (0.0062) |
| Age | 0.0711 (0.0003) | 0.0723 (0.0002) |
| Atrial Fibrillation | 0.4827 (0.0156) | 0.4353 (0.0148) |
| Chronic Kidney Disease (stage 4/5) | 0.3646 (0.0352) | 0.3612 (0.0318) |
| Type I diabetes | 0.7307 (0.0585) | 0.5859 (0.0564) |
| Type II diabetes | 0.5967 (0.0138) | 0.5414 (0.0128) |
| Ethnicity | ||
| White or not stated | Reference | Reference |
| Asian | −0.0840 (0.0700) | −0.1936 (0.0679) |
| Bangladesh | 0.1343 (0.1557) | 0.1191 (0.1504) |
| Black | −0.6454 (0.0568) | −0.7092 (0.0536) |
| Chinese | −0.7970 (0.1629) | −0.7965 (0.1553) |
| Indian | −0.0250 (0.0498) | 0.0787 (0.0440) |
| Mixed | −0.6138 (0.1287) | −0.6547 (0.1229) |
| Other | −0.3416 (0.0758) | −0.2489 (0.0674) |
| Pakistani | 0.3351 (0.0794) | 0.3109 (0.0766) |
| Family history of coronary heart disease | 0.1391 (0.0088) | 0.1172 (0.0082) |
| Hypertension | 0.1390 (0.0078) | 0.2553 (0.0072) |
| Rheumatoid arthritis | 0.4200 (0.0237) | 0.4072 (0.0220) |
| Systolic blood pressure | 0.0109 (0.0002) | 0.0104 (0.0002) |
SE = standard error.