| Literature DB >> 24122911 |
Abstract
Propensity score methods are increasingly being used to estimate causal treatment effects in observational studies. In medical and epidemiological studies, outcomes are frequently time-to-event in nature. Propensity-score methods are often applied incorrectly when estimating the effect of treatment on time-to-event outcomes. This article describes how two different propensity score methods (matching and inverse probability of treatment weighting) can be used to estimate the measures of effect that are frequently reported in randomized controlled trials: (i) marginal survival curves, which describe survival in the population if all subjects were treated or if all subjects were untreated; and (ii) marginal hazard ratios. The use of these propensity score methods allows one to replicate the measures of effect that are commonly reported in randomized controlled trials with time-to-event outcomes: both absolute and relative reductions in the probability of an event occurring can be determined. We also provide guidance on variable selection for the propensity score model, highlight methods for assessing the balance of baseline covariates between treated and untreated subjects, and describe the implementation of a sensitivity analysis to assess the effect of unmeasured confounding variables on the estimated treatment effect when outcomes are time-to-event in nature. The methods in the paper are illustrated by estimating the effect of discharge statin prescribing on the risk of death in a sample of patients hospitalized with acute myocardial infarction. In this tutorial article, we describe and illustrate all the steps necessary to conduct a comprehensive analysis of the effect of treatment on time-to-event outcomes.Entities:
Keywords: confounding; event history analysis; inverse probability of treatment weighting; marginal effects; observational study; propensity score; propensity score matching; survival analysis
Mesh:
Substances:
Year: 2013 PMID: 24122911 PMCID: PMC4285179 DOI: 10.1002/sim.5984
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Comparison of baseline characteristics between treated and untreated subjects in the original sample and in the propensity score matched sample.
| Baseline variable | Original sample | Matched sample (caliper matching) | ||||
|---|---|---|---|---|---|---|
| Statin: No (6058) | Statin: Yes (3049) | Standardized difference | Statin: No (2449) | Statin: Yes | Standardized difference | |
| Demographic characteristics | ||||||
| Age | 68.1 ± 13.8 | 63.4 ± 12.4 | 0.355 | 63.6 ± 12.6 | 63.8 ± 12.6 | 0.013 |
| Female | 2241 (37.0%) | 887 (29.1%) | 0.167 | 738 (30.5%) | 736 (30.4%) | 0.002 |
| Presenting signs and symptoms | ||||||
| Acute congestive | 316 (5.2%) | 122 (4.0%) | 0.057 | 94 (3.9%) | 94 (3.9%) | 0.000 |
| heart failure/ | ||||||
| pulmonary edema | ||||||
| Classic cardiac risk factors | ||||||
| Family history of | 1763 (29.1%) | 1177 (38.6%) | 0.204 | 888 (36.6%) | 896 (37.0%) | 0.007 |
| heart disease | ||||||
| Diabetes | 1562 (25.8%) | 774 (25.4%) | 0.009 | 631 (26.0%) | 612 (25.3%) | 0.018 |
| Hyperlipidemia | 1138 (18.8%) | 1761 (57.8%) | 0.910 | 1115 (46.0%) | 1136 (46.9%) | 0.017 |
| Hypertension | 2683 (44.3%) | 1453 (47.7%) | 0.068 | 1,121 (46.3%) | 1,109 (45.8%) | 0.010 |
| Current smoker | 2004 (33.1%) | 1070 (35.1%) | 0.043 | 894 (36.9%) | 875 (36.1%) | 0.016 |
| Cardiac history and comorbid conditions | ||||||
| CVA/TIA | 610 (10.1%) | 237 (7.8%) | 0.079 | 181 (7.5%) | 196 (8.1%) | 0.023 |
| Angina | 1871 (30.9%) | 1086 (35.6%) | 0.101 | 833 (34.4%) | 822 (33.9%) | 0.010 |
| Cancer | 191 (3.2%) | 73 (2.4%) | 0.045 | 64 (2.6%) | 60 (2.5%) | 0.010 |
| Dementia | 243 (4.0%) | 33 (1.1%) | 0.171 | 26 (1.1%) | 29 (1.2%) | 0.012 |
| Previous AMI | 1254 (20.7%) | 799 (26.2%) | 0.132 | 566 (23.4%) | 562 (23.2%) | 0.004 |
| Asthma | 338 (5.6%) | 166 (5.4%) | 0.006 | 110 (4.5%) | 132 (5.4%) | 0.042 |
| Depression | 441 (7.3%) | 192 (6.3%) | 0.039 | 154 (6.4%) | 159 (6.6%) | 0.008 |
| Hyperthyroidism | 71 (1.2%) | 40 (1.3%) | 0.013 | 37 (1.5%) | 28 (1.2%) | 0.032 |
| Peptic ulcer | 345 (5.7%) | 156 (5.1%) | 0.025 | 127 (5.2%) | 120 (5.0%) | 0.013 |
| disease | ||||||
| Peripheral vascular | 430 (7.1%) | 220 (7.2%) | 0.005 | 187 (7.7%) | 178 (7.3%) | 0.014 |
| disease | ||||||
| Previous coronary | 433 (7.1%) | 411 (13.5%) | 0.220 | 267 (11.0%) | 264 (10.9%) | 0.004 |
| revascularization | ||||||
| Congestive heart | 275 (4.5%) | 91 (3.0%) | 0.079 | 86 (3.5%) | 71 (2.9%) | 0.035 |
| failure (chronic) | ||||||
| Stenosis | 96 (1.6%) | 35 (1.1%) | 0.037 | 21 (0.9%) | 30 (1.2%) | 0.036 |
| Vital signs on admission | ||||||
| Systolic blood pressure | 148.7 ± 31.6 | 149.3 ± 30.1 | 0.021 | 149.8 ± 30.8 | 149.0 ± 30.4 | 0.025 |
| Diastolic blood pressure | 83.6 ± 18.6 | 84.5 ± 18.0 | 0.047 | 84.4 ± 18.6 | 84.5 ± 18.2 | 0.010 |
| Heart rate | 84.6 ± 24.3 | 81.7 ± 23.0 | 0.121 | 81.3 ± 22.5 | 81.7 ± 22.5 | 0.020 |
| Respiratory rate | 21.2 ± 5.7 | 20.3 ± 4.8 | 0.166 | 20.4 ± 4.9 | 20.3 ± 4.9 | 0.005 |
| Results of initial laboratory tests | ||||||
| White blood count | 10.3 ± 4.9 | 10.0 ± 4.4 | 0.065 | 10.1 ± 4.4 | 10.1 ± 4.6 | 0.008 |
| Hemoglobin | 137.5 ± 19.3 | 140.6 ± 16.9 | 0.167 | 140.6 ± 17.4 | 140.5 ± 17.4 | 0.003 |
| Sodium | 138.9 ± 3.9 | 139.2 ± 3.3 | 0.079 | 139.2 ± 3.7 | 139.2 ± 3.3 | 0.011 |
| Glucose | 9.4 ± 5.1 | 9.2 ± 5.3 | 0.037 | 9.3 ± 5.6 | 9.2 ± 5.2 | 0.013 |
| Potassium | 4.1 ± 0.6 | 4.1 ± 0.5 | 0.061 | 4.1 ± 0.5 | 4.1 ± 0.5 | 0.023 |
| Creatinine | 105.7 ± 65.4 | 99.9 ± 50.0 | 0.096 | 100.4 ± 54.3 | 101.1 ± 54.2 | 0.014 |
CVA, cerebral vascular accident; TIA, transient ischemic attack; AMI, acute myocardial infarction. Continuous variables are reported as means ± standard deviations. Dichotomous variables are reported N (%). The propensity score matched sample was constructed greedy nearest neighbor matching on the logit of the propensity score using calipers of width equal to 0.2 of the standard deviation of the logit of the propensity score.
Standardized differences comparing baseline covariates between treated and untreated subjects using different propensity score methods.
| Propensity score method | 31 baseline covariates | 55 pairwise interactions of continuous covariates | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Minimum | 25th percentile | Median | 75th percentile | Maximum | Minimum | 25th percentile | Median | 75th percentile | Maximum | |
| Full sample (unmatched and unweighted) | 0.005 | 0.038 | 0.068 | 0.169 | 0.875 | 0.003 | 0.046 | 0.092 | 0.155 | 0.348 |
| Caliper matching | 0 | 0.008 | 0.013 | 0.020 | 0.042 | 0.001 | 0.005 | 0.011 | 0.016 | 0.033 |
| Optimal matching | 0.001 | 0.013 | 0.026 | 0.046 | 0.425 | 0.003 | 0.012 | 0.026 | 0.047 | 0.061 |
| Weighting using the inverse probability | 0.002 | 0.007 | 0.014 | 0.018 | 0.039 | 0.001 | 0.004 | 0.010 | 0.016 | 0.036 |
| of treatment weights | ||||||||||
Figure 1Kaplan–Meier survival curves obtained using different propensity score methods.
Recommendations for the use of propensity score methods with time-to-event outcomes.
| Objectives | Propensity score matching (pair matching) | Inverse probability of treatment weighting using the propensity score |
|---|---|---|
| Estimand | ATT | ATE or ATT depending on the weights selected |
| Balance assessment | Compare distribution of baseline covariates between treated and untreated subjects in the matched sample. | Compare distribution of baseline covariates between treated and untreated subjects in the sample weighted by the inverse probability of treatment. |
| Estimate and report survival curves | Estimate Kaplan–Meier survival curves in treated and untreated subjects in the matched sample. Use stratified log-rank test to compare survival curves (stratify on matched sets). | Estimate adjusted Kaplan–Meier survival curves in the weighted sample. Use adjusted log-rank test to compare survival curves. |
| Estimate and report absolute reduction in the probability of an event occurring | From the estimated survival curves, estimate the absolute difference in the probability of an event occurring within a specified duration of follow-up. | From the estimated marginal survival curves, estimate the absolute difference in probability of an event occurring within a specified duration of follow-up. |
| Estimate relative change in the hazard of an event occurring | Use Cox proportional hazards model in the matched sample. Regress survival on an indicator variable for treatment selection. Use a robust variance estimator. | Use Cox proportional hazards model in the weighted sample. Regress survival on an indicator variable for treatment selection. Use a robust variance estimator. |
ATT, average treatment effect for the treated; ATE, average treatment effect.