| Literature DB >> 24463885 |
Peter C Austin1, Tibor Schuster2.
Abstract
Observational studies are increasingly being used to estimate the effect of treatments, interventions and exposures on outcomes that can occur over time. Historically, the hazard ratio, which is a relative measure of effect, has been reported. However, medical decision making is best informed when both relative and absolute measures of effect are reported. When outcomes are time-to-event in nature, the effect of treatment can also be quantified as the change in mean or median survival time due to treatment and the absolute reduction in the probability of the occurrence of an event within a specified duration of follow-up. We describe how three different propensity score methods, propensity score matching, stratification on the propensity score and inverse probability of treatment weighting using the propensity score, can be used to estimate absolute measures of treatment effect on survival outcomes. These methods are all based on estimating marginal survival functions under treatment and lack of treatment. We then conducted an extensive series of Monte Carlo simulations to compare the relative performance of these methods for estimating the absolute effects of treatment on survival outcomes. We found that stratification on the propensity score resulted in the greatest bias. Caliper matching on the propensity score and a method based on earlier work by Cole and Hernán tended to have the best performance for estimating absolute effects of treatment on survival outcomes. When the prevalence of treatment was less extreme, then inverse probability of treatment weighting-based methods tended to perform better than matching-based methods.Entities:
Keywords: Monte Carlo simulations; inverse probability of treatment weighting; observational study; propensity score; survival analysis; time-to-event outcomes
Mesh:
Year: 2014 PMID: 24463885 PMCID: PMC5051602 DOI: 10.1177/0962280213519716
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Figure 1.Estimates of changes in mean survival (ATE).
Figure 2.Estimates of changes in mean survival (ATT).
Figure 3.Estimates of changes in median survival (ATE).
Figure 4.Estimates of changes in median survival (ATT).
Figure 5.Bias in estimating S(10th percentile of t).
Figure 6.Bias in estimating S(25th percentile of t).
Figure 7.Bias in estimating S(50th percentile of t).
Figure 8.Bias in estimating S(75th percentile of t).
Figure 9.Bias in estimating S(90th percentile of t).
Empirical type I error rates of different propensity score methods for comparing survival functions between treatment groups.
| Statistical method | Prevalence of treatment | ||
|---|---|---|---|
| 0.05 | 0.10 | 0.25 | |
| Effect in overall population of all subjects | |||
| Stratification (Cox regression stratifying on PS strata) | 0.439 | 0.571 | 0.629 |
| Stratification (stratified log-rank test) | 0.439 | 0.572 | 0.629 |
| IPTW (Cole and Hernán) | 0.070 | 0.072 | 0.051 |
| IPTW (Xie and Liu) | 0.722 | 0.604 | 0.350 |
| Effect in population of treated subjects | |||
| Caliper matching (naïve Cox regression) | 0.013 | 0.006 | 0.010 |
| Caliper matching (Cox regression with robust standard errors) | 0.034 | 0.030 | 0.029 |
| Caliper matching (log-rank test) | 0.013 | 0.006 | 0.010 |
| Caliper matching (stratified log-rank test) | 0.035 | 0.033 | 0.039 |
| Nearest neighbour matching (naïve Cox regression) | 0.012 | 0.008 | 0.073 |
| Nearest neighbour matching (Cox regression with robust standard errors) | 0.033 | 0.030 | 0.144 |
| Nearest neighbour matching (log-rank test) | 0.012 | 0.008 | 0.073 |
| Nearest neighbour matching (stratified log-rank test) | 0.033 | 0.033 | 0.283 |
| IPTW (Cole and Hernán) | 0.009 | 0.006 | 0.006 |
| IPTW (Xie and Liu) | 0.000 | 0.000 | 0.000 |
Note: The cells contain empirical estimates of the type I error rate. These were the proportion of 1000 simulated datasets in which the null hypothesis of no difference in survival functions was rejected at the P < 0.05 level.