| Literature DB >> 21337595 |
Abstract
Propensity-score matching allows one to reduce the effects of treatment-selection bias or confounding when estimating the effects of treatments when using observational data. Some authors have suggested that methods of inference appropriate for independent samples can be used for assessing the statistical significance of treatment effects when using propensity-score matching. Indeed, many authors in the applied medical literature use methods for independent samples when making inferences about treatment effects using propensity-score matched samples. Dichotomous outcomes are common in healthcare research. In this study, we used Monte Carlo simulations to examine the effect on inferences about risk differences (or absolute risk reductions) when statistical methods for independent samples are used compared with when statistical methods for paired samples are used in propensity-score matched samples. We found that compared with using methods for independent samples, the use of methods for paired samples resulted in: (i) empirical type I error rates that were closer to the advertised rate; (ii) empirical coverage rates of 95 per cent confidence intervals that were closer to the advertised rate; (iii) narrower 95 per cent confidence intervals; and (iv) estimated standard errors that more closely reflected the sampling variability of the estimated risk difference. Differences between the empirical and advertised performance of methods for independent samples were greater when the treatment-selection process was stronger compared with when treatment-selection process was weaker. We recommend using statistical methods for paired samples when using propensity-score matched samples for making inferences on the effect of treatment on the reduction in the probability of an event occurring.Mesh:
Year: 2011 PMID: 21337595 PMCID: PMC3110307 DOI: 10.1002/sim.4200
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Empirical type I error rates for statistical methods for paired vs independent samples
| Covariate scenario | Mean percent of treated subjects matched | Pearson chi-squared test (independent sample method) | McNemar's test (paired sample method) |
|---|---|---|---|
| Independent normal | 90.4 | 0.0427 | 0.0614 |
| Correlated normal | 75.2 | 0.0351 | 0.0537 |
| Mixture scenario 1 | 91.4 | 0.0296 | 0.0521 |
| Mixture scenario 2 | 96.1 | 0.0422 | 0.0499 |
| Independent Bernoulli | 99.7 | 0.0416 | 0.0477 |
| Independent normal | 90.4 | 0.0378 | 0.0493 |
| Correlated normal | 75.2 | 0.0345 | 0.0537 |
| Mixture scenario 1 | 91.4 | 0.0405 | 0.0499 |
| Mixture scenario 2 | 96.1 | 0.0427 | 0.0515 |
| Independent Bernoulli | 99.7 | 0.051 | 0.0537 |
| Independent normal | 93.5 | 0.0208 | 0.0405 |
| Correlated normal | 78.6 | 0.0208 | 0.0526 |
| Mixture scenario 1 | 95.2 | 0.029 | 0.0526 |
| Mixture scenario 2 | 97.7 | 0.0334 | 0.051 |
| Independent Bernoulli | 99 | 0.0395 | 0.0532 |
| Independent normal | 93.5 | 0.0258 | 0.0521 |
| Correlated normal | 78.6 | 0.0214 | 0.0466 |
| Mixture scenario 1 | 95.2 | 0.0323 | 0.0477 |
| Mixture scenario 2 | 97.7 | 0.0312 | 0.0433 |
| Independent Bernoulli | 99 | 0.0384 | 0.0504 |
Note: Cells contain results averaged over 1825 Monte Carlo simulations.
Coverage and width of empirical 95 per cent confidence intervals and estimation of sampling variances of treatment effects −0.29 treatment effect, weak treatment-selection model
| Coverage of 95 per cent confidence intervals | Lengths of 95 per cent confidence intervals | Ratio of length of independent CI to paired CI | Ratio of mean estimated variance of treatment effect to variance of empirical sampling distribution | ||||
|---|---|---|---|---|---|---|---|
| True risk difference | Independent | Paired | Independent | Paired | Independent | Paired | |
| 0 | 0.957 | 0.938 | 0.057 | 0.053 | 1.075 | 1.084 | 0.944 |
| −0.02 | 0.963 | 0.947 | 0.057 | 0.053 | 1.075 | 1.076 | 0.938 |
| −0.05 | 0.964 | 0.949 | 0.056 | 0.052 | 1.077 | 1.093 | 0.956 |
| −0.1 | 0.963 | 0.95 | 0.055 | 0.051 | 1.078 | 1.126 | 0.99 |
| −0.15 | 0.961 | 0.944 | 0.053 | 0.05 | 1.06 | 1.138 | 1.008 |
| 0 | 0.965 | 0.946 | 0.063 | 0.057 | 1.105 | 1.255 | 1.02 |
| −0.02 | 0.975 | 0.958 | 0.063 | 0.057 | 1.105 | 1.247 | 1.016 |
| −0.05 | 0.97 | 0.948 | 0.063 | 0.057 | 1.105 | 1.24 | 1.013 |
| −0.1 | 0.972 | 0.951 | 0.062 | 0.056 | 1.107 | 1.262 | 1.037 |
| −0.15 | 0.969 | 0.952 | 0.061 | 0.055 | 1.109 | 1.244 | 1.032 |
| 0 | 0.97 | 0.948 | 0.056 | 0.053 | 1.057 | 1.131 | 0.991 |
| −0.02 | 0.964 | 0.951 | 0.056 | 0.053 | 1.057 | 1.124 | 0.986 |
| −0.05 | 0.961 | 0.95 | 0.056 | 0.052 | 1.077 | 1.111 | 0.977 |
| −0.1 | 0.964 | 0.95 | 0.054 | 0.051 | 1.059 | 1.122 | 0.992 |
| −0.15 | 0.962 | 0.949 | 0.053 | 0.05 | 1.06 | 1.124 | 1.001 |
| 0 | 0.958 | 0.95 | 0.054 | 0.052 | 1.038 | 1.091 | 0.988 |
| −0.02 | 0.958 | 0.95 | 0.054 | 0.051 | 1.059 | 1.093 | 0.991 |
| −0.05 | 0.962 | 0.951 | 0.053 | 0.051 | 1.039 | 1.07 | 0.972 |
| −0.1 | 0.953 | 0.942 | 0.052 | 0.05 | 1.04 | 1.07 | 0.977 |
| −0.15 | 0.955 | 0.945 | 0.05 | 0.048 | 1.042 | 1.085 | 0.998 |
| 0 | 0.958 | 0.952 | 0.052 | 0.051 | 1.02 | 1.07 | 1.014 |
| −0.02 | 0.953 | 0.95 | 0.052 | 0.05 | 1.04 | 1.078 | 1.023 |
| −0.05 | 0.955 | 0.95 | 0.051 | 0.05 | 1.02 | 1.077 | 1.024 |
| −0.1 | 0.95 | 0.946 | 0.05 | 0.048 | 1.042 | 1.072 | 1.022 |
| −0.15 | 0.959 | 0.954 | 0.048 | 0.047 | 1.021 | 1.058 | 1.014 |
Coverage and width of empirical 95 per cent confidence intervals and estimation of sampling variances of treatment effects −0.15 treatment effect, strong treatment-selection model
| Coverage of 95 per cent confidence intervals | Lengths of 95 per cent confidence intervals | Ratio of length of independent CI to paired CI | Ratio of mean estimated variance of treatment effect to variance of empirical sampling distribution | ||||
|---|---|---|---|---|---|---|---|
| True risk difference | Independent | Paired | Independent | Paired | Independent | Paired | |
| 0 | 0.974 | 0.947 | 0.168 | 0.148 | 1.135 | 1.252 | 0.976 |
| −0.02 | 0.974 | 0.952 | 0.167 | 0.147 | 1.136 | 1.286 | 1.006 |
| −0.05 | 0.972 | 0.946 | 0.164 | 0.145 | 1.131 | 1.257 | 0.989 |
| −0.1 | 0.974 | 0.952 | 0.159 | 0.142 | 1.120 | 1.265 | 1.012 |
| −0.15 | 0.968 | 0.948 | 0.152 | 0.138 | 1.101 | 1.199 | 0.984 |
| 0 | 0.978 | 0.953 | 0.188 | 0.161 | 1.168 | 1.360 | 0.997 |
| −0.02 | 0.977 | 0.950 | 0.187 | 0.160 | 1.169 | 1.341 | 0.988 |
| −0.05 | 0.973 | 0.945 | 0.184 | 0.159 | 1.157 | 1.297 | 0.963 |
| −0.1 | 0.970 | 0.945 | 0.179 | 0.156 | 1.147 | 1.262 | 0.954 |
| −0.15 | 0.973 | 0.949 | 0.173 | 0.152 | 1.138 | 1.259 | 0.977 |
| 0 | 0.967 | 0.952 | 0.164 | 0.147 | 1.116 | 1.234 | 0.993 |
| −0.02 | 0.972 | 0.953 | 0.162 | 0.146 | 1.110 | 1.236 | 0.997 |
| −0.05 | 0.973 | 0.951 | 0.159 | 0.144 | 1.104 | 1.296 | 1.055 |
| −0.1 | 0.971 | 0.955 | 0.154 | 0.139 | 1.108 | 1.248 | 1.030 |
| −0.15 | 0.969 | 0.948 | 0.147 | 0.135 | 1.089 | 1.208 | 1.020 |
| 0 | 0.969 | 0.956 | 0.154 | 0.143 | 1.077 | 1.208 | 1.042 |
| −0.02 | 0.973 | 0.957 | 0.152 | 0.141 | 1.078 | 1.228 | 1.061 |
| −0.05 | 0.973 | 0.957 | 0.148 | 0.138 | 1.072 | 1.234 | 1.071 |
| −0.1 | 0.965 | 0.955 | 0.141 | 0.133 | 1.060 | 1.188 | 1.047 |
| −0.15 | 0.961 | 0.950 | 0.133 | 0.126 | 1.056 | 1.151 | 1.039 |
| 0 | 0.960 | 0.950 | 0.146 | 0.139 | 1.050 | 1.117 | 1.008 |
| −0.02 | 0.964 | 0.954 | 0.143 | 0.136 | 1.051 | 1.148 | 1.039 |
| −0.05 | 0.963 | 0.954 | 0.139 | 0.133 | 1.045 | 1.140 | 1.037 |
| −0.1 | 0.957 | 0.950 | 0.131 | 0.126 | 1.040 | 1.079 | 0.996 |
| −0.15 | 0.948 | 0.939 | 0.122 | 0.118 | 1.034 | 1.004 | 0.947 |
Coverage and width of empirical 95 per cent confidence intervals and estimation of sampling variances of treatment effects −0.15 treatment effect, weak treatment-selection model
| Coverage of 95 per cent confidence intervals | Lengths of 95 per cent confidence intervals | Ratio of length of independent CI to paired CI | Ratio of mean estimated variance of treatment effect to variance of empirical sampling distribution | ||||
|---|---|---|---|---|---|---|---|
| True risk difference | Independent | Paired | Independent | Paired | Independent | Paired | |
| 0 | 0.962 | 0.951 | 0.047 | 0.045 | 1.044 | 1.113 | 1.005 |
| −0.02 | 0.953 | 0.942 | 0.047 | 0.044 | 1.068 | 1.097 | 0.993 |
| −0.05 | 0.944 | 0.929 | 0.045 | 0.043 | 1.047 | 1.07 | 0.974 |
| −0.1 | 0.873 | 0.856 | 0.043 | 0.041 | 1.049 | 1.074 | 0.99 |
| −0.15 | 0.685 | 0.665 | 0.04 | 0.039 | 1.026 | 1.073 | 1.009 |
| 0 | 0.965 | 0.946 | 0.053 | 0.049 | 1.082 | 1.193 | 1.019 |
| −0.02 | 0.956 | 0.939 | 0.053 | 0.049 | 1.082 | 1.207 | 1.034 |
| −0.05 | 0.923 | 0.895 | 0.051 | 0.048 | 1.063 | 1.192 | 1.027 |
| −0.1 | 0.722 | 0.676 | 0.049 | 0.046 | 1.065 | 1.153 | 1.005 |
| −0.15 | 0.311 | 0.265 | 0.047 | 0.044 | 1.068 | 1.138 | 1.01 |
| 0 | 0.959 | 0.95 | 0.047 | 0.045 | 1.044 | 1.105 | 1.004 |
| −0.02 | 0.956 | 0.944 | 0.046 | 0.044 | 1.045 | 1.106 | 1.008 |
| −0.05 | 0.953 | 0.944 | 0.045 | 0.043 | 1.047 | 1.126 | 1.03 |
| −0.1 | 0.91 | 0.896 | 0.042 | 0.041 | 1.024 | 1.099 | 1.018 |
| −0.15 | 0.744 | 0.723 | 0.039 | 0.038 | 1.026 | 1.058 | 0.998 |
| 0 | 0.957 | 0.948 | 0.045 | 0.043 | 1.047 | 1.039 | 0.968 |
| −0.02 | 0.952 | 0.944 | 0.044 | 0.042 | 1.048 | 1.027 | 0.959 |
| −0.05 | 0.95 | 0.942 | 0.042 | 0.041 | 1.024 | 0.994 | 0.932 |
| −0.1 | 0.925 | 0.919 | 0.04 | 0.039 | 1.026 | 0.955 | 0.906 |
| −0.15 | 0.883 | 0.877 | 0.036 | 0.036 | 1 | 0.95 | 0.917 |
| 0 | 0.949 | 0.946 | 0.042 | 0.042 | 1 | 1.038 | 1.002 |
| −0.02 | 0.955 | 0.949 | 0.041 | 0.041 | 1 | 1.03 | 0.996 |
| −0.05 | 0.952 | 0.95 | 0.04 | 0.039 | 1.026 | 1.031 | 1.001 |
| −0.1 | 0.953 | 0.951 | 0.036 | 0.036 | 1 | 0.983 | 0.961 |
| −0.15 | 0.946 | 0.945 | 0.032 | 0.032 | 1 | 0.972 | 0.962 |
Coverage and width of empirical 95 per cent confidence intervals and estimation of sampling variances of treatment effects −0.29 treatment effect, strong treatment-selection model
| Coverage of 95 per cent confidence intervals | Lengths of 95 per cent confidence intervals | Ratio of length of independent CI to paired CI | Ratio of mean estimated variance of treatment effect to variance of empirical sampling distribution | ||||
|---|---|---|---|---|---|---|---|
| True risk difference | Independent | Paired | Independent | Paired | Independent | Paired | |
| 0 | 0.979 | 0.958 | 0.181 | 0.157 | 1.153 | 1.391 | 1.044 |
| −0.02 | 0.980 | 0.956 | 0.181 | 0.157 | 1.153 | 1.391 | 1.044 |
| −0.05 | 0.976 | 0.951 | 0.181 | 0.157 | 1.153 | 1.370 | 1.032 |
| −0.1 | 0.975 | 0.950 | 0.180 | 0.157 | 1.146 | 1.356 | 1.026 |
| −0.15 | 0.979 | 0.957 | 0.179 | 0.156 | 1.147 | 1.398 | 1.066 |
| 0 | 0.979 | 0.944 | 0.188 | 0.157 | 1.197 | 1.470 | 1.027 |
| −0.02 | 0.982 | 0.945 | 0.189 | 0.158 | 1.196 | 1.478 | 1.034 |
| −0.05 | 0.981 | 0.952 | 0.191 | 0.160 | 1.194 | 1.508 | 1.057 |
| −0.1 | 0.968 | 0.928 | 0.193 | 0.162 | 1.191 | 1.470 | 1.039 |
| −0.15 | 0.947 | 0.897 | 0.193 | 0.163 | 1.184 | 1.334 | 0.954 |
| 0 | 0.970 | 0.946 | 0.180 | 0.158 | 1.139 | 1.251 | 0.966 |
| −0.02 | 0.976 | 0.947 | 0.180 | 0.158 | 1.139 | 1.288 | 0.995 |
| −0.05 | 0.971 | 0.942 | 0.179 | 0.158 | 1.133 | 1.240 | 0.961 |
| −0.1 | 0.969 | 0.945 | 0.178 | 0.157 | 1.134 | 1.268 | 0.987 |
| −0.15 | 0.964 | 0.940 | 0.176 | 0.156 | 1.128 | 1.218 | 0.956 |
| 0 | 0.966 | 0.949 | 0.176 | 0.161 | 1.093 | 1.205 | 1.003 |
| −0.02 | 0.970 | 0.951 | 0.176 | 0.160 | 1.100 | 1.206 | 1.004 |
| −0.05 | 0.969 | 0.944 | 0.175 | 0.159 | 1.101 | 1.207 | 1.005 |
| −0.1 | 0.968 | 0.951 | 0.172 | 0.157 | 1.096 | 1.223 | 1.021 |
| −0.15 | 0.968 | 0.946 | 0.169 | 0.155 | 1.090 | 1.191 | 1.003 |
| 0 | 0.959 | 0.946 | 0.172 | 0.161 | 1.068 | 1.137 | 0.989 |
| −0.02 | 0.964 | 0.948 | 0.172 | 0.160 | 1.075 | 1.117 | 0.973 |
| −0.05 | 0.966 | 0.948 | 0.170 | 0.159 | 1.069 | 1.150 | 1.004 |
| −0.1 | 0.965 | 0.951 | 0.167 | 0.156 | 1.071 | 1.128 | 0.990 |
| −0.15 | 0.966 | 0.956 | 0.163 | 0.153 | 1.065 | 1.121 | 0.992 |