| Literature DB >> 24114655 |
Abstract
Matching on the propensity score is widely used to estimate the effect of an exposure in observational studies. However, the quality of the matches can be affected by decisions made during the matching process, particularly the order in which subjects are selected for matching and the maximum permitted difference between matched subjects (the "caliper"). This study used simulations to explore the effects of these decisions on both the imbalance of covariates and the closeness of matching, while allowing the numbers of potential matches and strengths of association between the confounding variable and the exposure to vary. It was found that, without a caliper, substantial bias was possible, particularly with a relatively small reservoir of potential matches and strong confounder-exposure association. Use of the recommended caliper reduced the bias considerably, but bias remained if subjects were selected by increasing or decreasing propensity score. A tighter caliper led to greatly reduced bias and closer matches, although some subjects could not be matched. This study suggests that a narrow caliper can improve the performance of propensity score matching. In situations where it is impossible to find appropriate matches for all exposed subjects, it is better to select subjects in order of the best available matches, rather than increasing or decreasing the propensity score.Entities:
Keywords: caliper; covariate balance; matching; propensity score
Mesh:
Year: 2013 PMID: 24114655 PMCID: PMC3873103 DOI: 10.1093/aje/kwt212
Source DB: PubMed Journal: Am J Epidemiol ISSN: 0002-9262 Impact factor: 4.897
Figure 1.Distribution of X in exposed and unexposed subjects when the log of the odds ratio for the effect of X on exposure takes the values A) 1.5, B) 2, C) 5, and D) 10. The solid line represents treated subjects, and the dashed line represents untreated subjects.
Initial Differences Between Exposed and Unexposed Subjects as Measured by the Mean Difference in X and the AUC
| Controls per Case | OR for Effect of | |||
|---|---|---|---|---|
| 1.5 | 2 | 5 | 10 | |
| Mean difference in | ||||
| 2 | 0.397 | 0.662 | 1.330 | 1.681 |
| 5 | 0.400 | 0.668 | 1.383 | 1.761 |
| 10 | 0.399 | 0.681 | 1.435 | 1.853 |
| 20 | 0.405 | 0.688 | 1.489 | 1.948 |
| AUC | ||||
| 2 | 0.611 | 0.680 | 0.828 | 0.887 |
| 5 | 0.611 | 0.682 | 0.836 | 0.896 |
| 10 | 0.611 | 0.685 | 0.845 | 0.907 |
| 20 | 0.613 | 0.687 | 0.853 | 0.916 |
Abbreviations: AUC, area under the receiver operating characteristic curve; OR, odds ratio.
Mean Difference in X Between Exposed and Unexposed Subjects When No Caliper is Applied, Using 5 Different Matching Methods
| Matching Method by Controls per Case | OR for Effect of | |||
|---|---|---|---|---|
| 1.5 | 2 | 5 | 10 | |
| 2 | ||||
| Ascendinga | 0.0103 | 0.0906 | 0.4772 | 0.6794 |
| Descendingb | 0.0233 | 0.0945 | 0.4773 | 0.6794 |
| Random orderc | 0.0157 | 0.0920 | 0.4772 | 0.6794 |
| Best firstd | 0.0156 | 0.0920 | 0.4772 | 0.6794 |
| 5-to-1-digite | 0.0502 | 0.1912 | 0.7718 | 1.0404 |
| 5 | ||||
| Ascending | −0.0007 | 0.0037 | 0.1639 | 0.3290 |
| Descending | 0.0027 | 0.0090 | 0.1644 | 0.3291 |
| Random order | 0.0011 | 0.0062 | 0.1641 | 0.3291 |
| Best first | 0.0011 | 0.0061 | 0.1641 | 0.3290 |
| 5-to-1-digit | 0.0022 | 0.0078 | 0.1596 | 0.3878 |
| 10 | ||||
| Ascending | −0.0002 | 0.0000 | 0.0674 | 0.1902 |
| Descending | 0.0008 | 0.0025 | 0.0682 | 0.1904 |
| Random order | 0.0003 | 0.0014 | 0.0677 | 0.1903 |
| Best first | 0.0003 | 0.0014 | 0.0677 | 0.1903 |
| 5-to-1-digit | 0.0010 | 0.0028 | 0.0266 | 0.0604 |
| 20 | ||||
| Ascending | 0.0000 | 0.0000 | 0.0258 | 0.1083 |
| Descending | 0.0002 | 0.0009 | 0.0271 | 0.1086 |
| Random order | 0.0001 | 0.0005 | 0.0263 | 0.1084 |
| Best first | 0.0001 | 0.0005 | 0.0263 | 0.1084 |
| 5-to-1-digit | 0.0007 | 0.0032 | 0.1068 | 0.2943 |
Abbreviation: OR, odds ratio.
a In the ascending method, each time a match is made, the exposed subject with the lowest propensity score is used.
b In the descending method, each time a match is made, the exposed subject with the highest propensity score is used.
c In the random order method, each time a match is made, the exposed subject is selected at random.
d In the best first method, each time a match is made, the exposed subject with the closest matching unexposed subject is used.
e In the 5-to-1-digit method, initially, matched pairs are selected at random from exposed-unexposed pairs for which propensity score is identical to 5 decimal places (on a log-odds scale). When no such pairs remain, pairs are selected at random from those with identical scores to 4 decimal places, then to 3 decimal places, and so forth.
Figure 2.Scatter plot of X in matched control against X in exposed subject when no caliper is used. A and C show the results when there are 2 controls per case; B and D show 10 controls per case. In A and B, the odds ratio for the effect of X on exposure is 1.5, and in C and D it is 10. Matching methods used are symbolized as follows: blue x, descending; red o, ascending; yellow x, random; green o, best-first; and brown +, 5-to-1-digit. The diagonal line represents perfect matches.
Figure 3.Scatter plot of X in matched control against X in exposed subject by using 0.25-standard deviation caliper. A and C show the results when there are 2 controls per case, and B and D show 10 controls per case. In A and B, the odds ratio for the effect of X on exposure is 1.5, and in C and D it is 10. Matching methods used are symbolized as follows: blue x, descending; red o, ascending; yellow x, random; green o, best-first; and brown +, 5-to-1-digit.
Mean Difference in X Between Exposed and Unexposed Subjects When a 0.25-SD Caliper is Applied, Using 5 Different Matching Methods
| Matching Method by Controls per Case | OR for Effect of | |||
|---|---|---|---|---|
| 1.5 | 2 | 5 | 10 | |
| 2 | ||||
| Ascendinga | −0.0140 | −0.0253 | −0.0411 | −0.0421 |
| Descendingb | 0.0170 | 0.0508 | 0.1066 | 0.1116 |
| Random orderc | 0.0015 | 0.0042 | 0.0126 | 0.0168 |
| Best firstd | 0.0007 | 0.0012 | 0.0021 | 0.0025 |
| 5-to-1-digite | −0.0006 | −0.0010 | −0.0013 | −0.0010 |
| 5 | ||||
| Ascending | −0.0017 | −0.0047 | −0.0204 | −0.0267 |
| Descending | 0.0020 | 0.0058 | 0.0594 | 0.0835 |
| Random order | 0.0003 | 0.0008 | 0.0073 | 0.0125 |
| Best first | 0.0002 | 0.0005 | 0.0016 | 0.0023 |
| 5-to-1-digit | 0.0001 | 0.0000 | −0.0006 | −0.0005 |
| 10 | ||||
| Ascending | −0.0004 | −0.0013 | −0.0117 | −0.0187 |
| Descending | 0.0005 | 0.0016 | 0.0311 | 0.0627 |
| Random order | 0.0001 | 0.0003 | 0.0045 | 0.0097 |
| Best first | 0.0001 | 0.0002 | 0.0013 | 0.0020 |
| 5-to-1-digit | 0.0001 | 0.0001 | −0.0001 | −0.0002 |
| 20 | ||||
| Ascending | −0.0001 | −0.0004 | −0.0063 | −0.0129 |
| Descending | 0.0002 | 0.0005 | 0.0137 | 0.0439 |
| Random order | 0.0000 | 0.0001 | 0.0025 | 0.0072 |
| Best first | 0.0000 | 0.0001 | 0.0009 | 0.0017 |
| 5-to-1-digit | 0.0000 | 0.0001 | 0.0000 | −0.0001 |
Abbreviations: OR, odds ratio; SD, standard deviation.
a In the ascending method, each time a match is made, the exposed subject with the lowest propensity score is used.
b In the descending method, each time a match is made, the exposed subject with the highest propensity score is used.
c In the random order method, each time a match is made, the exposed subject is selected at random.
d In the best first method, each time a match is made, the exposed subject with the closest matching unexposed subject is used.
e In the 5-to-1-digit method, initially, matched pairs are selected at random from exposed-unexposed pairs for which propensity score is identical to 5 decimal places (on a log-odds scale). When no such pairs remain, pairs are selected at random from those with identical scores to 4 decimal places, then to 3 decimal places, and so forth.
Figure 4.Cumulative frequency plot for the magnitude of the difference between the logit of the propensity score for a given exposed subject and the logit of the propensity score for the matched unexposed subject.
Mean Difference in X Between Exposed and Unexposed Subjects When a Caliper Selected by Youden's Indexa is Applied, Using 5 Different Matching Methods
| Matching Method by Controls per Case | OR for Effect of | |||
|---|---|---|---|---|
| 1.5 | 2 | 5 | 10 | |
| 2 | ||||
| Ascendingb | −0.0060 | −0.0107 | −0.0138 | −0.0124 |
| Descendingc | 0.0061 | 0.0119 | 0.0171 | 0.0164 |
| Random orderd | 0.0001 | 0.0004 | 0.0008 | 0.0014 |
| Best firste | 0.0001 | 0.0002 | 0.0003 | 0.0007 |
| 5-to-1-digitf | −0.0004 | −0.0007 | −0.0009 | −0.0005 |
| 5 | ||||
| Ascending | −0.0005 | −0.0012 | −0.0066 | −0.0078 |
| Descending | 0.0005 | 0.0012 | 0.0078 | 0.0096 |
| Random order | 0.0000 | 0.0000 | 0.0003 | 0.0005 |
| Best first | 0.0000 | 0.0000 | 0.0002 | 0.0002 |
| 5-to-1-digit | −0.0000 | −0.0001 | −0.0004 | −0.0004 |
| 10 | ||||
| Ascending | −0.00012 | −0.00026 | −0.00326 | −0.00536 |
| Descending | 0.00012 | 0.00026 | 0.00366 | 0.00640 |
| Random order | 0.00000 | 0.00000 | 0.00016 | 0.00032 |
| Best first | 0.00000 | 0.00001 | 0.00010 | 0.00014 |
| 5-to-1-digit | −0.00001 | −0.00002 | −0.00017 | −0.00030 |
| 20 | ||||
| Ascending | −0.00003 | −0.00006 | −0.00150 | −0.00343 |
| Descending | 0.00003 | 0.00006 | 0.00161 | 0.00397 |
| Random order | 0.00000 | 0.00000 | 0.00006 | 0.00019 |
| Best first | 0.00000 | 0.00000 | 0.00005 | 0.00009 |
| 5-to-1-digit | 0.00000 | 0.00000 | −0.00007 | −0.00019 |
Abbreviation: OR, odds ratio.
a For each point, Youden's index is the sum of the horizontal distance from the y-axis plus the vertical distance from the line y = 1.
b In the ascending method, each time a match is made, the exposed subject with the lowest propensity score is used.
c In the descending method, each time a match is made, the exposed subject with the highest propensity score is used.
d In the random order method, each time a match is made, the exposed subject is selected at random.
e In the best first method, each time a match is made, the exposed subject with the closest matching unexposed subject is used.
f n the 5-to-1-digit method, initially, matched pairs are selected at random from exposed-unexposed pairs for which propensity score is identical to 5 decimal places (on a log-odds scale). When no such pairs remain, pairs are selected at random from those with identical scores to 4 decimal places, then to 3 decimal places, and so forth.
Root Mean Squared Difference in X Between Exposed and Unexposed Subjects
| Matching Method by Caliper | OR for Effect of | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 Controls per Case | 5 Controls per Case | 10 Controls per Case | 20 Controls per Case | |||||||||||||
| 1.5 | 2 | 5 | 10 | 1.5 | 2 | 5 | 10 | 1.5 | 2 | 5 | 10 | 1.5 | 2 | 5 | 10 | |
| None | ||||||||||||||||
| Ascendinga | 0.2005 | 0.4653 | 0.9871 | 1.1479 | 0.0320 | 0.0985 | 0.5179 | 0.7145 | 0.0126 | 0.0345 | 0.3144 | 0.5124 | 0.0058 | 0.0155 | 0.1874 | 0.3696 |
| Descendingb | 0.0603 | 0.1583 | 0.5510 | 0.7395 | 0.0188 | 0.0399 | 0.2573 | 0.4237 | 0.0094 | 0.0196 | 0.1452 | 0.2877 | 0.0051 | 0.0112 | 0.0832 | 0.1976 |
| Random orderc | 0.1033 | 0.3016 | 0.7887 | 0.9661 | 0.0207 | 0.0563 | 0.3938 | 0.5823 | 0.0097 | 0.0226 | 0.2290 | 0.4080 | 0.0051 | 0.0118 | 0.1300 | 0.2879 |
| Best firstd | 0.1264 | 0.3750 | 0.9329 | 1.1149 | 0.0225 | 0.0665 | 0.4730 | 0.6814 | 0.0100 | 0.0246 | 0.2773 | 0.4818 | 0.0052 | 0.0124 | 0.1577 | 0.3417 |
| 5-to-1-digite | 0.3871 | 0.7771 | 1.4478 | 1.6312 | 0.0908 | 0.2541 | 1.1178 | 1.4035 | 0.0487 | 0.0949 | 0.6857 | 1.1963 | 0.0230 | 0.0610 | 0.5942 | 0.9506 |
| 0.25 SD | ||||||||||||||||
| Ascending | 0.0368 | 0.0584 | 0.0790 | 0.0777 | 0.0110 | 0.0206 | 0.0534 | 0.0609 | 0.0059 | 0.0105 | 0.0389 | 0.0501 | 0.0033 | 0.0062 | 0.0269 | 0.0410 |
| Descending | 0.0389 | 0.0850 | 0.1368 | 0.1367 | 0.0112 | 0.0223 | 0.0976 | 0.1168 | 0.0059 | 0.0110 | 0.0668 | 0.0999 | 0.0033 | 0.0063 | 0.0408 | 0.0820 |
| Random order | 0.0247 | 0.0333 | 0.0452 | 0.0488 | 0.0097 | 0.0162 | 0.0349 | 0.0424 | 0.0054 | 0.0093 | 0.0283 | 0.0376 | 0.0032 | 0.0058 | 0.0217 | 0.0330 |
| Best first | 0.0200 | 0.0203 | 0.0178 | 0.0177 | 0.0090 | 0.0137 | 0.0160 | 0.0173 | 0.0053 | 0.0085 | 0.0152 | 0.0164 | 0.0032 | 0.0055 | 0.0140 | 0.0155 |
| 5-to-1-digit | 0.0266 | 0.0265 | 0.0215 | 0.0203 | 0.0135 | 0.0168 | 0.0188 | 0.0180 | 0.0085 | 0.0116 | 0.0171 | 0.0169 | 0.0056 | 0.0080 | 0.0154 | 0.0155 |
| Youden indexf | ||||||||||||||||
| Ascending | 0.0116 | 0.0195 | 0.0217 | 0.0199 | 0.0021 | 0.0038 | 0.0133 | 0.0138 | 0.0009 | 0.0014 | 0.0083 | 0.0109 | 0.0004 | 0.0006 | 0.0050 | 0.0082 |
| Descending | 0.0118 | 0.0207 | 0.0246 | 0.0225 | 0.0021 | 0.0039 | 0.0146 | 0.0156 | 0.0009 | 0.0014 | 0.0088 | 0.0121 | 0.0004 | 0.0006 | 0.0051 | 0.0088 |
| Random order | 0.0085 | 0.0119 | 0.0113 | 0.0109 | 0.0019 | 0.0034 | 0.0080 | 0.0077 | 0.0008 | 0.0014 | 0.0058 | 0.0065 | 0.0004 | 0.0006 | 0.0040 | 0.0053 |
| Best first | 0.0079 | 0.0099 | 0.0074 | 0.0069 | 0.0019 | 0.0032 | 0.0060 | 0.0052 | 0.0008 | 0.0013 | 0.0049 | 0.0047 | 0.0004 | 0.0006 | 0.0036 | 0.0041 |
| 5-to-1-digit | 0.0095 | 0.0127 | 0.0093 | 0.0080 | 0.0026 | 0.0039 | 0.0069 | 0.0060 | 0.0011 | 0.0019 | 0.0052 | 0.0053 | 0.0005 | 0.0008 | 0.0037 | 0.0044 |
Abbreviations: OR, odds ratio; SD, standard deviation.
a In the ascending method, each time a match is made, the exposed subject with the lowest propensity score is used.
b In the descending method, each time a match is made, the exposed subject with the highest propensity score is used.
c In the random order method, each time a match is made, the exposed subject is selected at random.
d In the best first method, each time a match is made, the exposed subject with the closest matching unexposed subject is used.
e In the 5-to-1-digit method, initially, matched pairs are selected at random from exposed-unexposed pairs for which propensity score is identical to 5 decimal places (on a log-odds scale). When no such pairs remain, pairs are selected at random from those with identical scores to 4 decimal places, then to 3 decimal places, and so forth.
f For each point, Youden's index is the sum of the horizontal distance from the y-axis plus the vertical distance from the line y = 1.