Kazuki Yoshida1, Sonia Hernández-Díaz, Daniel H Solomon, John W Jackson, Joshua J Gagne, Robert J Glynn, Jessica M Franklin. 1. From the aDepartment of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA; bDepartment of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA; cDivision of Rheumatology, Immunology and Allergy, Brigham and Women's Hospital, Boston, MA; dDivision of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA; and eDepartment of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD.
Abstract
BACKGROUND: Propensity score matching is a commonly used tool. However, its use in settings with more than two treatment groups has been less frequent. We examined the performance of a recently developed propensity score weighting method in the three-treatment group setting. METHODS: The matching weight method is an extension of inverse probability of treatment weighting (IPTW) that reweights both exposed and unexposed groups to emulate a propensity score matched population. Matching weights can generalize to multiple treatment groups. The performance of matching weights in the three-group setting was compared via simulation to three-way 1:1:1 propensity score matching and IPTW. We also applied these methods to an empirical example that compared the safety of three analgesics. RESULTS: Matching weights had similar bias, but better mean squared error (MSE) compared with three-way matching in all scenarios. The benefits were more pronounced in scenarios with a rare outcome, unequally sized treatment groups, or poor covariate overlap. IPTW's performance was highly dependent on covariate overlap. In the empirical example, matching weights achieved the best balance for 24 out of 35 covariates. Hazard ratios were numerically similar to matching. However, the confidence intervals were narrower for matching weights. CONCLUSIONS: Matching weights demonstrated improved performance over three-way matching in terms of MSE, particularly in simulation scenarios where finding matched subjects was difficult. Given its natural extension to settings with even more than three groups, we recommend matching weights for comparing outcomes across multiple treatment groups, particularly in settings with rare outcomes or unequal exposure distributions. See video abstract at, http://links.lww.com/EDE/B188.
BACKGROUND: Propensity score matching is a commonly used tool. However, its use in settings with more than two treatment groups has been less frequent. We examined the performance of a recently developed propensity score weighting method in the three-treatment group setting. METHODS: The matching weight method is an extension of inverse probability of treatment weighting (IPTW) that reweights both exposed and unexposed groups to emulate a propensity score matched population. Matching weights can generalize to multiple treatment groups. The performance of matching weights in the three-group setting was compared via simulation to three-way 1:1:1 propensity score matching and IPTW. We also applied these methods to an empirical example that compared the safety of three analgesics. RESULTS: Matching weights had similar bias, but better mean squared error (MSE) compared with three-way matching in all scenarios. The benefits were more pronounced in scenarios with a rare outcome, unequally sized treatment groups, or poor covariate overlap. IPTW's performance was highly dependent on covariate overlap. In the empirical example, matching weights achieved the best balance for 24 out of 35 covariates. Hazard ratios were numerically similar to matching. However, the confidence intervals were narrower for matching weights. CONCLUSIONS: Matching weights demonstrated improved performance over three-way matching in terms of MSE, particularly in simulation scenarios where finding matched subjects was difficult. Given its natural extension to settings with even more than three groups, we recommend matching weights for comparing outcomes across multiple treatment groups, particularly in settings with rare outcomes or unequal exposure distributions. See video abstract at, http://links.lww.com/EDE/B188.
Authors: Maya L Petersen; Kristin E Porter; Susan Gruber; Yue Wang; Mark J van der Laan Journal: Stat Methods Med Res Date: 2010-10-28 Impact factor: 3.021
Authors: Jessica M Franklin; Jeremy A Rassen; Diana Ackermann; Dorothee B Bartels; Sebastian Schneeweiss Journal: Stat Med Date: 2013-12-09 Impact factor: 2.373
Authors: Kazuki Yoshida; Daniel H Solomon; Sebastien Haneuse; Seoyoung C Kim; Elisabetta Patorno; Sara K Tedeschi; Houchen Lyu; Sonia Hernández-Díaz; Robert J Glynn Journal: Pharmacoepidemiol Drug Saf Date: 2019-05-27 Impact factor: 2.890
Authors: Houchen Lyu; Sizheng S Zhao; Kazuki Yoshida; Sara K Tedeschi; Chang Xu; Sagar U Nigwekar; Benjamin Z Leder; Daniel H Solomon Journal: J Clin Endocrinol Metab Date: 2019-11-01 Impact factor: 5.958
Authors: Kazuki Yoshida; Daniel H Solomon; Sebastien Haneuse; Seoyoung C Kim; Elisabetta Patorno; Sara K Tedeschi; Houchen Lyu; Jessica M Franklin; Til Stürmer; Sonia Hernández-Díaz; Robert J Glynn Journal: Am J Epidemiol Date: 2019-03-01 Impact factor: 4.897
Authors: Til Stürmer; Tiansheng Wang; Yvonne M Golightly; Alex Keil; Jennifer L Lund; Michele Jonsson Funk Journal: Rheumatology (Oxford) Date: 2020-01-01 Impact factor: 7.580
Authors: Kazuki Yoshida; Zhi Yu; Gail A Greendale; Kristine Ruppert; Yinjuan Lian; Sara K Tedeschi; Tzu-Chieh Lin; Sebastien Haneuse; Robert J Glynn; Sonia Hernández-Díaz; Daniel H Solomon Journal: Pharmacoepidemiol Drug Saf Date: 2017-12-12 Impact factor: 2.890
Authors: Xiaojuan Li; Bruce H Fireman; Jeffrey R Curtis; David E Arterburn; David P Fisher; Érick Moyneur; Mia Gallagher; Marsha A Raebel; W Benjamin Nowell; Lindsay Lagreid; Sengwee Toh Journal: Am J Epidemiol Date: 2019-04-01 Impact factor: 4.897
Authors: Monique E Cho; Jared L Hansen; Brian C Sauer; Alfred K Cheung; Adhish Agarwal; Tom Greene Journal: Clin J Am Soc Nephrol Date: 2021-03-29 Impact factor: 8.237