Literature DB >> 28371206

Using classification tree analysis to generate propensity score weights.

Ariel Linden1,2, Paul R Yarnold3.   

Abstract

RATIONALE, AIMS AND
OBJECTIVES: In evaluating non-randomized interventions, propensity scores (PS) estimate the probability of assignment to the treatment group given observed characteristics. Machine learning algorithms have been proposed as an alternative to conventional logistic regression for modelling PS in order to avoid limitations of linear methods. We introduce classification tree analysis (CTA) to generate PS which is a "decision-tree"-like classification model that provides accurate, parsimonious decision rules that are easy to display and interpret, reports P values derived via permutation tests, and evaluates cross-generalizability.
METHOD: Using empirical data, we identify all statistically valid CTA PS models and then use them to compute strata-specific, observation-level PS weights that are subsequently applied in outcomes analyses. We compare findings obtained using this framework to logistic regression and boosted regression, by evaluating covariate balance using standardized differences, model predictive accuracy, and treatment effect estimates obtained using median regression and a weighted CTA outcomes model.
RESULTS: While all models had some imbalanced covariates, main-effects logistic regression yielded the lowest average standardized difference, whereas CTA yielded the greatest predictive accuracy. Nevertheless, treatment effect estimates were generally consistent across all models.
CONCLUSIONS: Assessing standardized differences in means as a test of covariate balance is inappropriate for machine learning algorithms that segment the sample into two or more strata. Because the CTA algorithm identifies all statistically valid PS models for a sample, it is most likely to identify a correctly specified PS model, and should be considered as an alternative approach to modeling the PS.
© 2017 John Wiley & Sons, Ltd.

Entities:  

Keywords:  causal inference; classification tree analysis; machine learning; propensity score

Mesh:

Year:  2017        PMID: 28371206     DOI: 10.1111/jep.12744

Source DB:  PubMed          Journal:  J Eval Clin Pract        ISSN: 1356-1294            Impact factor:   2.431


  4 in total

Review 1.  Machine Learning in Causal Inference: Application in Pharmacovigilance.

Authors:  Yiqing Zhao; Yue Yu; Hanyin Wang; Yikuan Li; Yu Deng; Guoqian Jiang; Yuan Luo
Journal:  Drug Saf       Date:  2022-05-17       Impact factor: 5.228

2.  Characterizing Risk Factors for Clostridioides difficile Infection among Hospitalized Patients with Community-Acquired Pneumonia.

Authors:  Caroline C Jozefczyk; W Justin Moore; Paul R Yarnold; Nathaniel J Rhodes; Karolina Harkabuz; Robert Maxwell; Sarah H Sutton; Christina Silkaitis; Chao Qi; Richard G Wunderink; Teresa R Zembower
Journal:  Antimicrob Agents Chemother       Date:  2021-06-17       Impact factor: 5.191

3.  Propensity score adjustment using machine learning classification algorithms to control selection bias in online surveys.

Authors:  Ramón Ferri-García; María Del Mar Rueda
Journal:  PLoS One       Date:  2020-04-22       Impact factor: 3.240

Review 4.  Automated data-adaptive analytics for electronic healthcare data to study causal treatment effects.

Authors:  Sebastian Schneeweiss
Journal:  Clin Epidemiol       Date:  2018-07-06       Impact factor: 4.790

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.