Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Using classification tree analysis to generate propensity score weights.

Literature DB >> 28371206

Using classification tree analysis to generate propensity score weights.

Abstract

RATIONALE, AIMS AND
OBJECTIVES: In evaluating non-randomized interventions, propensity scores (PS) estimate the probability of assignment to the treatment group given observed characteristics. Machine learning algorithms have been proposed as an alternative to conventional logistic regression for modelling PS in order to avoid limitations of linear methods. We introduce classification tree analysis (CTA) to generate PS which is a "decision-tree"-like classification model that provides accurate, parsimonious decision rules that are easy to display and interpret, reports P values derived via permutation tests, and evaluates cross-generalizability.
METHOD: Using empirical data, we identify all statistically valid CTA PS models and then use them to compute strata-specific, observation-level PS weights that are subsequently applied in outcomes analyses. We compare findings obtained using this framework to logistic regression and boosted regression, by evaluating covariate balance using standardized differences, model predictive accuracy, and treatment effect estimates obtained using median regression and a weighted CTA outcomes model.
RESULTS: While all models had some imbalanced covariates, main-effects logistic regression yielded the lowest average standardized difference, whereas CTA yielded the greatest predictive accuracy. Nevertheless, treatment effect estimates were generally consistent across all models.
CONCLUSIONS: Assessing standardized differences in means as a test of covariate balance is inappropriate for machine learning algorithms that segment the sample into two or more strata. Because the CTA algorithm identifies all statistically valid PS models for a sample, it is most likely to identify a correctly specified PS model, and should be considered as an alternative approach to modeling the PS.

Entities: Disease

Keywords: causal inference; classification tree analysis; machine learning; propensity score

Mesh：

Year: 2017 PMID： 28371206 DOI： 10.1111/jep.12744

Source DB: PubMed Journal: J Eval Clin Pract ISSN： 1356-1294 Impact factor: 2.431

Keyword Cloud
Cited

4 in total

Review 1. Machine Learning in Causal Inference: Application in Pharmacovigilance.

Authors: Yiqing Zhao; Yue Yu; Hanyin Wang; Yikuan Li; Yu Deng; Guoqian Jiang; Yuan Luo
Journal: Drug Saf Date: 2022-05-17 Impact factor: 5.228

2. Characterizing Risk Factors for Clostridioides difficile Infection among Hospitalized Patients with Community-Acquired Pneumonia.

Authors: Caroline C Jozefczyk; W Justin Moore; Paul R Yarnold; Nathaniel J Rhodes; Karolina Harkabuz; Robert Maxwell; Sarah H Sutton; Christina Silkaitis; Chao Qi; Richard G Wunderink; Teresa R Zembower
Journal: Antimicrob Agents Chemother Date: 2021-06-17 Impact factor: 5.191

3. Propensity score adjustment using machine learning classification algorithms to control selection bias in online surveys.

Authors: Ramón Ferri-García; María Del Mar Rueda
Journal: PLoS One Date: 2020-04-22 Impact factor: 3.240

Review 4. Automated data-adaptive analytics for electronic healthcare data to study causal treatment effects.

Authors: Sebastian Schneeweiss
Journal: Clin Epidemiol Date: 2018-07-06 Impact factor: 4.790

4 in total