Literature DB >> 28991001

Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation.

Richard Wyss, Sebastian Schneeweiss, Mark van der Laan, Samuel D Lendle, Cheng Ju, Jessica M Franklin.   

Abstract

The high-dimensional propensity score is a semiautomated variable selection algorithm that can supplement expert knowledge to improve confounding control in nonexperimental medical studies utilizing electronic healthcare databases. Although the algorithm can be used to generate hundreds of patient-level variables and rank them by their potential confounding impact, it remains unclear how to select the optimal number of variables for adjustment. We used plasmode simulations based on empirical data to discuss and evaluate data-adaptive approaches for variable selection and prediction modeling that can be combined with the high-dimensional propensity score to improve confounding control in large healthcare databases. We considered approaches that combine the high-dimensional propensity score with Super Learner prediction modeling, a scalable version of collaborative targeted maximum-likelihood estimation, and penalized regression. We evaluated performance using bias and mean squared error (MSE) in effect estimates. Results showed that the high-dimensional propensity score can be sensitive to the number of variables included for adjustment and that severe overfitting of the propensity score model can negatively impact the properties of effect estimates. Combining the high-dimensional propensity score with Super Learner was the most consistent strategy, in terms of reducing bias and MSE in the effect estimates, and may be promising for semiautomated data-adaptive propensity score estimation in high-dimensional covariate datasets.

Entities:  

Mesh:

Year:  2018        PMID: 28991001     DOI: 10.1097/EDE.0000000000000762

Source DB:  PubMed          Journal:  Epidemiology        ISSN: 1044-3983            Impact factor:   4.822


  17 in total

1.  Theory meets practice: a commentary on VanderWeele's 'principles of confounder selection'.

Authors:  Sebastian Schneeweiss
Journal:  Eur J Epidemiol       Date:  2019-03-06       Impact factor: 8.082

2.  Identifying Drugs Inducing Prematurity by Mining Claims Data with High-Dimensional Confounder Score Strategies.

Authors:  Romain Demailly; Sylvie Escolano; Françoise Haramburu; Pascale Tubert-Bitter; Ismaïl Ahmed
Journal:  Drug Saf       Date:  2020-06       Impact factor: 5.606

3.  Quantifying the bias due to observed individual confounders in causal treatment effect estimates.

Authors:  Layla Parast; Beth Ann Griffin
Journal:  Stat Med       Date:  2020-05-10       Impact factor: 2.373

4.  Synthetic Negative Controls: Using Simulation to Screen Large-scale Propensity Score Analyses.

Authors:  Richard Wyss; Sebastian Schneeweiss; Kueiyu Joshua Lin; David P Miller; Linda Kalilani; Jessica M Franklin
Journal:  Epidemiology       Date:  2022-04-12       Impact factor: 4.860

Review 5.  When Can Nonrandomized Studies Support Valid Inference Regarding Effectiveness or Safety of New Medical Treatments?

Authors:  Jessica M Franklin; Richard Platt; Nancy A Dreyer; Alex John London; Gregory E Simon; Jonathan H Watanabe; Michael Horberg; Adrian Hernandez; Robert M Califf
Journal:  Clin Pharmacol Ther       Date:  2021-05-09       Impact factor: 6.903

6.  Can Hyperparameter Tuning Improve the Performance of a Super Learner?: A Case Study.

Authors:  Jenna Wong; Travis Manderson; Michal Abrahamowicz; David L Buckeridge; Robyn Tamblyn
Journal:  Epidemiology       Date:  2019-07       Impact factor: 4.822

7.  G-computation and machine learning for estimating the causal effects of binary exposure statuses on binary outcomes.

Authors:  Florent Le Borgne; Arthur Chatton; Maxime Léger; Rémi Lenain; Yohann Foucher
Journal:  Sci Rep       Date:  2021-01-14       Impact factor: 4.379

Review 8.  Propensity score matching with R: conventional methods and new features.

Authors:  Qin-Yu Zhao; Jing-Chao Luo; Ying Su; Yi-Jie Zhang; Guo-Wei Tu; Zhe Luo
Journal:  Ann Transl Med       Date:  2021-05

Review 9.  Automated data-adaptive analytics for electronic healthcare data to study causal treatment effects.

Authors:  Sebastian Schneeweiss
Journal:  Clin Epidemiol       Date:  2018-07-06       Impact factor: 4.790

10.  Intersections of machine learning and epidemiological methods for health services research.

Authors:  Sherri Rose
Journal:  Int J Epidemiol       Date:  2021-01-23       Impact factor: 7.196

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.