Literature DB >> 26226129

Variable Selection for Confounder Control, Flexible Modeling and Collaborative Targeted Minimum Loss-Based Estimation in Causal Inference.

Mireille E Schnitzer, Judith J Lok, Susan Gruber.   

Abstract

This paper investigates the appropriateness of the integration of flexible propensity score modeling (nonparametric or machine learning approaches) in semiparametric models for the estimation of a causal quantity, such as the mean outcome under treatment. We begin with an overview of some of the issues involved in knowledge-based and statistical variable selection in causal inference and the potential pitfalls of automated selection based on the fit of the propensity score. Using a simple example, we directly show the consequences of adjusting for pure causes of the exposure when using inverse probability of treatment weighting (IPTW). Such variables are likely to be selected when using a naive approach to model selection for the propensity score. We describe how the method of Collaborative Targeted minimum loss-based estimation (C-TMLE; van der Laan and Gruber, 2010 [27]) capitalizes on the collaborative double robustness property of semiparametric efficient estimators to select covariates for the propensity score based on the error in the conditional outcome model. Finally, we compare several approaches to automated variable selection in low- and high-dimensional settings through a simulation study. From this simulation study, we conclude that using IPTW with flexible prediction for the propensity score can result in inferior estimation, while Targeted minimum loss-based estimation and C-TMLE may benefit from flexible prediction and remain robust to the presence of variables that are highly correlated with treatment. However, in our study, standard influence function-based methods for the variance underestimated the standard errors, resulting in poor coverage under certain data-generating scenarios.

Entities:  

Mesh:

Year:  2016        PMID: 26226129      PMCID: PMC4733443          DOI: 10.1515/ijb-2015-0017

Source DB:  PubMed          Journal:  Int J Biostat        ISSN: 1557-4679            Impact factor:   0.968


  27 in total

1.  Data, design, and background knowledge in etiologic inference.

Authors:  J M Robins
Journal:  Epidemiology       Date:  2001-05       Impact factor: 4.822

2.  Collaborative double robust targeted maximum likelihood estimation.

Authors:  Mark J van der Laan; Susan Gruber
Journal:  Int J Biostat       Date:  2010-05-17       Impact factor: 0.968

3.  A Systematic Review of Propensity Score Methods in the Social Sciences.

Authors:  Felix J Thoemmes; Eun Sook Kim
Journal:  Multivariate Behav Res       Date:  2011-02-07       Impact factor: 5.923

4.  Re: "Variable selection for propensity score models".

Authors:  Ian Shrier; Robert W Platt; Russell J Steele
Journal:  Am J Epidemiol       Date:  2007-05-25       Impact factor: 4.897

5.  Evaluating uses of data mining techniques in propensity score estimation: a simulation study.

Authors:  Soko Setoguchi; Sebastian Schneeweiss; M Alan Brookhart; Robert J Glynn; E Francis Cook
Journal:  Pharmacoepidemiol Drug Saf       Date:  2008-06       Impact factor: 2.890

6.  Invited commentary: positivity in practice.

Authors:  Daniel Westreich; Stephen R Cole
Journal:  Am J Epidemiol       Date:  2010-02-05       Impact factor: 4.897

7.  A targeted maximum likelihood estimator of a causal effect on a bounded continuous outcome.

Authors:  Susan Gruber; Mark J van der Laan
Journal:  Int J Biostat       Date:  2010-08-01       Impact factor: 0.968

8.  Applying propensity score methods in medical research: pitfalls and prospects.

Authors:  Zhehui Luo; Joseph C Gardiner; Cathy J Bradley
Journal:  Med Care Res Rev       Date:  2010-05-04       Impact factor: 3.929

9.  EFFECT OF BREASTFEEDING ON GASTROINTESTINAL INFECTION IN INFANTS: A TARGETED MAXIMUM LIKELIHOOD APPROACH FOR CLUSTERED LONGITUDINAL DATA.

Authors:  Mireille E Schnitzer; Mark J van der Laan; Erica E M Moodie; Robert W Platt
Journal:  Ann Appl Stat       Date:  2014-06       Impact factor: 2.083

10.  Using methods from the data-mining and machine-learning literature for disease classification and prediction: a case study examining classification of heart failure subtypes.

Authors:  Peter C Austin; Jack V Tu; Jennifer E Ho; Daniel Levy; Douglas S Lee
Journal:  J Clin Epidemiol       Date:  2013-02-04       Impact factor: 6.437

View more
  7 in total

1.  Effect Estimation in Point-Exposure Studies with Binary Outcomes and High-Dimensional Covariate Data - A Comparison of Targeted Maximum Likelihood Estimation and Inverse Probability of Treatment Weighting.

Authors:  Menglan Pang; Tibor Schuster; Kristian B Filion; Mireille E Schnitzer; Maria Eberg; Robert W Platt
Journal:  Int J Biostat       Date:  2016-11-01       Impact factor: 0.968

2.  Discussion of "Data-driven confounder selection via Markov and Bayesian networks" by Häggström.

Authors:  Thomas S Richardson; James M Robins; Linbo Wang
Journal:  Biometrics       Date:  2017-11-02       Impact factor: 2.571

3.  The Appalachia Mind Health Initiative (AMHI): a pragmatic randomized clinical trial of adjunctive internet-based cognitive behavior therapy for treating major depressive disorder among primary care patients.

Authors:  Robert M Bossarte; Ronald C Kessler; Andrew A Nierenberg; Ambarish Chattopadhyay; Pim Cuijpers; Angel Enrique; Phyllis M Foxworth; Sarah M Gildea; Bea Herbeck Belnap; Marc W Haut; Kari B Law; William D Lewis; Howard Liu; Alexander R Luedtke; Wilfred R Pigeon; Larry A Rhodes; Derek Richards; Bruce L Rollman; Nancy A Sampson; Cara M Stokes; John Torous; Tyler D Webb; Jose R Zubizarreta
Journal:  Trials       Date:  2022-06-20       Impact factor: 2.728

4.  Study protocol for pragmatic trials of Internet-delivered guided and unguided cognitive behavior therapy for treating depression and anxiety in university students of two Latin American countries: the Yo Puedo Sentirme Bien study.

Authors:  Corina Benjet; Ronald C Kessler; Alan E Kazdin; Pim Cuijpers; Yesica Albor; Nayib Carrasco Tapias; Carlos C Contreras-Ibáñez; Ma Socorro Durán González; Sarah M Gildea; Noé González; José Benjamín Guerrero López; Alex Luedtke; Maria Elena Medina-Mora; Jorge Palacios; Derek Richards; Alicia Salamanca-Sanabria; Nancy A Sampson
Journal:  Trials       Date:  2022-06-02       Impact factor: 2.728

5.  Perspective: Big Data and Machine Learning Could Help Advance Nutritional Epidemiology.

Authors:  Jason D Morgenstern; Laura C Rosella; Andrew P Costa; Russell J de Souza; Laura N Anderson
Journal:  Adv Nutr       Date:  2021-06-01       Impact factor: 8.701

6.  Propensity Scores in Pharmacoepidemiology: Beyond the Horizon.

Authors:  John W Jackson; Ian Schmid; Elizabeth A Stuart
Journal:  Curr Epidemiol Rep       Date:  2017-11-06

7.  G-computation, propensity score-based methods, and targeted maximum likelihood estimator for causal inference with different covariates sets: a comparative simulation study.

Authors:  Arthur Chatton; Florent Le Borgne; Clémence Leyrat; Florence Gillaizeau; Chloé Rousseau; Laetitia Barbin; David Laplaud; Maxime Léger; Bruno Giraudeau; Yohann Foucher
Journal:  Sci Rep       Date:  2020-06-08       Impact factor: 4.379

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.