Literature DB >> 33446866

G-computation and machine learning for estimating the causal effects of binary exposure statuses on binary outcomes.

Florent Le Borgne1,2, Arthur Chatton1,2, Maxime Léger1,3, Rémi Lenain1,4, Yohann Foucher5,6.   

Abstract

In clinical research, there is a growing interest in the use of propensity score-based methods to estimate causal effects. G-computation is an alternative because of its high statistical power. Machine learning is also increasingly used because of its possible robustness to model misspecification. In this paper, we aimed to propose an approach that combines machine learning and G-computation when both the outcome and the exposure status are binary and is able to deal with small samples. We evaluated the performances of several methods, including penalized logistic regressions, a neural network, a support vector machine, boosted classification and regression trees, and a super learner through simulations. We proposed six different scenarios characterised by various sample sizes, numbers of covariates and relationships between covariates, exposure statuses, and outcomes. We have also illustrated the application of these methods, in which they were used to estimate the efficacy of barbiturates prescribed during the first 24 h of an episode of intracranial hypertension. In the context of GC, for estimating the individual outcome probabilities in two counterfactual worlds, we reported that the super learner tended to outperform the other approaches in terms of both bias and variance, especially for small sample sizes. The support vector machine performed well, but its mean bias was slightly higher than that of the super learner. In the investigated scenarios, G-computation associated with the super learner was a performant method for drawing causal inferences, even from small sample sizes.

Entities:  

Year:  2021        PMID: 33446866      PMCID: PMC7809122          DOI: 10.1038/s41598-021-81110-0

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  35 in total

1.  Effects of adjusting for instrumental variables on bias and precision of effect estimates.

Authors:  Jessica A Myers; Jeremy A Rassen; Joshua J Gagne; Krista F Huybrechts; Sebastian Schneeweiss; Kenneth J Rothman; Marshall M Joffe; Robert J Glynn
Journal:  Am J Epidemiol       Date:  2011-10-24       Impact factor: 4.897

2.  Super learner.

Authors:  Mark J van der Laan; Eric C Polley; Alan E Hubbard
Journal:  Stat Appl Genet Mol Biol       Date:  2007-09-16

3.  Assessment of survival prediction models based on microarray data.

Authors:  Martin Schumacher; Harald Binder; Thomas Gerds
Journal:  Bioinformatics       Date:  2007-05-07       Impact factor: 6.937

4.  Impact of mis-specification of the treatment model on estimates from a marginal structural model.

Authors:  Geneviève Lefebvre; Joseph A C Delaney; Robert W Platt
Journal:  Stat Med       Date:  2008-08-15       Impact factor: 2.373

5.  Does obesity shorten life? The importance of well-defined interventions to answer causal questions.

Authors:  M A Hernán; S L Taubman
Journal:  Int J Obes (Lond)       Date:  2008-08       Impact factor: 5.095

6.  Implementation of G-computation on a simulated data set: demonstration of a causal inference technique.

Authors:  Jonathan M Snowden; Sherri Rose; Kathleen M Mortimer
Journal:  Am J Epidemiol       Date:  2011-03-16       Impact factor: 4.897

7.  Using Super Learner Prediction Modeling to Improve High-dimensional Propensity Score Estimation.

Authors:  Richard Wyss; Sebastian Schneeweiss; Mark van der Laan; Samuel D Lendle; Cheng Ju; Jessica M Franklin
Journal:  Epidemiology       Date:  2018-01       Impact factor: 4.822

8.  Machine learning in the estimation of causal effects: targeted minimum loss-based estimation and double/debiased machine learning.

Authors:  Iván Díaz
Journal:  Biostatistics       Date:  2020-04-01       Impact factor: 5.899

9.  Propensity score estimation: neural networks, support vector machines, decision trees (CART), and meta-classifiers as alternatives to logistic regression.

Authors:  Daniel Westreich; Justin Lessler; Michele Jonsson Funk
Journal:  J Clin Epidemiol       Date:  2010-08       Impact factor: 6.437

10.  G-computation, propensity score-based methods, and targeted maximum likelihood estimator for causal inference with different covariates sets: a comparative simulation study.

Authors:  Arthur Chatton; Florent Le Borgne; Clémence Leyrat; Florence Gillaizeau; Chloé Rousseau; Laetitia Barbin; David Laplaud; Maxime Léger; Bruno Giraudeau; Yohann Foucher
Journal:  Sci Rep       Date:  2020-06-08       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.