Literature DB >> 34268558

Challenges in Obtaining Valid Causal Effect Estimates with Machine Learning Algorithms.

Ashley I Naimi1, Alan E Mishler2, Edward H Kennedy2.   

Abstract

Unlike parametric regression, machine learning (ML) methods do not generally require precise knowledge of the true data generating mechanisms. As such, numerous authors have advocated for ML methods to estimate causal effects. Unfortunately, ML algorithmscan perform worse than parametric regression. We demonstrate the performance of ML-based single- and double-robust estimators. We use 100 Monte Carlo samples with sample sizes of 200, 1200, and 5000 to investigate bias and confidence interval coverage under several scenarios. In a simple confounding scenario, confounders were related to the treatment and the outcome via parametric models. In a complex confounding scenario, the simple confounders were transformed to induce complicated nonlinear relationships. In the simple scenario, when ML algorithms were used, double-robust estimators were superior to single-robust estimators. In the complex scenario, single-robust estimators with ML algorithms were at least as biased as estimators using misspecified parametric models. Double-robust estimators were less biased, but coverage was well below nominal. The use of sample splitting, inclusion of confounder interactions, reliance on a richly specified ML algorithm, and use of doubly robust estimators was the only explored approach that yielded negligible bias and nominal coverage. Our results suggest that ML based singly robust methods should be avoided.
© The Author(s) 2021. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  causal inference; doubly-robust estimation; epidemiologic methods; machine learning; nonparametric methods; semiparametric theory

Year:  2021        PMID: 34268558     DOI: 10.1093/aje/kwab201

Source DB:  PubMed          Journal:  Am J Epidemiol        ISSN: 0002-9262            Impact factor:   4.897


  9 in total

1.  Machine learning can improve the development of evidence-based dietary guidelines.

Authors:  Lisa M Bodnar; Sharon I Kirkpatrick; Ashley I Naimi
Journal:  Public Health Nutr       Date:  2022-06-27       Impact factor: 4.539

2.  Performance Evaluation of Parametric and Nonparametric Methods When Assessing Effect Measure Modification.

Authors:  Gabriel Conzuelo Rodriguez; Lisa M Bodnar; Maria M Brooks; Abdus Wahed; Edward H Kennedy; Enrique Schisterman; Ashley I Naimi
Journal:  Am J Epidemiol       Date:  2022-01-01       Impact factor: 5.363

3.  The Role of the Natural Course in Causal Analysis.

Authors:  Jacqueline E Rudolph; Abigail Cartus; Lisa M Bodnar; Enrique F Schisterman; Ashley I Naimi
Journal:  Am J Epidemiol       Date:  2022-01-24       Impact factor: 5.363

4.  Thirteen Questions About Using Machine Learning in Causal Research (You Won't Believe the Answer to Number 10!).

Authors:  Stephen J Mooney; Alexander P Keil; Daniel J Westreich
Journal:  Am J Epidemiol       Date:  2021-08-01       Impact factor: 4.897

5.  AIPW: An R Package for Augmented Inverse Probability-Weighted Estimation of Average Causal Effects.

Authors:  Yongqi Zhong; Edward H Kennedy; Lisa M Bodnar; Ashley I Naimi
Journal:  Am J Epidemiol       Date:  2021-12-01       Impact factor: 5.363

6.  Incremental Propensity Score Effects for Time-fixed Exposures.

Authors:  Ashley I Naimi; Jacqueline E Rudolph; Edward H Kennedy; Abigail Cartus; Sharon I Kirkpatrick; David M Haas; Hyagriv Simhan; Lisa M Bodnar
Journal:  Epidemiology       Date:  2021-03-01       Impact factor: 4.860

7.  Analyses of child cardiometabolic phenotype following assisted reproductive technologies using a pragmatic trial emulation approach.

Authors:  Jonathan Yinhao Huang; Shirong Cai; Zhongwei Huang; Mya Thway Tint; Wen Lun Yuan; Izzuddin M Aris; Keith M Godfrey; Neerja Karnani; Yung Seng Lee; Jerry Kok Yen Chan; Yap Seng Chong; Johan Gunnar Eriksson; Shiao-Yng Chan
Journal:  Nat Commun       Date:  2021-09-23       Impact factor: 14.919

8.  Use of Machine Learning to Estimate the Per-Protocol Effect of Low-Dose Aspirin on Pregnancy Outcomes: A Secondary Analysis of a Randomized Clinical Trial.

Authors:  Yongqi Zhong; Maria M Brooks; Edward H Kennedy; Lisa M Bodnar; Ashley I Naimi
Journal:  JAMA Netw Open       Date:  2022-03-01

Review 9.  Machine learning for improving high-dimensional proxy confounder adjustment in healthcare database studies: An overview of the current literature.

Authors:  Richard Wyss; Chen Yanover; Tal El-Hay; Dimitri Bennett; Robert W Platt; Andrew R Zullo; Grammati Sari; Xuerong Wen; Yizhou Ye; Hongbo Yuan; Mugdha Gokhale; Elisabetta Patorno; Kueiyu Joshua Lin
Journal:  Pharmacoepidemiol Drug Saf       Date:  2022-07-05       Impact factor: 2.732

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.