Literature DB >> 33548542

Bagged random causal networks for interventional queries on observational biomedical datasets.

Mattia Prosperi1, Yi Guo2, Jiang Bian3.   

Abstract

Learning causal effects from observational data, e.g. estimating the effect of a treatment on survival by data-mining electronic health records (EHRs), can be biased due to unmeasured confounders, mediators, and colliders. When the causal dependencies among features/covariates are expressed in the form of a directed acyclic graph, using do-calculus it is possible to identify one or more adjustment sets for eliminating the bias on a given causal query under certain assumptions. However, prior knowledge of the causal structure might be only partial; algorithms for causal structure discovery often provide ambiguous solutions, and their computational complexity becomes practically intractable when the feature sets grow large. We hypothesize that the estimation of the true causal effect of a causal query on to an outcome can be approximated as an ensemble of lower complexity estimators, namely bagged random causal networks. A bagged random causal network is an ensemble of subnetworks constructed by sampling the feature subspaces (with the query, the outcome, and a random number of other features), drawing conditional dependencies among the features, and inferring the corresponding adjustment sets. The causal effect can be then estimated by any regression function of the outcome by the query paired with the adjustment sets. Through simulations and a real-world clinical dataset (class III malocclusion data), we show that the bagged estimator is -in most cases- consistent with the true causal effect if the structure is known, has a good variance/bias trade-off when the structure is unknown (estimated using heuristics), has lower computational complexity than learning a full network, and outperforms boosted regression. In conclusion, the bagged random causal network is well-suited to estimate query-target causal effects from observational studies on EHR and other high-dimensional biomedical databases.
Copyright © 2021 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Bagging; Causal inference; Directed acyclic graph; Electronic health record; Machine learning; Treatment effect

Mesh:

Year:  2021        PMID: 33548542      PMCID: PMC8085906          DOI: 10.1016/j.jbi.2021.103689

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  15 in total

1.  An introduction to instrumental variables for epidemiologists.

Authors:  S Greenland
Journal:  Int J Epidemiol       Date:  2000-08       Impact factor: 7.196

2.  A structural approach to selection bias.

Authors:  Miguel A Hernán; Sonia Hernández-Díaz; James M Robins
Journal:  Epidemiology       Date:  2004-09       Impact factor: 4.822

Review 3.  An introduction to causal inference.

Authors:  Judea Pearl
Journal:  Int J Biostat       Date:  2010-02-26       Impact factor: 0.968

4.  Markov Blanket Feature Selection Using Representative Sets.

Authors:  Kui Yu; Xindong Wu; Wei Ding; Yang Mu; Hao Wang
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2017-11       Impact factor: 10.451

5.  Controlling for Informed Presence Bias Due to the Number of Health Encounters in an Electronic Health Record.

Authors:  Benjamin A Goldstein; Nrupen A Bhavsar; Matthew Phelan; Michael J Pencina
Journal:  Am J Epidemiol       Date:  2016-11-16       Impact factor: 4.897

6.  Evaluating large-scale propensity score performance through real-world and synthetic data experiments.

Authors:  Yuxi Tian; Martijn J Schuemie; Marc A Suchard
Journal:  Int J Epidemiol       Date:  2018-12-01       Impact factor: 7.196

7.  Estimating Individual Treatment Effect in Observational Data Using Random Forest Methods.

Authors:  Min Lu; Saad Sadiq; Daniel J Feaster; Hemant Ishwaran
Journal:  J Comput Graph Stat       Date:  2018-02-01       Impact factor: 2.302

8.  Control of Confounding and Reporting of Results in Causal Inference Studies. Guidance for Authors from Editors of Respiratory, Sleep, and Critical Care Journals.

Authors:  David J Lederer; Scott C Bell; Richard D Branson; James D Chalmers; Rachel Marshall; David M Maslove; David E Ost; Naresh M Punjabi; Michael Schatz; Alan R Smyth; Paul W Stewart; Samy Suissa; Alex A Adjei; Cezmi A Akdis; Élie Azoulay; Jan Bakker; Zuhair K Ballas; Philip G Bardin; Esther Barreiro; Rinaldo Bellomo; Jonathan A Bernstein; Vito Brusasco; Timothy G Buchman; Sudhansu Chokroverty; Nancy A Collop; James D Crapo; Dominic A Fitzgerald; Lauren Hale; Nicholas Hart; Felix J Herth; Theodore J Iwashyna; Gisli Jenkins; Martin Kolb; Guy B Marks; Peter Mazzone; J Randall Moorman; Thomas M Murphy; Terry L Noah; Paul Reynolds; Dieter Riemann; Richard E Russell; Aziz Sheikh; Giovanni Sotgiu; Erik R Swenson; Rhonda Szczesniak; Ronald Szymusiak; Jean-Louis Teboul; Jean-Louis Vincent
Journal:  Ann Am Thorac Soc       Date:  2019-01

9.  Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations.

Authors:  Peter W G Tennant; Eleanor J Murray; Kellyn F Arnold; Laurie Berrie; Matthew P Fox; Sarah C Gadd; Wendy J Harrison; Claire Keeble; Lynsie R Ranker; Johannes Textor; Georgia D Tomova; Mark S Gilthorpe; George T H Ellison
Journal:  Int J Epidemiol       Date:  2021-05-17       Impact factor: 7.196

10.  Reducing bias through directed acyclic graphs.

Authors:  Ian Shrier; Robert W Platt
Journal:  BMC Med Res Methodol       Date:  2008-10-30       Impact factor: 4.615

View more
  1 in total

Review 1.  Artificial intelligence-enabled decision support in nephrology.

Authors:  Tyler J Loftus; Benjamin Shickel; Tezcan Ozrazgat-Baslanti; Yuanfang Ren; Benjamin S Glicksberg; Jie Cao; Karandeep Singh; Lili Chan; Girish N Nadkarni; Azra Bihorac
Journal:  Nat Rev Nephrol       Date:  2022-04-22       Impact factor: 42.439

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.