Literature DB >> 27382148

Causal inference and the data-fusion problem.

Elias Bareinboim1, Judea Pearl2.   

Abstract

We review concepts, principles, and tools that unify current approaches to causal analysis and attend to new challenges presented by big data. In particular, we address the problem of data fusion-piecing together multiple datasets collected under heterogeneous conditions (i.e., different populations, regimes, and sampling methods) to obtain valid answers to queries of interest. The availability of multiple heterogeneous datasets presents new opportunities to big data analysts, because the knowledge that can be acquired from combined data would not be possible from any individual source alone. However, the biases that emerge in heterogeneous environments require new analytical tools. Some of these biases, including confounding, sampling selection, and cross-population biases, have been addressed in isolation, largely in restricted parametric models. We here present a general, nonparametric framework for handling these biases and, ultimately, a theoretical solution to the problem of data fusion in causal inference tasks.

Keywords:  causal inference; counterfactuals; external validity; selection bias; transportability

Year:  2016        PMID: 27382148      PMCID: PMC4941504          DOI: 10.1073/pnas.1510507113

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  6 in total

1.  Data, design, and background knowledge in etiologic inference.

Authors:  J M Robins
Journal:  Epidemiology       Date:  2001-05       Impact factor: 4.822

2.  Causal effects in psychotherapy: counterfactuals counteract overgeneralization.

Authors:  Michael Hofler; Andrew T Gloster; Jurgen Hoyer
Journal:  Psychother Res       Date:  2010-11

3.  Causal diagrams for epidemiologic research.

Authors:  S Greenland; J Pearl; J M Robins
Journal:  Epidemiology       Date:  1999-01       Impact factor: 4.822

4.  Including known covariates can reduce power to detect genetic effects in case-control studies.

Authors:  Matti Pirinen; Peter Donnelly; Chris C A Spencer
Journal:  Nat Genet       Date:  2012-07-22       Impact factor: 38.330

5.  Generalizing evidence from randomized clinical trials to target populations: The ACTG 320 trial.

Authors:  Stephen R Cole; Elizabeth A Stuart
Journal:  Am J Epidemiol       Date:  2010-06-14       Impact factor: 4.897

6.  The Covariate's Dilemma.

Authors:  Joel Mefford; John S Witte
Journal:  PLoS Genet       Date:  2012-11-08       Impact factor: 5.917

  6 in total
  53 in total

1.  Is the Smog Lifting?: Causal Inference in Environmental Epidemiology.

Authors:  W Dana Flanders; Michael D Garber
Journal:  Epidemiology       Date:  2019-05       Impact factor: 4.822

2.  Population Intervention Measures to Connect Research Findings to Policy.

Authors:  Jennifer Ahern
Journal:  Am J Public Health       Date:  2016-12       Impact factor: 9.308

Review 3.  Single cell RNA-sequencing: replicability of cell types.

Authors:  Megan Crow; Jesse Gillis
Journal:  Curr Opin Neurobiol       Date:  2019-01-09       Impact factor: 6.627

4.  Epidemiology at a time for unity.

Authors:  Bryan Lau; Priya Duggal; Stephan Ehrhardt
Journal:  Int J Epidemiol       Date:  2018-10-01       Impact factor: 7.196

5.  Drawing causal inference from Big Data.

Authors:  Richard M Shiffrin
Journal:  Proc Natl Acad Sci U S A       Date:  2016-07-05       Impact factor: 11.205

6.  Invited Commentary: Causal Inference Across Space and Time-Quixotic Quest, Worthy Goal, or Both?

Authors:  Jessie K Edwards; Catherine R Lesko; Alexander P Keil
Journal:  Am J Epidemiol       Date:  2017-07-15       Impact factor: 4.897

7.  Detecting Anomalies Among Practice Sites Within Multicenter Trials.

Authors:  Seth A Berkowitz; Kara E Rudolph; Sanjay Basu
Journal:  Circ Cardiovasc Qual Outcomes       Date:  2019-03

8.  Collaborative, pooled and harmonized study designs for epidemiologic research: challenges and opportunities.

Authors:  Catherine R Lesko; Lisa P Jacobson; Keri N Althoff; Alison G Abraham; Stephen J Gange; Richard D Moore; Sharada Modur; Bryan Lau
Journal:  Int J Epidemiol       Date:  2018-04-01       Impact factor: 7.196

9.  Learning stable and predictive structures in kinetic systems.

Authors:  Niklas Pfister; Stefan Bauer; Jonas Peters
Journal:  Proc Natl Acad Sci U S A       Date:  2019-11-27       Impact factor: 11.205

10.  Combining Multiple Observational Data Sources to Estimate Causal Effects.

Authors:  Shu Yang; Peng Ding
Journal:  J Am Stat Assoc       Date:  2019-06-11       Impact factor: 5.033

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.