| Literature DB >> 33285821 |
Maximilian Kurthen1, Torsten Enßlin1.
Abstract
We address the problem of two-variable causal inference without intervention. This task is to infer an existing causal relation between two random variables, i.e., X → Y or Y → X , from purely observational data. As the option to modify a potential cause is not given in many situations, only structural properties of the data can be used to solve this ill-posed problem. We briefly review a number of state-of-the-art methods for this, including very recent ones. A novel inference method is introduced, Bayesian Causal Inference (BCI) which assumes a generative Bayesian hierarchical model to pursue the strategy of Bayesian model selection. In the adopted model, the distribution of the cause variable is given by a Poisson lognormal distribution, which allows to explicitly regard the discrete nature of datasets, correlations in the parameter spaces, as well as the variance of probability densities on logarithmic scales. We assume Fourier diagonal Field covariance operators. The model itself is restricted to use cases where a direct causal relation X → Y has to be decided against a relation Y → X , therefore we compare it other methods for this exact problem setting. The generative model assumed provides synthetic causal data for benchmarking our model in comparison to existing state-of-the-art models, namely LiNGAM, ANM-HSIC, ANM-MML, IGCI, and CGNN. We explore how well the above methods perform in case of high noise settings, strongly discretized data, and very sparse data. BCI performs generally reliably with synthetic data as well as with the real world TCEP benchmark set, with an accuracy comparable to state-of-the-art algorithms. We discuss directions for the future development of BCI.Entities:
Keywords: additive noise; bayesian model selection; causal inference; cause–effect pairs; information field theory
Year: 2019 PMID: 33285821 PMCID: PMC7516476 DOI: 10.3390/e22010046
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Overview over the used Bayesian hierarchical model, for the case .
Figure 2Different field samples from the distribution (on the left) with the power spectrum (top), (middle), (bottom). On the left, the field values themselves are plotted, on the right an exponential function is applied to those and the fields are normalized, i.e., as in our formulation (Same colors/line styles on the right and the left indicate the same underlying functions (colors itself chosen just for distinguishability).
Figure 3Illustration of a Bayesian Causal Inference run on synthetic data generated for causality . Here, the method clearly favours this causality with an odds ratio of . (a) Synthetic data, with causality ; (b) Count histogram () and inferred for the model in the direction ; (c) Count histogram () and inferred for the model in the direction ; (d) Values of terms in and . Smaller values increase the probability of the respective direction.
Accuracy for the synthetic data benchmark. All parameters for the forward model besides the mentioned one are kept to default values, namely , , .
| Model | Default | 30 Samples | 10 Samples | ||||
|---|---|---|---|---|---|---|---|
| BCI | 0.98 | 0.94 | 0.90 | 0.93 | 0.97 | 0.92 | 0.75 |
| LiNGAM | 0.30 | 0.31 | 0.40 | 0.23 | 0.21 | 0.44 | 0.45 |
| ANM-HSIC | 1.00 | 0.98 | 0.94 | 0.99 | 1.00 | 0.91 | 0.71 |
| ANM-MML | 1.00 | 0.99 | 0.99 | 1.00 | 1.00 | 0.98 | 0.69 |
| IGCI | 0.65 | 0.60 | 0.58 | 0.24 | 0.09 | 0.48 | 0.40 |
| CGNN | 0.72 | 0.75 | 0.77 | 0.57 | 0.22 | 0.46 | 0.39 |
Accuracy for TCEP Benchmark.
| Model | TCEP | TCEP with 75 Samples |
|---|---|---|
| BCI | 0.64 | 0.60 |
| ANM-HSIC | 0.63 | 0.54 |
| ANM-MML | 0.58 | 0.56 |
| IGCI | 0.66 | 0.62 |
| CGNN | 0.70 | 0.69 |