| Literature DB >> 35327862 |
Shubhadeep Chakraborty1, Ali Shojaie1.
Abstract
The PC and FCI algorithms are popular constraint-based methods for learning the structure of directed acyclic graphs (DAGs) in the absence and presence of latent and selection variables, respectively. These algorithms (and their order-independent variants, PC-stable and FCI-stable) have been shown to be consistent for learning sparse high-dimensional DAGs based on partial correlations. However, inferring conditional independences from partial correlations is valid if the data are jointly Gaussian or generated from a linear structural equation model-an assumption that may be violated in many applications. To broaden the scope of high-dimensional causal structure learning, we propose nonparametric variants of the PC-stable and FCI-stable algorithms that employ the conditional distance covariance (CdCov) to test for conditional independence relationships. As the key theoretical contribution, we prove that the high-dimensional consistency of the PC-stable and FCI-stable algorithms carry over to general distributions over DAGs when we implement CdCov-based nonparametric tests for conditional independence. Numerical studies demonstrate that our proposed algorithms perform nearly as good as the PC-stable and FCI-stable for Gaussian distributions, and offer advantages in non-Gaussian graphical models.Entities:
Keywords: FCI algorithm; PC algorithm; causal structure learning; consistency; high dimensionality; nonparametric testing
Year: 2022 PMID: 35327862 PMCID: PMC8947566 DOI: 10.3390/e24030351
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Comparison of the average structural Hamming distances (SHD) of nonPC and PC-stable algorithms across simulation studies.
| Normal | Copula | |||||
|
|
|
| nonPC | PC-stable | nonPC | PC-stable |
| 50 | 9 | 1.4 | 3.35 | 3.05 | 5.55 | 5.75 |
| 100 | 27 | 2.0 | 14.55 | 11.00 | 25.6 | 28.6 |
| 150 | 81 | 2.4 | 53.70 | 43.45 | 97.3 | 121.3 |
| 200 | 243 | 2.8 | 186.2 | 183.4 | 331.00 | 471.45 |
| Mixture | Nonlinear SEM | |||||
|
|
|
| nonPC | PC-stable | nonPC | PC-stable |
| 50 | 9 | 1.4 | 3.8 | 3.5 | 2.9 | 3.7 |
| 100 | 27 | 2.0 | 17.75 | 18.00 | 15.05 | 20.05 |
| 150 | 81 | 2.4 | 69.05 | 77.75 | 62.583 | 95.083 |
| 200 | 243 | 2.8 | 250.3 | 336.1 | 213.70 | 375.45 |
Comparison of the average structural Hamming distances (SHD) of nonFCI and FCI-stable algorithms across simulation studies.
| Normal | Copula | Mixture | Nonlinear SEM | ||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
| nonFCI | FCI-Stable | nonFCI | FCI-Stable | nonFCI | FCI-Stable | nonFCI | FCI-Stable |
| 10 | 2.0 | 7.15 | 7.60 | 1.3 | 1.8 | 5.65 | 6.80 | 7.15 | 8.20 |
| 20 | 2.0 | 14.55 | 17.60 | 4.55 | 6.85 | 13.65 | 18.55 | 19.0 | 20.8 |
| 30 | 2.0 | 27.65 | 33.95 | 5.25 | 10.15 | 19.3 | 27.8 | 33.40 | 37.85 |
| 100 | 3.0 | 109.30 | 150.35 | 26.95 | 60.05 | 62.25 | 111.10 | 115.2 | 149.0 |
| 200 | 3.0 | 287.75 | 371.40 | 76.733 | 157.267 | 136.05 | 255.10 | 289.6 | 354.1 |
Figure 1CPDAGs estimated by the nonPC and PC-stable algorithms for the Montana poll dataset.
Comparison of the SHD between the skeletons estimated from the original and the categorized protein expression data by the nonPC and PC-stable algorithms.
| nonPC | PC-Stable |
|---|---|
| 22 | 79 |