| Literature DB >> 35322096 |
Jian Sun1, Chenye Wu2,3, Weihua Peng4, Jiayan Huang5, Cuiyun Han5, Yong Zhu6, Yajuan Lyu6.
Abstract
Spurred by causal structure learning (CSL) ability to reveal the cause-effect connection, significant research efforts have been made to enhance the scalability of CSL algorithms in various artificial intelligence applications. However, less effort has been made regarding the stability and the interpretability of CSL algorithms. Thus, this work proposes a self-correction mechanism that embeds domain knowledge for CSL, improving the stability and accuracy even in low-dimensional but high-noise environments by guaranteeing a meaningful output. The suggested algorithm is challenged against multiple classic and influential CSL algorithms in synthesized and field datasets. Our algorithm achieves a superior accuracy on the synthesized dataset, while on the field dataset, our method interprets the learned causal structure as a human preference for investment, coinciding with domain expert analysis.Entities:
Mesh:
Year: 2022 PMID: 35322096 PMCID: PMC8942159 DOI: 10.1038/s41598-022-08879-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Acronym table.
| Acronym | Meaning |
|---|---|
| Covid-19 | Coronavirus disease 2019 |
| HP | Human preference |
| CSL | Causal structure learning |
| DAG(s) | Directed acyclic graph(s) |
| CI(s) | conditional independence(s) |
| CPDAG(s) | Completed partially directed acyclic graph(s) |
| DK | Domain knowledge |
| RCoT | Randomized conditional correlation test |
| (MA)SHD | (Minimal average) structural hamming distance |
| MAKL-d | Minimal average Kullback–Leibler divergence |
| pc.stable | Peter Spirtes & Clark Glymour algorithm (stable version) |
| RPC* | Robust pc.stable algorithm |
| iamb(.fdr) | Incremental Association Markov Blanket (with False Discovery Rate Correction) algorithm |
| fast/inter.iamb | Fast/Interleaved Incremental Association Markov Blanket algorithm |
| gs | Grow-Shrink algorithm |
| mmhc | Max-Min Hill Climbing algorithm |
| h2pc | Hybrid Hybrid Parents and Children algorithm |
| hc | Hill Climbing search algorithm |
| tabu | Tabu search algorithm |
Figure 1We challenge ten influential CSL algorithms. Six of them are constraint-based: pc.stable[10], gs[11], iamb/iamb.fdr[12], fast.iamb[13] and inter.iamb[14]. Two are score-based, employing different searching algorithms: hc[15] and tabu[16]. The remaining two are hybrid algorithms: mmhc[17] and h2pc[18]. All of them are implemented by employing the bnlearn R package[19]. Five algorithms learn poor structures, which conflict with DK or contain self-conflicts, illustrated in (a–e). In the structures learned by pc.stable and mmhc, the Covid has no influence on the financial market. Besides, h2pc, tabu and hc suggest financial market can influence the daily confirmed diagnosis. In addition to the real dataset, all constraint-based algorithms are tested on 9 synthetic dataset and the number of self-conflict are counted in (f).
Figure 3The gray part broadly highlights why constraint-based algorithm can produce contradictory results. The orange part shows how the CI testing results influence each step of constraint-based algorithms. The blue part broadly introduces the processes of constraint-based algorithms. The green part is how we build a self-correction mechanism to revise the CI testing results and get consistent causal structures.
Figure 2Traditional constraint-based algorithm analysis. We take the state-of-the-art algorithm, Randomized conditional Correlation Test (RCoT)[28], as an example. The tested dataset is generated from the structure shown in (a). The generation process is similar to that elaborated in Synthetic Data part. Several CI hypotheses are selected, and RCoT is conducted 100 times on each CI hypothesis, with the returned p values shown in (c). The y-axis shows CI hypotheses in the form of . The x-axis shows the corresponding 100 p values. The red forks are the p values of the CI hypotheses that should be accepted, and the brown dots should be rejected. However, it is hard to statistically distinguish whether a CI hypothesis should be rejected or not, making the decisions solely based on CI testings not adequately convincing. (b) shows an example of CI testing error propagating, which may ultimately mislead the entire algorithm.
Position of this work in the literature.
| pc.stable | gs | iamb(.fdr) | fast/inter.iamb | hc/tabu | h2pc/mmhc | RPC* | |
|---|---|---|---|---|---|---|---|
| Fully oriented graph | |||||||
| DK embedded | |||||||
| Constraint embedded | |||||||
| Self-correction mechanism |
Synthesized dataset generation.
| (a) Structures | (b) Six adopted noise types | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Name | Cancer | Survey | Asia | Sachs | Child | Type\ | 0.5 | 1 | 2 |
| Node | 5 | 6 | 8 | 11 | 20 | Gaussian | N(0,0.25) | N(0,1) | N(0,4) |
| Edges | 4 | 6 | 8 | 17 | 25 | Uniform | U( | U( | U( |
Figure 4Performance on synthesized datasets.
Figure 5Changes on stock, securities and gold before and after the outbreak of Covid-19.
Figure 6Influence of Covid-19 on causal structures between financial products.
Figure 7Cases violating the consistency conditions. Suppose that the accepted CI hypotheses set is . The introduced skeleton is shown in (a), where forms a v-structure based on the first CI hypothesis, and forms another v-structure based on the second CI hypothesis. Edge is oriented in the opposite directions. (b) A new v-structure, where is . In its skeleton, and form two v-structures. Thus, there should be an additional v-structure . However, this v-structure cannot be produced in the v-structure orientation.
Figure 8Early termination: considering this figure as an example, in the V-structure orientation process, the CI hypothesis will form a v-structure . Thus we record for and for . Following the same idea, we record for since . Then the rules process orients , otherwise a new v-structure will be produced. Then we record for since the orientation of is the reason to orient . Now a cycle is detected. are reported since they are the evidence path of the cycle’s edges. The most recorded pair is , and the related CI hypotheses are and . These will not form v-structures by decorating the former one as and the latter one as .
MASHD of CPDAGs.
| Structure | Noise | Algorithms | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RPC* | pc.stable | gs | iamb | fast.iamb | inter. iamb | iamb. fdr | hc | tabu | mmhc | h2pc | ||
| Cancer | N(0,0.25) | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.40 | 0.80 | 0.40 | 0.40 | |
| N(0,1) | 1.00 | 0.80 | 0.80 | 0.80 | 0.80 | 0.80 | 0.60 | 1.00 | 0.60 | 0.60 | ||
| N(0,4) | 1.20 | 1.20 | 1.20 | 1.20 | 1.20 | 1.00 | 1.20 | 1.20 | 1.20 | 1.20 | ||
| U( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.40 | 0.40 | 0.40 | 0.40 | ||
| U( | 1.00 | 1.20 | 1.20 | 1.20 | 1.20 | 0.80 | 0.80 | 1.20 | 0.80 | 0.80 | ||
| U( | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.40 | 0.40 | 0.40 | 0.40 | ||
| Survey | N(0,0.25) | 1.17 | 1.50 | 1.50 | 1.50 | 1.50 | 1.33 | 1.33 | 1.00 | 1.17 | 1.17 | |
| N(0,1) | 1.67 | 1.67 | 1.67 | 1.67 | 1.67 | 1.67 | 0.67 | 0.67 | 0.67 | 0.67 | ||
| N(0,4) | 1.33 | 1.33 | 1.33 | 1.33 | 1.33 | 1.17 | 1.00 | 1.00 | 1.00 | 1.00 | ||
| U( | 1.17 | 1.17 | 1.17 | 1.17 | 1.17 | 1.17 | 1.00 | 1.00 | 0.83 | 0.83 | ||
| U( | 1.00 | 1.17 | 1.17 | 1.17 | 1.17 | 1.17 | 0.67 | 0.67 | 0.67 | 0.67 | ||
| U( | 1.67 | 1.67 | 1.67 | 1.67 | 1.67 | 1.67 | 1.67 | 1.67 | 1.67 | 1.67 | ||
| Asia | N(0,0.25) | 1.13 | 0.88 | 0.88 | 0.88 | 0.88 | 0.75 | 1.00 | 1.00 | 0.63 | 0.70 | |
| N(0,1) | 0.63 | 0.38 | 0.38 | 0.38 | 0.38 | 0.50 | 0.63 | 0.63 | 0.63 | 0.63 | ||
| N(0,4) | 0.75 | 0.88 | 0.88 | 0.88 | 0.88 | 1.00 | 0.50 | 0.88 | 0.50 | 0.50 | ||
| U( | 1.13 | 0.75 | 0.75 | 0.75 | 0.75 | 0.75 | 0.75 | 0.75 | 0.75 | 0.75 | ||
| U( | 0.75 | 0.88 | 0.88 | 0.88 | 0.88 | 0.88 | 0.63 | 0.75 | 0.63 | 0.63 | ||
| U( | 0.75 | 1.13 | 1.13 | 1.13 | 1.13 | 1.13 | 0.88 | 1.00 | 0.88 | 0.88 | ||
| Sachs | N(0,0.25) | 1.82 | 1.91 | 1.91 | 1.91 | 1.91 | 2.00 | 2.45 | 2.45 | 2.00 | 2.00 | |
| N(0,1) | 1.82 | 1.64 | 1.64 | 1.64 | 1.64 | 1.64 | 2.18 | 2.00 | 1.73 | 1.73 | ||
| N(0,4) | 1.82 | 1.73 | 1.73 | 1.73 | 1.73 | 1.73 | 2.00 | 1.91 | 1.82 | 1.91 | ||
| U( | 1.91 | 2.00 | 2.00 | 2.00 | 2.00 | 1.82 | 2.18 | 2.18 | 1.91 | 2.00 | ||
| U( | 1.82 | 1.91 | 1.91 | 2.00 | 1.91 | 1.91 | 2.00 | 2.00 | 1.91 | 1.91 | ||
| U( | 1.73 | 1.64 | 1.64 | 1.64 | 1.64 | 1.64 | 1.91 | 1.91 | 1.91 | 1.91 | ||
| Child | N(0,0.25) | 1.50 | 2.00 | 1.90 | 2.00 | 1.90 | 1.95 | 3.15 | 3.15 | 1.55 | 1.60 | |
| N(0,1) | 1.70 | 1.65 | 1.65 | 1.70 | 1.65 | 1.70 | 2.45 | 2.50 | 1.55 | 1.55 | ||
| N(0,4) | 1.60 | 1.45 | 1.45 | 1.65 | 1.45 | 1.70 | 1.60 | 1.65 | 1.45 | 1.45 | ||
| U( | 1.20 | 0.90 | 0.90 | 0.90 | 0.90 | 1.05 | 1.45 | 1.40 | 1.25 | 1.20 | ||
| U( | 1.60 | 1.50 | 1.55 | 1.55 | 1.55 | 1.35 | 2.05 | 2.05 | 1.45 | 1.45 | ||
| U( | 1.65 | 1.55 | 1.55 | 1.55 | 1.55 | 1.60 | 1.75 | 1.90 | 1.55 | 1.55 | ||
Significant values are in bold.
Figure 9MASHD of DAGs.
MASHD of DAGs and MAKL-d.
| Structure | Noise | Algorithms | ||||
|---|---|---|---|---|---|---|
| RPC* | hc | tabu | mmhc | h2pc | ||
| cancer | N(0,0.25) | 0.40/0.90 | 0.80/0.90 | 0.40/0.90 | 0.40/0.90 | |
| N(0,1) | 0.60/ | 1.00/ | 0.60/ | 0.60/ | ||
| N(0,4) | 1.20/ | 1.20/ | 1.20/ | 1.20/ | ||
| U( | 0.40/ | 0.40/ | 0.40/ | 0.40/ | ||
| U( | 0.80/ | 1.20/0.92 | 0.80/ | 0.80/ | ||
| U( | 0.40/ | 0.40/ | 0.40/ | 0.40/ | ||
| survey | N(0,0.25) | 1.33/1.10 | 1.00/1.00 | 1.17/0.91 | 1.17/0.91 | |
| N(0,1) | 0.67/0.61 | 0.67/0.60 | 0.67/0.61 | 0.67/0.61 | ||
| N(0,4) | 1.00/0.30 | 1.00/0.30 | 1.00/0.30 | 1.00/0.30 | ||
| U( | 1.00/0.74 | 1.00/0.77 | 0.83/0.55 | 0.83/0.55 | ||
| U( | 0.67/0.72 | 0.67/0.72 | 0.67/0.72 | 0.67/0.72 | ||
| U( | 1.17/0.23 | 1.17/0.23 | 1.17/0.23 | 1.17/0.23 | ||
| asia | N(0,0.25) | 0.88/0.87 | 0.88/0.87 | 0.50/0.89 | 0.63/0.88 | |
| N(0,1) | 0.50/ | 0.50/ | 0.50/ | 0.50/ | ||
| N(0,4) | 0.38/0.77 | 0.88/0.74 | 0.38/0.77 | 0.38/0.77 | ||
| U( | 0.63/1.00 | 0.63/1.00 | 0.63/1.00 | 0.63/1.00 | ||
| U( | 0.50/0.92 | 0.75/0.96 | 0.50/0.92 | 0.50/0.92 | ||
| U( | 0.75/0.90 | 0.88/0.96 | 0.75/0.90 | 0.75/0.90 | ||
| sachs | N(0,0.25) | 1.45/1.13 | 1.45/1.13 | 1.23/1.00 | 1.36/0.93 | |
| N(0,1) | 1.27/1.01 | 1.45/0.84 | 1.00/0.77 | 1.00/0.79 | ||
| N(0,4) | 1.18/0.98 | 1.27/0.97 | 1.00/0.84 | 1.09/0.91 | ||
| U( | 1.27/1.06 | 1.36/1.06 | 1.09/1.00 | 1.18/1.07 | ||
| U( | 1.18/1.06 | 1.27/1.06 | 1.18/1.07 | 0.91/1.12 | ||
| U( | 1.36/0.82 | 1.36/0.82 | 1.36/0.87 | 1.36/0.87 | ||
| child | N(0,0.25) | 2.75/ | 2.80/1.36 | 1.25/1.16 | 1.30/1.21 | |
| N(0,1) | 2.00/ | 2.20/1.07 | 1.10/ | 1.10/1.07 | ||
| N(0,4) | 1.50/0.83 | 1.55/0.83 | 1.20/0.87 | 1.20/0.87 | ||
| U( | 1.05/ | 1.00/ | 0.80/1.14 | 0.85/1.10 | ||
| U( | 1.60/1.13 | 1.60/1.13 | 0.95/1.13 | 0.95/1.08 | ||
| U( | 1.45/1.04 | 1.65/1.09 | 1.25/0.96 | 1.25/0.96 | ||
Significant values are in bold.
The numbers before (after) forward slashes are MASHD (MAKL-d).