| Literature DB >> 35205507 |
Xiangxiang Zhang1,2,3, Wenkai Hu1,2,3, Fan Yang4.
Abstract
Causality inference is a process to infer Cause-Effect relations between variables in, typically, complex systems, and it is commonly used for root cause analysis in large-scale process industries. Transfer entropy (TE), as a non-parametric causality inference method, is an effective method to detect Cause-Effect relations in both linear and nonlinear processes. However, a major drawback of transfer entropy lies in the high computational complexity, which hinders its real application, especially in systems that have high requirements for real-time estimation. Motivated by such a problem, this study proposes an improved method for causality inference based on transfer entropy and information granulation. The calculation of transfer entropy is improved with a new framework that integrates the information granulation as a critical preceding step; moreover, a window-length determination method is proposed based on delay estimation, so as to conduct appropriate data compression using information granulation. The effectiveness of the proposed method is demonstrated by both a numerical example and an industrial case, with a two-tank simulation model. As shown by the results, the proposed method can reduce the computational complexity significantly while holding a strong capability for accurate casuality detection.Entities:
Keywords: causality; information granulation; oscillation; root cause; transfer entropy
Year: 2022 PMID: 35205507 PMCID: PMC8871421 DOI: 10.3390/e24020212
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1The framework diagram of the proposed method.
Figure 2The schematic of the information granulation process. (a) Original time series; (b) Discretization; (c) Information granulation for each subsequence.
IG-based TEs and thresholds (in the brackets).
| X | Y | Z | |
|---|---|---|---|
| X | 0.72 (0.34) | 0.99 (0.30) | |
| Y | 0 | 0.22 (0.41) | |
| Z | 0 | 2.56 (0.51) |
Figure 4The information flow pathways for the numerical example.
Figure 5The trends of IG-based TEs under three different dimensions of the granular time series versus the window length. Subplots (a), (b), and (c) correspond to the results based on the lower support , the core , and the upper support , respectively.
TEs and thresholds (in the brackets).
| X | Y | Z | |
|---|---|---|---|
| X | 1.57 (0.31) | 1.44 (0.15) | |
| Y | 0.07 (0.20) | 0.05 (0.16) | |
| Z | 0.12 (0.25) | 2.60 (0.36) |
Calculation time using the traditional TE and the IG-based TE.
| Traditional TE | IG-Based TE | |
|---|---|---|
| Calculation time |
Figure 6Diagram of the two-tank process.
Figure 7The time series of the original variables.
Figure 8An example of information granulation for .
IG-based TEs and thresholds (in the brackets).
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
| 1.80 (0.24) | 0.82 (0.20) | 1.95 (0.21) | 1.47 (0.24) | |
|
| 0.03 (0.16) | 1.10 (0.21) | 1.16 (0.26) | 1.82 (0.25) | |
|
| 0.11 (0.21) | 0 | 0.53 (0.15) | 0.61 (0.20) | |
|
| 0 | 0.08 (0.24) | 0.85 (0.15) | 1.93 (0.20) | |
|
| 0 | 0.04 (0.21) | 0.14 (0.26) | 1.04 (0.16) |
Figure 9The information flow pathways for the two-tank system.
TEs and thresholds (in the brackets).
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
| 1.05 (0.24) | 0.88 (0.36) | 1.08 (0.18) | 0.90 (0.25) | |
|
| 0.03 (0.16) | 0.80 (0.43) | 1.04 (0.39) | 0.72 (0.30) | |
|
| 0.09 (0.20) | 0 | 0.98 (0.48) | 0.78 (0.29) | |
|
| 0 | 0.08 (0.16) | 0.88 (0.35) | 0.94 (0.24) | |
|
| 0 | 0.04 (0.18) | 0.14 (0.26) | 1.05 (0.40) |
IG-based DTEs and thresholds.
| From | Intermediate Variables | IG-Based DTE | Thresholds |
|---|---|---|---|
|
|
| 0.08 | 0.20 |
|
|
| 0.11 | 0.21 |
|
|
| 0.05 | 0.24 |
|
|
| 0.05 | 0.24 |
|
|
| 0.13 | 0.25 |
|
|
| 0.06 | 0.15 |
|
|
| 0.05 | 0.15 |
|
|
| 0.04 | 0.20 |
|
|
| 0 |
Figure 10The direct information flow pathways for the two-tank system.
Calculation time using the traditional TE and the IG-based TE.
| Traditional TE | IG-Based TE | |
|---|---|---|
| Calculation time |
Detailed calculation time using the IG-based TE.
| Delay Estimation | Calculation of IG | Calculation of TE | |
|---|---|---|---|
| Calculation time |
The value of TE versus the orders.
| IG-Based TE ( | |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|