| Literature DB >> 33808377 |
Martina Chvosteková1, Jozef Jakubík1, Anna Krakovská1.
Abstract
In this study, the information flow time arrow is investigated for stochastic data defined by vector autoregressive models. The time series are analyzed forward and backward by different Granger causality detection methods. Besides the normal distribution, which is usually required for the validity of Granger causality analysis, several other distributions of predictive errors are considered. A clear effect of a change in the order of cause and effect on the time-reversed series of unidirectionally connected variables was detected with standard Granger causality test (GC), when the product of the connection strength and the ratio of the predictive errors of the driver and the recipient was below a certain level, otherwise bidirectional causal connection was detected. On the other hand, opposite causal link was detected unconditionally by the methods based on the time reversal testing, but they were not able to detect correct bidirectional connection. The usefulness of the backward analysis is manifested in cases where falsely detected unidirectional connections can be rejected by applying the result obtained after the time reversal, and in cases of uncorrelated causally independent variables, where the absence of a causal link detected by GC on the original series should be confirmed on the time-reversed series.Entities:
Keywords: Granger causality; endogeneity; predictive error; time reversal
Year: 2021 PMID: 33808377 PMCID: PMC8066447 DOI: 10.3390/e23040409
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
False positive rates (in %) for causally independent variables. The results for eight discussed testing procedures (inv—results in the time-reversed series) are presented, with the worst (more than ) FPR highlighted in bold.
| Condition | Sample | GC | inv | PEGC | inv | mTRGC | inv | mTRGC* | GC | |
|---|---|---|---|---|---|---|---|---|---|---|
| A | 300 | FPR | 2.6 | 2.6 | 0.6 | 0.6 | 0 | 0 |
| 2.5 |
| 3000 | FPR | 2.5 | 2.5 | 0.6 | 0.6 | 0 | 0 |
| 2.5 | |
| B | 300 | FPR | 2.7 | 2.6 | 1.1 | 0.7 | 0 | 0 |
| 2.6 |
| 3000 | FPR | 2.5 | 2.5 | 1.1 | 0.6 | 0 | 0 |
| 2.5 | |
| C | 300 | FPR | 2.6 | 2.6 | 0.9 | 0.6 | 0 | 0 |
| 2.5 |
| 3000 | FPR | 2.5 | 2.5 | 0.9 | 0.6 | 0 | 0 |
| 2.4 | |
| D | 300 | FPR | 2.7 | 2.6 | 0.6 | 0.7 | 0 | 0 |
| 2.6 |
| 3000 | FPR | 2.5 | 2.5 | 0.6 | 0.6 | 0 | 0 |
| 2.4 | |
| E | 300 | FPR | 2.7 | 2.7 | 2.4 | 2.3 | 0 | 0 |
| 2.6 |
| 3000 | FPR | 2.5 | 2.5 |
|
| 0 | 0 |
| 2.5 | |
| F | 300 | FPR |
|
| 1.7 | 1 | 0 | 0 |
|
|
| 3000 | FPR |
|
|
| 1.8 | 0 | 0 |
|
| |
| G | 300 | FPR | 2.6 |
| 0.7 |
| 0 | 2.6 |
| 2.2 |
| 3000 | FPR | 2.6 |
| 0.6 |
| 0 |
|
| 1.1 |
False positive rates and false negatives rates (in %) for unidirectionally causally connected variables. The results for eight discussed testing procedures (inv—results in the time-reversed series) are presented, with the worst (more than ) FPR highlighted in bold.
| Condition | Sample | GC | inv | PEGC | inv | mTRGC | inv | mTRGC* | GC | |
|---|---|---|---|---|---|---|---|---|---|---|
| A | 300 | FPR | 2.2 |
| 0.7 | 1.8 | 0 | 0 |
| 0.3 |
| FNR | 10.3 | 10.3 | 22.2 | 22.3 | 14 | 14.2 | 3.5 | 10.3 | ||
| 3000 | FPR | 2.3 |
| 0.9 |
| 0 | 0 | 0.7 | 0.1 | |
| FNR | 2.5 | 2.5 | 6.4 | 6.5 | 4 | 4 | 0.7 | 2.5 | ||
| B | 300 | FPR | 2.2 |
| 1.3 | 2.2 | 0 | 0 |
| 0.3 |
| FNR | 10.7 | 10.7 | 31.6 | 22 | 14.8 | 15.4 | 3.6 | 10.7 | ||
| 3000 | FPR | 2.3 |
| 1.6 |
| 0 | 0 | 0.8 | 0.1 | |
| FNR | 2.7 | 2.7 | 9.7 | 6.3 | 4 | 4 | 0.8 | 2.7 | ||
| C | 300 | FPR | 2.3 |
| 1.1 | 2 | 0 | 0 |
| 0.3 |
| FNR | 10.7 | 10.7 | 28.2 | 22.5 | 15.4 | 15.4 | 3.7 | 10.8 | ||
| 3000 | FPR | 2.4 |
| 1.3 |
| 0 | 0 | 0.8 | 0.1 | |
| FNR | 2.6 | 2.6 | 8.7 | 6.5 | 4 | 4 | 0.8 | 2.7 | ||
| D | 300 | FPR | 2.3 |
| 0.6 | 1.4 | 0 | 0 |
| 0.4 |
| FNR | 12 | 12 | 26.5 | 25.4 | 18 | 17.2 | 4.1 | 12 | ||
| 3000 | FPR | 2.4 |
| 0.7 |
| 0 | 0 | 0.9 | 0.1 | |
| FNR | 3 | 3 | 7.9 | 7.4 | 4.8 | 4.8 | 0.9 | 3 | ||
| E | 300 | FPR | 2.4 |
| 2.3 |
| 0 | 0 |
| 0.4 |
| FNR | 12.5 | 12.4 | 21 | 21.1 | 17.4 | 17.4 | 4.3 | 12.5 | ||
| 3000 | FPR | 2.6 |
|
|
| 0 | 0 | 1 | 0.1 | |
| FNR | 3.4 | 3.4 | 5.9 | 5.8 | 5.2 | 5.2 | 1 | 3.4 | ||
| F | 300 | FPR |
|
| 1.9 | 2 | 0 | 0 |
| 0.7 |
| FNR | 15.9 | 15.9 | 17.7 | 39.9 | 24.2 | 24.4 | 5.9 | 16 | ||
| 3000 | FPR |
|
|
|
| 0 | 0 | 1.5 | 0.2 | |
| FNR | 4.4 | 4.4 | 5.3 | 12.5 | 6.2 | 6.2 | 1.5 | 4.5 | ||
| G | 300 | FPR | 2.4 |
| 0.8 |
| 0 | 0 |
| 0.3 |
| FNR | 10.7 | 10 | 23.6 | 21.7 | 15.4 | 14.2 | 3.7 | 10.8 | ||
| 3000 | FPR | 2.4 |
| 1 |
| 0 | 0 | 0.7 | 0.1 | |
| FNR | 2.6 | 2.4 | 6.7 | 6.3 | 4.4 | 3.8 | 0.8 | 2.6 |
False negatives rates (in %) for bidirectionally causally connected variables. The results for eight discussed testing procedures (inv—results in the time-reversed series).
| Condition | Sample | GC | inv | PEGC | inv | mTRGC | inv | mTRGC* | GC | |
|---|---|---|---|---|---|---|---|---|---|---|
| A | 300 | FNR | 9.7 | 13.7 | 24.7 | 28.6 | 62.4 | 62 | 50 | 50 |
| 3000 | FNR | 1.8 | 5.1 | 5.8 | 9.1 | 53 | 53 | 50 | 50 | |
| B | 300 | FNR | 12.7 | 21.5 | 42.4 | 34.5 | 57.9 | 57.9 | 50 | 50.1 |
| 3000 | FNR | 2 | 8.2 | 11 | 14 | 52.3 | 52.6 | 50 | 50 | |
| C | 300 | FNR | 12.8 | 21.6 | 38.3 | 35.5 | 57.9 | 57.5 | 50 | 50.1 |
| 3000 | FNR | 2 | 8.3 | 9.3 | 14.2 | 51.9 | 51.9 | 50 | 50 | |
| D | 300 | FNR | 11.2 | 15.9 | 25.5 | 27.9 | 58.4 | 58.4 | 50 | 50 |
| 3000 | FNR | 2.3 | 6.6 | 7.2 | 10.9 | 52.1 | 52.1 | 50 | 50 | |
| E | 300 | FNR | 13 | 16.4 | 25.2 | 28.7 | 62 | 62.8 | 50 | 50 |
| 3000 | FNR | 3.3 | 6.2 | 5.8 | 8.7 | 53 | 53 | 50 | 50 | |
| F | 300 | FNR | 33.7 | 34.4 | 34.7 | 61.7 | 66.2 | 65.8 | 50 | 50.6 |
| 3000 | FNR | 2.1 | 9 | 5.6 | 21.9 | 54.5 | 54.9 | 50 | 50 | |
| G | 300 | FNR | 10.3 | 19.9 | 27.6 | 34.1 | 61.3 | 60.9 | 50 | 50 |
| 3000 | FNR | 2 | 7.3 | 6 | 13.8 | 53 | 53 | 50 | 50 |
Figure 1Rates of false detections obtained by GC on time series of length generated with normally distributed errors (condition A) for unidirectionally causally connected variables : (a) false positive rates observed on original time series, (b) false negative rates observed on original time series, (c) false positive rates observed on time-reversed series, (d) false negative rates observed on time-reversed series.
Figure 2Correlation for time series of length generated with independent normally distributed errors (condition A) for unidirectionally connected variables : (a) correlation of variables, (b) correlation of predictive error elements fitted by VAR on original time series, (c) correlation (multiplied by −1) of predictive error elements fitted by VAR on time-reversed series.