Literature DB >> 29167736

On the interpretability and computational reliability of frequency-domain Granger causality.

Luca Faes¹, Sebastiano Stramaglia^2,3, Daniele Marinazzo⁴.

Abstract

This Correspondence article is a comment which directly relates to the paper "A study of problems encountered in Granger causality analysis from a neuroscience perspective" ( Stokes and Purdon, 2017). We agree that interpretation issues of Granger causality (GC) in neuroscience exist, partially due to the historically unfortunate use of the name "causality", as described in previous literature. On the other hand, we think that Stokes and Purdon use a formulation of GC which is outdated (albeit still used) and do not fully account for the potential of the different frequency-domain versions of GC; in doing so, their paper dismisses GC measures based on a suboptimal use of them. Furthermore, since data from simulated systems are used, the pitfalls that are found with the used formulation are intended to be general, and not limited to neuroscience. It would be a pity if this paper, even if written in good faith, became a wildcard against all possible applications of GC, regardless of the large body of work recently published which aims to address faults in methodology and interpretation. In order to provide a balanced view, we replicate the simulations of Stokes and Purdon, using an updated GC implementation and exploiting the combination of spectral and causal information, showing that in this way the pitfalls are mitigated or directly solved.

Entities: Chemical

Keywords: Granger-Geweke causality; brain connectivity; directed coherence; frequency-domain connectivity; physiological oscillations; spectral decomposition; time series analysis; vector autoregressive models

Year: 2017 PMID： 29167736 PMCID： PMC5676195 DOI： 10.12688/f1000research.12694.1

Source DB: PubMed Journal: F1000Res ISSN： 2046-1402

Correspondence

Granger causality (GC) [1] is an extremely popular statistical tool used to analyze directed interactions from multivariate time series measured from coupled dynamical systems. A particularly appealing aspect of the notion of GC is that it can be formulated in the frequency domain, and is thus eligible for the analysis of signals that are rich of oscillatory content such as those commonly encountered in neuroscience and physiology [2, 3]. The spectral formulation of GC is obtained by elaborating in the frequency domain the parameters of the linear vector autoregressive (VAR) model that fit the observed multivariate time series. A main approach to do this was developed by Geweke [4, 5], yielding bivariate and conditional frequency domain measures of the so-called Granger-Geweke causality (GGC). An alternative framework stems from the works of Kaminski et al [6]. and Baccalà et al [7, 8]., who derived measures like the directed coherence (DC) and the partial DC (PDC), quantifying the total and direct directed influence of one time series over another in a fully multivariate setting (see references 9– 11 for comprehensive treatments). In their recent work [12], Stokes and Purdon performed a critical evaluation of frequency-domain GC computed within the Geweke framework, evidencing in two simulation studies some computational and interpretational problems associated with the GGC measures. Specifically, they showed that – even when the systems generating the observed data belong to the finite-order VAR model class – spectral GGC cannot be reliably estimated and cannot recover the functional oscillatory structure underlying the data. These observations led the authors to conclude that the notion of causality quantified by GGC, and by other Granger causality measures in general, often yield counterintuitive and misleading results, thus being incompatible with the objectives of many neuroscience studies. We definitely agree that GC and lag-based data-driven methods in general cannot provide measures of “causality” as intended in other applications (see references 9 and 13 for a thoughtful distinction between data-driven and model-based approaches). We also share the view that the assumptions of linearity and stationarity, as well as the presence of unobserved variables, noise or inappropriate sampling may pose theoretical and practical problems which can severely impair both the formulation and the computation of spectral GC measures – this has been stated by Stokes and Purdon [12] and in previous studies [2, 3, 14]. On the other hand we think that, based on the way simulated data have been analyzed and interpreted by Stokes and Purdon [12], frequency domain GC methods have been dismissed based on a suboptimal (even though frequently applied) formulation of GGC, and based on the lack of direct consideration of the DC/PDC framework. In this contribution, we repeat the simulations of Stokes and Purdon [12], and suggest that the negative conclusions based on the results of such simulations are overstated. We show that spectral GGC estimates can be obtained with a high computational reliability if proper estimation approaches are employed, and the interpretation of frequency domain causality measures can be meaningfully performed if spectral and causal information are properly combined. The codes for running our analyses are based on existing Matlab © toolboxes [11, 15, 16] and are provided as supplementary data alongside this article. The first simulation of Stokes and Purdon [12] shows that, due to the modeling choices required to compute spectral GGC, this measure cannot be reliably estimated even for simple systems. By generating 100 realizations of this simulation with the same parameters and data length we confirm that, by applying the standard method of fitting separate full and reduced VAR models, spectral GGC estimates display a strong bias ( Figure 1A) or a very large variability ( Figure 1B), depending on the choice of the model order. As explained by Stokes and Purdon [12], this tradeoff between bias and variance arises from the incorrect representation of the reduced model as a VAR process of finite order. Exactly for this reason however, the problem can be overcome employing the state-space (SS) approach [16], which allows to compute GGC in closed form from the SS parameters of any observed VAR process. Here we show that this approach yields highly accurate spectral estimates of GGC, which closely follow the expected profiles over the coupled directions and have negligible magnitude over the uncoupled direction ( Figure 1C); the higher reliability of the SS estimator compared with the standard VAR-based method is evident also looking at single process realizations ( Figure 1D).

Figure 1.

Comparison of conditional frequency-domain Granger-Geweke causality (GGC) profiles computed for the three-node system of Stokes and Purdon [12] (Example 1, where nodes 1, 2, and 3 resonate respectively at 40 Hz, 10 Hz, and 50 Hz, and where unidirectional causality is imposed from node 1 to node 2, and from node 2 to node 3).

Comparison of conditional frequency-domain Granger-Geweke causality (GGC) profiles computed for the three-node system of Stokes and Purdon [12] (Example 1, where nodes 1, 2, and 3 resonate respectively at 40 Hz, 10 Hz, and 50 Hz, and where unidirectional causality is imposed from node 1 to node 2, and from node 2 to node 3).

GGC is computed along the two coupled directions (f 1→2, f 2→3) and along a direction with no coupling (f 3→1). Columns report the distribution of GGC estimates (median and 5 th–95 th percentiles) computed using classical vector autoregressive (VAR) estimation of full and reduced models performed with the true model order p=3 ( a) and with an increased order p=20 ( b), and using state space (SS) estimation ( c), as well as estimates obtained for a single simulation run ( d); in each plot, the true causality values computed from the original model parameters are reported in red. Results evidence the lower bias and variability of spectral GGC computed using the SS method compared to the classical VAR approach. The second simulation of Stokes and Purdon [12] shows that, due to the independence of GGC measures from the intrinsic dynamics of the “receiver” process, the spectral GGC profiles linking this process to its putatively causal “transmitter” process are often misleading, because different systems can have identical causality functions but different receiver dynamics. In Figure 2 we confirm this result both in terms of GGC and using the DC, a spectral causality measure taken from the VAR framework [7] that for bivariate processes like the one simulated here is analytically related to the spectral GGC [15]. However, this invariance property is in our view absolutely reasonable, because the DC has a clear-cut interpretation as the relative amount of spectral power that, at each frequency, arrives to the receiver starting from the transmitter [11]. Nevertheless, the DC is also useful to fully recover the functional oscillatory structure of the observed processes, because it shapes the receiver spectrum to reveal the portion of its spectral power that is “causally” due to the transmitter; this is depicted in the spectral decomposition of Figure 2.

Figure 2.

Theoretical profiles of spectral and causality measures computed for the two-node system of Stokes and Purdon [12] (Example 2, where unidirectional causality is imposed from node 1 to node 2).

Theoretical profiles of spectral and causality measures computed for the two-node system of Stokes and Purdon [12] (Example 2, where unidirectional causality is imposed from node 1 to node 2).

The system is studied setting a resonance frequency of 50 Hz for the transmitter and of 10 Hz (top row panels), 30 Hz (mid row panels) and 50 Hz (bottom row panels) for the receiver. The fact that the power spectral density (PSD) of the transmitter (S 11(f), a) is the same for the three cases induces, together with the unaltered coefficients determining the causal effects, the same profile for the directed coherence DC 1→2(f) ( c) and the spectral GGC measure GG 1→2(f) ( d). However, the different causal contribution of the transmitter on the receiver is revealed by the partial spectrum S 2|1(f)=S 22(f)·DC 1→2(f) (orange line in ( b)), which quantifies the portion of the overall PSD of the receiver (S 22(f), purple line in ( b)) that is causally explained by the transmitter; the non-explained part (S 2|2(f), green line in ( b)) reflects the autonomous dynamics of the receiver. In conclusion, while thanking Stokes and Purdon [12] for pointing out some weaknesses of GGC measures, we think that proper formulations can provide meaningful results of directed dynamical influence, whose interpretation still is bound to the knowledge and good faith of those who write and read related scientific literature. The package contains Matlab scripts and functions that allow to reproduce the simulations of Stokes and Purdon, and to compare for these simulations: (i) the standard vector autoregressive estimator and the updated state space estimator of the spectral Granger-Geweke causality measure; (ii) the spectral Granger-Geweke causality measure with the directed coherence measure. Click here for additional data file.

Data availability

The data referenced by this article are under copyright with the following copyright statement: Copyright: © 2017 Faes L et al. Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication). Dataset 1: Codes to compute frequency-domain Granger causality in linear stochastic processes. The package contains Matlab scripts and functions that allow to reproduce the simulations of Stokes and Purdon, and to compare for these simulations: (i) the standard vector autoregressive estimator and the updated state space estimator of the spectral Granger-Geweke causality measure; (ii) the spectral Granger-Geweke causality measure with the directed coherence measure. DOI, 10.5256/f1000research.12694.d178159 [17]. The correspondence is a comment directly related to “A study of problems encountered in Granger causality analysis from a neuroscience perspective [1]. I completely agree with the authors that there are interpretation issues of Granger causality (and by the way also a lot of wrong use/wrong adaptation in the field) and therefore the whole concept has to be applied with care. But – like the authors clearly stated in their comment – Stokes and Purdon uses 1) a version of frequency-domain Granger causality which is a least sub-optimal and 2) the results they were able to show by means of simulated data should be a general problem not only occurring in the case of neuroscience data. Therefore, I explicitly support the comment of Faes, Stramaglia and Marinazzo. The comment is well written. Basic information concerning the concept of Granger causality and in particular the frequency-domain approach are given (including the coverage of related literature). The main idea behind the paper of Stokes and Purdon and the reason why the authors felt the need to comment on this paper are clearly stated. Then, the simulations of Stokes and Purdon are repeated using an updated Granger causality implementation and a combination of spectral and causal information. Technical details are given, results are correctly presented and well described, and conclusions are drawn. I do not have any major or minor concerns. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. I am glad that Faes et al. have not fallen prey to these considerations and have mounted a fair and elegant, albeit brief, critical appraisal of the PNAS Stokes and Purdon (S/P) paper. Fael et al. have restricted themselves to nailing just three obvious crucial facts that may go unnoticed by the casual reader for whom the authoritativeness of a vehicle like PNAS may stand as a certificate of validity. The first point is that the S/P paper fails to mention how their VAR models were computed. My own AsympPDC package (The AsympPDC Package 3.0 is directly downloadable from http://www.lcs.poli.usp.br/~baccala/pdc/asymp_package_v3.zip. A preliminary version is also available through [7]. Visit http://www.lcs.poli.usp.br/~baccala/pdc/ for future version updates.) provides five different methods: the simplest naive and most popular approach, the least squares method, is the worst performer thanks to error propagation alone. It is important to stress that accurate VAR model estimation is crucial to whichever approach to GC is chosen. Faes et. al correctly supply an alternative method of estimation where no theoretically meaningless ‘negative’ GGC value is observed. The Faes et al. paper has the added obvious merit of including the actual value of GGC computed from the actual true model absentmindedly lacking in the original S/P paper. The second point raised almost in passing by Faes et al. is that the S/P paper may possibly induce its readers to completely disregard frequency domain causality descriptions while S/P mostly glosses over the alternative DC/PDC framework. This is a huge oversight since currently only DC/PDC estimators have statistical theoretically rigorous computable confidence intervals and objective null hypothesis threshold tests [1] - [2] which may be applied using the freely downloadable AsympPDC package. This fact alone sets the DC/PDC methodologically apart. In today’s internet era of information no more than a google away, this omission is unforgivable. The third point concerns the two time series case of Example 2. Indeed here is a point that Faes et al correctly argue that S/P fail to grasp. GC was conceived by Granger [3] to decompose pairwise relationships into exposing factors that aid predictability. This was later shown to be equivalent to detecting and characterizing the presence of feedback by Sims [4]. In this case, GGC, DC and PDC address the so called connection ‘detection’ problem by focusing on the connection. If some given influence is present and if an influenced subsystem is affected, how it responds resonantly or otherwise is its own business. Do the valid Faes et al. criticisms imply the S/P paper is worthless? Despite its many additional shortcomings, the S/P paper has the important merit of stripping bare some of the field’s reigning confusion, enough to call for added discussion. This only stresses the relevance of the present criticisms and the urgent need for clarification. Finally I would like to put forward some thoughts that may explain the present state of conceptual disarray regarding causality. The first issue is ignorance about time series estimation — of the ‘a little knowledge is dangerous thing' kind. One cannot expect correct and reasonable results by just downloading some package, pressing some buttons without knowing the precise limitations of each available tool - time series analysis still has some elements of an art. The second problem has to do with the notions of Causality - it is tempting to ask the methodology to provide a glimpse on actual mechanisms. Sometimes, on physical grounds, this desire may be fulfilled, but true (mechanical) causality determination requires observer intervention [5] - [6]; what GC does is to allow the exclusion of some tentative mechanisms. Thirdly, even when it reflects actual mechanisms, but more than two time series are simultaneously examined, different descriptions, as is the case for the complementarity between DC and PDC, they reflect different system properties so that one must break the GC concept into more general ideas: Granger Connectivity and Granger Influentiability [5] - [6]. Last but not least, popular descriptions of large scale connectivity sometimes qualified as ‘effective’ versus ‘functional’ have further added to the present state of confusion due to their shear inconsistent application throughout the literature (se a discussion in [5] - [7]. I think we should thank Faes et al. for pointing out some of these problems. Perhaps this is a good opportunity to start clearing up these issues once and for all. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

14 in total

1. Partial directed coherence: a new concept in neural structure determination.

Authors: L A Baccalá; K Sameshima
Journal: Biol Cybern Date: 2001-06 Impact factor: 2.086

2. Unified asymptotic theory for all partial directed coherence forms.

Authors: L A Baccalá; C S N de Brito; D Y Takahashi; K Sameshima
Journal: Philos Trans A Math Phys Eng Sci Date: 2013-07-15 Impact factor: 4.226

3. Wiener-Granger causality: a well established methodology.

Authors: Steven L Bressler; Anil K Seth
Journal: Neuroimage Date: 2010-03-02 Impact factor: 6.556

4. Granger causality for state-space models.

Authors: Lionel Barnett; Anil K Seth
Journal: Phys Rev E Stat Nonlin Soft Matter Phys Date: 2015-04-23

5. Granger causality analysis in neuroscience and neuroimaging.

Authors: Anil K Seth; Adam B Barrett; Lionel Barnett
Journal: J Neurosci Date: 2015-02-25 Impact factor: 6.167

6. Measuring frequency domain granger causality for multiple blocks of interacting time series.

Authors: Luca Faes; Giandomenico Nollo
Journal: Biol Cybern Date: 2013-01-29 Impact factor: 2.086

7. Directed Transfer Function: Unified Asymptotic Theory and Some of Its Implications.

Authors: Luiz A Baccala; Daniel Y Takahashi; Koichi Sameshima
Journal: IEEE Trans Biomed Eng Date: 2016-04-04 Impact factor: 4.538

8. Effective connectivity: influence, causality and biophysical modeling.

Authors: Pedro A Valdes-Sosa; Alard Roebroeck; Jean Daunizeau; Karl Friston
Journal: Neuroimage Date: 2011-04-06 Impact factor: 6.556

9. Measuring connectivity in linear multivariate processes: definitions, interpretation, and practical analysis.

Authors: Luca Faes; Silvia Erla; Giandomenico Nollo
Journal: Comput Math Methods Med Date: 2012-05-14 Impact factor: 2.238

10. On the interpretability and computational reliability of frequency-domain Granger causality.

Authors: Luca Faes; Sebastiano Stramaglia; Daniele Marinazzo
Journal: F1000Res Date: 2017-09-20

10 in total

1. Estimation of Granger causality through Artificial Neural Networks: applications to physiological systems and chaotic electronic oscillators.

Authors: Yuri Antonacci; Ludovico Minati; Luca Faes; Riccardo Pernice; Giandomenico Nollo; Jlenia Toppi; Antonio Pietrabissa; Laura Astolfi
Journal: PeerJ Comput Sci Date: 2021-05-18

2. Estimation of Vector Autoregressive Parameters and Granger Causality From Noisy Multichannel Data.

Authors: Prashant Rangarajan; Rajesh P N Rao
Journal: IEEE Trans Biomed Eng Date: 2018-12-18 Impact factor: 4.538

3. Modulation of network-to-network connectivity via spike-timing-dependent noninvasive brain stimulation.

Authors: Emiliano Santarnecchi; Davide Momi; Giulia Sprugnoli; Francesco Neri; Alvaro Pascual-Leone; Alessandro Rossi; Simone Rossi
Journal: Hum Brain Mapp Date: 2018-08-16 Impact factor: 5.038

4. Information Transfer in Linear Multivariate Processes Assessed through Penalized Regression Techniques: Validation and Application to Physiological Networks.

Authors: Yuri Antonacci; Laura Astolfi; Giandomenico Nollo; Luca Faes
Journal: Entropy (Basel) Date: 2020-07-01 Impact factor: 2.524