| Literature DB >> 29238300 |
Danilo Benozzo1,2, Emanuele Olivetti1,3, Paolo Avesani1,3.
Abstract
Brain effective connectivity aims to detect causal interactions between distinct brain units and it is typically studied through the analysis of direct measurements of the neural activity, e.g., magneto/electroencephalography (M/EEG) signals. The literature on methods for causal inference is vast. It includes model-based methods in which a generative model of the data is assumed and model-free methods that directly infer causality from the probability distribution of the underlying stochastic process. Here, we firstly focus on the model-based methods developed from the Granger criterion of causality, which assumes the autoregressive model of the data. Secondly, we introduce a new perspective, that looks at the problem in a way that is typical of the machine learning literature. Then, we formulate the problem of causality detection as a supervised learning task, by proposing a classification-based approach. A classifier is trained to identify causal interactions between time series for the chosen model and by means of a proposed feature space. In this paper, we are interested in comparing this classification-based approach with the standard Geweke measure of causality in the time domain, through simulation study. Thus, we customized our approach to the case of a MAR model and designed a feature space which contains causality measures based on the idea of precedence and predictability in time. Two variations of the supervised method are proposed and compared to a standard Granger causal analysis method. The results of the simulations show that the supervised method outperforms the standard approach, in particular it is more robust to noise. As evidence of the efficacy of the proposed method, we report the details of our submission to the causality detection competition of Biomag2014, where the proposed method reached the 2nd place. Moreover, as empirical application, we applied the supervised approach on a dataset of neural recordings of rats obtaining an important reduction in the false positive rate.Entities:
Keywords: Geweke measure in time; Granger causality; brain effective connectivity; causal inference; causal interaction classification; machine learning
Year: 2017 PMID: 29238300 PMCID: PMC5712990 DOI: 10.3389/fninf.2017.00068
Source DB: PubMed Journal: Front Neuroinform ISSN: 1662-5196 Impact factor: 4.081
Figure 1Given a criterion of causality, the estimation of causality structure can be implemented in three different ways: the standard non-parametric approach (top), the parametric one (mid), and the proposed parametric supervised one (bottom).
For each effect x(t) and M = 3, we report the seven possible causality scenarios.
| 1 | ||
| 2 | ||
| 3 | ||
| 4 | ||
| 5 | ||
| 6 | ||
| 7 |
Description of how the 21 linear regression problems are defined for each trial.
, i = 0, 1, 2 and t ∈ .
Figure 2Example of how the sample associated at the time point t = 30 is built in order to form the input of the last regression problem in Table 2, for the case i = 2 and p = 10.
AUC values of GCA, CBC, and MBC on the two datasets LMAR and L.
| 1 | 1 | 1 | |
| 0.72 | 0.92 | 0.91 |
The standard deviation is lower than 0.0015 in all cases.
AUC values of CBC with the complete and reduced feature spaces, on LMAR and L.
| 1 | 1 | 1 | |
| 0.92 | 0.91 | 0.90 |
The standard deviation is lower than 0.0015 in all cases.
Figure 3ROC curves estimated on the results of the three analyzed causal inference methods: Granger Causality Analysis (GCA), Cell-based Classification (CBC), and Matrix-based Classification (MBC).
Confusion matrices of GCA, CBC, and MBC on the Causal2014 dataset, taking into account for the bias for reducing the false-positives.
| 1 | 0 | 1 | 0 | 1 | 0 | ||||||
| True | 1 | 99.6% | 0.4% | True | 1 | 59.4% | 40.6% | True | 1 | 57.8% | 42.2% |
| 0 | 80.1% | 19.9% | 0 | 2.8% | 97.2% | 0 | 2.2% | 97.8% | |||
The values are conditional probabilities given the true class, i.e., each row sums up to 1.
AUC computed by applying CBC to the empirical dataset with different sampling frequencies and time window widths.
| 600 Hz | 0.80 | 0.82 | 0.82 | 0.83 | 0.82 |
| 800 Hz | 0.82 | 0.82 | 0.82 | 0.73 | 0.62 |
| 1 kHz | 0.82 | 0.82 | 0.75 | 0.61 | 0.64 |
The standard deviation is lower than 0.009 in all cases.
Figure 4ROC curves estimated on the results of CBC when applied on three different feature spaces: the complete one in contrast with the pw and c-pw ones. The ROC curve of GCA is shown as benchmark.