| Literature DB >> 33265171 |
Oliver M Cliff1,2, Mikhail Prokopenko2, Robert Fitch1,3.
Abstract
The Kullback-Leibler (KL) divergence is a fundamental measure of information geometry that is used in a variety of contexts in artificial intelligence. We show that, when system dynamics are given by distributed nonlinear systems, this measure can be decomposed as a function of two information-theoretic measures, transfer entropy and stochastic interaction. More specifically, these measures are applicable when selecting a candidate model for a distributed system, where individual subsystems are coupled via latent variables and observed through a filter. We represent this model as a directed acyclic graph (DAG) that characterises the unidirectional coupling between subsystems. Standard approaches to structure learning are not applicable in this framework due to the hidden variables; however, we can exploit the properties of certain dynamical systems to formulate exact methods based on differential topology. We approach the problem by using reconstruction theorems to derive an analytical expression for the KL divergence of a candidate DAG from the observed dataset. Using this result, we present a scoring function based on transfer entropy to be used as a subroutine in a structure learning algorithm. We then demonstrate its use in recovering the structure of coupled Lorenz and Rössler systems.Entities:
Keywords: Kullback–Leibler divergence; complex networks; information theory; model selection; nonlinear systems; state space reconstruction; stochastic interaction; transfer entropy
Year: 2018 PMID: 33265171 PMCID: PMC7512642 DOI: 10.3390/e20020051
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Trajectory of a pair of coupled Lorenz systems. Top row: original state of the subsystems. Bottom row: time-series measurements of the subsystems. In each figure, the black lines represent an uncoupled simulation (), and teal lines illustrate a simulation where the first (leftmost) subsystem was coupled to the second (). (a) ; (b) .
Figure 2Representation of (a) the synchronous GDS with two vertices ( and ), and (b) the rolled-out DBN of the equivalent structure. Subsystems and are coupled by virtue of the edge .
Figure 3Distributions of the (a) TEA penalty function (28) and the (b) TEE penalty function (28). Both distributions were generated by observing the outcome of 1000 samples from two Gaussian variables with a correlation of . The figures illustrate: the distribution as a set of 100 sampled points (black dots); the area considered independent (grey regions); the measured transfer entropy (black line); and the difference between measurement and penalty term (dark grey region). Both tests use a value of (a p-value of ). The distribution in (a) was estimated by assuming variables were linearly-coupled Gaussians, and the distribution in (b) was computed via a kernal box method (computed by the Java Information Dynamics Toolkit (JIDT), see [52] for details).
Figure 4The network topologies used in this paper. The top row (a–d) are four arbitrary networks with three nodes () and the bottom row (e–h) are four arbitrary networks with four nodes ().
Figure 5Transfer entropy as a function of the parameters of a coupled Lorenz–Rössler system. These components are: coupling strength and embedding dimension in the top row (a–c); coupling strength and observation noise in the middle row (d–f); and observation noise and embedding dimension in the bottom row (g–i).
-scores for three-node () networks. We present the classification summary for the three arbitrary topologies of coupled Lorenz systems represented by Figure 4b–d (network has no edges and thus an undefined -score). The p-value of the TEE score is given in the top row of each table, with ∞ signifying using no significance testing, i.e., score (27).
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 5 K | 0.8 | 0.5 | 0.8 | 0.5 | 0.8 | 0.5 | 0.8 | 0.5 | |
| 25 K | 1 | 0.8 | 1 | 0.5 | 1 | 0.5 | 1 | 0.8 | |
| 100 K | 1 | 0.5 | 1 | 1 | 1 | 1 | 1 | 0.8 | |
| 5 K | 1 | 0.67 | 1 | 1 | 1 | 1 | 1 | 0.67 | |
| 25 K | 1 | 1 | 1 | 0.5 | 1 | 1 | 1 | 1 | |
| 100 K | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 5 K | 0.8 | - | 0.8 | 0.8 | 0.8 | 0.5 | 0.8 | - | |
| 25 K | 1 | 1 | 1 | 1 | 1 | 0.5 | 1 | 1 | |
| 100 K | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
-scores for four-node () networks. We present the classification summary for the three arbitrary topologies of coupled Lorenz systems represented by Figure 4f–h (network has no edges and thus an undefined -score). The p-value of the TEE score is given in the top row of each table, with ∞ signifying using no significance testing, i.e., score (27).
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 5 K | 0.57 | 0.5 | 0.57 | 0.29 | 0.57 | 0.29 | 0.57 | - | |
| 25 K | 0.75 | 0.33 | 0.75 | 0.33 | 0.75 | 0.29 | 0.75 | 0.33 | |
| 100 K | 1 | 0.33 | 1 | 0.57 | 1 | 0.4 | 1 | 0.33 | |
| 5 K | 1 | 0.25 | 1 | 0.29 | 0.75 | 0.25 | 0.75 | 0.57 | |
| 25 K | 1 | 0.5 | 1 | 0.86 | 1 | 0.86 | 1 | 0.5 | |
| 100 K | 1 | 0.86 | 1 | 0.86 | 1 | 0.86 | 1 | 0.86 | |
| 5 K | 1 | 0.25 | 1 | 0.57 | 1 | 0.75 | 1 | 0.25 | |
| 25 K | 1 | 0.86 | 1 | 0.86 | 1 | 0.86 | 1 | 0.86 | |
| 100 K | 1 | 0.86 | 1 | 0.86 | 1 | 0.57 | 1 | 0.86 | |
Classification results for three-node () networks for samples. We present the precision (P), recall (R), fallout (F), and -score for the eight arbitrary topologies of coupled Lorenz systems represented by Figure 4.
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 10 | 1 | 10 | 1 | 10 | ||
| - | - | - | - | - | - | - | - | ||
| 0.33 | 0.22 | 0.33 | 0.22 | 0.22 | 0.33 | 0.33 | 0.22 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| - | - | - | - | - | - | - | - | ||
| 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | ||
| 0.14 | 0.14 | 0.14 | 0.14 | 0.14 | 0.14 | 0.14 | 0.14 | ||
| 0.67 | 0.5 | 0.67 | 0.5 | 0.67 | 0.5 | 0.67 | 0.5 | ||
| 0.8 | 0.5 | 0.8 | 0.5 | 0.8 | 0.5 | 0.8 | 0.5 | ||
| 1 | 0.5 | 1 | 1 | 1 | 1 | 1 | 0.5 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 1 | 0.67 | 1 | 1 | 1 | 1 | 1 | 0.67 | ||
| 1 | 0 | 1 | 1 | 1 | 0.5 | 1 | 0 | ||
| 0.14 | 0.43 | 0.14 | 0.14 | 0.14 | 0.14 | 0.14 | 0.43 | ||
| 0.67 | 0 | 0.67 | 0.67 | 0.67 | 0.5 | 0.67 | 0 | ||
| 0.8 | - | 0.8 | 0.8 | 0.8 | 0.5 | 0.8 | - | ||
Classification results for four-node () networks for samples. We present the precision (P), recall (R), fallout (F), and -score for the eight arbitrary topologies of coupled Lorenz systems represented by Figure 4.
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 10 | 1 | 10 | 1 | 10 | ||
| - | - | - | - | - | - | - | - | ||
| 0.31 | 0.25 | 0.31 | 0.19 | 0.31 | 0.25 | 0.31 | 0.19 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| - | - | - | - | - | - | - | - | ||
| 0.67 | 0.67 | 0.67 | 0.33 | 0.67 | 0.33 | 0.67 | 0 | ||
| 0.15 | 0.23 | 0.15 | 0.23 | 0.15 | 0.23 | 0.15 | 0.31 | ||
| 0.5 | 0.4 | 0.5 | 0.25 | 0.5 | 0.25 | 0.5 | 0 | ||
| 0.57 | 0.5 | 0.57 | 0.29 | 0.57 | 0.29 | 0.57 | - | ||
| 1 | 0.25 | 1 | 0.25 | 0.75 | 0.25 | 0.75 | 0.5 | ||
| 0 | 0.25 | 0 | 0.17 | 0.083 | 0.25 | 0.083 | 0.083 | ||
| 1 | 0.25 | 1 | 0.33 | 0.75 | 0.25 | 0.75 | 0.67 | ||
| 1 | 0.25 | 1 | 0.29 | 0.75 | 0.25 | 0.75 | 0.57 | ||
| 1 | 0.25 | 1 | 0.5 | 1 | 0.75 | 1 | 0.25 | ||
| 0 | 0.25 | 0 | 0.083 | 0 | 0.083 | 0 | 0.25 | ||
| 1 | 0.25 | 1 | 0.67 | 1 | 0.75 | 1 | 0.25 | ||
| 1 | 0.25 | 1 | 0.57 | 1 | 0.75 | 1 | 0.25 | ||
Classification results for three-node () networks for 10,000 samples. We present the precision (P), recall (R), fallout (F), and -score for the eight arbitrary topologies of coupled Lorenz systems represented by Figure 4.
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 10 | 1 | 10 | 1 | 10 | ||
| - | - | - | - | - | - | - | - | ||
| 0.22 | 0.11 | 0.22 | 0.11 | 0.22 | 0.22 | 0.22 | 0.11 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| - | - | - | - | - | - | - | - | ||
| 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | ||
| 0 | 0.14 | 0 | 0.14 | 0 | 0.14 | 0 | 0.14 | ||
| 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | ||
| 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | ||
| 1 | 0.5 | 1 | 1 | 1 | 0 | 1 | 0.5 | ||
| 0 | 0.14 | 0 | 0 | 0 | 0.29 | 0 | 0.14 | ||
| 1 | 0.5 | 1 | 1 | 1 | 0 | 1 | 0.5 | ||
| 1 | 0.5 | 1 | 1 | 1 | - | 1 | 0.5 | ||
| 1 | 1 | 1 | 0.5 | 1 | 0.5 | 1 | 1 | ||
| 0.14 | 0.14 | 0 | 0 | 0.14 | 0.14 | 0.14 | 0.14 | ||
| 0.67 | 0.67 | 1 | 1 | 0.67 | 0.5 | 0.67 | 0.67 | ||
| 0.8 | 0.8 | 1 | 0.67 | 0.8 | 0.5 | 0.8 | 0.8 | ||
Classification results for four-node () networks for 10,000 samples. We present the precision (P), recall (R), fallout (F), and -score for the eight arbitrary topologies of coupled Lorenz systems represented by Figure 4.
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 10 | 1 | 10 | 1 | 10 | ||
| - | - | - | - | - | - | - | - | ||
| 0.31 | 0.25 | 0.31 | 0.19 | 0.31 | 0.19 | 0.31 | 0.25 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| - | - | - | - | - | - | - | - | ||
| 0.67 | 0.33 | 0.67 | 0 | 1 | 1 | 0.67 | 0.33 | ||
| 0.15 | 0.15 | 0.15 | 0.15 | 0.15 | 0.15 | 0.15 | 0.15 | ||
| 0.5 | 0.33 | 0.5 | 0 | 0.6 | 0.6 | 0.5 | 0.33 | ||
| 0.57 | 0.33 | 0.57 | - | 0.75 | 0.75 | 0.57 | 0.33 | ||
| 0.75 | 0.5 | 1 | 0.5 | 1 | 0.25 | 0.75 | 0.5 | ||
| 0.083 | 0.083 | 0 | 0.083 | 0 | 0.17 | 0.083 | 0.083 | ||
| 0.75 | 0.67 | 1 | 0.67 | 1 | 0.33 | 0.75 | 0.67 | ||
| 0.75 | 0.57 | 1 | 0.57 | 1 | 0.29 | 0.75 | 0.57 | ||
| 1 | 0.25 | 1 | 0.25 | 1 | 0 | 1 | 0.25 | ||
| 0 | 0.17 | 0 | 0.17 | 0 | 0.25 | 0 | 0.17 | ||
| 1 | 0.33 | 1 | 0.33 | 1 | 0 | 1 | 0.33 | ||
| 1 | 0.29 | 1 | 0.29 | 1 | - | 1 | 0.29 | ||
Classification results for three-node () networks for 25,000 samples. We present the precision (P), recall (R), fallout (F), and -score for the eight arbitrary topologies of coupled Lorenz systems represented by Figure 4.
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 10 | 1 | 10 | 1 | 10 | ||
| - | - | - | - | - | - | - | - | ||
| 0.22 | 0.11 | 0.22 | 0.11 | 0.22 | 0.22 | 0.22 | 0.11 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| - | - | - | - | - | - | - | - | ||
| 1 | 1 | 1 | 0.5 | 1 | 0.5 | 1 | 1 | ||
| 0 | 0.14 | 0 | 0.14 | 0 | 0.14 | 0 | 0.14 | ||
| 1 | 0.67 | 1 | 0.5 | 1 | 0.5 | 1 | 0.67 | ||
| 1 | 0.8 | 1 | 0.5 | 1 | 0.5 | 1 | 0.8 | ||
| 1 | 1 | 1 | 0.5 | 1 | 1 | 1 | 1 | ||
| 0 | 0 | 0 | 0.14 | 0 | 0 | 0 | 0 | ||
| 1 | 1 | 1 | 0.5 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 0.5 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | 0.5 | 1 | 1 | ||
| 0 | 0 | 0 | 0 | 0 | 0.14 | 0 | 0 | ||
| 1 | 1 | 1 | 1 | 1 | 0.5 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | 0.5 | 1 | 1 | ||
Classification results for four-node () networks for 25,000 samples. We present the precision (P), recall (R), fallout (F), and -score for the eight arbitrary topologies of coupled Lorenz systems represented by Figure 4.
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 10 | 1 | 10 | 1 | 10 | ||
| - | - | - | - | - | - | - | - | ||
| 0.31 | 0.19 | 0.31 | 0.19 | 0.31 | 0.19 | 0.31 | 0.19 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| - | - | - | - | - | - | - | - | ||
| 1 | 0.33 | 1 | 0.33 | 1 | 0.33 | 1 | 0.33 | ||
| 0.15 | 0.15 | 0.15 | 0.15 | 0.15 | 0.23 | 0.15 | 0.15 | ||
| 0.6 | 0.33 | 0.6 | 0.33 | 0.6 | 0.25 | 0.6 | 0.33 | ||
| 0.75 | 0.33 | 0.75 | 0.33 | 0.75 | 0.29 | 0.75 | 0.33 | ||
| 1 | 0.5 | 1 | 0.75 | 1 | 0.75 | 1 | 0.5 | ||
| 0 | 0.17 | 0 | 0 | 0 | 0 | 0 | 0.17 | ||
| 1 | 0.5 | 1 | 1 | 1 | 1 | 1 | 0.5 | ||
| 1 | 0.5 | 1 | 0.86 | 1 | 0.86 | 1 | 0.5 | ||
| 1 | 0.75 | 1 | 0.75 | 1 | 0.75 | 1 | 0.75 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 1 | 0.86 | 1 | 0.86 | 1 | 0.86 | 1 | 0.86 | ||
Classification results for three-node () networks with 50,000 samples. We present the precision (P), recall (R), fallout (F), and -score for the eight arbitrary topologies of coupled Lorenz systems represented by Figure 4.
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 10 | 1 | 10 | 1 | 10 | ||
| - | - | - | - | - | - | - | - | ||
| 0 | 0.11 | 0 | 0 | 0 | 0.11 | 0 | 0.22 | ||
| - | 0 | - | - | - | 0 | - | 0 | ||
| - | - | - | - | - | - | - | - | ||
| 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | ||
| 0 | 0.14 | 0 | 0.14 | 0 | 0.14 | 0 | 0.14 | ||
| 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | ||
| 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | 1 | 0.5 | ||
| 1 | 1 | 1 | 0.5 | 1 | 1 | 1 | 1 | ||
| 0 | 0.14 | 0 | 0.14 | 0 | 0.14 | 0 | 0 | ||
| 1 | 0.67 | 1 | 0.5 | 1 | 0.67 | 1 | 1 | ||
| 1 | 0.8 | 1 | 0.5 | 1 | 0.8 | 1 | 1 | ||
| 1 | 0.5 | 1 | 1 | 1 | 0.5 | 1 | 1 | ||
| 0 | 0.14 | 0 | 0 | 0 | 0.14 | 0 | 0 | ||
| 1 | 0.5 | 1 | 1 | 1 | 0.5 | 1 | 1 | ||
| 1 | 0.5 | 1 | 1 | 1 | 0.5 | 1 | 1 | ||
Classification results for four-node () networks with 50,000 samples. We present the precision (P), recall (R), fallout (F), and -score for the eight arbitrary topologies of coupled Lorenz systems represented by Figure 4.
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 10 | 1 | 10 | 1 | 10 | ||
| - | - | - | - | - | - | - | - | ||
| 0.19 | 0.062 | 0.19 | 0.19 | 0.19 | 0.12 | 0.19 | 0.12 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| - | - | - | - | - | - | - | - | ||
| 1 | 0.33 | 1 | 0 | 1 | 0.33 | 1 | 0.33 | ||
| 0 | 0.15 | 0 | 0 | 0 | 0.23 | 0.15 | 0.15 | ||
| 1 | 0.33 | 1 | - | 1 | 0.25 | 0.6 | 0.33 | ||
| 1 | 0.33 | 1 | - | 1 | 0.29 | 0.75 | 0.33 | ||
| 1 | 0.75 | 1 | 0.5 | 1 | 0.5 | 1 | 0.75 | ||
| 0 | 0 | 0 | 0.17 | 0 | 0.083 | 0 | 0 | ||
| 1 | 1 | 1 | 0.5 | 1 | 0.67 | 1 | 1 | ||
| 1 | 0.86 | 1 | 0.5 | 1 | 0.57 | 1 | 0.86 | ||
| 1 | 0.75 | 1 | 0.75 | 1 | 0.75 | 1 | 0.75 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 1 | 0.86 | 1 | 0.86 | 1 | 0.86 | 1 | 0.86 | ||
Classification results for three-node () networks with 100,000 samples. We present the precision (P), recall (R), fallout (F), and -score for the eight arbitrary topologies of coupled Lorenz systems represented by Figure 4.
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 10 | 1 | 10 | 1 | 10 | ||
| - | - | - | - | - | - | - | - | ||
| 0 | 0.22 | 0 | 0.11 | 0 | 0.22 | 0 | 0.11 | ||
| - | 0 | - | 0 | - | 0 | - | 0 | ||
| - | - | - | - | - | - | - | - | ||
| 1 | 0.5 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 0 | 0.14 | 0 | 0 | 0 | 0 | 0 | 0.14 | ||
| 1 | 0.5 | 1 | 1 | 1 | 1 | 1 | 0.67 | ||
| 1 | 0.5 | 1 | 1 | 1 | 1 | 1 | 0.8 | ||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
Classification results for four-node () networks with 100,000 samples. We present the precision (P), recall (R), fallout (F), and -score for the eight arbitrary topologies of coupled Lorenz systems represented by Figure 4.
| Graph | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 10 | 1 | 10 | 1 | 10 | 1 | 10 | ||
| - | - | - | - | - | - | - | - | ||
| 0.19 | 0.062 | 0.19 | 0.062 | 0.19 | 0.19 | 0.19 | 0.12 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| - | - | - | - | - | - | - | - | ||
| 1 | 0.33 | 1 | 0.67 | 1 | 0.33 | 1 | 0.33 | ||
| 0 | 0.15 | 0 | 0.15 | 0 | 0.077 | 0 | 0.15 | ||
| 1 | 0.33 | 1 | 0.5 | 1 | 0.5 | 1 | 0.33 | ||
| 1 | 0.33 | 1 | 0.57 | 1 | 0.4 | 1 | 0.33 | ||
| 1 | - | 1 | - | 1 | - | 1 | - | ||
| 0 | - | 0 | - | 0 | - | 0 | - | ||
| 1 | - | 1 | - | 1 | - | 1 | - | ||
| 1 | - | 1 | - | 1 | - | 1 | - | ||
| 1 | 0.75 | 1 | 0.75 | 1 | 0.5 | 1 | 0.75 | ||
| 0 | 0 | 0 | 0 | 0 | 0.083 | 0 | 0 | ||
| 1 | 1 | 1 | 1 | 1 | 0.67 | 1 | 1 | ||
| 1 | 0.86 | 1 | 0.86 | 1 | 0.57 | 1 | 0.86 | ||