Literature DB >> 27626089

Metric projection for dynamic multiplex networks.

Abstract

Evolving multiplex networks are a powerful model for representing the dynamics along time of different phenomena, such as social networks, power grids, biological pathways. However, exploring the structure of the multiplex network time series is still an open problem. Here we propose a two-step strategy to tackle this problem based on the concept of distance (metric) between networks. Given a multiplex graph, first a network of networks is built for each time step, and then a real valued time series is obtained by the sequence of (simple) networks by evaluating the distance from the first element of the series. The effectiveness of this approach in detecting the occurring changes along the original time series is shown on a synthetic example first, and then on the Gulf dataset of political events.

Entities: Chemical Disease Gene Species

Keywords: Applied mathematics; Computational mathematics; Computer science; Information science

Year: 2016 PMID： 27626089 PMCID： PMC5011148 DOI： 10.1016/j.heliyon.2016.e00136

Source DB: PubMed Journal: Heliyon ISSN： 2405-8440

Introduction

When the links connecting a set of N nodes arise from k different sources, a possible representation for the corresponding graph is the construction of k networks on the same N nodes, one for each source. The resulting structure is known as a multiplex network, and each of the composing graphs is called a layer. Multiplex networks are quite effective in representing many different real-world situations [1], [2], [3], and their structure helps extracting crucial information about the complex systems under investigation that would instead remain hidden when analyzing individual layers separately [4], [5], [6]; furthermore, their relation with time series analysis techniques has recently gained interest in the literature [7]. A key property to be highlighted is the correlated multiplexity, as stated in [8]: in real-world systems, the relation between layers is not at all random; in fact, in many cases, the layers are mutually correlated. Moreover, the communities induced on different layers tend to overlap across layers, thus generating interesting mesoscale structures. These observations guided the authors of [9] in defining a network having the layers of the original multiplex graph as nodes, and using information theory to define a similarity measure between the layers themselves, so to investigate the mesoscopic modularity of the multiplex network. Here we propose to pursue a similar strategy for defining a network of networks derived from a multiplex graph, although in a different context and with a different aim. In particular, we project a time series of multiplex networks into a series of simple networks to be used in the analysis of the dynamics of the original multiplex series. The projection map defining the similarity measure between layers is induced by the Hamming–Ipsen–Mikhailov (HIM) network distance [10], a glocal metric combining the Hamming and the Ipsen–Mikhailov distances, used in different scientific areas [11], [12], [13], [14], [15], [16]. The main goal in using this representation is the analysis of the dynamics of the original time series through the investigation of the trend of the projected evolving networks, by extracting the corresponding real-valued time series obtained computing the HIM distance between any element in the series and the first one. For instance, we show on a synthetic example that this strategy is more informative than considering statistics of the time series for each layer of the multiplex networks, or than studying the networks derived collapsing all layers into one including all links, as in [17], [18] when the aim is detecting the timesteps where more relevant changes occur and the system is undergoing a state transition (tipping point) or it is approaching it (early warning signals). This is a classical problem in time series analysis, and very diverse solutions have appeared in literature (see [19] for a recent example). Here we use two different evaluating strategies, the former based on the fluctuations of mean and variance [20] (implemented in the R package changepoint https://cran.r-project.org/web/packages/changepoint/index.html), and the latter involving the study of increment entropy indicator [21]. We conclude with the analysis of the well known Gulf Dataset (part of the Penn State Event Data) concerning the 304.401 political events (of 66 different categories) occurring between 202 countries in the 10 years between 15 April 1979 to 31 March 1991, focusing on the situation in the Gulf region and the Arabian peninsula. A major task in the analysis of the Gulf dataset is the assessment of the translation of the geopolitical events into fluctuations of measurable indicators. A similar network-based mining of sociopolitical relations, but with a probabilistic approach, can be found in [22], [23], [24]. Here we show the effectiveness of the newly introduced methodology in associating relevant political events and periods to characteristic behaviors in the dynamics of the time series of the induced networks of networks, together with a simple overview of the corresponding mesoscale modular structure.

Background

The Hamming–Ipsen–Mikhailov (HIM) metric [10], [25] is a distance function quantifying in the real interval the difference between two networks on shared nodes. The HIM metric linearly combines an edit distance, the Hamming (H) [26], [27], [28] and a spectral distance, the Ipsen–Mikhailov (IM) [29]. Edit distances are local metrics, functions of insertion and deletion of matching links, while spectral measures are global distances, functions of the network spectrum. Local functions disregards the overall network structure, while spectral measures cannot distinguish isospectral graphs. As its characterizing feature, HIM is a glocal distance that overcomes the drawbacks of local and global metrics when separately considered. Furthermore, its definition can be naturally extended to directed networks. Hereafter we give a brief description of the H, IM and HIM distances, graphically summarized in Figure 1.

Figure 1

Summary of the definitions of the HIM distance and its Hamming (H) and Ipsen–Mikhailov (IM) components.

Notations. Let and be two simple networks on N nodes, whose adjacency matrices are and , with , where for unweighted graphs and for weighted networks. Let then be the identity matrix , let be the unitary matrix with all entries equal to one and let be the null matrix with all entries equal to zero. Denote then by the empty network with N nodes and no links (with adjacency matrix ) and by the clique (undirected simple full network) with N nodes and all possible links, whose adjacency matrix is . Finally, the Laplacian matrix L of an undirected network is defined as the difference between the degree matrix D and the adjacency matrix A, where D is the diagonal matrix of vertex degrees. L is positive semidefinite and singular, with eigenvalues .

The Hamming distance

The Hamming distance, one of the most common dissimilarity measures in coding and string theory and recently used also for network comparison, evaluates the presence/absence of matching links on the two compared networks. In terms of adjacency matrices, the expression for the normalized Hamming metric H reads as where the normalization factor bounds the range of the function H in the interval . The lower bound 0 is attained only for identical networks , the upper limit 1 for complementary networks . When and are unweighted networks, is just the fraction of different matching links over the total number of possible links between the two graphs.

The Ipsen–Mikhailov distance

The Ipsen–Mikhailov IM metric stems from the realization of an N nodes network as an N molecules system connected by identical elastic springs, according to the adjacency matrix A. The dynamics of the spring-mass system can be described by the set of N differential equations The vibrational frequencies of are given by , while the spectral density for a graph in terms of the sum of Lorentz distributions is defined as where γ is the common width and K is the normalization constant defined by the condition , and thus The scale parameter γ specifies the half-width at half-maximum, which is equal to half the interquartile range. Then the spectral distance between two graphs and on N nodes with densities and can be defined as Since for each N, denoting by the unique solution of , the normalized Ipsen–Mikhailov distance between two undirected networks can be defined as so that IM is bounded between 0 and 1, with upper bound attained only for .

The Hamming–Ipsen–Mikhailov distance

Consider now the cartesian product Z of two metric spaces and , where is the set of all simple undirected networks on N nodes endowed either with the Hamming metric H or with the Ipsen–Mikhailov distance IM. Define then on Z the one-parameter Hamming–Ipsen–Mikhailov distance HIM as the (Euclidean) product metric of H and IM, normalized by the factor , for : where in what follows we will omit the subscript ξ when it is equal to one. Note that, apart from extreme values, the qualitative impact of ξ is minimal in practice, and in what follows will always be assumed. The metric is bounded in the interval , with the lower bound attained for every couple of identical networks, and the upper one attained only on the pair . Moreover, all distances will be nonzero for non-identical isomorphic/isospectral graphs.

A minimal example

In Figure 2 we show, in the space, the graphical representation in circular layout and the mutual HIM distances between four undirected simple networks on six shared nodes, namely the ring network (A), the star network (B), a regular network with degree three (C) and a regular lattice (D). HIM distances range from 0.217 for the pair (C,D), which are the mutually closest networks, to 0.495 for (B,C) which are the farthest graphs. In all cases, the Hamming distance is contributing to the HIM metric more than the Ipsen–Mikhailov component, indicating that the presence or absence of matching links is has a larger impact than the overall structure. Note for instance that networks A and B have the same Hamming distance as A and D, but the spectral structure of the lattice D is closer to the structure of the ring network A than the star network B, as quantitatively shown by the different IM distance; in particular, the spectral structures of A and D are the closest, with IM distance even smaller of the IM distance between C and D. An analogous situation occurs for the pairs B,C and B,D, sharing the same H distance but with a different IM distance.

Figure 2

Graphical representation in circular layout (a), scatterplot (b) and tabular (c) representation of the HIM distance in the Ipsen–Mikhailov (IM axis) and Hamming (H axis) distance space between ring network (A), the star network (B), a regular network with degree three (C) and a 3 × 2 regular lattice (D).

Theory

Let be a sequence (time series) of τ multiplex networks with λ layers sharing ν nodes , as displayed in Figure 3.

Figure 3

Graphical representation of a sequence of τ multiplex networks with λ layers.

The metric projection. Construct now the metric projection of as the full undirected weighted network with λ nodes where the weight of the edge connecting vertices and is defined by the HIM similarity between layers and : thus, if is the adjacency matrix of , then In Figure 4 an example of the construction of is shown for a multiplex network with five layers.

Figure 4

Construction of the metric projection at a given time point t = i for a multiplex network with λ = 5 layers; the metric projection is a new network with one node for each layer of the original net, and the edge weight is given by the complement of the HIM distance between the corresponding layers.

The collapsed projection. Moreover, if is the adjacency matrix of , define the collapsed projection of on nodes as the network where a link exists between and if it exists in at least one layer (for binary layers); in case of weighted layers, the weight of the link is the average of the weights across all layers. Thus, if is the adjacency matrix of , then In Figure 5 we show a graphical sketch of the collapsing of a multiplex network with five layers.

Figure 5

Construction of the collapsed projection at a given time point t = i for a multiplex network with λ = 5 layers; the collapsed projection is a new network sharing the same nodes of the original multiplex net, where a link exists in the projection if the same link appears in at least one of the layers of the multiplex network, as if all the layers were collapsed into a single one.

Caveat: consider a sequence of binary multiplex networks such that, for each of the possible links and for each timestep, there exists at least one layer including this link. Then the collapsed projection, at each time step, is the full graph on ν nodes, and, as such, it has no temporal dynamics, regardless of the evolution of each single layer. The distance series. To investigate the dynamics of for , we construct a suite of associated time series by means of three different procedures, all involving the HIM distance between each network in a given sequence and the first element of the sequence itself. The first group D1 of distance series is obtained by evaluating the dynamics of each layer considered separately: In Figure 6 we show the construction of the distance series D1 for the first layer of the multiplex network in Figure 3.

Figure 6

Construction of the distance series D1 for the first layer of the sequence of multiplex network in Figure 3. The value of the time series at time point t = i is the HIM distance between the layer L1 at time t = i and at time t = 1.

The second series, D2, collects the metric dynamics of the collapsed projection : An example of construction of D2 for the five layers multiplex network of Figure 5 is shown in Figure 7.

Figure 7

Construction of the distance series D2 for the sequence of collapsed networks in Figure 5. The value of the time series at time point t = i is the HIM distance between at time t = i and at time t = 1.

Finally, the last series D3 collects the metric dynamics of the metric projection , and the corresponding example for the multiplex networks in Figure 4 is shown in Figure 8:

Figure 8

Construction of the distance series D3 for the sequence of metric projections in Figure 4. The value of the time series at time point t = i is the HIM distance between at time t = i and at time t = 1.

Dynamics indicators. The dynamics of the time series D⁎ is quantitatively analyzed by means of a set of indicators, assessing the series' information content and detecting occurring tipping points. The first indicator is the Increment Entropy (IncEnt), introduced in [21] as a measure the complexity of time series in terms of its unpredictability [30]. The starting point is increment series of a time series as an informative encoder of the characteristics of dynamic changes hidden in a signal. In practice, the increments are grouped in vectors of size m, and each increment is mapped into a two-letters word, with a sign and its size coded in this word according to a resolution parameter R. Finally, the IncEnt is computed as the Shannon entropy of these words: the larger is the IncEnt value, the less predictable is the series. The second indicator meanvar belongs to the family of the changepoint detection indicators as implemented in R by the changepoint package [20]. In general, changepoint detection algorithms are the solutions to the problem of estimating the points in a time series where the statistical properties change. The subset of the meanvar functions search for changes in both the mean and the variance, and a number of alternative are known in literature [31], [32], [33], [34], [35], [36]. In particular, in what follows we will employ the recent Pruned Exact Linear Time (PELT) algorithm [37], based on the classical segment neighborhood technique minimizing the combination of a cost function (for instance, twice the negative log-likelihood) with a linear penalty function through dynamic programming. Finally, we will use Changepoints for a Range of PenaltieS (CROPS) [38] to obtain optimal changepoint segmentations of data sequences for all penalty values across a continuous range.

Results & discussion

A synthetic example

Consider now a sequence of binary multiplex networks with , and , generated as follows. Define the perturbation function taking as entries a binary simple network N on n nodes, and a couple of real values with , and returning a network obtained from N by swapping the status (present/not present) of links, where g is a random value in the interval . Further, define the default transition as the pair , a small transition as , a medium transition as and, finally, a large transition as . Moreover, let R be an Erdós–Rényi random model and define 4 special timepoints: the initial time step , the first spike , the second spike and the third spike . Then, each layer at a given time step t is defined through the following rule: In Figure 9 we show the evolution along the 30 timepoints of the 5 curves for , its average and , . To assess the information content of each curve we use the Increment Entropy indicator IncEnt, whose value increases with the series' complexity: the IncEnt values are reported in Table 1.

Figure 9

D1, D2, D3 for a synthetic example on 5 layers and 30 timepoints; in the right column, third row, we plot .

Table 1

Increment Entropy values for the distance sequences of the synthetic example, with parameters m = 2, R = 2.

Dist.	IncEnt	Dist.	IncEnt
D₁(L₁)	2.52	D₁(L₂)	2.78
D₁(L₃)	2.59	D₁(L₄)	2.83
D₁(L₅)	2.44	D‾1	2.27
D₂	1.82	D₃	3.04

Among the evolving layers, and have the largest IncEnt, while the other three layers show a lower level of complexity. As expected, the average and the collapsed network distance has very low IncEnt value, yielding that both averaging the distances and collapsing the layers lose information about the overall dynamics. Finally, distance is the metric which better detects the network evolution along time, conserving most of the information. This is also supported by the meanvar indicator with CROPS range with the PELT algorithm: in fact, the meanvar indicator detects correctly in the three points , and as changepoints, while in , other than the , meanvar detects and =28 which are unrelated to the designed dynamics.

The Gulf Dataset

Data description. Part of the Penn State Event Data http://eventdata.psu.edu/ (formerly Kansas Event Data System), available at http://vlado.fmf.uni-lj.si/pub/networks/data/KEDS/, the Gulf Dataset collects, on a monthly bases, political events between pairs of countries focusing on the Gulf region and the Arabian peninsula for the period 15 April 1979 to 31 March 1999, for a total of 240 months. The 304401 political events involve 202 countries and they belong to 66 classes (including for instance “pessimist comment”, “meet”, “formal protest”, “military engagement”, etc.) as coded by the World Event/Interaction Survey (WEIS) Project [39], [40], [41] http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/5211, whose full list is reported in Table 2, 3.

Table 2

Part 1 of the full table of WEIS codes [39], with the 66 layers considered in the Gulf dataset case study; entries with no layer number were not monitored in the Gulf dataset events collection.

Layer#	WEIS code	WEIS cat	Description
1	011	Yield	Surrender, yield or order, submit to arrest, etc.
2	012	Yield	Yield position, retreat; evacuate
3	013	Yield	Admit wrongdoing; retract statement
	014	Yield	Accommodate, Cease-fire
4	015	Yield	Cede Power
5	021	Comment	Explicit decline to comment
6	022	Comment	Comment on situation – pessimistic
7	023	Comment	Comment on situation – neutral
8	024	Comment	Comment on situation – optimistic
9	025	Comment	Explain policy or future position
	026	Comment	Appoint or Elect
	027	Comment	Alter Rules
10	031	Consult	Meet with at neutral site, or send note.
11	032	Consult	Consult & Visit; go to
12	033	Consult	Receive visit; host
	034	Consult	Vote, Elect
13	041	Approve	Praise, hail, applaud, condole
14	042	Approve	Endorse other's policy or position; give verbal support
	043	Approve	Rally
15	051	Promise	Promise own policy support
16	052	Promise	Promise material support
17	053	Promise	Promise other future support action
18	054	Promise	Assure; reassure
	055	Promise	Promise Rights
19	061	Grant	Express regret; apologize
20	062	Grant	Give state invitation
21	063	Grant	Grant asylum
22	064	Grant	Grant privilege, diplomatic recognition
23	065	Grant	Suspend negative sanctions; truce
24	066	Grant	Release and/or return persons or property
	067	Grant	Grant Position
25	070	Reward	Reward
26	071	Reward	Extend economic aid (as gift and/or loan)
27	072	Reward	Extend military assistance
28	073	Reward	Give other assistance
29	081	Agree	Make substantive agreement
30	082	Agree	Agree to future action or procedure; agree to meet, to negotiate
	083	Agree	Ally
	084	Agree	Merge; Integrate
31	091	Request	Ask for information
32	092	Request	Ask for policy assistance
33	093	Request	Ask for material assistance
34	094	Request	Request action; call for
35	095	Request	Entreat; plead; appeal to
	096	Request	Request policy change
	097	Request	Request rights

Table 3

Part 2 of the full table of WEIS codes [39], with the 66 layers considered in the Gulf dataset case study; entries with no layer number were not monitored in the Gulf dataset events collection.

Layer#	WEIS code	WEIS cat	Description
36	101	Propose	Offer proposal
37	102	Propose	Urge or suggest action or policy
38	111	Reject	Turn down proposal; reject protest demand, threat, etc
39	112	Reject	Refuse; oppose; refuse to allow
	113	Reject	Defy law
40	121	Accuse	Charge; criticize; blame; disapprove
41	122	Accuse	Denounce; denigrate; abuse
	123	Accuse	Investigate
42	131	Protest	Make complaint (not formal)
43	132	Protest	Make formal complaint or protest
	133	Protest	Symbolic act
44	141	Deny	Deny an accusation
45	142	Deny	Deny an attributed policy, action role or position
46	150	Demand	Issue order or command; insist; demand compliance; etc
	151	Demand	Issue Command
	152	Demand	Claim Rights
47	160	Warn	Give warning
48	161	Warn	Warn of policies
	162	Warn	Warn of problem
49	171	Threaten	Threat without specific negative sanctions
50	172	Threaten	Threat with specific non-military negative sanctions
51	173	Threaten	Threat with force specified
52	174	Threaten	Ultimatum; threat with negative sanctions and time limit specified
53	181	Demonstrate	Non-military demonstration; to walk out on
54	182	Demonstrate	Armed force mobilization
55	191	Reduce Relations1	Cancel or postpone planned event
56	192	Reduce Relations1	Reduce routine international activity; recall officials; etc
57	193	Reduce Relations1	Reduce or halt aid
58	194	Reduce Relations1	Halt negotiations
59	195	Reduce Relations1	Break diplomatic relations
	196	Reduce Relations1	Strike
	197	Reduce Relations1	Censor
60	201	Expel	Order personnel out of country
61	202	Expel	Expel organization or group
	203	Expel	Ban Organization
62	211	Seize	Seize position or possessions
63	212	Seize	Detain or arrest person(s)
	213	Seize	Hijack; Kidnap
64	221	Force	Non-injury obstructive act
65	222	Force	Non-military injury-destruction
66	223	Force	Military engagement

As negative sanctions.

In the notation of Sec. 3, the Gulf Dataset translates into a time series of multiplex networks with unweighted and undirected layers sharing nodes. The landmark event for the considered zone in the 20 years data range of interest is definitely the First Gulf War (FGW), occurring between August 1990 and March 1999. However, other (smaller) events located in the area had a relevant impact on world politics and diplomatic relations. Among them, the Iraq Disarmament Crisis (IDC) in February 1998 significantly emerges from the data, as shown in what follows. During that month, Iraq President Saddam Hussein negotiated a deal with U.N. Secretary General Kofi Annan, allowing weapons inspectors to return to Baghdad, preventing military action by the United States and Britain.

Network statistics

Consider in this section the set of 304401 edges connecting the 202 nodes independently of their class. In Table 4 we list the top-10 countries/institutions participating in the largest number of edges across different time spans, together with the absolute number of shared edges and the corresponding percentage over the total number of edges for the period. In general, USA, Iraq and Iran are the major players, with different proportions according to the specific period: in particular, Iraq is the main character in both the major events, FGW and IDC. Other key actors are Israel, the United Nations and the Saudi Arabia, with a relevant presence in each key event in the area. Note that, overall, the top 20 institutions (also including, other than those listed in the table, the Arab world, France, Syria, Egypt, Russia, Turkey, Jordan, Libya, Germany and the Kurd world) are responsible for 82.57% of all edges.

Table 4

Top-10 countries/institutions ranked by number of shared links, absolute and in percentage over (twice) the total number of links in the considered period. The Iran–Iraq War started in September 1980 and ended in August 1988. SA: Saudi Arabia; UN: United Nations.

Apr79–Mar99			FGW			IDC
Edges 304401			Edges 41181			Edges 7712
Inst.	Degree	%	Inst.	Degree	%	Inst.	Degree	%
USA	93900	15.42	Iraq	18691	22.69	Iraq	3830	24.83
Iraq	84974	13.96	USA	15584	18.92	USA	2876	18.65
Iran	61782	10.15	Kuwait	5245	6.37	UN	1946	12.62
Israel	32204	5.29	SA	3548	4.31	Russia	896	5.81
UN	30097	4.94	Israel	3420	4.15	UK	715	4.64
SA	20503	3.37	UN	3363	4.08	France	651	4.22
Lebanon	19130	3.14	UK	2997	3.64	Iran	468	3.03
Palestine	18607	3.06	Iran	2104	2.55	Arab world	321	2.08
UK	18415	3.02	France	2076	2.52	China	309	2.00
Kuwait	17405	2.86	Arab world	2053	2.49	Kuwait	306	1.98

Out of all potential unique edges, only 4394 are represented in the Gulf Dataset. In Table 5 we list the top-10 links ranked by occurrence, together with the number of occurrences itself and the corresponding percentage over the total number of edges for the period. As it happens for the nodes, there are a few key links throughout the whole timespan which are consistently present in most of the important events, with different proportions. However, in some of the events, there is an interesting wide gap in the number of occurrences between the very top edges and the remaining ones, e.g., Iraq–USA in FGW (and post) and IDC, and Iran–Iraq during the corresponding war and in the pre-FGW, yielding that these are the links mainly driving the whole network evolution.

Table 5

Apr79–Mar99			FGW			IDC
Edges 304401			Edges 41181			Edges 7712
Edge	Degree	%	Edge	Degree	%	Edge	Degree	%
Iran–Iraq	19121	6.28	Iraq–USA	6061	14.72	Iraq–USA	1021	13.24
Iraq–USA	19002	6.24	Iraq–Kuwait	2306	5.60	Iraq–UN	927	12.02
Iran–USA	14051	4.62	SA–USA	1169	2.84	UN–USA	337	4.37
Iraq–UN	12775	4.20	Iraq–UN	1118	2.71	Iraq–Russia	315	4.08
Israel–Lebanon	6590	2.16	Kuwait–USA	1050	2.55	Iraq–UK	241	3.12
Israel–USA	5803	1.91	Iraq–UK	1012	2.46	France–Iraq	191	2.48
Iraq–Kuwait	5187	1.70	Iran–Iraq	989	2.40	UK–USA	184	2.39
SA–USA	4468	1.47	Israel–USA	935	2.27	France–UN	171	2.22
Israel–Palestina	4466	1.47	Iraq–Israel	851	2.07	Russia–USA	170	2.20
UN–USA	4209	1.38	Iraq–SA	796	1.93	Iraq–Turkey	136	1.76

In Figure 10 we display the dynamics of the occurrence along time of the top edges, showing their different trends during the diverse events. It is interesting to note how two top links, Iran–Iraq and Iran–USA are preponderant from 1979 to 1989, i.e., throughout the whole Iran–Iraq War, while they go decaying quickly afterwards, with a minor spike for FGW. Complementarily, two other major links Iraq–USA and Iraq–United Nations have the opposite trend, remaining almost uninfluential until FGW and growing later on, with a noticeable spike for IDC; moreover, Iraq–United Nations does not show any trend change for FGW, while Iraq–USA does. The Iraq–Kuwait link has a very limited dynamics, with the unique important spike for FGW. Very similar are also the Saudi Arabia–USA and the Israel–USA links, showing an additional lower spike in correspondence of the raise of the terroristic actions between 1995–1996. This last event is crucial in the Israel–Lebanon relations, where it has the largest effect; FGW, instead, has almost no impact here.

Figure 10

Occurrences along time of the top-8 most frequent links. The blue area marks FGW, while the red dashed line indicates IDC in February 98.

D⁎ indicators analysis. The two main events FGW and IDC generate sudden changes in the D1 time series for most of the layers: an example is given in Figure 11 for the layer 37, corresponding to WEIS code 102 (“Urge or suggest action or policy”), where we highlight FGW by a blue background, and IDC by a red dashed line. The complete panel of the curves for all the 66 layers is shown in Figure 12,13,14: most of the layers show a decise change in trend in correspondence of the two main events, although some of the layers display a different behavior (e.g., layer 59, “Break diplomatic relations”), sometimes due to the paucity of data (e.g., “Halt negotiations” or “Reward”). Note that many other spikes exist in many layers, corresponding to different geopolitical events occurring throughout the considered timespan.

Figure 11

D1 time series for the layer 37, corresponding to WEIS code 102 (“Urge or suggest action or policy”). The period corresponding to FGW is marked by the blue background, while the red dashed line indicates IDC in February 1998.

Figure 12

Curves of indicator D1 for the 24 layers L(t), for i = 1,…,24: the blue area marks FGW, while the red dashed line indicates IDC in February 98. For each curve, the corresponding World Event/Interaction Survey category is indicated in the top left corner.

Figure 13

Curves of indicator D1 for the 21 layers L(t), for i = 25,…,45: the blue area marks FGW, while the red dashed line indicates IDC in February 98. For each curve, the corresponding World Event/Interaction Survey category is indicated in the top left corner.

Figure 14

Curves of indicator D1 for the 21 layers L(t), for i = 46,…,66: the blue area marks FGW, while the red dashed line indicates IDC in February 98. For each curve, the corresponding World Event/Interaction Survey category is indicated in the top left corner.

All the information conveyed by the 66 time series can be summarized by using the and indicators displayed in Figure 15. The two curves show a similar trend, with two major spikes corresponding to the FGW and the IDC, neatly emerging in both time series. Furthermore, both indicators are consistent in showing that the two periods pre- and post-FGW are not part of the FGW spike, implying that in these two periods the structure of the occurring binary geopolitical events is closer to the analogous structure for the “no-war” periods.

Figure 15

Time evolution of a global view of the (monthly) Gulf Dataset. (top) D2 dynamics of the collapsed projections and (bottom) D3 dynamics of the metric projections . For each date, the value on y-axis is the HIM distance from the first element of the time series. Different colors mark different time periods. The black line represents the fixed-interval smoothing via a state-space model [48].

However, as expected, the indicator includes a lower level of information than : this is particularly evident (also for the smoothed curves, in black in the plots) in the periods 85–89 and 95–97, where the dynamics of is much flatter than the dynamics of . Note that a nontrivial dynamics in the two periods 85–89 and 95–97 exist in many layers, as shown in Figure 12,13,14, triggered by a number of important events impacting the geopolitical relations: the final part of the Iran–Iraq War (1980–1988), the decline and fall of the Soviet Empire (not directly related to the Middle East area, but reflecting also there), the dramatic change of the situation of the Middle East conflicts induced by the outbreak of the First Intifada in December 1987 [42], and the terrorism excalation (Dhahran, Tel Aviv, and Jerusalem) in Middle East in 95/96 causing a bursting increase in the number of victims just to name the more relevant events. Thus, this case study, too supports the superiority of as a global indicator to summarize the evolution of a series of multiplex networks. We also computed all the HIM distances for (respectively, ) (resp. ), which are then used to project the 240 networks on a plane through a MultiDimensional Scaling (MDS) [43]: the resulting plots are displayed in Figure 16.

Figure 16

Planar multidimensional scaling plot with HIM distance of the collapsed (top) and metric (bottom) projection for the monthly Gulf Dataset. Colors are consistent with those in Figure 15.

Both indicators yield that the months corresponding to FGW (in blue in the plots) are close together and confined in the lower left corner of the plane, showing both a mutual high degree of homogeneity and, at the same time, a relevant difference to the graphs of all other months. Interestingly, this holds also for the months immediately before and after (in green and orange in the figures) the conflict, which are quite distant from the war months' cloud, as previously observed. This confirms that, only at the onset of the conflict the diplomatic relations worldwide changed consistently and their structure remained very similar throughout the whole event. From both the multidimensional scaling plots in Figure 16 it is clear that the both the and networks for the FGW months can be easily discriminated from all other nets. However, from the MDS projections it is not evident whether the months Apr 1979 – Dec 1989 (in grey) could be separated from the Nov 91 – Dec 99 months. By using a Support Vector Machine classifier with the HIM kernel [10], [25] (with for and for ), a 5-fold CV classification gives as best result the accuracy 81.2% for () and 73.3% for (). Thus, in both cases, machine learning provides a good separation between the networks belonging to the two periods. Community structure of . We conclude by analyzing the dynamics of the mesostructure of the layer network as extracted by the Louvain community detection algorithm [44], [45], [46], [47]. For any temporal step, the Louvain algorithm clusters the 66 nodes (WEIS categories) of into two or three communities, whose dimension along time is shown in Figure 17. In Figure 18 we show, for each date, which community each category (on the rows) belongs to; WEIS categories are ranked according to their community distribution, i.e., decreasing number of presences in Comm. #1 and increasing for Comm. #2. Thus in top rows we have the categories lying in Comm. #1 during all the 240 months (layers 7,10,11,28,34,40), while bottom rows are reserved to the categories always belonging to Comm. #2 (3,4,19,25,48,52,58): their description in terms of WEIS categories is shown in Table 6, while the full community distribution is reported in Table 7 and graphically summarized by the triangleplot in Figure 19. Focusing on the categories that are consistently lying in a given community throughout all 240 months, some of them are semantically similar: for instance, consult, assistance, action request in community #1 while two distinct groups emerge in community #2, namely admit wrongdoing, cede power, apologize, reward on one side and warn of policies, sanction threats and halt negotiations characterizing the second group. However, it is interesting the constant presence of the category charge/criticize/blame/disapprove in community #1. Moreover, there is no strong polarization for Community #3. Many layers sharing the same (or similar) WEIS second level category (Yield, Comment, Consult, etc.) are quite close in the community distribution ranked list, with a general escalating trend proceedings from help request (or other more neutral actions) to more severe situations growing together with the community distribution rank.

Figure 17

Dimension of the three communities identified by the Louvain algorithm in along the 240 months.

Figure 18

Community evolution along time for each of the 66 WEIS categories, ranked by community distribution.

Table 6

The 13 layers not swapping community across all 240 timepoints.

Community #1
Layer	WEIS code	WEIS category
7	023	[Comment] Comment on situation – neutral
10	031	[Consult] Meet with at neutral site, or send note
11	032	[Consult] Consult & Visit; go to
28	073	[Reward] Give other assistance
34	094	[Request] Request action; call for
40	121	[Accuse] Charge; criticize; blame; disapprove

Table 7

Community distribution for the 66 WEIS categories: for each layer, we report the number of occurrences in the three detected communities.

Rank	Layer	#1	#2	#3	Rank	Layer	#1	#2	#3
1	7	240	0	0	34	8	86	108	46
2	10	240	0	0	35	17	86	106	48
3	11	240	0	0	36	24	71	130	39
4	28	240	0	0	37	12	58	131	51
5	34	240	0	0	38	47	56	130	54
6	40	240	0	0	39	27	54	140	46
7	66	239	0	1	40	26	51	154	35
8	9	236	0	4	41	18	45	153	42
9	37	234	2	4	42	42	31	170	39
10	38	234	1	5	43	53	27	174	39
11	29	233	2	5	44	61	26	179	35
12	30	232	2	6	45	51	21	182	37
13	6	230	2	8	46	1	17	204	19
14	35	230	0	10	47	60	15	204	21
15	41	222	3	15	48	57	14	214	12
16	39	220	7	13	49	50	9	221	10
17	43	208	12	20	50	59	7	225	8
18	14	196	18	26	51	23	5	219	16
19	62	175	29	36	52	32	5	231	4
20	63	170	32	38	53	15	4	232	4
21	2	164	28	48	54	16	2	236	2
22	56	150	51	39	55	20	2	236	2
23	46	148	49	43	56	33	2	237	1
24	36	146	48	46	57	5	1	235	4
25	55	136	46	58	58	21	1	238	1
26	45	122	64	54	59	44	1	231	8
27	64	121	71	48	60	3	0	240	0
28	65	119	64	57	61	4	0	240	0
29	31	116	64	60	62	19	0	240	0
30	22	110	79	51	63	25	0	240	0
31	54	109	86	45	64	48	0	240	0
32	13	97	91	52	65	52	0	240	0
33	49	97	80	63	66	58	0	240	0

Figure 19

Triangleplot projection of the 66 WEIS categories defined by their community distribution.

Conclusion

We introduced here a novel approach for the longitudinal analysis of a time series of multiplex networks, defined by mean of a metric transformation conveying the information carried by all layers into a single network for each timestamp, with the original layers as nodes. The transformation is induced by the Hamming–Ipsen–Mikhailov distance between graph sharing the same nodes, and it preserves the key events encoded into each instance of the multiplex network time series, making it more efficient than the collapsing of all layers into one collecting all edges for detecting important fluctuations in the original network's dynamics. Moreover, a community detection analysis on the obtained network can help shading light on the relations between the original layers throughout the whole time span.

Declarations

Author contribution statement

Giuseppe Jurman: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Competing interest statement

The authors declare no conflict of interest.

Additional information

Data associated with this study has been deposited at http://vlado.fmf.uni-lj.si/pub/networks/data/KEDS/GulfLDays.zip.

18 in total

Metric projection for dynamic multiplex networks.

Introduction

Background

The Hamming distance

The Ipsen–Mikhailov distance

The Hamming–Ipsen–Mikhailov distance

A minimal example

Theory

Results & discussion

A synthetic example

The Gulf Dataset

Network statistics

Conclusion

Declarations

Author contribution statement

Funding statement

Competing interest statement

Additional information

1. Validation of gene regulatory networks: scientific and inferential.

2. A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data.

3. Structural reducibility of multilayer networks.

4. Networks of military alliances, wars, and international trade.

5. Detecting shifts in gene regulatory networks during time-course experiments at single-time-point temporal resolution.

6. Consensus clustering in complex networks.

7. Metabolic pathways variability and sequence/networks comparisons.

8. A statistical approach for array CGH data analysis.

9. Weighted multiplex networks.

10. Emergence of network features from multiplexity.

Rank	Layer	#1	#2	#3	Rank	Layer	#1	#2	#3
1	7	240	0	0	34	8	86	108	46
2	10	240	0	0	35	17	86	106	48
3	11	240	0	0	36	24	71	130	39
4	28	240	0	0	37	12	58	131	51
5	34	240	0	0	38	47	56	130	54
6	40	240	0	0	39	27	54	140	46
7	66	239	0	1	40	26	51	154	35
8	9	236	0	4	41	18	45	153	42
9	37	234	2	4	42	42	31	170	39
10	38	234	1	5	43	53	27	174	39
11	29	233	2	5	44	61	26	179	35
12	30	232	2	6	45	51	21	182	37
13	6	230	2	8	46	1	17	204	19
14	35	230	0	10	47	60	15	204	21
15	41	222	3	15	48	57	14	214	12
16	39	220	7	13	49	50	9	221	10
17	43	208	12	20	50	59	7	225	8
18	14	196	18	26	51	23	5	219	16
19	62	175	29	36	52	32	5	231	4
20	63	170	32	38	53	15	4	232	4
21	2	164	28	48	54	16	2	236	2
22	56	150	51	39	55	20	2	236	2
23	46	148	49	43	56	33	2	237	1
24	36	146	48	46	57	5	1	235	4
25	55	136	46	58	58	21	1	238	1
26	45	122	64	54	59	44	1	231	8
27	64	121	71	48	60	3	0	240	0
28	65	119	64	57	61	4	0	240	0
29	31	116	64	60	62	19	0	240	0
30	22	110	79	51	63	25	0	240	0
31	54	109	86	45	64	48	0	240	0
32	13	97	91	52	65	52	0	240	0
33	49	97	80	63	66	58	0	240	0

Rank	Layer	#1	#2	#3	Rank	Layer	#1	#2	#3
1	7	240	0	0	34	8	86	108	46
2	10	240	0	0	35	17	86	106	48
3	11	240	0	0	36	24	71	130	39
4	28	240	0	0	37	12	58	131	51
5	34	240	0	0	38	47	56	130	54
6	40	240	0	0	39	27	54	140	46
7	66	239	0	1	40	26	51	154	35
8	9	236	0	4	41	18	45	153	42
9	37	234	2	4	42	42	31	170	39
10	38	234	1	5	43	53	27	174	39
11	29	233	2	5	44	61	26	179	35
12	30	232	2	6	45	51	21	182	37
13	6	230	2	8	46	1	17	204	19
14	35	230	0	10	47	60	15	204	21
15	41	222	3	15	48	57	14	214	12
16	39	220	7	13	49	50	9	221	10
17	43	208	12	20	50	59	7	225	8
18	14	196	18	26	51	23	5	219	16
19	62	175	29	36	52	32	5	231	4
20	63	170	32	38	53	15	4	232	4
21	2	164	28	48	54	16	2	236	2
22	56	150	51	39	55	20	2	236	2
23	46	148	49	43	56	33	2	237	1
24	36	146	48	46	57	5	1	235	4
25	55	136	46	58	58	21	1	238	1
26	45	122	64	54	59	44	1	231	8
27	64	121	71	48	60	3	0	240	0
28	65	119	64	57	61	4	0	240	0
29	31	116	64	60	62	19	0	240	0
30	22	110	79	51	63	25	0	240	0
31	54	109	86	45	64	48	0	240	0
32	13	97	91	52	65	52	0	240	0
33	49	97	80	63	66	58	0	240	0

Rank	Layer	#1	#2	#3	Rank	Layer	#1	#2	#3
1	7	240	0	0	34	8	86	108	46
2	10	240	0	0	35	17	86	106	48
3	11	240	0	0	36	24	71	130	39
4	28	240	0	0	37	12	58	131	51
5	34	240	0	0	38	47	56	130	54
6	40	240	0	0	39	27	54	140	46
7	66	239	0	1	40	26	51	154	35
8	9	236	0	4	41	18	45	153	42
9	37	234	2	4	42	42	31	170	39
10	38	234	1	5	43	53	27	174	39
11	29	233	2	5	44	61	26	179	35
12	30	232	2	6	45	51	21	182	37
13	6	230	2	8	46	1	17	204	19
14	35	230	0	10	47	60	15	204	21
15	41	222	3	15	48	57	14	214	12
16	39	220	7	13	49	50	9	221	10
17	43	208	12	20	50	59	7	225	8
18	14	196	18	26	51	23	5	219	16
19	62	175	29	36	52	32	5	231	4
20	63	170	32	38	53	15	4	232	4
21	2	164	28	48	54	16	2	236	2
22	56	150	51	39	55	20	2	236	2
23	46	148	49	43	56	33	2	237	1
24	36	146	48	46	57	5	1	235	4
25	55	136	46	58	58	21	1	238	1
26	45	122	64	54	59	44	1	231	8
27	64	121	71	48	60	3	0	240	0
28	65	119	64	57	61	4	0	240	0
29	31	116	64	60	62	19	0	240	0
30	22	110	79	51	63	25	0	240	0
31	54	109	86	45	64	48	0	240	0
32	13	97	91	52	65	52	0	240	0
33	49	97	80	63	66	58	0	240	0