Literature DB >> 27626089

Metric projection for dynamic multiplex networks.

Giuseppe Jurman1.   

Abstract

Evolving multiplex networks are a powerful model for representing the dynamics along time of different phenomena, such as social networks, power grids, biological pathways. However, exploring the structure of the multiplex network time series is still an open problem. Here we propose a two-step strategy to tackle this problem based on the concept of distance (metric) between networks. Given a multiplex graph, first a network of networks is built for each time step, and then a real valued time series is obtained by the sequence of (simple) networks by evaluating the distance from the first element of the series. The effectiveness of this approach in detecting the occurring changes along the original time series is shown on a synthetic example first, and then on the Gulf dataset of political events.

Entities:  

Keywords:  Applied mathematics; Computational mathematics; Computer science; Information science

Year:  2016        PMID: 27626089      PMCID: PMC5011148          DOI: 10.1016/j.heliyon.2016.e00136

Source DB:  PubMed          Journal:  Heliyon        ISSN: 2405-8440


Introduction

When the links connecting a set of N nodes arise from k different sources, a possible representation for the corresponding graph is the construction of k networks on the same N nodes, one for each source. The resulting structure is known as a multiplex network, and each of the composing graphs is called a layer. Multiplex networks are quite effective in representing many different real-world situations [1], [2], [3], and their structure helps extracting crucial information about the complex systems under investigation that would instead remain hidden when analyzing individual layers separately [4], [5], [6]; furthermore, their relation with time series analysis techniques has recently gained interest in the literature [7]. A key property to be highlighted is the correlated multiplexity, as stated in [8]: in real-world systems, the relation between layers is not at all random; in fact, in many cases, the layers are mutually correlated. Moreover, the communities induced on different layers tend to overlap across layers, thus generating interesting mesoscale structures. These observations guided the authors of [9] in defining a network having the layers of the original multiplex graph as nodes, and using information theory to define a similarity measure between the layers themselves, so to investigate the mesoscopic modularity of the multiplex network. Here we propose to pursue a similar strategy for defining a network of networks derived from a multiplex graph, although in a different context and with a different aim. In particular, we project a time series of multiplex networks into a series of simple networks to be used in the analysis of the dynamics of the original multiplex series. The projection map defining the similarity measure between layers is induced by the Hamming–Ipsen–Mikhailov (HIM) network distance [10], a glocal metric combining the Hamming and the Ipsen–Mikhailov distances, used in different scientific areas [11], [12], [13], [14], [15], [16]. The main goal in using this representation is the analysis of the dynamics of the original time series through the investigation of the trend of the projected evolving networks, by extracting the corresponding real-valued time series obtained computing the HIM distance between any element in the series and the first one. For instance, we show on a synthetic example that this strategy is more informative than considering statistics of the time series for each layer of the multiplex networks, or than studying the networks derived collapsing all layers into one including all links, as in [17], [18] when the aim is detecting the timesteps where more relevant changes occur and the system is undergoing a state transition (tipping point) or it is approaching it (early warning signals). This is a classical problem in time series analysis, and very diverse solutions have appeared in literature (see [19] for a recent example). Here we use two different evaluating strategies, the former based on the fluctuations of mean and variance [20] (implemented in the R package changepoint https://cran.r-project.org/web/packages/changepoint/index.html), and the latter involving the study of increment entropy indicator [21]. We conclude with the analysis of the well known Gulf Dataset (part of the Penn State Event Data) concerning the 304.401 political events (of 66 different categories) occurring between 202 countries in the 10 years between 15 April 1979 to 31 March 1991, focusing on the situation in the Gulf region and the Arabian peninsula. A major task in the analysis of the Gulf dataset is the assessment of the translation of the geopolitical events into fluctuations of measurable indicators. A similar network-based mining of sociopolitical relations, but with a probabilistic approach, can be found in [22], [23], [24]. Here we show the effectiveness of the newly introduced methodology in associating relevant political events and periods to characteristic behaviors in the dynamics of the time series of the induced networks of networks, together with a simple overview of the corresponding mesoscale modular structure.

Background

The Hamming–Ipsen–Mikhailov (HIM) metric [10], [25] is a distance function quantifying in the real interval the difference between two networks on shared nodes. The HIM metric linearly combines an edit distance, the Hamming (H) [26], [27], [28] and a spectral distance, the Ipsen–Mikhailov (IM) [29]. Edit distances are local metrics, functions of insertion and deletion of matching links, while spectral measures are global distances, functions of the network spectrum. Local functions disregards the overall network structure, while spectral measures cannot distinguish isospectral graphs. As its characterizing feature, HIM is a glocal distance that overcomes the drawbacks of local and global metrics when separately considered. Furthermore, its definition can be naturally extended to directed networks. Hereafter we give a brief description of the H, IM and HIM distances, graphically summarized in Figure 1.
Figure 1

Summary of the definitions of the HIM distance and its Hamming (H) and Ipsen–Mikhailov (IM) components.

Notations. Let and be two simple networks on N nodes, whose adjacency matrices are and , with , where for unweighted graphs and for weighted networks. Let then be the identity matrix , let be the unitary matrix with all entries equal to one and let be the null matrix with all entries equal to zero. Denote then by the empty network with N nodes and no links (with adjacency matrix ) and by the clique (undirected simple full network) with N nodes and all possible links, whose adjacency matrix is . Finally, the Laplacian matrix L of an undirected network is defined as the difference between the degree matrix D and the adjacency matrix A, where D is the diagonal matrix of vertex degrees. L is positive semidefinite and singular, with eigenvalues .

The Hamming distance

The Hamming distance, one of the most common dissimilarity measures in coding and string theory and recently used also for network comparison, evaluates the presence/absence of matching links on the two compared networks. In terms of adjacency matrices, the expression for the normalized Hamming metric H reads as where the normalization factor bounds the range of the function H in the interval . The lower bound 0 is attained only for identical networks , the upper limit 1 for complementary networks . When and are unweighted networks, is just the fraction of different matching links over the total number of possible links between the two graphs.

The Ipsen–Mikhailov distance

The Ipsen–Mikhailov IM metric stems from the realization of an N nodes network as an N molecules system connected by identical elastic springs, according to the adjacency matrix A. The dynamics of the spring-mass system can be described by the set of N differential equations The vibrational frequencies of are given by , while the spectral density for a graph in terms of the sum of Lorentz distributions is defined as where γ is the common width and K is the normalization constant defined by the condition , and thus The scale parameter γ specifies the half-width at half-maximum, which is equal to half the interquartile range. Then the spectral distance between two graphs and on N nodes with densities and can be defined as Since for each N, denoting by the unique solution of , the normalized Ipsen–Mikhailov distance between two undirected networks can be defined as so that IM is bounded between 0 and 1, with upper bound attained only for .

The Hamming–Ipsen–Mikhailov distance

Consider now the cartesian product Z of two metric spaces and , where is the set of all simple undirected networks on N nodes endowed either with the Hamming metric H or with the Ipsen–Mikhailov distance IM. Define then on Z the one-parameter Hamming–Ipsen–Mikhailov distance HIM as the (Euclidean) product metric of H and IM, normalized by the factor , for : where in what follows we will omit the subscript ξ when it is equal to one. Note that, apart from extreme values, the qualitative impact of ξ is minimal in practice, and in what follows will always be assumed. The metric is bounded in the interval , with the lower bound attained for every couple of identical networks, and the upper one attained only on the pair . Moreover, all distances will be nonzero for non-identical isomorphic/isospectral graphs.

A minimal example

In Figure 2 we show, in the space, the graphical representation in circular layout and the mutual HIM distances between four undirected simple networks on six shared nodes, namely the ring network (A), the star network (B), a regular network with degree three (C) and a regular lattice (D). HIM distances range from 0.217 for the pair (C,D), which are the mutually closest networks, to 0.495 for (B,C) which are the farthest graphs. In all cases, the Hamming distance is contributing to the HIM metric more than the Ipsen–Mikhailov component, indicating that the presence or absence of matching links is has a larger impact than the overall structure. Note for instance that networks A and B have the same Hamming distance as A and D, but the spectral structure of the lattice D is closer to the structure of the ring network A than the star network B, as quantitatively shown by the different IM distance; in particular, the spectral structures of A and D are the closest, with IM distance even smaller of the IM distance between C and D. An analogous situation occurs for the pairs B,C and B,D, sharing the same H distance but with a different IM distance.
Figure 2

Graphical representation in circular layout (a), scatterplot (b) and tabular (c) representation of the HIM distance in the Ipsen–Mikhailov (IM axis) and Hamming (H axis) distance space between ring network (A), the star network (B), a regular network with degree three (C) and a 3 × 2 regular lattice (D).

Theory

Let be a sequence (time series) of τ multiplex networks with λ layers sharing ν nodes , as displayed in Figure 3.
Figure 3

Graphical representation of a sequence of τ multiplex networks with λ layers.

The metric projection. Construct now the metric projection of as the full undirected weighted network with λ nodes where the weight of the edge connecting vertices and is defined by the HIM similarity between layers and : thus, if is the adjacency matrix of , then In Figure 4 an example of the construction of is shown for a multiplex network with five layers.
Figure 4

Construction of the metric projection at a given time point t = i for a multiplex network with λ = 5 layers; the metric projection is a new network with one node for each layer of the original net, and the edge weight is given by the complement of the HIM distance between the corresponding layers.

The collapsed projection. Moreover, if is the adjacency matrix of , define the collapsed projection of on nodes as the network where a link exists between and if it exists in at least one layer (for binary layers); in case of weighted layers, the weight of the link is the average of the weights across all layers. Thus, if is the adjacency matrix of , then In Figure 5 we show a graphical sketch of the collapsing of a multiplex network with five layers.
Figure 5

Construction of the collapsed projection at a given time point t = i for a multiplex network with λ = 5 layers; the collapsed projection is a new network sharing the same nodes of the original multiplex net, where a link exists in the projection if the same link appears in at least one of the layers of the multiplex network, as if all the layers were collapsed into a single one.

Caveat: consider a sequence of binary multiplex networks such that, for each of the possible links and for each timestep, there exists at least one layer including this link. Then the collapsed projection, at each time step, is the full graph on ν nodes, and, as such, it has no temporal dynamics, regardless of the evolution of each single layer. The distance series. To investigate the dynamics of for , we construct a suite of associated time series by means of three different procedures, all involving the HIM distance between each network in a given sequence and the first element of the sequence itself. The first group D1 of distance series is obtained by evaluating the dynamics of each layer considered separately: In Figure 6 we show the construction of the distance series D1 for the first layer of the multiplex network in Figure 3.
Figure 6

Construction of the distance series D1 for the first layer of the sequence of multiplex network in Figure 3. The value of the time series at time point t = i is the HIM distance between the layer L1 at time t = i and at time t = 1.

The second series, D2, collects the metric dynamics of the collapsed projection : An example of construction of D2 for the five layers multiplex network of Figure 5 is shown in Figure 7.
Figure 7

Construction of the distance series D2 for the sequence of collapsed networks in Figure 5. The value of the time series at time point t = i is the HIM distance between at time t = i and at time t = 1.

Finally, the last series D3 collects the metric dynamics of the metric projection , and the corresponding example for the multiplex networks in Figure 4 is shown in Figure 8:
Figure 8

Construction of the distance series D3 for the sequence of metric projections in Figure 4. The value of the time series at time point t = i is the HIM distance between at time t = i and at time t = 1.

Dynamics indicators. The dynamics of the time series D⁎ is quantitatively analyzed by means of a set of indicators, assessing the series' information content and detecting occurring tipping points. The first indicator is the Increment Entropy (IncEnt), introduced in [21] as a measure the complexity of time series in terms of its unpredictability [30]. The starting point is increment series of a time series as an informative encoder of the characteristics of dynamic changes hidden in a signal. In practice, the increments are grouped in vectors of size m, and each increment is mapped into a two-letters word, with a sign and its size coded in this word according to a resolution parameter R. Finally, the IncEnt is computed as the Shannon entropy of these words: the larger is the IncEnt value, the less predictable is the series. The second indicator meanvar belongs to the family of the changepoint detection indicators as implemented in R by the changepoint package [20]. In general, changepoint detection algorithms are the solutions to the problem of estimating the points in a time series where the statistical properties change. The subset of the meanvar functions search for changes in both the mean and the variance, and a number of alternative are known in literature [31], [32], [33], [34], [35], [36]. In particular, in what follows we will employ the recent Pruned Exact Linear Time (PELT) algorithm [37], based on the classical segment neighborhood technique minimizing the combination of a cost function (for instance, twice the negative log-likelihood) with a linear penalty function through dynamic programming. Finally, we will use Changepoints for a Range of PenaltieS (CROPS) [38] to obtain optimal changepoint segmentations of data sequences for all penalty values across a continuous range.

Results & discussion

A synthetic example

Consider now a sequence of binary multiplex networks with , and , generated as follows. Define the perturbation function taking as entries a binary simple network N on n nodes, and a couple of real values with , and returning a network obtained from N by swapping the status (present/not present) of links, where g is a random value in the interval . Further, define the default transition as the pair , a small transition as , a medium transition as and, finally, a large transition as . Moreover, let R be an Erdós–Rényi random model and define 4 special timepoints: the initial time step , the first spike , the second spike and the third spike . Then, each layer at a given time step t is defined through the following rule: In Figure 9 we show the evolution along the 30 timepoints of the 5 curves for , its average and , . To assess the information content of each curve we use the Increment Entropy indicator IncEnt, whose value increases with the series' complexity: the IncEnt values are reported in Table 1.
Figure 9

D1, D2, D3 for a synthetic example on 5 layers and 30 timepoints; in the right column, third row, we plot .

Table 1

Increment Entropy values for the distance sequences of the synthetic example, with parameters m = 2, R = 2.

Dist.IncEntDist.IncEnt
D1(L1)2.52D1(L2)2.78
D1(L3)2.59D1(L4)2.83
D1(L5)2.44D12.27
D21.82D33.04
Among the evolving layers, and have the largest IncEnt, while the other three layers show a lower level of complexity. As expected, the average and the collapsed network distance has very low IncEnt value, yielding that both averaging the distances and collapsing the layers lose information about the overall dynamics. Finally, distance is the metric which better detects the network evolution along time, conserving most of the information. This is also supported by the meanvar indicator with CROPS range with the PELT algorithm: in fact, the meanvar indicator detects correctly in the three points , and as changepoints, while in , other than the , meanvar detects and =28 which are unrelated to the designed dynamics.

The Gulf Dataset

Data description. Part of the Penn State Event Data http://eventdata.psu.edu/ (formerly Kansas Event Data System), available at http://vlado.fmf.uni-lj.si/pub/networks/data/KEDS/, the Gulf Dataset collects, on a monthly bases, political events between pairs of countries focusing on the Gulf region and the Arabian peninsula for the period 15 April 1979 to 31 March 1999, for a total of 240 months. The 304401 political events involve 202 countries and they belong to 66 classes (including for instance “pessimist comment”, “meet”, “formal protest”, “military engagement”, etc.) as coded by the World Event/Interaction Survey (WEIS) Project [39], [40], [41] http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/5211, whose full list is reported in Table 2, 3.
Table 2

Part 1 of the full table of WEIS codes [39], with the 66 layers considered in the Gulf dataset case study; entries with no layer number were not monitored in the Gulf dataset events collection.

Layer#WEIS codeWEIS catDescription
1011YieldSurrender, yield or order, submit to arrest, etc.
2012YieldYield position, retreat; evacuate
3013YieldAdmit wrongdoing; retract statement
014YieldAccommodate, Cease-fire
4015YieldCede Power
5021CommentExplicit decline to comment
6022CommentComment on situation – pessimistic
7023CommentComment on situation – neutral
8024CommentComment on situation – optimistic
9025CommentExplain policy or future position
026CommentAppoint or Elect
027CommentAlter Rules
10031ConsultMeet with at neutral site, or send note.
11032ConsultConsult & Visit; go to
12033ConsultReceive visit; host
034ConsultVote, Elect
13041ApprovePraise, hail, applaud, condole
14042ApproveEndorse other's policy or position; give verbal support
043ApproveRally
15051PromisePromise own policy support
16052PromisePromise material support
17053PromisePromise other future support action
18054PromiseAssure; reassure
055PromisePromise Rights
19061GrantExpress regret; apologize
20062GrantGive state invitation
21063GrantGrant asylum
22064GrantGrant privilege, diplomatic recognition
23065GrantSuspend negative sanctions; truce
24066GrantRelease and/or return persons or property
067GrantGrant Position
25070RewardReward
26071RewardExtend economic aid (as gift and/or loan)
27072RewardExtend military assistance
28073RewardGive other assistance
29081AgreeMake substantive agreement
30082AgreeAgree to future action or procedure; agree to meet, to negotiate
083AgreeAlly
084AgreeMerge; Integrate
31091RequestAsk for information
32092RequestAsk for policy assistance
33093RequestAsk for material assistance
34094RequestRequest action; call for
35095RequestEntreat; plead; appeal to
096RequestRequest policy change
097RequestRequest rights
Table 3

Part 2 of the full table of WEIS codes [39], with the 66 layers considered in the Gulf dataset case study; entries with no layer number were not monitored in the Gulf dataset events collection.

Layer#WEIS codeWEIS catDescription
36101ProposeOffer proposal
37102ProposeUrge or suggest action or policy
38111RejectTurn down proposal; reject protest demand, threat, etc
39112RejectRefuse; oppose; refuse to allow
113RejectDefy law
40121AccuseCharge; criticize; blame; disapprove
41122AccuseDenounce; denigrate; abuse
123AccuseInvestigate
42131ProtestMake complaint (not formal)
43132ProtestMake formal complaint or protest
133ProtestSymbolic act
44141DenyDeny an accusation
45142DenyDeny an attributed policy, action role or position
46150DemandIssue order or command; insist; demand compliance; etc
151DemandIssue Command
152DemandClaim Rights
47160WarnGive warning
48161WarnWarn of policies
162WarnWarn of problem
49171ThreatenThreat without specific negative sanctions
50172ThreatenThreat with specific non-military negative sanctions
51173ThreatenThreat with force specified
52174ThreatenUltimatum; threat with negative sanctions and time limit specified
53181DemonstrateNon-military demonstration; to walk out on
54182DemonstrateArmed force mobilization
55191Reduce Relations1Cancel or postpone planned event
56192Reduce Relations1Reduce routine international activity; recall officials; etc
57193Reduce Relations1Reduce or halt aid
58194Reduce Relations1Halt negotiations
59195Reduce Relations1Break diplomatic relations
196Reduce Relations1Strike
197Reduce Relations1Censor
60201ExpelOrder personnel out of country
61202ExpelExpel organization or group
203ExpelBan Organization
62211SeizeSeize position or possessions
63212SeizeDetain or arrest person(s)
213SeizeHijack; Kidnap
64221ForceNon-injury obstructive act
65222ForceNon-military injury-destruction
66223ForceMilitary engagement

As negative sanctions.

In the notation of Sec. 3, the Gulf Dataset translates into a time series of multiplex networks with unweighted and undirected layers sharing nodes. The landmark event for the considered zone in the 20 years data range of interest is definitely the First Gulf War (FGW), occurring between August 1990 and March 1999. However, other (smaller) events located in the area had a relevant impact on world politics and diplomatic relations. Among them, the Iraq Disarmament Crisis (IDC) in February 1998 significantly emerges from the data, as shown in what follows. During that month, Iraq President Saddam Hussein negotiated a deal with U.N. Secretary General Kofi Annan, allowing weapons inspectors to return to Baghdad, preventing military action by the United States and Britain.

Network statistics

Consider in this section the set of 304401 edges connecting the 202 nodes independently of their class. In Table 4 we list the top-10 countries/institutions participating in the largest number of edges across different time spans, together with the absolute number of shared edges and the corresponding percentage over the total number of edges for the period. In general, USA, Iraq and Iran are the major players, with different proportions according to the specific period: in particular, Iraq is the main character in both the major events, FGW and IDC. Other key actors are Israel, the United Nations and the Saudi Arabia, with a relevant presence in each key event in the area. Note that, overall, the top 20 institutions (also including, other than those listed in the table, the Arab world, France, Syria, Egypt, Russia, Turkey, Jordan, Libya, Germany and the Kurd world) are responsible for 82.57% of all edges.
Table 4

Top-10 countries/institutions ranked by number of shared links, absolute and in percentage over (twice) the total number of links in the considered period. The Iran–Iraq War started in September 1980 and ended in August 1988. SA: Saudi Arabia; UN: United Nations.

Apr79–Mar99
FGW
IDC
Edges 304401
Edges 41181
Edges 7712
Inst.Degree%Inst.Degree%Inst.Degree%
USA9390015.42Iraq1869122.69Iraq383024.83
Iraq8497413.96USA1558418.92USA287618.65
Iran6178210.15Kuwait52456.37UN194612.62
Israel322045.29SA35484.31Russia8965.81
UN300974.94Israel34204.15UK7154.64
SA205033.37UN33634.08France6514.22
Lebanon191303.14UK29973.64Iran4683.03
Palestine186073.06Iran21042.55Arab world3212.08
UK184153.02France20762.52China3092.00
Kuwait174052.86Arab world20532.49Kuwait3061.98




Out of all potential unique edges, only 4394 are represented in the Gulf Dataset. In Table 5 we list the top-10 links ranked by occurrence, together with the number of occurrences itself and the corresponding percentage over the total number of edges for the period. As it happens for the nodes, there are a few key links throughout the whole timespan which are consistently present in most of the important events, with different proportions. However, in some of the events, there is an interesting wide gap in the number of occurrences between the very top edges and the remaining ones, e.g., Iraq–USA in FGW (and post) and IDC, and Iran–Iraq during the corresponding war and in the pre-FGW, yielding that these are the links mainly driving the whole network evolution.
Table 5

Top-10 countries/institutions ranked by number of shared links, absolute and in percentage over (twice) the total number of links in the considered period. The Iran–Iraq War started in September 1980 and ended in August 1988. SA: Saudi Arabia; UN: United Nations.

Apr79–Mar99
FGW
IDC
Edges 304401
Edges 41181
Edges 7712
EdgeDegree%EdgeDegree%EdgeDegree%
Iran–Iraq191216.28Iraq–USA606114.72Iraq–USA102113.24
Iraq–USA190026.24Iraq–Kuwait23065.60Iraq–UN92712.02
Iran–USA140514.62SA–USA11692.84UN–USA3374.37
Iraq–UN127754.20Iraq–UN11182.71Iraq–Russia3154.08
Israel–Lebanon65902.16Kuwait–USA10502.55Iraq–UK2413.12
Israel–USA58031.91Iraq–UK10122.46France–Iraq1912.48
Iraq–Kuwait51871.70Iran–Iraq9892.40UK–USA1842.39
SA–USA44681.47Israel–USA9352.27France–UN1712.22
Israel–Palestina44661.47Iraq–Israel8512.07Russia–USA1702.20
UN–USA42091.38Iraq–SA7961.93Iraq–Turkey1361.76




In Figure 10 we display the dynamics of the occurrence along time of the top edges, showing their different trends during the diverse events. It is interesting to note how two top links, Iran–Iraq and Iran–USA are preponderant from 1979 to 1989, i.e., throughout the whole Iran–Iraq War, while they go decaying quickly afterwards, with a minor spike for FGW. Complementarily, two other major links Iraq–USA and Iraq–United Nations have the opposite trend, remaining almost uninfluential until FGW and growing later on, with a noticeable spike for IDC; moreover, Iraq–United Nations does not show any trend change for FGW, while Iraq–USA does. The Iraq–Kuwait link has a very limited dynamics, with the unique important spike for FGW. Very similar are also the Saudi Arabia–USA and the Israel–USA links, showing an additional lower spike in correspondence of the raise of the terroristic actions between 1995–1996. This last event is crucial in the Israel–Lebanon relations, where it has the largest effect; FGW, instead, has almost no impact here.
Figure 10

Occurrences along time of the top-8 most frequent links. The blue area marks FGW, while the red dashed line indicates IDC in February 98.

D⁎ indicators analysis. The two main events FGW and IDC generate sudden changes in the D1 time series for most of the layers: an example is given in Figure 11 for the layer 37, corresponding to WEIS code 102 (“Urge or suggest action or policy”), where we highlight FGW by a blue background, and IDC by a red dashed line. The complete panel of the curves for all the 66 layers is shown in Figure 12,13,14: most of the layers show a decise change in trend in correspondence of the two main events, although some of the layers display a different behavior (e.g., layer 59, “Break diplomatic relations”), sometimes due to the paucity of data (e.g., “Halt negotiations” or “Reward”). Note that many other spikes exist in many layers, corresponding to different geopolitical events occurring throughout the considered timespan.
Figure 11

D1 time series for the layer 37, corresponding to WEIS code 102 (“Urge or suggest action or policy”). The period corresponding to FGW is marked by the blue background, while the red dashed line indicates IDC in February 1998.

Figure 12

Curves of indicator D1 for the 24 layers L(t), for i = 1,…,24: the blue area marks FGW, while the red dashed line indicates IDC in February 98. For each curve, the corresponding World Event/Interaction Survey category is indicated in the top left corner.

Figure 13

Curves of indicator D1 for the 21 layers L(t), for i = 25,…,45: the blue area marks FGW, while the red dashed line indicates IDC in February 98. For each curve, the corresponding World Event/Interaction Survey category is indicated in the top left corner.

Figure 14

Curves of indicator D1 for the 21 layers L(t), for i = 46,…,66: the blue area marks FGW, while the red dashed line indicates IDC in February 98. For each curve, the corresponding World Event/Interaction Survey category is indicated in the top left corner.

All the information conveyed by the 66 time series can be summarized by using the and indicators displayed in Figure 15. The two curves show a similar trend, with two major spikes corresponding to the FGW and the IDC, neatly emerging in both time series. Furthermore, both indicators are consistent in showing that the two periods pre- and post-FGW are not part of the FGW spike, implying that in these two periods the structure of the occurring binary geopolitical events is closer to the analogous structure for the “no-war” periods.
Figure 15

Time evolution of a global view of the (monthly) Gulf Dataset. (top) D2 dynamics of the collapsed projections and (bottom) D3 dynamics of the metric projections . For each date, the value on y-axis is the HIM distance from the first element of the time series. Different colors mark different time periods. The black line represents the fixed-interval smoothing via a state-space model [48].

However, as expected, the indicator includes a lower level of information than : this is particularly evident (also for the smoothed curves, in black in the plots) in the periods 85–89 and 95–97, where the dynamics of is much flatter than the dynamics of . Note that a nontrivial dynamics in the two periods 85–89 and 95–97 exist in many layers, as shown in Figure 12,13,14, triggered by a number of important events impacting the geopolitical relations: the final part of the Iran–Iraq War (1980–1988), the decline and fall of the Soviet Empire (not directly related to the Middle East area, but reflecting also there), the dramatic change of the situation of the Middle East conflicts induced by the outbreak of the First Intifada in December 1987 [42], and the terrorism excalation (Dhahran, Tel Aviv, and Jerusalem) in Middle East in 95/96 causing a bursting increase in the number of victims just to name the more relevant events. Thus, this case study, too supports the superiority of as a global indicator to summarize the evolution of a series of multiplex networks. We also computed all the HIM distances for (respectively, ) (resp. ), which are then used to project the 240 networks on a plane through a MultiDimensional Scaling (MDS) [43]: the resulting plots are displayed in Figure 16.
Figure 16

Planar multidimensional scaling plot with HIM distance of the collapsed (top) and metric (bottom) projection for the monthly Gulf Dataset. Colors are consistent with those in Figure 15.

Both indicators yield that the months corresponding to FGW (in blue in the plots) are close together and confined in the lower left corner of the plane, showing both a mutual high degree of homogeneity and, at the same time, a relevant difference to the graphs of all other months. Interestingly, this holds also for the months immediately before and after (in green and orange in the figures) the conflict, which are quite distant from the war months' cloud, as previously observed. This confirms that, only at the onset of the conflict the diplomatic relations worldwide changed consistently and their structure remained very similar throughout the whole event. From both the multidimensional scaling plots in Figure 16 it is clear that the both the and networks for the FGW months can be easily discriminated from all other nets. However, from the MDS projections it is not evident whether the months Apr 1979 – Dec 1989 (in grey) could be separated from the Nov 91 – Dec 99 months. By using a Support Vector Machine classifier with the HIM kernel [10], [25] (with for and for ), a 5-fold CV classification gives as best result the accuracy 81.2% for () and 73.3% for (). Thus, in both cases, machine learning provides a good separation between the networks belonging to the two periods. Community structure of . We conclude by analyzing the dynamics of the mesostructure of the layer network as extracted by the Louvain community detection algorithm [44], [45], [46], [47]. For any temporal step, the Louvain algorithm clusters the 66 nodes (WEIS categories) of into two or three communities, whose dimension along time is shown in Figure 17. In Figure 18 we show, for each date, which community each category (on the rows) belongs to; WEIS categories are ranked according to their community distribution, i.e., decreasing number of presences in Comm. #1 and increasing for Comm. #2. Thus in top rows we have the categories lying in Comm. #1 during all the 240 months (layers 7,10,11,28,34,40), while bottom rows are reserved to the categories always belonging to Comm. #2 (3,4,19,25,48,52,58): their description in terms of WEIS categories is shown in Table 6, while the full community distribution is reported in Table 7 and graphically summarized by the triangleplot in Figure 19. Focusing on the categories that are consistently lying in a given community throughout all 240 months, some of them are semantically similar: for instance, consult, assistance, action request in community #1 while two distinct groups emerge in community #2, namely admit wrongdoing, cede power, apologize, reward on one side and warn of policies, sanction threats and halt negotiations characterizing the second group. However, it is interesting the constant presence of the category charge/criticize/blame/disapprove in community #1. Moreover, there is no strong polarization for Community #3. Many layers sharing the same (or similar) WEIS second level category (Yield, Comment, Consult, etc.) are quite close in the community distribution ranked list, with a general escalating trend proceedings from help request (or other more neutral actions) to more severe situations growing together with the community distribution rank.
Figure 17

Dimension of the three communities identified by the Louvain algorithm in along the 240 months.

Figure 18

Community evolution along time for each of the 66 WEIS categories, ranked by community distribution.

Table 6

The 13 layers not swapping community across all 240 timepoints.

Community #1
LayerWEIS codeWEIS category
7023[Comment] Comment on situation – neutral
10031[Consult] Meet with at neutral site, or send note
11032[Consult] Consult & Visit; go to
28073[Reward] Give other assistance
34094[Request] Request action; call for
40121[Accuse] Charge; criticize; blame; disapprove

Table 7

Community distribution for the 66 WEIS categories: for each layer, we report the number of occurrences in the three detected communities.

RankLayer#1#2#3RankLayer#1#2#3
17240003488610846
2102400035178610648
3112400036247113039
4282400037125813151
5342400038475613054
6402400039275414046
7662390140265115435
892360441184515342
9372342442423117039
10382341543532717439
11292332544612617935
12302322645512118237
136230284611720419
143523001047601520421
154122231548571421412
16392207134950922110
17432081220505972258
181419618265123521916
19621752936523252314
20631703238531542324
2121642848541622362
22561505139552022362
23461484943563322371
2436146484657512354
25551364658582112381
26451226454594412318
2764121714860302400
2865119645761402400
29311166460621902400
30221107951632502400
31541098645644802400
3213979152655202400
3349978063665802400
Figure 19

Triangleplot projection of the 66 WEIS categories defined by their community distribution.

Conclusion

We introduced here a novel approach for the longitudinal analysis of a time series of multiplex networks, defined by mean of a metric transformation conveying the information carried by all layers into a single network for each timestamp, with the original layers as nodes. The transformation is induced by the Hamming–Ipsen–Mikhailov distance between graph sharing the same nodes, and it preserves the key events encoded into each instance of the multiplex network time series, making it more efficient than the collapsing of all layers into one collecting all edges for detecting important fluctuations in the original network's dynamics. Moreover, a community detection analysis on the obtained network can help shading light on the relations between the original layers throughout the whole time span.

Declarations

Author contribution statement

Giuseppe Jurman: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Competing interest statement

The authors declare no conflict of interest.

Additional information

Data associated with this study has been deposited at http://vlado.fmf.uni-lj.si/pub/networks/data/KEDS/GulfLDays.zip.
  18 in total

1.  Validation of gene regulatory networks: scientific and inferential.

Authors:  Edward R Dougherty
Journal:  Brief Bioinform       Date:  2010-12-22       Impact factor: 11.622

2.  A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data.

Authors:  Nancy R Zhang; David O Siegmund
Journal:  Biometrics       Date:  2007-03       Impact factor: 2.571

3.  Structural reducibility of multilayer networks.

Authors:  Manlio De Domenico; Vincenzo Nicosia; Alexandre Arenas; Vito Latora
Journal:  Nat Commun       Date:  2015-04-23       Impact factor: 14.919

4.  Networks of military alliances, wars, and international trade.

Authors:  Matthew O Jackson; Stephen Nei
Journal:  Proc Natl Acad Sci U S A       Date:  2015-12-14       Impact factor: 11.205

5.  Detecting shifts in gene regulatory networks during time-course experiments at single-time-point temporal resolution.

Authors:  Yoichi Takenaka; Shigeto Seno; Hideo Matsuda
Journal:  J Bioinform Comput Biol       Date:  2015-09-30       Impact factor: 1.122

6.  Consensus clustering in complex networks.

Authors:  Andrea Lancichinetti; Santo Fortunato
Journal:  Sci Rep       Date:  2012-03-27       Impact factor: 4.379

7.  Metabolic pathways variability and sequence/networks comparisons.

Authors:  Kyaw Tun; Pawan K Dhar; Maria Concetta Palumbo; Alessandro Giuliani
Journal:  BMC Bioinformatics       Date:  2006-01-18       Impact factor: 3.169

8.  A statistical approach for array CGH data analysis.

Authors:  Franck Picard; Stephane Robin; Marc Lavielle; Christian Vaisse; Jean-Jacques Daudin
Journal:  BMC Bioinformatics       Date:  2005-02-11       Impact factor: 3.169

9.  Weighted multiplex networks.

Authors:  Giulia Menichetti; Daniel Remondini; Pietro Panzarasa; Raúl J Mondragón; Ginestra Bianconi
Journal:  PLoS One       Date:  2014-06-06       Impact factor: 3.240

10.  Emergence of network features from multiplexity.

Authors:  Alessio Cardillo; Jesús Gómez-Gardeñes; Massimiliano Zanin; Miguel Romance; David Papo; Francisco del Pozo; Stefano Boccaletti
Journal:  Sci Rep       Date:  2013       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.