Literature DB >> 23278059

Effects of weak ties on epidemic predictability on community networks.

Panpan Shu¹, Ming Tang, Kai Gong, Ying Liu.

Abstract

Weak ties play a significant role in the structures and the dynamics of community networks. Based on the contact process, we study numerically how weak ties influence the predictability of epidemic dynamics. We first investigate the effects of the degree of bridge nodes on the variabilities of both the arrival time and the prevalence of disease, and find out that the bridge node with a small degree can enhance the predictability of epidemic spreading. Once weak ties are settled, the variability of the prevalence will display a complete opposite trend to that of the arrival time, as the distance from the initial seed to the bridge node or the degree of the initial seed increases. More specifically, the further distance and the larger degree of the initial seed can induce the better predictability of the arrival time and the worse predictability of the prevalence. Moreover, we discuss the effects of the number of weak ties on the epidemic variability. As the community strength becomes very strong, which is caused by the decrease of the number of weak ties, the epidemic variability will change dramatically. Compared with the case of the hub seed and the random seed, the bridge seed can result in the worst predictability of the arrival time and the best predictability of the prevalence.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2012 PMID： 23278059 PMCID： PMC7112478 DOI： 10.1063/1.4767955

Source DB: PubMed Journal: Chaos ISSN： 1054-1500 Impact factor: 3.642

In community networks, the links that connect pairs of nodes belonging to different communities are defined as weak ties. The weak ties hypothesis, which is first proposed by Granovetter, is a central concept in the social network analysis. Weak ties not only play a role in effecting social cohesion but also are helpful for stabilizing complex systems under most conditions. Most recent research results showed that weak ties have significant impacts on spreading dynamics. But until now, no study on the effects of weak ties on the predictability of epidemic dynamics has been given to us. In this study, we investigate how the degree of bridge nodes and the number of weak ties influence the predictability of the epidemic dynamics on a local community. We show numerically that both the degree of bridge nodes and the network modularity are crucial in the predictability of the epidemic spreading on the local community. More importantly, we find out that the variability of the arrival time always displays a complete opposite trend to that of the prevalence, which implies that it is impossible to predict the epidemic patterns in the early stage of outbreaks accurately. This work provides us further understanding and new perspective in the effect of weak ties on epidemic spreading.

INTRODUCTION

Community structures at mesoscale level are ubiquitous in a variety of real-world complex systems, such as Facebook, YouTube, and Xiaonei. In general, there are more connections between members in the same community than that between members from different communities, where the links that connect pairs of nodes belonging to different communities are defined as weak ties. The weak ties hypothesis, which is first proposed by Granovetter, is a central concept in the social network analysis. Weak ties not only play a role in effecting social cohesion but also are helpful for stabilizing complex systems under most conditions. Recently, weak ties have been shown to destabilize ecosystems under specific conditions. Epidemic spreading, a fundamental dynamical process, is one of the most important subjects in the complex network theory. Inspired by the significant effects of weak ties on the dynamics on networks, many recent works have contributed to understanding the interplay between weak ties and spreading dynamics on community networks. Onnela et al. found that weak ties can significantly slow the diffusion process, leading to the dynamic trapping of information on communities. As weak ties are removed gradually, the coverage of information will drop sharply. On adaptive networks, strong communities with weak ties may prevent disease propagation. In order to assess the accuracy and the forecasting capabilities of numerical models, the predictability of outbreaks has been investigated in many studies. Colizza et al. studied the effect of airline transportation network on the predictability of epidemic patterns by means of the normalized entropy function and found that the heterogeneous weight distribution contributes to enhancing the predictability. Crépey et al. found that initial conditions such as the degree heterogeneity of the initial seed can induce a large variability on the prediction of the prevalence. Loecher and Kadtke argued that the random walk centrality (RWC) serves as a better index than degree to predict the prevalence of disease. Comparing the scale-free network (SFN) with community structure with the random SFN, the predictability of its global prevalence was found to be better. Considering the relative independence of a local community, Gong et al. investigated the prevalence and its variability on the local community, and found that the extraordinarily large variability in the early stage of outbreaks made the prediction of epidemic spreading hard. In addition, Zhao et al. studied how the heterogeneous time delay (HETD) associated with geographical distance influences the spreading speed and the variability of the prevalence. Owing to the correlations between time delay and network hierarchy in the HETD, the epidemic spreading is slowed down obviously and the predictability of the prevalence is reduced remarkably. On community networks, as mentioned above, weak ties play a very significant role in epidemic dynamics. But until now, there is no study on the effects of weak ties on the predictability of epidemic dynamics. In this paper, we investigate how weak ties influence the predictability of the epidemic dynamics on community networks. We show numerically that both the degree of bridge nodes and the number of weak ties can remarkably influence the predictability of the epidemic spreading on a local community. More importantly, we find out that the variability of the arrival time always displays a complete opposite trend to that of the prevalence, which implies that it is impossible to predict the epidemic spreading in the early stage of outbreaks accurately. The paper is organized as follows. In Sec. II, we briefly describe the dynamical process on a community network and introduce the quantitative measurements of predictability. In Sec. III, we investigate the effects of the degree of bridge nodes on the predictability of the dynamics. In Sec. IV, the effects of the number of weak ties on the predictability are analyzed. Finally, we draw conclusions in Sec. V.

MODEL INTRODUCTION

The community network with degree heterogeneity

To investigate the effects of weak ties on the predictability of epidemic dynamics, we must first identify which links on community networks are weak ties. Unfortunately, there is not a generally accepted and authenticated community detection algorithm; thus, it is difficult to identify weak ties accurately on real-world networks. We here consider a community network model comprised two confined communities A and B. Except for the community structure, degree heterogeneity is another important feature of real-world community networks. In this study, we focus on the community network with degree heterogeneity. To be specific, two independent Barabási-Albert (BA) scale-free networks with the same size are first produced, and then these two networks are connected by few links. In order to normalize the terms of community network, we define the links between two communities as weak ties, and call the nodes connected by these weak ties bridge nodes. Obviously, the network has a strong community structure because of few weak ties. With the increase of the number of weak ties, the community structure will be weakened. The network modularity Q, a popular evaluating indicator in measuring the community structure, is defined as where and represent the number of intra-links and the sum of degrees of the nodes in community s, respectively; L denotes the total link number in the network; and C is the number of communities. Here, , the larger Q is, the stronger community structure is. However, the Q cannot accurately characterize the community strength of a network with two communities. To address the shortcoming, the normalized is defined as where corresponds to the random network with the same degree sequence, and is the modularity of the network without inter-community links. After the normalization, is in range [0, 1].

Dynamic process

In many real-world spreading processes, each node's potential infection-activity is limited, and is not strictly equal to its degree. For epidemic spreading on human contact networks, although a hub individual has many acquaintances, he/she cannot contact all his/her acquaintances once within one time step. Thus, a contact process (CP) model with identical infectivity is proposed to study the epidemic spreading on complex networks, where each infected node can only contact one of its neighbors per unit time. For simplicity, we only study susceptible-infected (SI) spreading dynamics through numerical simulations, which is a classical model of the CP. In the model, “S” and “I” represent, respectively, the susceptible (or healthy) and the infected states. At the beginning, a node is selected as the initial infected (i.e., seed) and all other nodes are in “S” state. At each time step, each infected node randomly contacts one of its neighbors, and then the contacted neighboring node will be infected with probability if it is in the healthy state, or else it will retain its state. Once an individual is infected, it will keep its state forever. To eliminate the stochastic effect of disease propagation, we set . Although other disease models such as susceptible-infected-recovered (SIR) are practically more relevant and realistic, the inclusion of more processes and parameters such as the recovery rate μ in SIR model complicates the analysis of the problem considered here. For example, in epidemic spreading with unlimited infectivity, the initial seed with a larger degree will result in a higher prevalence due to the existence of recovery rate in SIR model. However, the simple SI model can still be adopted to well describe the early dynamics of epidemic outbreaks, such as the acquired immune deficiency syndrome (AIDS), gonorrhea, and syphilis. Owing to the SI model, the effects of different contact patterns on epidemic spreading can be clearly understood.

Statistical parameter

In view of the relative independence of a local community, we take the dynamics and its variability into account. When a disease emerges on the community network, it is very important for a local community to keep a watchful eye on two statistical parameters: the arrival time and the prevalence of disease, where the arrival time of disease is defined as the moment that infectious individual first occurs on the community in each realization, and the prevalence i(t) is the density of infected individuals at time t. In order to investigate the predictability of epidemic dynamics, the variability of the arrival time (the prevalence) is defined as the relative variation of the arrival time (the prevalence) given by and denotes that all independent dynamic realizations are essentially the same, and the arrival time (the prevalence) on the network is deterministic. The larger means the worse predictability that a particular realization is far from average over all independent realizations.

THE EFFECT OF WEAK TIES

The effect of weak tie with different degrees

Because of the degree heterogeneity of the community network, there may be different degrees of bridge nodes, that is, a pair of bridge nodes (i.e., and ) connected by a weak tie may have different degrees and . In the CP, weak ties with the different combinations of have different effects on the propagation from the first community A to the second community B. As a first step towards this, we consider the case of one weak tie between two communities. Reference 42 showed that once the spreading starts on the second community, the seed is irrelevant. That is to say, different hardly affect the epidemic spreading on the second community B. Therefore, by varying the only, we investigate how different influence the predictability of the epidemic dynamics on the second community B. In simulations, a node with the fixed degree in the first community A is connected with a randomly chosen node in the second community B. Fig. 1 shows the mean arrival time and its variability on the second community B when different degrees of bridge nodes are created. When the bridge node of the first community A is chosen as the seed, that is d = 0 (d is the geodesic distance from the initial infected node to the bridge node ), the mean arrival time is linear with the degree of the bridge node . As the bridge node of the second community is one neighbor of the bridge node , it will be infected with probability at each time step, which is obviously a Poisson process. So the mean arrival time is equal to and its relative variation is . When the disease seed is a node with one step to the bridge node (i.e., d = 1), the mean arrival time for different will increase compared with the case of d = 0. For a small is significantly greater than that of d = 0, e.g., for . But for a large , the relative change of is very little, e.g., for . The reason is that the infection of the bridge node can be divided into two processes: The bridge node is first infected in ( is the time duration of the first process), e.g., for and for , and then the bridge node is infected by the bridge node in ( is the time duration of the second process). With the further increasing of the distance from the seed to the bridge node (such as d = 3 and 4), the mean arrival times are nearly the same for the fixed . It can be understood that owing to the finite size effect of the network with the average shortest path length , the bridge node is infected till an overall outbreak emerges on the first community A.

FIG. 1.

The mean arrival time and its variability as a function of the degree of the bridge node where the “squares,” “circles,” “triangleups,” “triangledowns,” and “diamonds” denote the cases of the seeds with d = 0, 1, 2, 3, and 4, respectively. (a) versus , (b) versus . The parameters are chosen as . We perform the experiments on different networks, each of which are tested in independent realizations.

In Fig. 1(b), the variabilities of the arrival time for different are shown. When d = 0, for different are approximately equal to 1 because these infections are Poisson processes. Interestingly, compared with the case of d = 0, for d = 1 decreases at each , that is to say the further distance to the initial seed can lead to the better predictability of the arrival time. As mentioned above, the infection of the bridge node has two processes, and thus its variability can be written as where and denote the time durations of the first process and the second process, respectively. Substituting it into Eq. (3), we obtain where . Considering the independence of these two processes, Eq. (6) is reduced to where and are the time variances of the first process and the second process, respectively. In the first process, there are two basic spreading pathways through which the bridge node may be infected. As shown in Fig. 2, the bridge node may be infected directly by the initial seed (i.e., a neighboring node of the bridge node ) with probability ; the other route is an indirect transmission of infection from its neighboring nodes except the seed when the overall outbreak occurs on the first community A. Although the first route is a Poisson process, the variability in will be less than 1 due to the approximate determinacy of the second pathway. In the second process, the variability because the infection in is a Poisson process. As , we have Obviously, must be less than 1 when d = 1. As increases with will also increase according to Eq. (8), which is verified by the results in Fig. 1(b). Especially for a very large , the variability is very close to 1. It means that although the large degree of the bridge node can delay the mean arrival time of disease, it causes the worst predictability of the arrival time. When of the first process will be more determined. Thus, for a small will become small, e.g., for . The above results demonstrate that when the degree of the bridge node is small, the further distance of the initial seed to the bridge node can result in a better predictability of the arrival time due to the approximate determinacy of the first process, while the variability is almost not affected for the very large because of the Poisson property of the second process.

FIG. 2.

Two spreading pathways through which the bridge node may be infected. The first one is a direct transmission from the seed to the bridge node, and the second one is an indirect transmission from the seed to node i, j, and then to the bridge node.

Next, we focus on the statistical parameter . Reference 42 showed that the variability of the prevalence on a local community is very large at the beginning of outbreaks due to the uncertain arrival time of disease, which makes the prediction of the prevalence hard. For this reason, we pay attention to the variability of the prevalence in the early stage of outbreaks. In Fig. 3(a), the mean prevalence at T = 20 decreases with and d, which is a complete opposite of the trend of in Fig. 1(a). But Fig. 3(b) shows that its variability increases with and d, which is in accordance with the trend of . On the one hand, as the very large variability of the prevalence in the early stage is originated from the uncertain arrival time of disease, it is difficult for the large (corresponding to the large ) to make sure the arrival of disease within T = 20. It will result in the large variability of the prevalence, e.g., for . On the other hand, the further distance of the initial seed to the bridge node also makes the disease more difficult to arrive at the bridge node , and can thus cause larger , e.g., for . The results at other T values such as 5, 10, and 30 confirm the conclusion. These results imply that both the large degree of bridge node and the further distance of the initial seed can make the prediction of the prevalence very hard.

FIG. 3.

At T = 20, the mean prevalence and its variability as a function of the degree of the bridge node where the “squares,” “circles,” “triangleups,” “triangledowns,” and “diamonds” denote the cases of the seeds with d = 0, 1, 2, 3, and 4, respectively. (a) versus , (b) versus . The parameters are chosen as . We perform the experiments on different networks, each of which are tested in independent realizations.

The effect of different initial seeds when d = 1

From Subsection III A, we know that the degree heterogeneity of the bridge node has a significant impact on the predictability of epidemic dynamics. Here, we investigate the effects of different degrees of the initial seed on the variability of the epidemic dynamics on the second community B when the distance between the initial seed and the bridge node is d = 1. As shown in Figs. 4 and 5, when is small, and are obviously affected by the degree of the seed, while there is almost no effect on the variability when is large. When , the large degree of the initial seed will result in a large (Fig. 4(a)) and a small (Fig. 4(c)). Owing to the finite contact ability of the initial seed with the large degree, it will cost more time to infect the bridge node in the first process, that is the large . In the process, the bridge node may be infected through two basic pathways. For the initial seed with a small degree, the bridge node is infected directly with higher probability. Thus, the first process introduces a larger because of the randomness of the Poisson process, and thus in the whole process increases according to Eq. (8). In addition, Fig. 5(c) shows that the variability of the prevalence increases with the degree of the initial seed, which is consistent with the trend of the mean arrival time in Fig. 4(a). We should note that the degree of the initial seed has an opposite effect on the variabilities of the arrival time and the prevalence, which may bring about a great trouble for the pandemic prevention and control.

FIG. 4.

When the distance between the initial seed and the bridge node d = 1, the mean arrival time and its variability as a function of the degree of the initial seed, where versus k for (a) and (b), versus k for (c) and (d). The results are averaged over independent realizations on networks.

FIG. 5.

When d = 1, the mean prevalence and its variability at T = 20 as a function of the degree of the initial seed, where versus kfor (a) and (b), versus k for (c) and (d). The results are averaged over independent realizations in networks.

When , the whole infection process is dominated by the second Poisson process (i.e., very large in Eq. (8)). Therefore, the different seeds with d = 1 cannot affect the variability of epidemic dynamics visibly (see Figs. 4(b), 4(d), 5(b), and 5(d)). However, the very large and mean that it is difficult to accurately forecast the epidemic spreading when the degree of the bridge node is large. In order to ensure the universality of the above results, other are also used to simulate the infection process. As expected, all simulations reveal the same conclusion.

THE EFFECT OF THE NUMBER OF WEAK TIES

In real-world community networks with the modularity , there are many weak ties between communities. In this section, we would like to understand the effects of the number of weak ties on the predictability of epidemic dynamics. To gain a clear idea of the relation between the modularity Q and the number of weak ties, Eq. (1) is expanded as where represents the number of weak ties between the community A and the community B. When the network is connected randomly, , thus ; When and , Q reaches the maximum value, i.e., . Therefore, Q can only range from 0 to 0.5. Substituting and into Eq. (2), we have After the standardization, can range from 0 to 1. By adding the number of weak ties between two communities randomly, we can obtain the community networks with different . Fig. 6 shows the case of three kinds of the initial seed: bridge node, random node, and hub. In Fig. 6(a), with the increase of , the mean arrival time will increase because fewer weak ties lengthen the distance between two communities. Especially when , the mean arrival time will increase rapidly. Compared with the other two cases, the case of the bridge node chosen as the initial seed has the shortest . The case of the random node includes the cases of the bridge node and the non-bridge node, so for the random seed must be longer than that for the bridge seed. For the case of the hub, it has the longest mean arrival time. As the nodes connected by weak ties are chosen randomly, the nodes with a small degree will be more probably chosen as bridge nodes due to the degree heterogeneity, while it is very difficult for the hubs to be bridge nodes. When the degree of the bridge node is small, the initial seed with a large degree leads to the longer (see Fig. 4(a)).

FIG. 6.

The mean arrival time and its variability as a function of the modularity where the “squares,” “circles,” and “triangles” denote the cases of the bridge seed, the random seed, and the hub seed, respectively. (a) versus , (b) versus . The parameters are chosen as . We perform the experiments on different networks, each of which are tested in independent realizations.

Moreover, the variabilities of the arrival time for these three cases are compared in Fig. 6(b). As mentioned in Fig. 1(b), the further distance of the initial seed to the bridge node can reduce the variability . With the increase of , the distance between two communities is lengthened by fewer weak ties, and thus will decrease. For example, when increases from 0.9 to 1, for the random case decreases from 0.33 to 0.18 rapidly. For the case of the hub, its arrival time has the most accurate predictability. It is because that the initial seed with a large degree can lead to a low variability when the bridge node has a small degree (see Fig. 4(c)). More significant, for the case of the bridge node increases with the network modularity , which is opposite to the other two cases. Even though a bridge node is chosen as the initial seed, the infection to community B is not always through the weak tie of the bridge node because of the existence of many other weak ties. That is, there are two optional spreading pathways towards the community B: the weak tie of and the other weak ties. Thus, the actual path length of epidemic spreading must be greater than 1. In other words, more links will result in the longer distance of the infection process. Therefore, more weak ties can reduce due to the increase of the distance between the community A and the community B. Actually, more weak ties can increase the deterministic of the second optional pathway, and thus enhances the predictability of the arrival time. Furthermore, the effects of the number of weak ties on the predictability of the prevalence in the early stage of outbreaks are also analyzed in Fig. 7. As the mean arrival time increases with in Fig. 6(a), the prevalence at T = 2 will decrease accordingly in Fig. 7(a). is resulted from . Fig. 7(b) shows that the variability of the prevalence increases with . For the case of the bridge node, the change of is very little, which means that the bridge node plays a significant role in enhancing the predictability of the prevalence. For the cases of the randomly chosen node and the hub, increases slowly when , while increases rapidly when (e.g., for ), which is in accordance with the trend of . The results at other T values (e.g., T = 5 and 10) reveal the same conclusion. It implies that the strong community structure can increase the difficulty of the predictability of the prevalence.

FIG. 7.

At T = 2, the mean prevalence and its variability as a function of the modularity where the “squares,” “circles,” and “triangles” denote the cases of the bridge seed, the random seed, and the hub seed, respectively. (a) versus , (b) versus . We perform the experiments on different networks, each of which are tested in independent realizations.

CONCLUSIONS

In conclusion, we have studied the effects of weak ties on the predictability of the epidemic dynamics on a heterogeneous network with local community structure. First, we have shown that the degree of bridge nodes can remarkably influence the variabilities of both the arrival time and the prevalence of disease. With the increase of the degree of the bridge node, the mean arrival time and the outbreak (i.e., the prevalence) of disease will be delayed, but their variabilities will still increase because of the Poisson property of the transmission from the bridge node to . Second, we have shown that the distance between the initial seed and the bridge node, as well as the degree of the initial seed when d = 1, has different impacts on the epidemic predictability under different conditions. When the degree of the bridge node is large, the variability of the arrival time is close to 1, and the variability of the prevalence is very large. When the degree of the bridge node is small, the further distance of the initial seed to the bridge node (or the larger degree of the initial seed when d = 1) will enhance the predictability of the arrival time due to the approximate determinacy of the indirect transmission in infecting the bridge node , while the predictability of the prevalence in the early stage will get worse due to the uncertain arrival time of disease. These results suggest that the bridge node with a small degree may be an important detection station for the epidemic control on community networks. Taking global epidemics on the airline network, for example, the slackened precaution at a small international airport may bring a lot of trouble to the prediction and control of disease. Moreover, we also have analyzed the effects of the number of weak ties on the epidemic predictability where the results of three different initial seeds (i.e., bridge node, random node, and hub) were compared. With the increase of the network modularity, the variabilities of the arrival time for the case of the random node and the hub will decrease slowly when , and then drop sharply when is greater than 0.9. It is important to note that the variability of the prevalence will increase rapidly when the community strength is strong enough, which is in accordance with the trend of the mean arrival time. By contrast, owing to the Poisson property of the transmission on weak ties, the variability of the arrival time for the case of the bridge node increases with the network modularity, which displays the worst predictability of the arrival time. But the best predictability of the prevalence is observed when the bridge node is first infected, which is originated from the shortest arrival time for this case. These results suggest that although the strong community structure can delay the infection from one community to another, it can also cause the unpredictability of the epidemic spreading on these communities. It is very important to avoid the infection of bridge nodes in the early stage of outbreaks for a better prediction of the spreading on community networks. The above results provide us further understanding and new perspective in the effect of weak ties on epidemic spreading. This work only focused on the CP model with limited infectivity, but the other type of epidemic spreading (i.e., epidemic spreading with unlimited infectivity) may change the above conclusions qualitatively. Thus, on a heterogeneous network with local community structure, how weak ties influence the predictability of the unlimited spreading is an interesting question.

38 in total

Effects of weak ties on epidemic predictability on community networks.

INTRODUCTION

MODEL INTRODUCTION

The community network with degree heterogeneity

Dynamic process

Statistical parameter

THE EFFECT OF WEAK TIES

The effect of weak tie with different degrees

The effect of different initial seeds when d = 1

THE EFFECT OF THE NUMBER OF WEAK TIES

CONCLUSIONS

1. Assortative mixing in networks.

2. Critical behavior of the Ising model in annealed scale-free networks.

3. Fluctuation effects in metapopulation models: percolation and pandemic threshold.

4. Behaviors of susceptible-infected epidemics on scale-free networks with identical infectivity.

5. Finite-size scaling in complex networks.

6. Fuzzy communities and the concept of bridgeness in complex networks.

7. Emergence of bursts and communities in evolving weighted networks.

8. Stratus not altocumulus: a new view of the yeast protein interaction network.

9. Epidemic variability in hierarchical geographical networks with human activity patterns.

10. Still stratus not altocumulus: further evidence against the date/party hub distinction.

1. Effective information spreading based on local information in correlated networks.

2. Numerical identification of epidemic thresholds for susceptible-infected-recovered model on finite-size networks.

3. Backbone reconstruction in temporal networks from epidemic data.

4. An efficient immunization strategy for community networks.

5. Asymmetrically interacting spreading dynamics on complex layered networks.

6. Modeling and analysis of epidemic spreading on community networks with heterogeneity.

7. Interplay between the local information based behavioral responses and the epidemic spreading in complex networks.

8. Recovery rate affects the effective epidemic threshold with synchronous updating.