Panpan Shu1, Wei Wang1, Ming Tang1, Younghae Do2. 1. Web Sciences Center, University of Electronic Science and Technology of China, Chengdu 610054, China. 2. Department of Mathematics, Kyungpook National University, Daegu 702-701, South Korea.
Abstract
Epidemic threshold has always been a very hot topic for studying epidemic dynamics on complex networks. The previous studies have provided different theoretical predictions of the epidemic threshold for the susceptible-infected-recovered (SIR) model, but the numerical verification of these theoretical predictions is still lacking. Considering that the large fluctuation of the outbreak size occurs near the epidemic threshold, we propose a novel numerical identification method of SIR epidemic threshold by analyzing the peak of the epidemic variability. Extensive experiments on synthetic and real-world networks demonstrate that the variability measure can successfully give the numerical threshold for the SIR model. The heterogeneous mean-field prediction agrees very well with the numerical threshold, except the case that the networks are disassortative, in which the quenched mean-field prediction is relatively close to the numerical threshold. Moreover, the numerical method presented is also suitable for the susceptible-infected-susceptible model. This work helps to verify the theoretical analysis of epidemic threshold and would promote further studies on the phase transition of epidemic dynamics.
Epidemic threshold has always been a very hot topic for studying epidemic dynamics on complex networks. The previous studies have provided different theoretical predictions of the epidemic threshold for the susceptible-infected-recovered (SIR) model, but the numerical verification of these theoretical predictions is still lacking. Considering that the large fluctuation of the outbreak size occurs near the epidemic threshold, we propose a novel numerical identification method of SIR epidemic threshold by analyzing the peak of the epidemic variability. Extensive experiments on synthetic and real-world networks demonstrate that the variability measure can successfully give the numerical threshold for the SIR model. The heterogeneous mean-field prediction agrees very well with the numerical threshold, except the case that the networks are disassortative, in which the quenched mean-field prediction is relatively close to the numerical threshold. Moreover, the numerical method presented is also suitable for the susceptible-infected-susceptible model. This work helps to verify the theoretical analysis of epidemic threshold and would promote further studies on the phase transition of epidemic dynamics.
Epidemic threshold, which is one of the most important features
of the epidemic dynamics, has attracted much attention recently. The existing studies have
provided different theoretical predictions for epidemic threshold of the
susceptible-infected-recovered (SIR) model on complex networks, while the numerical
verification of these theoretical predictions is still lacking. As a result, it is very
necessary to develop an effective numerical measure for identifying the SIR epidemic
threshold. In this paper, the numerical identification of the SIR epidemic threshold is
systematically studied. We present a numerical method by analyzing the peak of the epidemic
variability to identify the epidemic threshold. To understand the effectiveness of the
variability measure, the distribution of outbreaks sizes is investigated near the epidemic
threshold on random regular networks. Based on the analysis of the cutoff hypothesis of the
outbreak size distribution, we find that the variability measure can provide an excellent
identification of the epidemic threshold. We further use the variability measure to verify the
existing theoretical predictions on scale-free and real networks. The results show that the
heterogeneous mean-field (HMF) prediction agrees very well with the numerical threshold,
except the case that the networks are disassortative, in which the quenched mean-field (QMF)
prediction is relatively close to the numerical threshold. The numerical method presented can
effectively identify SIR epidemic thresholds on various networks, and could be extended to
other dynamical processes such as information diffusion and behavior spreading. This work
provides us a deep understanding of epidemic threshold and would promote further studies on
phase transition of epidemic dynamics.
INTRODUCTION
Models for disease propagation are at the core of our understanding about epidemic dynamics
on complex networks. Two epidemic
models of particular importance are the susceptible-infected-susceptible (SIS) and
susceptible-infected-recovered (SIR) models. At each time step, an infected node can transmit a disease to each of
its susceptible neighbors with probability β. At the same time, the
infected nodes become susceptible again in the SIS model or recover in the SIR model with
probability μ. In the SIS model, a critical value of the effective
transmission rate
separates the absorbing phase with only healthy nodes from the active phase with a
stationary density of infected nodes. Differently, no steady state is allowed in the SIR
model, but a threshold still exists above which the final fraction of recovered nodes is
finite.The traditional theoretical study on the epidemic threshold of the SIS model is based on
the heterogeneous mean-field (HMF) theory, which means that all the nodes within a given
degree are considered to be statistically equivalent. According to the HMF theory, the epidemic threshold of SIS
model is given by ,
where
and
are the first and second moments of degree distribution
P(k),
respectively. As the quenched structure of the network and dynamical correlations between
the state of adjacent nodes is neglected in the HMF theory, researchers have proposed an important improvement over the HMF
theory—quenched mean-field (QMF) theory. The QMF theory fully preserves the actual quenched
structure of the network described as its adjacency matrix, and the epidemic threshold is
predicted to be
where Λ is the maximum
eigenvalue of the adjacency matrix of a given network. Considering that the existing
theories more or less have some limitations (e.g., the HMF theory neglects the quenched
structure of the network; QMF theory ignores dynamical correlations), some numerical methods such as the finite-size scaling
analysis, susceptibility, and lifetime measure have been proposed to check the accuracies of different
theoretical predictions for the SIS model. Among these existing methods, the relatively
common one for a network with finite size N is the susceptibility measure
where ρ denotes the outbreak size. In
Ref. 18, the susceptibility measure has been shown to
be very effective for identifying the SIS epidemic thresholds on various networks.For another paradigmatic epidemic model, the SIR model, there have been a lot of
theoretical studies on its epidemic threshold. The earliest theoretical study on the SIR
epidemic threshold is based on the assumption of homogeneous mixing, showing that the SIR
epidemic threshold is inversely proportional to the average connectivity . At the HMF level, the epidemic threshold of SIR model takes the value
On networks with power-law scaling , where γ is the degree
exponent, the HMF approach predicts a vanishing threshold for scale-free networks with
and the finite threshold for . Mapping the SIR model to a bond percolation
process, the epidemic threshold
coincides with the result of Eq. (3) for a SIR
model with unit infection time (e.g., μ = 1), and when the infection times
vary among infected nodes (e.g., ),
the epidemic threshold is given by
According to the QMF theory, the epidemic threshold of the SIR model has the same
expression as Eq. (1). However, the QMF result
is even qualitatively not correct, as the QMF predicts a vanishing threshold for power-law
distributed networks with
that is in conflict with the visually numerical results.Although the numerical threshold of the SIS model has attracted much attention, the systematic study of the
numerical identification of the SIR epidemic threshold is still insufficient. It is well
known that the outbreak size becomes finite above the threshold
λ.
However, as the value of λ increases, the outbreak size continuously
changes from an infinitesimal fraction to a finite fraction in the SIR model, and thus, it is difficult to determine the
value of λ at which the outbreak size turns to be finite. To our knowledge,
there has no effective numerical method for identifying the SIR epidemic threshold in
previous studies. In this work, we perform extensive numerical simulations of the SIR model
on networks with finite size, and present a numerical identification method by analyzing the
peak of the epidemic variability
(i.e., the maximal value of the epidemic variability) to identify the epidemic threshold.
The effectiveness of the numerical measure is checked on random regular networks (RRN),
where the HMF theory is exact. To get a deep understanding of the validity of the numerical
method, we investigate the distribution of outbreaks sizes near the epidemic threshold. The
robustness of the variability measure is confirmed by the analysis of cutoff hypothesis of
the outbreak size distribution. We further employ the variability measure to verify the
theoretical predictions on scale-free networks and real-world networks, where the results
indicate that the HMF prediction agrees very well with the numerical threshold, except the
case that the networks are disassortative, in which the QMF prediction is relatively close
to the numerical threshold.
AN EFFECTIVE NUMERICAL IDENTIFICATION MEASURE
In this section, we give the detailed description of the simulation of SIR model, propose
the numerical identification measure of the epidemic threshold, and make deep analysis of
the effectiveness of the numerical measure. In the SIR model, each node can be one of three
states which are the susceptible state, infected state, and recovered state, respectively.
At the beginning, one node is randomly selected as the initial infected (i.e., seed), and
all other nodes are susceptible. At each time step t, each susceptible node
i becomes infected with probability
if it has one or more infected neighbors, where n is the number
of its infected neighbors. At the same time, all infected nodes recover (or die) at rate
μ and the recovered nodes acquire permanent immunity. Time increases by ,
and the dynamical process terminates when all infected nodes are recovered. In this paper,
μ is set to 1, unless otherwise specified.
Proposing the numerical identification measure
The susceptibility measure can not only identify an effective SIS epidemic
threshold, but can also be used to
determine the critical point of the percolation process. Since the connection between SIR and bond percolation is made
through the assimilation of the size of the percolating giant component with the final
number of recovered individuals, we
check the effectiveness of susceptibility measure χ for the SIR model on
RRN, where all nodes have exactly the same degree k. On these networks,
the HMF prediction (Ref. 5) is accurate for the SIR model. We compare the HMF
prediction with the numerical threshold identified by the susceptibility measure
in Fig. 1(a), where the result shows that the
numerical threshold of the SIR model identified by χ is significantly
larger than .
FIG. 1.
Comparison of theoretical thresholds with numerical thresholds on RRN. (a) The threshold
λ vs. k, where N is set
to 104. (b) The threshold λ vs.
N, where k is set to 10. “Squares,” “circles,” and
“triangleups” denote , ,
and , respectively.
Inset: Susceptibility χ and variability Δ as a function of
λ. The results are averaged over
independent realizations on 102 networks.
Considering that the fluctuation of the outbreak size is large near the epidemic
threshold, we try to identify the epidemic threshold by the variability measure
which is a standard measure to determine the critical
point in equilibrium phase on magnetic system. The inset of Fig. 1(a)
shows that the variability Δ exhibits a peak over a wide range of λ, so
we estimate the epidemic threshold from the position of the peak of the variability .
On RRN with different values of k, we find that
is always consistent with the HMF prediction. When the degree k is given,
we further consider the relationship between the epidemic threshold and the network size
N in Fig. 1(b), where the
numerical thresholds
and
do not change with N.
is very close to the HMF prediction, while there is an obvious gap between
and HMF prediction. From the above, we know that the variability Δ can identify an
effective SIR epidemic threshold, while the epidemic threshold identified by the
susceptibility χ is overestimated on RRN. Thus, a new problem has arisen:
Why the variability Δ performs well but the susceptibility χ goes awry
for the SIR model?
Analysis of the effectiveness of numerical identification measure
Next, we make an analysis of the effectiveness of the numerical identification measures
above, by investigating the distribution of outbreak sizes which has the strong
heterogeneity near the epidemic threshold. On RRN with k = 10, where the
epidemic threshold ,
Fig. 2(a) shows the distribution of outbreak sizes
near λ. The outbreak sizes follow approximately an
exponential distribution at
that is smaller than λ. At ,
the outbreak sizes follow a power-law distribution
with a cutoff at some values, where . Since the disease may die out
quickly or infect a subset of nodes when ,
the distribution of outbreak sizes is bimodal, with two peaks occurring at
and
for ,
respectively.
FIG. 2.
Analysis of effectiveness and robustness of numerical methods on RRN. (a) Numerical
distribution of outbreak sizes in SIR model for
(circles),
(triangles), and
(squares), where blue solid, red short dash, and black dot lines represent the theoretical
distributions given by Eq. (A4). (b)
χ vs. λ, where only the small outbreak sizes with
are considered to numerically count χ. (c) Δ vs. λ,
where the lump is assumed to be located at r in the
theoretical distribution diagram when .
In (b) and (c), “triangles,” “circles,” and “diamonds” denote cutoff values
r = 0.05, 0.2 and 0.4, respectively. The parameters are
chosen as
and k = 10. The results are averaged over
independent realizations on 102 networks.
Moreover, the theoretical distribution of outbreak sizes (see Appendix for details) is
compared with the results obtained by numerical simulations in Fig. 2(a), where the theoretical probability from Eq. (A4) is consistent with the numerical results
for relatively small outbreak sizes ().
At the epidemic threshold, the theoretical results also show that the outbreak sizes obey
a power-law distribution with the exponent of about −1.5. When ,
some large outbreak sizes constitute a lump in the numerical scattergram, but the
probability of large outbreak sizes cannot be solved from Eq. (A4). We thus speculate that the non-ignorable
lump may affect the numerical identification of SIR epidemic threshold for the
susceptibility measure.To verify the speculation, Fig. 2(b) investigates
the effectiveness of the susceptibility measures under some cutoff hypotheses. We set the
cutoff value of the outbreak size as r, which means that only
the outbreak sizes with
are used to numerically count the susceptibility χ under the cutoff
hypothesis. Three kinds of r are considered, where
corresponds to the maximum value of small outbreak size before the lump appears in the
numerical scattergram,
means that the numerical scattergram consists of a part of the lump, and
means that there is a complete lump in the numerical scattergram. As shown in Fig. 2(b), the susceptibility measure can indeed give a quite
effective estimate of the SIR epidemic threshold when the whole lump is ignored (i.e., ).
With the increase of r, the position of peak value of
susceptibility χ gradually shifts to the right for large outbreak sizes
are considered. This shows that the susceptibility χ loses its
effectiveness in identifying the SIR epidemic threshold due to the existence of the
lump.In Fig. 2(c), the robustness of the variability Δ is
further checked in theory. As the numerical distribution of the large outbreak sizes is
concentrated, we assume that there is a lump located at r
with in the theoretical probability
distribution diagram of outbreak sizes while .
Based on such theoretical distribution, we plot the variability measure as a function of
λ for different values of r in Fig. 2(c). Since the variability Δ measures the heterogeneity
of the outbreak size distribution, which is strongest at the epidemic threshold, the peak position of the
variability measure does not change with the position of the lump.From the above analysis, we can conclude that the variability Δ is effective in
identifying the epidemic threshold of SIR model, while the bimodal distribution of
outbreak sizes for
leads to the overestimation of the SIR epidemic threshold when using the susceptibility
χ.
VERIFICATION OF THE THEORETICAL PREDICTIONS ON SCALE-FREE AND REAL NETWORKS
We further verify the theoretical predictions on scale-free and real networks, by comparing
them with the numerical thresholds from the variability Δ.
Scale-free networks
We build scale-free networks (SFN) with degree distribution
based on the configuration model. The
so-called structural cutoff
and natural cutoff
(Ref. 33) are considered to constrain the maximum
possible degree k on SFN. Differently, the degree-degree
correlations vanish on scale-free networks with structural off, while the disassortative
degree-degree correlations exist when
for scale-free networks with natural cutoff, because high degree vertices connect preferably to low degree ones
in this case. We consider the SIR model on SFN with structural cutoff in Figs. 3(a) and 3(c), where
the SIR epidemic threshold increases monotonically with the degree exponent
γ, and the variation of epidemic threshold with network size
N is approximately linear in logarithmic relationship. The HMF prediction is very
close to the numerical threshold ,
while there is an obvious difference between the QMF prediction and .
FIG. 3.
Comparison of theoretical thresholds with numerical thresholds on SFN. The threshold
λ vs. γ on SFN with structural cutoff
(a) and natural cutoff (b), where N is set to 104.
λ vs. N on SFN with structural cutoff
(c) and natural cutoff (d), where solid and empty symbols denote
and 3.50, respectively. “Squares,” “circles,” and “triangles” denote , and ,
respectively. The results are averaged over
independent realizations on 102 networks.
The SFN with natural cutoff are considered in Figs. 3(b) and 3(d), where the variations of
epidemic threshold with γ and N are similar to the
results on SFN with structural cutoff. When ,
the HMF prediction is close to the numerical threshold, while there is a gap between the
QMF prediction and the numerical threshold. Since the disassortative degree-degree
correlations exist for ,
there is a slight difference between and .
In Fig. 3(d), the distinction between and
becomes large with the increase of N for ,
while in such case the QMF prediction is always close to the numerical threshold since the
principle eigenvector is delocalized when . It can been seen from the above analysis
that the numerical method presented provides quantitive indexes for the observations in
Ref. 22, where the HMF theory is relatively
accurate for the SIR model.
Real-world networks
To check the performance of the variability Δ on real-world networks, Fig. 4 depicts Δ as a function of λ for
Hamsterster full network, which
contains friendships and family links between users of the website hamsterster.com, and
Facebook (NIPS) network, which
contains Facebook user-user friendships. The numerical results intuitively show that the
variability Δ always reaches a maximum value near the value of λ above
which the outbreak size ρ becomes finite. The theoretical predictions of
the HMF theory and of the QMF theory are quite close to the numerical threshold identified
by Δ on Hamsterster full network, but they become poor on Facebook (NIPS) network.
Considering the difference in the behaviors of the theoretical predictions on the two
networks above, the detailed comparisons between the numerical and theoretical thresholds
on other real networks are presented in Table I.
The results indicate that although the HMF prediction and the numerical threshold are nearly the
same for assortative networks, there is an obvious difference between them for the
networks showing significant disassortative mixing. The QMF prediction is relatively worse
than the HMF prediction for assortative networks, but the former is close to for some
disassortative networks (e.g., Router views, CAIDI, and
email contacts). These findings
numerically show the accuracies of existing theoretical predictions on real-world
networks.
FIG. 4.
Variability Δ and outbreak size ρ as a function of λ on
Hamsterster full network (a) and Facebook (NIPS) network (b). “Triangleup” and
“triangledown” denote and ,
respectively. The variability Δ for each λ is normalized by the maximal
variability Δ. The results are averaged over 106
independent realizations on each network.
TABLE I.
Characteristics and epidemic thresholds of real-world networks. N is the
network size, k is the maximum degree, r
is the correlation coefficient of the degrees, and are the HMF
and QMF predictions for SIR, respectively, (SIR)
denotes the numerical threshold of SIR identified by Δ, and (SIS)
and (SIS)
represent the numerical thresholds of SIS identified by Δ and χ,
respectively.
Network
N
kmax
r
λcHMF
λcQMF
λpΔ(SIR)
λpΔ(SIS)
λpχ(SIS)
Hamsterster full35
2000
273
0.023
0.023
0.020
0.023
0.025
0.025
Brightkite36
56739
1134
0.010
0.016
0.010
0.014
0.012
0.012
arXiv astro-ph37
17903
504
0.201
0.015
0.011
0.012
0.012
0.012
Pretty good privacy38
10680
206
0.239
0.056
0.024
0.053
0.033
0.033
US power grid39
4941
19
0.003
0.348
0.134
0.446
0.261
0.264
Euroroad40
1039
10
0.090
0.479
0.249
0.498
0.331
0.331
Facebook (NIPS)41
2888
769
−0.668
0.004
0.036
0.075
0.079
0.077
Route views37
6474
1458
−0.182
0.006
0.022
0.037
0.034
0.036
CAIDA37
26475
2628
−0.195
0.004
0.014
0.019
0.019
0.019
email contacts42
12625
576
−0.387
0.009
0.02
0.027
0.024
0.025
CONCLUSION AND DISCUSSION
In summary, we have studied the numerical identification of SIR epidemic threshold on
complex networks with finite size. We have checked the effectiveness of the susceptibility
measure for SIR on RRN, where the HMF is exact. The results showed an obvious gap between
the numerical threshold identified by the susceptibility χ and the HMF
prediction. Then, we proposed the numerical identification method by analyzing the peak of
the epidemic variability, and found that the numerical threshold identified by the
variability measure Δ agrees very well with the HMF prediction on RRN.In order to get a deep understanding of the effectiveness of the two numerical measures
above, we have analyzed the distribution of outbreak sizes near the epidemic threshold
λ. The outbreak sizes follow approximately an exponential
distribution when .
At the epidemic threshold, the outbreak sizes follow a power-law distribution with the
exponent of about −1.5. When ,
the numerical distribution of outbreak sizes is bimodal with two peaks occurring at
and O(1), respectively. The probability of small outbreak sizes in theory
is consistent with that obtained by numerical simulations, but the probability of large
outbreak sizes that constitute a lump in the numerical scattergram cannot be obtained
theoretically. Based on the analysis of the cutoff hypothesis of the outbreak size
distribution, we found that the susceptibility measure can give a fairly effective SIR
epidemic threshold when the lump is ignored. Since the variability measure reflects the
heterogeneity of the outbreak size distribution, it is always effective in identifying the
epidemic threshold, where the distribution of outbreak sizes has the very strong
heterogeneity.We further employed the variability measure to verify the theoretical predictions on
scale-free and real networks. The HMF prediction is close to the numerical threshold on most
of the networks, but on SFN with natural cutoff and degree exponent ,
it becomes poor due to the existence of disassortative mixing. Similarly, the HMF prediction
agrees well with the numerical method on real networks with assortative mixing, while it
becomes very poor for disassortative networks, where the QMF prediction is relatively close
to the numerical threshold. These findings provide quantitive indexes for the accuracies of
existing theoretical predictions from the perspective of simulation.As part of the discussion, we have considered the epidemic threshold for
in Fig. 5. The results on RRN and SFN all show that the
numerical thresholds for
are a little larger than those for .
As shown in the inset,
leads to an epidemic threshold close to Eq. (4), while
leads to an epidemic threshold close to Eq. (3). It should be pointed out that the numerical threshold of SIR model is
inclined to the theoretical prediction
from the edge-based compartmental theory when .
These findings could be complementary to some existing results.
FIG. 5.
Comparison of theoretical thresholds with numerical thresholds for .
The threshold λ vs. k for
(a) and
(b) on RRN. The threshold λ vs. γ for
(c) and
(d) on SFN, where solid and empty symbols denote SFN with structural cutoff
and natural cutoff ,
respectively. “Circles” and “squares” denote and ,
respectively. Inset: λ as a function of μ on
RRN with
and k = 6. The results are averaged over
independent realizations on 102 networks.
Moreover, we have tried applying the variability measure to the identification of the SIS
epidemic threshold. As shown in Fig. 6 and Table I, the numerical threshold
from the variability measure agrees very well with the threshold
identified by the susceptibility measure, whose validity for the SIS model has been
confirmed in Ref. 18. This shows that the variability
measure can also provide an effective estimate of the SIS epidemic threshold.
FIG. 6.
Comparison of theoretical thresholds with numerical thresholds for SIS model on RRN. (a)
The threshold λ vs. k, where
N is set to 104. (b) The threshold
λ vs. N, where k is set
to 10. “Squares” and “circles” denote
and ,
respectively. The results are averaged over
independent realizations on 102 networks.
We have put forward a numerical method for identifying the epidemic threshold for SIR
model, which is also suitable for the SIS model. This method can effectively identify
epidemic thresholds on various networks, and could be extended to other dynamical processes
such as information diffusion and behavior spreading. Further work should be done to check
the effectiveness of this method on more complicated networks (e.g., temporal networks and multilayer networks). Besides, the accurate analytic
approximation of the epidemic threshold for general networks remains an important problem.
This work helps to verify theoretical analysis of epidemic threshold and would promote
further studies on phase transition of epidemic dynamics.
Authors: S Boccaletti; G Bianconi; R Criado; C I Del Genio; J Gómez-Gardeñes; M Romance; I Sendiña-Nadal; Z Wang; M Zanin Journal: Phys Rep Date: 2014-07-10 Impact factor: 25.600