Changwang Zhang1, Shi Zhou2, Joel C Miller3, Ingemar J Cox4, Benjamin M Chain5. 1. 1] Department of Computer Science, University College London, UK [2] Security Science Doctoral Research Training Centre, University College London, UK [3] School of Computer Science, National University of Defense Technology, Changsha, China. 2. Department of Computer Science, University College London, UK. 3. 1] School of Mathematical Sciences, Monash University, Melbourne, Victoria, Australia [2] School of Biological Sciences, Monash University, Melbourne, Victoria, Australia [3] Monash Academy for Cross &Interdisciplinary Mathematics, Monash University, Melbourne, Victoria, Australia. 4. 1] Department of Computer Science, University of Copenhagen, Denmark [2] Department of Computer Science, University College London, UK. 5. Division of Infection and Immunity, University College London, UK.
Abstract
Epidemic spreading phenomena are ubiquitous in nature and society. Examples include the spreading of diseases, information, and computer viruses. Epidemics can spread by local spreading, where infected nodes can only infect a limited set of direct target nodes and global spreading, where an infected node can infect every other node. In reality, many epidemics spread using a hybrid mixture of both types of spreading. In this study we develop a theoretical framework for studying hybrid epidemics, and examine the optimum balance between spreading mechanisms in terms of achieving the maximum outbreak size. We show the existence of critically hybrid epidemics where neither spreading mechanism alone can cause a noticeable spread but a combination of the two spreading mechanisms would produce an enormous outbreak. Our results provide new strategies for maximising beneficial epidemics and estimating the worst outcome of damaging hybrid epidemics.
Epidemic spreading phenomena are ubiquitous in nature and society. Examples include the spreading of diseases, information, and computer viruses. Epidemics can spread by local spreading, where infected nodes can only infect a limited set of direct target nodes and global spreading, where an infected node can infect every other node. In reality, many epidemics spread using a hybrid mixture of both types of spreading. In this study we develop a theoretical framework for studying hybrid epidemics, and examine the optimum balance between spreading mechanisms in terms of achieving the maximum outbreak size. We show the existence of critically hybrid epidemics where neither spreading mechanism alone can cause a noticeable spread but a combination of the two spreading mechanisms would produce an enormous outbreak. Our results provide new strategies for maximising beneficial epidemics and estimating the worst outcome of damaging hybrid epidemics.
Epidemic spreading phenomena are ubiquitous in nature and society. Examples include the
spreading of infectious diseases within a population, the spreading of computer viruses
on the Internet, and the propagation of information in society. Understanding and
modelling the dynamics of such events can have significant practical impact on health
care, technology and the economy. Various spreading mechanisms have been studied12. The two most common mechanisms are local spreading, where
infected nodes only infect a limited subset of target nodes3; and
global spreading, where nodes are fully-mixed such that an infected node can
infect any other node41. In reality, many epidemics use hybrid
spreading, which involves a combination of two or more spreading mechanisms. For
example the computer worms Conficker5 and Code-Red6 can
send probing packets to targeted computers in the local network or to any randomly
chosen computers on the Internet.Early relevant studies investigated epidemics spreading in populations whose nodes mix at
both local and global levels (“two levels of mixing”)7. These early studies7 did not incorporate the structure of
the local spreading network, assuming both local and global spreading are fully-mixed.
Since the introduction of network based epidemic analysis31, hybrid
epidemics have been studied in structured populations8, in structured
households91011, and by considering networked epidemic spreading
with “two levels of mixing”121314. A number of
studies151617181920 have also considered epidemics in
metapopulations, which consist of a number of weakly connected subpopulations. The
studies of epidemics in clustered networks212223 are also relevant.
Much prior work on hybrid epidemics has focused on the impact of a network’s
structure on spreading.Most previous studies were about what we call the non-critically hybrid epidemics
where a combination of multiple mechanisms is not a necessary condition for an epidemic
outbreak. In this case, using a fixed total spreading effort, a hybrid epidemic will
always be less infectious than an epidemic using only the more infectious one of the two
spreading mechanisms1324. However, many real examples of hybrid
epidemics suggest the existence of critically hybrid epidemics where a mixture of
spreading mechanisms may be more infectious than using only one mechanism.In this paper we investigate whether, and if so when, hybrid epidemics spread more widely
than single-mechanism epidemics. We propose a mathematical framework for studying hybrid
epidemics and focus on exploring the optimum balance between local and global spreading
in order to maximize outbreak size. We demonstrate that hybrid epidemics can cause
larger outbreaks in a metapopulation than a single spreading mechanism.Our results suggest that it is possible to combine two spreading mechanisms, each with a
limited potential to cause an epidemic, to produce a highly effective spreading process.
Furthermore, we can identify an optimal tradeoff between local and global mechanisms
that enables a hybrid epidemic to cause the largest outbreak. Manipulating the balance
between local and global spreading may provide a way to improve strategies for
disseminating information, but also a way to estimate the largest outbreak of a hybrid
epidemic which can pose serious threats to Internet security.
The Hybrid Epidemic (HE) Model
Here we introduce a model for hybrid epidemics in a metapopulation, which
consists of a number of subpopulations. Each subpopulation is a collection of
densely or strongly connected nodes, whereas nodes from different subpopulations are
weakly connected. As illustrated in Figure 1, our model
considers two spreading mechanisms: 1) local spreading where an infected node can
infect nodes in its subpopulation and 2) global spreading, where an infected node
can infect all nodes in the metapopulation. In our model each subpopulation for
local spreading can be either fully-mixed or a network. For mathematical
convenience, we describe each subpopulation as a network and represent a fully-mixed
subpopulation as a fully connected network. Note that our definition of
metapopulation is different from the classical metapopulation defined in ecology
where subpopulations are connected via flows of agents1619.
Figure 1
Hybrid epidemic spreading in a metapopulation.
At each time step, an infected node has a fixed total spreading effort which
must be allocated between local spreading and global spreading. The
proportion of spreading effort spent in local spreading is α and
that in global spreading is 1−α. Local spreading
occurs between infected and susceptible nodes that are connected in
individual subpopulations; global spreading happens between an infected node
and any susceptible node in the metapopulation.
Our model considers hybrid epidemics in which at each time step, an infected node has
a fixed total spreading effort which must be allocated between the two spreading
mechanisms. Let the hybrid tradeoff, α, represent the proportion
of spreading effort spent in local spreading. The proportion of global spreading
effort is 1−α. A tunable α enables us to
investigate the interaction and the joint impact of the two spreading mechanisms on
epidemic dynamics, ranging from a completely local spreading scenario (with
α = 1) to a completely global spreading scenario (with
α = 0). For example computer worms like Conficker5 and Code-Red6 can conduct both local and global probes
but the average total number of probes in a time unit is fixed.We consider the hybrid epidemic spreading in terms of the
Susceptible-Infected-Recovered (SIR) model125, where each node is
in one of three states: susceptible (s), infected (i), and
recovered (r). At each time step, each infected node spreads both locally
and globally; it infects 1) each directly connected nodes in the same subpopulation
with rate b1 = αβ1 and 2)
each susceptible node in the metapopulation with rate b2 =
(1−α)β2.
β1 is the local infection rate when all spreading
effort is local (α = 1), and β2 is
the global infection rate when all spreading effort is global (α =
0). Each infected node recovers at a rate γ, and then remains
permanently in the recovered state. A node can infect other nodes and then recover
in the same time step.
Hybrid Spreading In A Single-Population
Before we analyse hybrid spreading in a metapopulation, we study a relatively simple
case where the epidemic process takes place in a single population. That is,
there is only one population, where local spreading is via direct connections on a
network structure and global spreading can reach any node in the population.Here we extend the edge-based compartmental modelling system26 for the
analysis. The system26 was proposed to analyse single-mechanism based
epidemics for the continuous time case. Here we extend the system to analyse 1)
hybrid epidemics, and 2) for the discrete time case. We calculate the probability
that a random test node u is in each state: susceptible s(t),
infected i(t), and recovered r(t).We denote p(k) as the probability that a node has degree (i.e. number
of neighbours) k. The generating function27 of degree
distribution p(k) is defined as . Let p(k) represent the probability that a
random neighbour of u has k neighbours. We assume the network is
uncorrelated: the degrees of the two end nodes of each link are not
correlated (i.e. independent from each other)1. In an uncorrelated
network , where is the average degree of the network and .Let θ(t) be the probability that a random neighbour v
has not infected u through local spreading. Let ϑ(t)
be the probability that a random node w has not infected u through
global spreading. Suppose u has k neighbours, the probability that it
is susceptible is where n is the
total number of nodes in the population. Then by averaging
s(t) over all degrees, we have,The probability θ can be broken into three parts: v is
susceptible at t, φ; v is infected at
t but has not infected u through local spreading,
φ; v is recovered at t and has
not infected u through local spreading, φ.
Neighbour v can not be infected by u and itself, then . In a time step, neighbour v 1) infects
u with rate b1φ through
local spreading and 2) recovers without infecting u through local spreading
at rate , i.e. after every time step:
(1−θ) increases by
b1φ and
φ increases by
γ(1−b1)φ.
The increase rate of φ here,
γ(1−b1)φ,
is different from that (rφ) in the original
system26. Because the original system was designed for the
continuous time case, and in the discrete time case in this paper, neighbour
v can infect u and recover at the same time step. Given that
φ and 1−θ are
both approximately 0 in the beginning (t = 0), we have . ThenFor global spreading, the probability ϑ can also be broken into
three parts: w is susceptible at t, φ;
w is infected at t but has not infected u through global
spreading, φ; w is recovered at t but
has not infected u through global spreading, φ.
Using a similar derivation process, we have and , and When the epidemic stops spreading,
φ = 0 and φ = 0.
By setting φ = 0 in equation (2)
we get Substituting equation (4) and φ = 0 into equation (3), we have By
setting φ = 0 and substituting equation
(5) in equation (2) we have Then θ -
stationary value of θ is a fixed point of
f(θ).
Threshold Condition
f(θ) has a known fixed point of θ = 1
which represents no epidemic outbreak. We test the stability of this fixed point. By
substituting equation (2) and equation (5)
into dθ/dt =
−b1φ, setting
θ = 1+ε and take the leading order (Taylor
Series), we have dε/dt = εh and where , , and . Then ε = Ce
where c is a constant. When h is negative, gradually decreases and approaches 0 as t
increases; while when h is positive, gradually increases and approaches +∞ with the increase of
t. That is the fixed point θ = 1 turns from stable to
unstable when h changes from negative to positive. A more rigorous analysis
would need to consider the fact that when a small amount of disease is introduced,
the fixed point at θ = 1 is moved slightly. However, the stability
analysis we do here is sufficient to determine whether epidemics are possible for
arbitrarily small initial infections. Further details are in a separate study28. The threshold condition for an epidemic outbreak is then
h>0:This epidemic threshold represents an condition which, when not satisfied,
results in an epidemic that vanishes exponentially fast429. There
are two special cases.For completely local spreading (α = 1, b1 =
β1, b2 = 0), the threshold
reduces to . Here and where
is the average degree of the network and is the average degree square of the network1. In
the methods section, we show that this threshold agrees with previous
threshold results30 for single-mechanism epidemics spreading
on networks for the discrete time case. For infinite scale-free networks, we
have such that the threshold
‘vanishes’ (i.e. ∞>1 is always
satisfied), in agreement with previous observation31.For completely global spreading (α = 0,
b1 = 0, b2 =
β2), the threshold reduces to
β(n−3+γ)/γ>1,
and when n is large it is approximate to
β2n/γ>1.
β2n/γ is the basic
reproduction number, R0, for single-mechanism epidemics
spreading in a fully mixed population4. R0
is the average number of nodes that an infected node can infect before it
recovers. Thus the threshold is equivalent to
R0>1, in agreement with previous work4.
Final Outbreak Size
The final outbreak size, r, is the fraction of nodes
that are recovered when all epidemic activities cease, i.e. when all nodes are
either recovered or susceptible. When t→∞, the
probability that a node is infected i(t)→0. Thus and where the value of θ can be
numerically calculated by conducting the fixed-point iteration of equation (6). Equation (9) can be viewed as a
function of the hybrid epidemic parameters and the network degree distribution. To
be noted here, for completely global spreading (α = 0,
b1 = 0, b2 =
β2), θ can not
be calculated from equation (6) (because b1 =
0). In this case, θ = 1,
g(θ) = 1, and where
ϑ can be obtained by setting
φ = 0,
g0(θ) = 1 and solving the equation (3) in the rage
0<ϑ<1.
Evaluation
Numerical simulations were performed to verify the above theoretical predictions for
hybrid epidemics in a single population. We consider three topologies for local
spreading in the single-population: (1) a fully connected network which represents a
fully mixed population; (2) a random network with Poisson degree distribution, which
is generated by the Erds-Rényi
(ER) model31 with average degree 5; and (3) a scale-free network with
a power-law degree distribution , which is
generated by the configuration model1 with the minimum degree m = 3.
Each of these networks has 1000 nodes. At the beginning, 5 randomly selected nodes
are infected and all others are susceptible.We run simulations for different values of
α∈[0,1]. We set the global
infection rate β2 = 10−4 and
the recovery rate γ = 1 (i.e. an infected node only spreads the epidemic
in one time step). For epidemics on the fully connected network, the local infection
rate β1 = 6×10−3.
And for epidemics on the random and scale-free networks,
β1 = 0.8. Figure 2 shows that
the final outbreak size predicted by equation (9) is in close
agreement with simulation results. The hybrid epidemics on the random network and
the scale-free network exhibit similar outbreak sizes for large values of
α. It is also evident that the hybrid epidemic is
characterised by a phase change, where the threshold is well predicted by equation (8).
Figure 2
Theoretical predictions and simulation results for hybrid epidemics in a
single-population.
The final outbreak size r is shown as a
function of the hybrid tradeoff α. Three network
topologies are considered: (1) a fully connected network (i.e. fully mixed);
(2) a random network with an average degree of 5; (3) a scale-free network
with a power-law degree distribution which is generated by the configuration model1
with the minimum degree m = 3. The population has 1000 nodes. The
global infection rate β2 =
10−4 and recovery rate γ = 1
are the same for epidemics on these three types of networks. The local
infection rate β1 is
6×10−3 for epidemics on the fully
connected network; and it is 0.8 for epidemics on the random and scale-free
networks. Initially 5 random nodes are infected. Simulation results are
shown as points and theoretical predictions of equation
(9) are dashed curves. The simulation results are averaged over
1000 runs with bars showing the standard deviation. The epidemic threshold
values of α are predicted by equation
(8) and marked as vertical lines.
Hybrid Spreading In A Metapopulation
We now extend the above theoretical results for a single-population to analyse hybrid
spreading in a metapopulation which consists of a number of subpopulations. Local
infection happens only between nodes in the same subpopulation whereas global
infection occurs both within and between subpopulations.
Hybrid Spreading At The Population Level
We define a subpopulation as susceptible if it contains only susceptible nodes. A
subpopulation is infected if it has at least one infected node. A subpopulation is
recovered if it has at least one recovered node and all other nodes are susceptible.
Only global spreading enables infection between subpopulations, whereas spreading
within a subpopulation can occur via both local and global spreading.The final outbreak size at the population level R, is
defined as the proportion of subpopulations that are recovered when the epidemic
stops spreading. We define that a subpopulation A directly infects another
subpopulation B if an infected node in A infects a susceptible node in B. We define
the population reproduction number, R, as the average number of
other subpopulations that an infected subpopulation directly infects before it
recovers. Note that our definition of R is similar to proposed by Colizza et al.16
but the definition of a metapopulation16 is different. In the
simulations and theoretical analysis, we approximate
Ras the population reproduction number of the
initially infected subpopulation p0, i.e. the average number of
other subpopulations that p0 directly infects. This approximation
becomes exact when the metapopulation has infinite number of subpopulations each
with the same network structure. A metapopulation includes many subpopulations. In
order for an epidemic to spread in a metapopulation, an infected subpopulation
should infect at least one other subpopulation before it recovers, i.e. the
threshold condition of the hybrid epidemic at the population-level is
R>1.We conduct epidemic simulations on a metapopulation containing 500 subpopulations
each with 100 nodes. Two topologies for local spreading in each subpopulation are
considered: random network and scale-free network. Figure 3
shows simulation results of the final outbreak sizes r
and R and the population reproduction number
R (right y axis) as a function of the hybrid tradeoff
α. Epidemic parameter values are included in Figure 3’s legend. For both the random and scale-free
networks, all three functions show a bell shape curve regarding α.
It is clear that the epidemic will not cause any significant infection if it uses
only local spreading (α = 1) or only global spreading
(α = 0). For the random network, the maximal outbreak at the
node level is obtained around the optimal
hybrid tradeoff . That is, if 50% of the
infection events occur via local spreading (and the rest via global spreading), the
epidemic will ultimately infect 34% of all nodes in the metapopulation. At the
population level, the total percentage of recovered subpopulations
R follows a very similar trend to
r, and the maximum epidemic size in terms of
subpopulations occurs at the same optimal .
The population reproduction number R follows a similar trend to
the final outbreak sizes R and
r. The threshold
R>1 defines the range of α for which
the final outbreak sizes are significantly larger than zero.
Figure 3
Simulation results of hybrid epidemics in a metapopulation where (a) each
subpopulation is a random network and (b) each subpopulation is a scale-free
network.
Three quantities are shown as a function of the hybrid tradeoff
α, including the final outbreak size as the fraction
of recovered nodes r (squares); the final
outbreak size as the fraction of recovered subpopulations
R (circles); and the population
reproduction number, R (triangles, right y-axis). The
metapopulation contains 500 subpopulations each with 100 nodes. In (a) each
subpopulation is a random network with an average degree of 5; and In (b)
each subpopulation is a scale-free network with a power-law degree
distribution which is generated by
the configuration model1 with the minimum degree m =
3. The local infection rate β1 = 0.8, the
global infection rate β2 =
10−6 and the recovery rate γ
= 1. Initially 3 random nodes in a subpopulation are infected. Simulation
results are shown as points and each result is averaged over 1000 runs.
It is important to appreciate that although the maximal is uniquely defined by the optimal , other R values can be obtained by
two different α values, on either side of the optimal
, potentially representing different
epidemic dynamics. As the hybrid epidemic for random and scale-free networks exhibit
similar properties, for simplicity we only show results for the random network in
the following.
Prediction of the Population Reproduction Number
R
The population reproduction number R is a fundamental
characteristic of hybrid epidemics in a metapopulation. We consider a metapopulation
with N+1 subpopulations, which are denoted as p where
i = 0, 1, 2, …, N. Each subpopulation has n nodes
connected to a same structured local spreading network. p is the
subpopulation where the epidemic starts from.We assume the infection inside the initially infected subpopulation
p is all caused by infected nodes inside
p. That is, we neglect the effects of global spreading of other
N subpopulations on p0. This is an acceptable
assumption when the metapopulation has a larger number of subpopulations. Under
these conditions, hybrid spreading within p0 is the same as
spreading in a single-population, which has been analysed in previous sections. To
predict R, we first analyse the expected number of nodes outside
p0 that will be infected by p0. We then
estimate the number of other subpopulations that these infected nodes should belong
to. Let sN(t) represent the probability that a random test
node in other subpopulations are susceptible at time t. Using the same
parameters defined in the analysis about hybrid epidemics in a single population, we
have where n is the number of node
in p0.When p0 recovers at time T, the fraction of nodes in
other subpopulations that have been infected by (infected nodes in)
p0 (via global spreading) is where we have used equation (5).
Then the number of such infected nodes is where nN is the total number of nodes in other
N subpopulations and θ can be numerically
calculated as θ by fixed-point iteration of
equation (6). As the nodes are infected randomly via the
global spreading, the probability that an infected node does not belong to a
particular subpopulation i is 1−1/N; and the probability
that none of these infected nodes belongs to the subpopulation i is . So the probability that at least one
infected node belongs to the subpopulation i is . Thus the population reproduction number
R, which is the number of other subpopulations that these
infected nodes should belong to, is:Figure 4 compares the predicted R against
simulation results as a function of the hybrid tradeoff α.
R is characterised by a bell-shaped curve. It peaks at
the optimal hybrid tradeoff α* where the population reproduction
number achieves its maximal value . This
optimal point is of particular interest as it represents the optimal trade-off
between the two spreading mechanisms, where the hybrid epidemic is most infectious
and therefore has the most extensive outbreak.
Figure 4
The population reproduction number R as a function of the
hybrid tradeoff α.
Theoretical predictions from equation (11) are shown as
a dashed curve. Simulation results are shown as points (average over 1,000
runs) and bars (one standard deviation). The metapopulation and epidemic
parameters are the same as Figure 3a.
The Optimal Hybrid Tradeoff α* and the Maximal
We next investigated the maximum epidemic outbreak in the context of varying
infectivity and recovery rates. For a given set of epidemic variables, we calculate
the theoretical prediction of R as a function of
α using equation (11), and then we obtain
the optimal and the maximal . For ease of analysis, we fix the global
infection rate β2 at a small value of
10−6 and then focus on the local infection rate
β1 and the recovery rate γ.Figure 5a shows the optimal hybrid tradeoff as a function of
β1 and γ. For a given
γ, a larger β1 results in a
smaller . Intuitively this can be
understood as when the efficiency of local spread increases, less effort needs to be
devoted to this spreading mechanism, and more can be allocated to global spreading.
On the other hand, for a given β1, a larger
γ results in an increase in . When the recovery rate is higher, nodes remain
infectious for shorter times. In this case, in order to achieve the maximum epidemic
outbreak, more local infection is favoured, since this will allow an infected
subpopulation to remain infected for longer, and hence increase the probability of
infecting other subpopulations before it recovers. A plot of versus
β1/γ is shown in Figure 5c. The fitting on a log-log scale in the inset indicates the two
quantities have a power-law relationship, i.e. is determined by β1/γ.
This means the optimal hybrid tradeoff can
be predicted when β1/γ is known.
Figure 5
The estimated optimal hybrid tradeoff and the maximal population reproduction number for hybrid epidemics in a metapopulation.
(a) as a function of local
infection rate β1 and recovery rate
γ; (b) as a function of β1 and
γ; (c) as a function of
β1/γ, which is fitted by
a dash line of . (d)
Population reproduction number R as a function of
α and β1 with
γ = 0.1, where the points are the corresponding
optimal for given
β1. We fix β2
= 10−6 and the metapopulation is as in Figure 3a.
Figure 5b shows the maximal as a function of β1 and γ,
where the is obtained when the
corresponding value of in Figure 5a is used. is very
sensitive to the recovery rate γ. As γ approaches zero,
the value of increases dramatically (note
that uses a log-scale colour-map)
regardless of value of β1. This is in agreement with
the intuition that a low recovery rate will favour any type of epidemic spreading.
For a fixed γ, increases
with β1. An increased infection rate of local spreading
will obviously increase the reproductive number, if other parameters are kept
constant, but the effect is much smaller than that of changing the recovery rate,
because global spreading maintains the reproductive number when local spreading
falls to low values.Figure 5a shows a clear phase shift between areas where an
epidemic occurs (the coloured area) and areas where it does not (the white area
towards the top-left corner). Accordingly, the corresponding in Figure 5b in the area where no
epidemic occurs is very small. The boundary between the epidemic and non-epidemic
phase space is defined by the line . This
is the threshold for completely local spreading in a single-population: and
for the network topology used. Since the global infection rate
β2 is fixed at a small value, no major spreading
will occur either within or between subpopulations below this threshold.Figure 5d plots R as a function of
β1 and α on a log-log scale while fixing
γ = 0.1. For given values of β1, the
corresponding optimal are shown as points.
We can see that points always fall in the area of the maximal for the given β1. Each
point represents a local optimum. The global optimum, the largest possible value of
R, is obtained towards the bottom-right corner, where the
local infection rate is high but the epidemic spends most effort on global
spreading. Infection across subpopulations can only be achieved by global spreading.
Since global spreading has a low infection rate, the epidemic should spend most of
its time (or resource) on global spreading. There will be much less time spent on
local spreading but its infection rate is high anyway.
Discussion
Hybrid spreading, the propagation of infectious agents using two or more alternative
mechanisms, is a common feature of many real world epidemics. Widespread epidemics
(e.g. computer worms) typically spread efficiently by local spreading through
connections within a subpopulation, but also use global spreading to probe distant
targets usually with much lower infectivity. In many cases, the amount of resources
(e.g. time, energy or money) which an infectious agent can devote to each mode of
propagation is limited. This study focuses on the tradeoff between local and global
spreading, and the effect of this tradeoff on the outbreak of an epidemic.We develop a theoretical framework for investigating the relationships between
α, the relative weight given to each spreading mechanisms, and
the other epidemic properties. These properties include epidemic infectivity,
subpopulation structure, epidemic threshold, and population reproduction number. The
predictions of the theoretical model agree well with stochastic simulation results,
both in single populations and in metapopulations.Our analysis shows that epidemics spreading in a metapopulation may be critically
hybrid epidemics where a combination of the two spreading mechanisms is essential
for an outbreak and neither completely local spreading nor completely global
spreading can allow epidemics to propagate successfully.Our study reveals that, in metapopulations, there exists an optimal tradeoff between
global and local spreading, and provides a way to calculate this optimum given
information on other epidemic parameters. These results are supported by our recent
study32 on measurement data of the Internet worm Conficker53334.The above results are of practical relevance when the total amount of time or
capacity that is allocated to spreading is limited by some resource constraint. For
example, the total probing frequency of computer worms is often capped at a low rate
to prevent them from being detected by anti-virus software. Furthermore, other
epidemic parameters, such as local or global infection rates are difficult to change
because they derive from inherent properties of the infectious agent. For example it
would be difficult to increase the global infection rate of an Internet worm. The
tradeoff between different types of spreading therefore becomes a key parameter in
terms of design strategy, which can be manipulated to maximise outbreak size.The consideration of hybrid spreading mechanisms also has some interesting
implications for strategies for protecting against the spread of epidemics. It is
clear from both theoretical considerations and simulations that epidemics can spread
with extremely low global infection rates (far below individual recovery rates),
provided there is efficient local infection. Such conditions are common for both
cyber epidemics (as computers within infected local networks tend to be more
vulnerable to infection35) and in infectious disease epidemics, where
contacts between family or community members are often much closer and more frequent
than the overall population. Protection strategies which target local networks
collectively (for example intensive local vaccination around individual disease
incidents, as was used in the final stages of smallpox eradication36)
may therefore be a key element of future strategies to control future mixed
spreading epidemics.In conclusion, our study highlights the importance of the tradeoff between local and
global spreading, and manipulation of this tradeoff may provide a way to improve
strategies for spreading, but also a way to estimate the worst outcome (i.e. largest
outbreak) of hybrid epidemics that can pose serious threats to Internet
security.
Methods
Threshold for local spreading using Newman’s method
Here we use Newman’s method30 to obtain the threshold
condition for the local spreading. Firstly we need to calculate the
“transmissibility” T which is the average
probability that an epidemic is transmitted between two connected nodes, of
which one is infected and the other is susceptible. According to Newman30, for the discrete time case T can be calculated as where τ is the
time steps that an infected node remains infected, p(τ) and
p(β1) respectively are the probability
distribution of τ and β1. For the model
in this paper, β1 is a constant and , in which is the probability that an infected node has not
recovered until τ−1 steps after infection, and
γ is the probability that the node recovers at the
τth step after infection. Also for the model in this paper, each
infected node at least remains infected for 1 time step. So that T for
our model can be obtained as
According to Newman30 the epidemic threshold for completely local
spreading is i.e. . This is the same as the epidemic threshold for
completely local spreading obtained in this paper.Note that treating each edge as having this value of T independently will
lead to the correct epidemic threshold and final size calculation, but there are
further discussions on its correctness in calculating the infection
probabilities37383940.
Simulation settings
Random networks used in all simulations have a Poisson degree distribution and
they are generated by the Erds-Rényi (ER) model31 with the average degree of
5.Scale-free networks used in all simulations have a power-law degree distribution
and they are generated by the
configuration model1 with the minimum degree m = 3.Figure 2 - simulations in a single-population: •
Size of single-population: 1,000 nodes; • Single-population topology:
fully connected network, random network and scale-free network; •
Local infection rate: (except for
fully connected network β1 =
6×10−3); • Global infection
rate: β2 = 10−4;
• Recovery rate: γ = 1; • Initial condition:
all nodes are susceptible except 5 randomly-chosen nodes are infected;
• Number of simulation runs averaged for each data point: 1,000.Figure 3 - simulations in a metapopulation: •
Size of metapopulation: 500 subpopulations each with 100 nodes; •
Subpopulatin topology: random networks and scale-free networks; •
Local infection rate: β1 = 0.8; • Global
infection rate: β2 =
10−6; • Recovery rate: γ
= 1; • Initial condition: all nodes are susceptible except 3
randomly-chosen nodes are infected; • Number of simulation runs
averaged for each data point: 1,000.Figure 4 and 5 - theoretical
predictions about hybrid epidemics: Same as Figure 3
except only the random network topology is considered.
Author Contributions
C.Z., S.Z., I.J.C., and B.M.C. designed the study. C.Z and J.C.M. conducted the
mathematical modelling and derivation. C.Z. performed the computational analysis and
simulations. S.Z. and B.M.C. wrote the manuscript with contributions from C.Z. and
J.C.M. and I.J.C.