| Literature DB >> 25893260 |
Martin Ritchie1, Luc Berthouze2, Istvan Z Kiss3.
Abstract
Clustering is the propensity of nodes that share a common neighbour to be connected. It is ubiquitous in many networks but poses many modelling challenges. Clustering typically manifests itself by a higher than expected frequency of triangles, and this has led to the principle of constructing networks from such building blocks. This approach has been generalised to networks being constructed from a set of more exotic subgraphs. As long as these are fully connected, it is then possible to derive mean-field models that approximate epidemic dynamics well. However, there are virtually no results for non-fully connected subgraphs. In this paper, we provide a general and automated approach to deriving a set of ordinary differential equations, or mean-field model, that describes, to a high degree of accuracy, the expected values of system-level quantities, such as the prevalence of infection. Our approach offers a previously unattainable degree of control over the arrangement of subgraphs and network characteristics such as classical node degree, variance and clustering. The combination of these features makes it possible to generate families of networks with different subgraph compositions while keeping classical network metrics constant. Using our approach, we show that higher-order structure realised either through the introduction of loops of different sizes or by generating networks based on different subgraphs but with identical degree distribution and clustering, leads to non-negligible differences in epidemic dynamics.Entities:
Keywords: Epidemic; High-order structure; Motif; Network; Subgraph
Mesh:
Year: 2015 PMID: 25893260 PMCID: PMC4698307 DOI: 10.1007/s00285-015-0884-1
Source DB: PubMed Journal: J Math Biol ISSN: 0303-6812 Impact factor: 2.259
Fig. 1Subgraph notation and position labelling. Subgraphs are labelled by followed by a symbolic subscript for ease of reference
Fig. 2Graphical representation of . and denote the excess degree of a susceptible node and rate of infection, respectively. We note that newly infected nodes are modelled as previously susceptible nodes so the product is being used to model the expected number of edges infection will be able to spread along upon infecting a susceptible node. This product implicitly considers all possible routes of infection into the node. The left hand side of the figure shows example subgraphs that are the source of infection for the central node. The right hand side of the figure graphically represents the expected excess degree of subgraphs for the central node
Summary of the key system variables and their generation
| Variable | Description | Generation |
|---|---|---|
|
| PGF of the HDD given as a function, not as a series | A symbolic software package can be used to compute the Jacobian and Hessian |
|
| Survivor functions with their evolution equations given by ODEs | These ODEs can be defined within a single for loop, see Eq. ( |
|
| The prevalences of | From Eq. ( |
|
| Total rate of infection experienced by an | For a subgraph with |
|
| Expected prevalence of a subgraph in a given state | The equation for this is computed based on the rate matrix, |
Fig. 3Performance of other models. Lines, circles and squares correspond to simulation average, ODE solution and pairwise ODE solution, respectively. All networks are homogeneous with . The lower peaks correspond to networks generated with each node allocated one of each corner type of a with clustering . Data with higher peak correspond to networks generated with a single and two subgraphs yielding
Fig. 4Clustering and cycles. Solid lines and markers correspond to simulation average and ODE solution, respectively. From darkest to lightest, the solid lines correspond to: , , and , i.e., each network used has an identical degree distribution given by . Clustering is and for the and other networks, respectively. For clarity, ODE solutions for only the two extreme cases, the null and triangle network, have been included. Note that the output from the network composed of is close to that of the null case. Epidemics corresponding to cycles of length six have been computed but omitted due to their similarity to the null case. Only two ODE solutions have been included for upper and lower cases
Fig. 5Clustering via differing subgraphs. Solid lines and markers correspond to simulation average and ODE solution, respectively. From darkest to lightest the solid lines correspond to: ; , ; , and , , where and denote complete pentagon and hexagon subgraphs, respectively. The networks were generated so that , and . The downward trend of peak prevalence corresponds to networks composed of complete subgraphs of increasing size. The larger subgraphs lead to more connections within the group rather than to the rest of the network