Literature DB >> 19535416

Plug-and-play inference for disease dynamics: measles in large and small populations as a case study.

Daihai He¹, Edward L Ionides, Aaron A King.

Abstract

Statistical inference for mechanistic models of partially observed dynamic systems is an active area of research. Most existing inference methods place substantial restrictions upon the form of models that can be fitted and hence upon the nature of the scientific hypotheses that can be entertained and the data that can be used to evaluate them. In contrast, the so-called plug-and-play methods require only simulations from a model and are thus free of such restrictions. We show the utility of the plug-and-play approach in the context of an investigation of measles transmission dynamics. Our novel methodology enables us to ask and answer questions that previous analyses have been unable to address. Specifically, we demonstrate that plug-and-play methods permit the development of a modelling and inference framework applicable to data from both large and small populations. We thereby obtain novel insights into the nature of heterogeneity in mixing and comment on the importance of including extra-demographic stochasticity as a means of dealing with environmental stochasticity and model misspecification. Our approach is readily applicable to many other epidemiological and ecological systems.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2009 PMID： 19535416 PMCID： PMC2842609 DOI： 10.1098/rsif.2009.0151

Source DB: PubMed Journal: J R Soc Interface ISSN： 1742-5662 Impact factor: 4.118

Introduction

The ability to forecast, understand and control the spread of infectious diseases increasingly depends on the capacity to formulate and test mathematical models capturing key mechanisms. Accordingly, there has been a great deal of recent interest in techniques that allow one to fit mechanistic models to time-series data (Gibson & Renshaw 1998; Finkenstädt & Grenfell 2000; Bjørnstad & Grenfell 2001; Gibson & Renshaw 2001; Grenfell et al. 2001, 2002; Roberts & Stramer 2001; Bjørnstad ; Finkenstädt ; Grenfell ; Keeling & Rohani 2002; Clark & Bjørnstad 2004; Koelle & Pascual 2004; Streftaris & Gibson 2004; Beskos ; Cook ; Cauchemez & Ferguson 2008; Cauchemez ; Ferrari ; Keeling & Ross 2008). Recent advances in statistical algorithms and computational hardware have made it possible to tailor statistical methodology directly to questions of scientific interest and have expanded the range of models that can be confronted with data. However, the nonlinear stochastic dynamical models arising in the study of infectious disease dynamics have proved relatively recalcitrant. Even the simplest useful models of disease transmission are highly nonlinear, and stochasticity cannot be ignored even in the most predictable of disease systems. Complicating the picture further is the ubiquitous presence of non-stationarity, measurement error and unobserved (hidden, or latent) variables. Finally, models of interest are typically formulated in continuous time, whereas data are sampled at discrete and sometimes irregular intervals. The large body of methodological literature devoted to the topic of inference for these partially observed stochastic dynamic systems is testament to its position as an important and unresolved challenge at the interface of the fields of ecology, epidemiology, mathematics and statistics. As general approaches, all of the methods mentioned earlier face severe limitations. Some must modify the model, discretizing time to the scale of observations (Finkenstädt & Grenfell 2000; Grenfell et al. 2001, 2002; Bjørnstad ; Finkenstädt ; Clark & Bjørnstad 2004; Koelle & Pascual 2004). They rely upon fortuitous congruence between some time scale of the process under study (e.g. infection generation time) and that of the sampling interval. These methods are computationally cheap, but can lead to severely biased conclusions (Glass ). Others rely on approximations that are valid only in restricted circumstances. For example, Cauchemez & Ferguson (2008) adopt a diffusion approximation to enable successful implementation of a Markov chain Monte Carlo approach; this approximation breaks down in small populations. On the other hand, straightforward Bayesian approaches have been demonstrated for small populations with purely demographic stochasticity (Gibson & Renshaw 1998, 2001); however, these methods break down for large populations (Roberts & Stramer 2001; Beskos ) or when environmental stochasticity is non-negligible (Bretó ). Finally, exact methods based on numerical solution of the so-called master equations (Keeling & Ross 2008) are available, but limited to low-dimensional models and small populations. In summary, each of these approaches makes demands upon the form of the model and/or the nature of the data and thus necessitates scientifically irrelevant or even inappropriate assumptions that can interfere with the hypotheses of interest. In this paper, we use a relatively new approach with none of the aforementioned limitations (Ionides et al. 2006, 2007, 2009; King ; Bretó ). In particular, the method works on continuous time models, in large or small populations, with demographic or environmental stochasticity or both. It has the important advantage of operation based only on simulation from the dynamic model, without a need for explicit expressions for transition probabilities. The latter property has been called plug-and-play (Bretó ). Plug-and-play inference methodology provides a powerful tool to enable carrying out data analysis via flexible, scientifically motivated classes of models. In particular, plug-and-play is extremely useful when one wishes to entertain multiple working hypotheses translated into multiple mechanistic models. Two properties analogous to plug-and-play have been considered in the contexts of optimization and complex system theory. These analogies suggest that plug-and-play is a subject whose importance extends beyond our present topic of inference for partially observed stochastic dynamic systems. The conceptual commonality is that all these methodologies can be applied to models specified as ‘black-box’ computer codes. This enables flexible model development and therefore avoids confounding methodological issues with scientific hypotheses. It also facilitates both the comparison of alternative explanations and the transfer of methodological expertise between different systems. The price of plug-and-play is primarily in terms of computational effort: when properties of a model are available in closed form, computationally more efficient methods typically exist. As computational capabilities continue to grow, the ease with which new scientific questions can be asked and answered via plug-and-play methodology will make such techniques increasingly attractive. In optimization theory, methods requiring only evaluation of the objective function have been called gradient-free (Spall 2003). In such methods, computer code to evaluate the objective function can be plugged into the optimizer. This is particularly useful in situations in which the objective function is a complex, and possibly stochastic, function for which the analytic calculation of derivatives is difficult or impossible (Kocsis & Szepesvári 2006). In the analysis of complex systems, methods that are based solely on simulations from computer codes representing a model have been called equation-free (Kevrekidis ). A typical goal of such investigations is to determine the relationship between macroscopic phenomena (e.g. phase transitions) and microscopic phenomena (e.g. molecular interactions). We illustrate our new approach via a case study of measles transmission dynamics. A focus on this well-studied disease allows us to compare our conclusions with those derived previously by other means. Further, we demonstrate that our analysis leads to fresh insights not readily available via previous analyses. Measles has occupied a central place in mathematical epidemiology owing to its relatively straightforward diagnosis, the lifelong immunity following infection and the availability of high-quality case report data. Models that assume mass-action transmission on the scale of communities do remarkably well at reproducing the disease dynamics, particularly in large populations (Bailey 1955; Bartlett 1960; Fine & Clarkson 1982), although recent work has suggested that deviations from mass action are detectable in time-series data (Bjørnstad ). Moreover, models incorporating purely demographic stochasticity have been deemed adequate to reproduce observed dynamical patterns. We revisit these questions by fitting a family of models that incorporate deviations from the mass-action assumption together with both environmental and demographic stochasticity. We fit these models to time-series data from both large and small populations. A comparison of results across community sizes indicates that there is little evidence in the data for deviations from mass-action transmission, at least as it has been modelled previously. On the other hand, our results do suggest a role for heterogeneity in transmission as a function of population size. With respect to the nature of stochasticity in this system, our results indicate a clear role for extra-demographic variability. Indeed, they suggest that failure to allow for extra-demographic variability may lead to severe biases in estimates of key parameters. These insights are possible because of the flexibility of the inference machinery, which also makes our approach more readily applicable to other, less exhaustively analysed, systems (e.g. King ).

Methods: inference machinery, data and models

Consider a time series y1: ≡ y1, … ,y, consisting of N observations made at times t1, … ,t. A stochastic model for y1: implies a joint density f(y1:|θ) given a vector of unknown parameters θ, where θ is in ℝ. Corresponding conditional densities for y given y1: are written as . Via the factorization , the log-likelihood function is given by . Sequential Monte Carlo (Doucet ; Arulampalam ; Cappé ) provides a standard method to obtain the log likelihood for stochastic dynamic systems. In our terminology, stochastic dynamic system is synonymous with partially observed Markov process (Ionides ; Bretó ); in the statistical literature, these systems are known as state-space models (Shumway & Stoffer 2006). Likelihood evaluation via sequential Monte Carlo has the feature that only simulations of sample paths are required; one need not have explicit forms for transition probabilities. That is, it has the plug-and-play property. Both Bayesian (Liu & West 2001; Toni ) and frequentist (Ionides ; Poyiadjis ) approaches to plug-and-play likelihood-based inference via sequential Monte Carlo have been proposed. We adopt the maximum-likelihood approach of Ionides et al. (2006, 2009), which enables statistically efficient inference for general nonlinear stochastic dynamical systems. In contrast, the approximations developed by Liu & West (2001) are not statistically efficient, and lead to additional bias and variance in the resulting parameter estimates (Storvik 2002; Newman ). The popularity of the artificial parameter evolution method of Liu & West (2001) may be due more to the convenience of its plug-and-play property than to its statistical properties. Various refinements and variations in the vein of the Bayesian methodology of Liu & West (2001) have been studied (Polson and references therein); however, these typically do not posses the plug-and-play property. We used the implementation of iterated filtering contained in the software package Pomp (King ) written for the R statistical computing environment (R Development Core Team 2006). This implementation follows the algorithm described by King , supplementary information therein). Heuristics and diagnostics, together with a discussion of computational issues, can be found in Ionides ; a theoretical treatment is given by Ionides . For complex dynamical systems, not all questions that one might wish to ask will be clearly answered by the available data. Typically, a question posed of the dynamic system is formalized as a hypothesis about a combination of parameters in a model for the system. If the likelihood function does not discriminate well between different values of some parameter combination, this parameter combination is said to be poorly identified. In such a case, one might proceed by making additional assumptions: adding constraints typically improves parameter identifiability at the cost of potential bias in the estimates of other parameters. In §3, we focus on questions the data are able to answer without further assumptions. A basic tool for this is the profile likelihood function, in which one parameter is varied over a range of values, while the likelihood is maximized over all remaining parameters (Barndorff-Nielsen & Cox 1994; Hilborn & Mangel 1997). In practice, we work with a smoothed estimate of the likelihood surface calculated via Monte Carlo (Ionides 2005). The data we analyse consist of weekly measles case reports on 20 representative communities with population sizes ranging from 2000 to 3 000 000. The towns and cities were selected from 954 urban locations for which case reports have been compiled (Grenfell & Bolker 1998); specifically, we chose the largest 10 cities by population and sampled 10 more towns at random. Figure 1 shows the case reports and annual birth rates for two of these: London and Hastings. These data and their limitations have been discussed previously (Fine & Clarkson 1982; Bjørnstad ). In particular, although measles became a notifiable disease in England and Wales in 1940, we follow Fine & Clarkson (1982) by starting our analysis in 1950 to avoid the disruptions caused by the second world war and the transition period following the introduction of the National Health Service in 1948. From 1950 onwards, it is believed that reporting was carried out fairly consistently, with between half and two-thirds of all cases being counted.

Figure 1.

Weekly reported measles cases (solid line) and annual birth rate (dotted line) for (a) London and (b) Hastings.

Weekly reported measles cases (solid line) and annual birth rate (dotted line) for (a) London and (b) Hastings. A basic susceptible–exposed–infectious–recovered (SEIR) model for measles divides the population into those individuals susceptible to infection, exposed (i.e. infected but not yet infectious), infectious and removed (i.e. quarantined or recovered and subsequently immune). The model is specified by describing the rates at which individuals move between compartments (Jacquez 1996; Matis & Kiffe 2000); figure 2 shows the transitions for the SEIR model of measles. A diagram such as figure 2 can be unambiguously interpreted as a deterministic system, a system with demographic stochasticity or a system with both demographic and extra-demographic stochasticity (Bretó ). We give the equations for each of these interpretations in appendix A. Here, we will employ the stochastic interpretation, with both demographic and extra-demographic stochasticity, described in box 1 (in appendix A). The transition rates in a model such as figure 2 may, in general, depend on the state of the system and/or covariates. We specify the force of infection as

Figure 2.

Flow diagram for measles. The population is divided into four compartments: S, susceptible; E, exposed and infected but not yet infectious; I, infectious; R, recovered and immune. Births enter S at the rate μBS(t), and all individuals have a mortality rate μSD = μED = μID = μRD = m. The case reports, C, count infected individuals with probability ρ. Since diagnosed cases are treated with bed-rest and hence removed, infections are counted upon transition to R.

Here β(t) is the transmission rate; ι the mean number of infectives visiting the population at any given time; α a mixing parameter, with α = 1 corresponding to homogeneous mixing (Liu ; Bjørnstad ), and N(t) the population size, treated as known via interpolation from census data. Since transmission rates are closely linked to contact rates among children, which are higher during school terms (Schenzle 1984; Keeling & Grenfell 2002; Bauch & Earn 2003; Conlan & Grenfell 2007), we assume that β(t) reflects the pattern of British school terms and holidays. Specifically, we take where p = 0.759 is the proportion of the year taken up by the school term, β̄ the mean transmission rate and a the relative effect of school holidays on transmission. For ease of interpretation, it is sometimes convenient to reparameterize β̄ in terms of the annual average basic reproductive ratio, R0, defined as the expected number of secondary infections engendered by an infective introduced into a fully susceptible population. When the duration of infection is much shorter than life expectancy, R0 ≈ β̄/μIR; here we employ a modification of this formula, given in appendix A. Flow diagram for measles. The population is divided into four compartments: S, susceptible; E, exposed and infected but not yet infectious; I, infectious; R, recovered and immune. Births enter S at the rate μBS(t), and all individuals have a mortality rate μSD = μED = μID = μRD = m. The case reports, C, count infected individuals with probability ρ. Since diagnosed cases are treated with bed-rest and hence removed, infections are counted upon transition to R. A novel feature of our model is that we add white noise with intensity σSE to the transmission process. Some empirical support for the choice of white (i.e. temporally uncorrelated) noise is given by Lande . This variability in the rates allows the possibility of over-dispersion (McCullagh & Nelder 1989; Bretó ). It is discussed at more length, together with a full specification of all the model equations, in appendix A. The other transition rates are specified as follows: μEI is the rate at which exposed individuals become infectious, thus μEI−1 is the mean latent period; μIR is the recovery rate; μSD = μED = μID = μRD = m denotes a constant per-capita death rate, ignoring the negligible effect of disease-induced mortality; μBS(t) is the per-capita rate of recruitment of susceptibles, depending on the birth rate b(t), which is assumed to be known via interpolation from birth records. Births enter the susceptible class with a delay τ corresponding to the age at which children enter the high-risk school-age population. We also consider a cohort-entry effect that reflects the fact that a large cohort of first-year students—the majority of them serologically naive—enters the schools each autumn (Schenzle 1984; He 2006). Specifically, a fraction c of recruits into the susceptible class enter on the school admission day, and the remaining fraction (1–c) enter the susceptible class continuously. The cohort effect provides a parsimonious and mechanistically plausible alternative to previous suggestions that the transmission rate β(t) may increase following the start of the academic year (Fine & Clarkson 1982; Grenfell ). The reporting process is taken to be on over-dispersed binomial, with reporting rate ρ and overdispersion parameter ψ (appendix A, equation (A 6)). Quantities appearing in the model are summarized in table 1.

Table 1.

List of symbols used in the paper.

symbol	description	units	reference
μ_ij	per-capita rate of transition from compartment i to j	yr⁻¹	figure 2, box 1
σ_ij	white-noise intensity on i → j transition rate	yr^1/2	box 1
LP	latent period	day	equation (A 7)
IP	infectious period	day	equation (A 7)
ι	mean number of visiting infectives	—	equation (2.1)
α	mixing exponent	—	equation (2.1)
β(t)	transmission rate	yr⁻¹	equations (2.1) and (2.2)
N(t)	population size	—	equation 2.1
a	amplitude of seasonality	—	equation 2.2
b(t)	birth rate	yr⁻¹	equation (A 5)
c	cohort entry fraction	—	equation (A 5)
τ	recruitment delay	yr	equation (A 5)
m	mortality rate μ_jD for j ∈ {S, E, I, R}	yr⁻¹	figure 2
ρ	reporting probability	—	equation (A 6)
ψ	reporting overdispersion	—	equation (A 6)
R₀	basic reproductive ratio	—	—

List of symbols used in the paper.

Results

Maximum-likelihood parameter estimates for our sample of 20 communities are presented in table 2. These estimates are for the model in §2 with no constraints placed on the parameters. The variation in the parameter estimates between populations has two sources: it may represent differences in the population dynamics of measles between communities, or it may be due to weak identifiability of some combinations of parameters. We disentangle these two explanations via a detailed investigation of two of them, London and Hastings. However, table 2 does reveal some consistent patterns that are robust across the population samples and are therefore not strongly affected by identifiability issues. These patterns are valuable for indicating model features about which the data under investigation are informative without the incorporation of additional assumptions based on prior analyses.

Table 2.

	N	α	σ_SE	R₀	LP	IP	a	ι	c	ρ	ψ
Halesworth	2	0.95	0.075	33	7.87	2.28	0.38	0.01	0.55	0.75	0.64
Lees	4	0.97	0.060	28	7.28	1.93	0.16	0.03	0.62	0.60	0.69
Mold	6	1.04	0.054	21	5.93	1.78	0.27	0.01	0.44	0.13	2.87
Dalton	11	0.99	0.078	28	5.48	1.98	0.20	0.04	0.42	0.46	0.82
Oswestry	11	1.04	0.070	53	10.29	2.72	0.34	0.03	0.26	0.63	0.48
Northwich	18	0.93	0.069	23	8.07	2.55	0.30	0.05	0.39	0.78	0.40
Bedwellty	29	0.94	0.061	25	6.82	3.03	0.16	0.04	0.35	0.31	0.95
Consett	39	1.01	0.071	36	9.07	2.66	0.20	0.07	0.31	0.65	0.41
Hastings	66	1.00	0.096	34	7.00	5.44	0.30	0.19	0.33	0.70	0.40
Cardiff	245	1.00	0.054	34	9.86	3.09	0.22	0.14	0.27	0.60	0.27
Bradford	294	1.00	0.053	36	10.80	3.22	0.25	0.27	0.30	0.60	0.19
Hull	302	0.97	0.064	39	9.18	5.46	0.22	0.14	0.27	0.58	0.26
Nottingham	307	0.98	0.038	23	5.72	3.69	0.16	0.17	0.34	0.61	0.26
Bristol	443	1.01	0.039	27	6.19	4.94	0.20	0.44	0.34	0.63	0.20
Leeds	510	1.00	0.078	48	9.48	10.92	0.27	1.25	0.59	0.67	0.17
Sheffield	515	1.02	0.043	33	7.23	6.38	0.31	0.85	0.23	0.65	0.17
Manchester	704	0.97	0.055	33	11.11	6.94	0.29	0.59	0.36	0.55	0.16
Liverpool	802	0.98	0.053	48	7.90	9.80	0.30	0.26	0.19	0.49	0.14
Birmingham	1118	1.02	0.056	35	11.23	7.90	0.31	1.09	0.61	0.56	0.19
London	3390	0.98	0.088	57	13.14	12.51	0.55	2.90	0.56	0.49	0.12
r		0.14	−0.27	0.44	0.46*	0.94***	0.26	0.94***	−0.16	−0.16	−0.93***

Point estimates for 20 representative communities. N is 1950 population in thousands; LP and IP are the latent and infectious periods, respectively, in days, calculated from μEI and μIR as described in equation (A 7) of appendix A; other parameters and their units are as specified in table 1; the parameters τ = 4 yr and m = 0.02 yr−1 were not estimated. Maximized log-likelihood values are presented in the electronic supplementary material. The bottom row gives the Spearman rank correlation of the columns with N with * denoting significance at the 5 per cent level, ** the 1 per cent level and *** the 0.1 per cent level. In table 2, the homogeneity parameter α is consistently found to be close to 1. At first glance, this may suggest that there is little evidence for inhomogeneity of transmission in the data. Profile likelihood plots over α for London and Hastings are presented in figure 3. We see that the statistical uncertainty in the estimates for these two places is comparable to the geographic variability in the point estimates in table 2 (mean ± s.d. of 0.99 ± 0.03). This finding confirms the suggestion by Glass that such exponents may be artefacts of the time discretization employed in earlier approaches (Bjørnstad ; Grenfell ). We can conclude that the phenomenon of inhomogeneous mixing—inasmuch as it plays a dynamic role—is best captured in some other way.

Figure 3.

Profile likelihood analysis of the mixing parameter, α, for (a) London and (b) Hastings. The solid black lines show the estimated profile log likelihood, derived from the Monte Carlo point estimates shown as circles. The dashed lines construct approximately 95 per cent CIs of (0.93, 1.03) for London and of (0.85, 1.03) for Hastings. The imported infection parameter ι is well identified in these data. There is a strong log-linear relationship between ι and N, consistent with the earlier findings of Bjørnstad and with the results obtained by Xia , who analysed a much more extensive dataset using a gravity model. The evidence for such a relationship is stronger here than in a previous empirical investigation (Finkenstädt , fig. 6). The extra-demographic stochasticity parameter σSE is consistently found to be around 0.06 (0.063 ± 0.015). The profile likelihood plots in figure 4 also indicate that this parameter is fairly well estimable, and the data strongly argue against the choice σSE = 0. Figure 4 suggests that a role for extra-demographic stochasticity is not only strongly indicated for large cities such as London but also clearly evidenced in smaller populations for which one might expect demographic stochasticity to dominate. One interpretation of this is that models with purely demographic stochasticity (i.e. all discrete-population continuous-time dynamic models that have been previously fitted to disease data) fail to capture an extra-demographic source of variability that has an important dynamic role. A priori exclusion of the possibility of such additional variability can lead to spurious confidence in the conclusions of the analysis (McCullagh & Nelder 1989) and to biased parameter estimates. Figure 4 shows that, in the absence of extra-demographic stochasticity, estimates of the latent and infectious periods are substantially lower when one allows for such stochasticity. This appears to arise because short transition times lead to fewer individuals in compartments E and I on average and therefore to a larger dynamic role for demographic stochasticity. In other words, the model compensates for the lack of extra-demographic stochasticity by attempting to increase the intensity of demographic stochasticity.

Figure 4.

Profile analysis of extrademographic variability, σSE, for (a) London and (b) Hastings. The solid black curve shows the estimated profile log likelihood, derived from the Monte Carlo point estimates shown as circles. The dashed lines construct approximately 95 per cent CIs for σSE. Corresponding profile estimates of the mean IP and mean LP are shown with dotted and dashed curves, respectively, with units given on the right-hand axis. The figure is calculated with the constraint α = 1. Dotted lines, IP; dashed lines, LP. Each parameter of the model has an interpretation in terms of the basic biology of transmission and infection at the level of individual hosts. To the extent that the model is a faithful description of the population-level processes, one expects that parameter estimates should agree closely with estimates based on individual-level observations. When discrepancies occur, they are evidence for model misspecification or oversimplification. Strikingly, table 2 shows a strong log-linear relationship between population size and infectious period and, to a lesser extent, latent period. Since there is no evidence that the natural history of measles varies with population size, this relationship is indicative of such oversimplification. Figure 1 suggests that this simple model fails to account adequately for heterogeneity in transmission. Specifically, the model calls for short infectious periods and high transmissibility β to reproduce the explosive epidemics and interepidemic fadeouts typical of small communities. The data from large populations, in contrast, represent the aggregate of many local epidemics, in each of the many schools and local communities contained in a large conurbation. In these aggregated data, fadeouts are rare and the epidemic curve is generally shallower. The oversimple model can reproduce these features only by extending the infectious period and reducing transmissibility. Moreover, while the infectious period estimated in the small populations (2–4 days) is consistent with evidence from household studies (Hope Simpson 1948, 1952; Bailey 1956), the longer infectious periods estimated in the large populations are not. Previous studies have noted that deterministic versions of the mass-action model analysed here do a remarkably good job of qualitatively reproducing the dynamics of measles in large communities (Earn ). Our analysis shows, however, that when both continuous time dynamics and stochasticity are taken into account, data from small populations afford a less-distorted view of the disease dynamics, at least when viewed through the lens of relatively simple models. The basic reproductive ratio of measles is conventionally held to lie between 14 and 18 (Anderson & May 1991). The values of R0 in table 2 are variable but consistently high relative to previous estimates (mean ± s.d. of 34.7 ± 10.1 over 20 communities). Profile likelihoods over R0 for London and Hastings (see the electronic supplementary material) yield approximately 95 per cent confidence intervals (CIs) of (37, 60) and (28, 74), respectively. Earlier empirical work on transmission dynamics (Bjørnstad ) also led to a relatively high value of R0 = 29.9 for London. Bjørnstad were cautious about the interpretation of this result because of concerns about the effects of the discrete-time methods employed. We can more confidently say that these higher values of R0 are a genuine feature of the continuous-time dynamics, at least under the assumption of homogeneous mixing and exponentially distributed latent and infectious periods. Changing the distribution of the latter to a more general gamma distribution (Keeling & Grenfell 1997; Lloyd 2001; Wearing ) decreases the estimated 95 per cent CI of R0 to (25, 41) for London (see the electronic supplementary material, figure S6). Though this is a substantial quantitative change, it does not alter the qualitative conclusion that homogeneous population models require high values of R0 in order to be consistent with the data.

Discussion and conclusion

A major goal of biomathematical modelling is to seek conceptual simplicity in the face of biological complexity. In general, the questions of interest should dictate the form and complexity of the model used to address them. In the present context in particular, the appropriate degree of aggregation over population inhomogeneities (such as spatial, socio-economic, genetic and age variation) may depend on the goals of the investigation (Levin ; May 2004). For measles, analyses based on models of homogeneous mixing populations have proved useful in the study of seasonality (Fine & Clarkson 1982; Bjørnstad ; Ferrari ) and effects of climate drivers (Lima 2009). Metapopulations of weakly coupled homogeneously mixing populations are central to current understanding of spatio-temporal disease dynamics (Grenfell ; Xia ). In addition, such models can yield insight into the fundamental predictability of disease outbreaks (Stone ) and have been found adequate to predict the effects of variation in birth rate and some dynamical features associated with the institution of vaccination programmes (Earn ). Homogeneous mixing models were also the basis of early work on local extinctions and vaccination strategies (Bartlett 1957), though age structure and infectious period distribution may have substantial roles to play in these questions (Schenzle 1984; Keeling & Grenfell 1997; Lloyd 2001; Conlan et al. submitted). In particular, estimates of the parameter R0 are known to be highly sensitive to assumptions on age structure (Wallinga ). Interpretations of homogeneous mixing models must be made in the context of their limitations. Clinical and household studies of infectious diseases yield information regarding the transmission and progression of infection on the scale of individuals or families (Hope Simpson 1948, 1952; Bailey 1956). Such studies are complemented by data on disease prevalence or incidence on larger spatial scales. The recent development of likelihood-based approaches capable of dealing with the full spectrum of unavoidable complexities (Cauchemez & Ferguson 2008; Cauchemez ; King )—unobserved variables, measurement error, process noise, nonlinearity, non-stationarity and covariates—means that we can now view a disease from the population point of view with something like the same clarity that we have for many years been able to view individual infections. Specifically, although we have long understood how to describe the population dynamics of infectious diseases using mathematical models, we have only recently gained the ability to fit these models to data using statistically sound methods. When biological quantities estimated in this way agree with those estimated from clinical or household studies, it can be interpreted as confirmation of the assumptions embodied in the model. Disagreement between the small- and large-scale points of view, however, raises interesting scientific questions. In the present study, we find broad agreement with previous studies concerning many of the model's parameters. On the other hand, we find significant departures with respect to three important quantities—R0, the infectious period and the intensity of extra-demographic stochasticity. The quantity R0 is central in epidemiological theory because it has interpretations in terms of so many quantities of interest, including mean age of first infection, mean susceptible fraction, exponential-phase epidemic growth rate and vaccination coverage required for eradication. It is important to realize that the conjunction of these interpretations occurs only in the context of very simple models. Simple models necessarily lack flexibility. In reality, these interpretations diverge owing to heterogeneities in age, spatial location, host genetics, etc. It is therefore unrealistic to expect that estimates of R0 (or any other single quantity) derived from fitting a simple model to one sort of data should agree with estimates derived from other data sources. Rather, the key question should be: which biological interpretations are relevant in the context of the data used to inform the model? From a statistical point of view, this corresponds to the question: to what features of the data are the parameter estimates sensitive? Our estimate of the basic reproductive ratio, R0, like those of earlier studies focusing on aggregated case-count time series, is somewhat greater than estimates based on serological surveys and age-stratified incidence data. It is possible that this reflects the sensitivity of the time-series-derived estimates of R0 to peak incidence. Specifically, peak incidence strongly influences the estimates of both R0 and reporting rate ρ, but since ρ is well identified by other features of the data (namely, the long-run cumulative incidence), we suspect that the model requires a high R0 to match the observed peak epidemic case counts. Alternatively, the high R0 estimates may reflect the sensitivity of estimated R0 to early phase epidemic growth. If this is in fact the case, our estimates of R0 may more accurately reflect contact rates among the core group of school-age children than they do those of the population at large. In this case, the model is effectively extrapolating these rates to the adult population, about which these data have little direct information since so few adult cases occur. In contrast, from this point of view, the interpretation of R0 in terms of its definition as mean number of secondary infections in a fully susceptible population is hopelessly extrapolated. Likewise, the interpretation of R0 in terms of mean age of first infection is unjustified, since it necessarily depends on the age structure of transmission, which is not part of the model. Perhaps the most direct information pertaining to mean age at first infection in these data comes from the lag between changes in birth rate and their subsequent effects on incidence. This lag is explicitly captured in our model by the delay, τ, between birth and recruitment into the susceptible pool. Since the comparatively high estimate of R0 does not appear to be a mere artefact of time discretization, the question remains as to why the population-level data suggest a more communicable disease than do the individual-level data. Existing estimates of R0 are sensitive to assumptions about the age structure of transmission that have not yet been fully resolved (Wallinga ). Perhaps transmission in schools is relatively more effectual than within households. Alternatively, it may be that this discrepancy points to a need for a better description of infectious- and/or latent-period distribution, of the disease's age structure or both. In our results, the departure of the estimated latent and infectious periods from plausible values obtained from household studies grows with city size, and so the most likely explanation for this correlation is the spatial heterogeneity of transmission within towns and cities. The latter, we have shown, cannot usefully be captured via the simple device of an exponent in the transmission term. More detailed explorations of disaggregated data and/or models with explicit spatial structure have the potential to shed light on this question. Finally, the strong evidence in favour of extra-demographic stochasticity raises the question of precisely why such stochasticity aids in the explanation of the data. To what extent does this finding indicate the presence of genuine environmental stochasticity? To what extent does it indicate model misspecification? To address these questions, again, analysis based on more detailed models is called for. In the case of measles, there are sufficient data to entertain models featuring spatial inhomogeneity (Grenfell ; Xia ; Bjørnstad & Grenfell 2008), age structure inhomogeneity (Schenzle 1984; Bolker & Grenfell 1996; Keeling & Grenfell 1997) or both (Bolker & Grenfell 1995). Although it has been demonstrated that such models can do a good job of accounting for gross features of the data, there has been less emphasis on requiring these models to account for all features of the data. Progress on inference methodology for parameter estimation from measles time series has emphasized SEIR-type models for homogeneous populations (Ellner ; Bjørnstad ; Cauchemez & Ferguson 2008; Keeling & Ross 2008). The study of age structure and spatial effects for measles has placed less emphasis on inferring parameters from data (one exception is Xia ). This may be partly because of the additional difficulties of inference for such systems and partly because models based on homogeneous mixing do an impressive job of describing key dynamic features (Earn ; Grenfell ). Regardless of one's view of the importance of paying further attention to population inhomogeneities, there is a natural methodological question: how applicable are the techniques presented here for larger, more complex models? There is a computational price to pay for the convenience of plug-and-play statistical methods. For the results in table 2, one application of the plug-and-play-iterated filtering inference procedure (based on 50 iterations, with a Monte Carlo size of 104) implemented via the Pomp package in R (R Development Core Team 2006; King ) took 5 h to run on a desktop machine. Computational effort scales roughly linearly with the number of parameters plus the number of state variables, so one can see that only a modest amount of additional structure could be included without hitting computational limitations. To increase computational efficiency, however, the iterated filtering algorithm can be implemented in a non-plug-and-play mode, in which the filter is either tailored to the particular model or takes advantage of available analytic properties of the transition densities. Such extensions, which would be required for much larger models, are a topic for future research. One can seek inspiration for future possibilities from numerical climate models, for which filtering operations (the computationally intensive step in the iterated filtering algorithm) have been carried out on systems with 5 × 106 state variables (Anderson & Collins 2007). Filtering in such high-dimensional situations requires the development and evaluation of appropriate approximations for the system under investigation. In this article, we have carried out the first scientific investigation based on a new framework for continuous-time, discrete-state population dynamics with both demographic and extra-demographic noise (probabilistic and statistical properties of this model class were investigated by Bretó . Extra-demographic stochasticity (interpreted as noise in the rates of a discrete population Markov population model) is equivalent to the possibility of multiple individuals moving simultaneously between compartments (Bretó ), and, as such, may be due to social events that affect the behaviour of many individuals (e.g. sporting events) or events that change disease transmissibility, such as variations in temperature and humidity. Variability in the rates gives the model additional flexibility that can also describe model misspecification. Analogously, when carrying out linear regression, it is customary to fit a line to data while understanding that the variation of the data around the line corresponds to unknown and unmodelled deterministic effects as well as to random fluctuations. For linear regression, one typically treats both these sources of uncertainty equally, and certainly all the usual standard errors and test statistics do not discriminate between them. We maintain that the same approach can be applied to dynamic models; in other words, the distinction between model misspecification and process stochasticity should be noted, though it will not usually affect the subsequent analysis. The term extra-demographic stochasticity encompasses all sources of variability beyond the intrinsic demographic stochasticity that would be present in a homogeneous population. There are many circumstances in which such variability can be expected to be important. In particular, the variability of rates in our new framework offers an approach to modelling superspreading events (Lloyd-Smith ). These events occur when variability between individuals, environmental effects or an interaction between the two results in a highly skewed distribution for the number of secondary cases caused by an index case. Superspreading has been documented in measles, but is of greater dynamic importance in other diseases such as severe acute respiratory syndrome (Lloyd-Smith ). Conventional population models amenable to non-plug-and-play statistical analyses have been unable to include such effects readily. This is, therefore, one more example in which the flexibility of plug-and-play methodology holds the potential to encourage the development of scientifically appropriate models.

40 in total

1. Realistic distributions of infectious periods in epidemic models: changing patterns of persistence and dynamics.

Authors: A L Lloyd
Journal: Theor Popul Biol Date: 2001-08 Impact factor: 1.570

2. Destabilization of epidemic models with the inclusion of realistic distributions of infectious periods.

Authors: A L Lloyd
Journal: Proc Biol Sci Date: 2001-05-07 Impact factor: 5.349

3. Inference for nonlinear dynamical systems.

Authors: E L Ionides; C Bretó; A A King
Journal: Proc Natl Acad Sci U S A Date: 2006-11-22 Impact factor: 11.205

4. Estimation of multiple transmission rates for epidemics in heterogeneous populations.

Authors: Alex R Cook; Wilfred Otten; Glenn Marion; Gavin J Gibson; Christopher A Gilligan
Journal: Proc Natl Acad Sci U S A Date: 2007-12-11 Impact factor: 11.205

5. The period of transmission in certain epidemic diseases; an observational method for its discovery.

Authors: R E H SIMPSON
Journal: Lancet Date: 1948-11-13 Impact factor: 79.321

6. Nonlinear population dynamics: models, experiments and data.

Authors: J M Cushing; R F Costantino; B Dennis; R A Desharnais; S M Henson
Journal: J Theor Biol Date: 1998-09-07 Impact factor: 2.691

7. Impact of vaccination on the spatial correlation and persistence of measles dynamics.

Authors: B M Bolker; B T Grenfell
Journal: Proc Natl Acad Sci U S A Date: 1996-10-29 Impact factor: 11.205

8. Influence of nonlinear incidence rates upon the behavior of SIRS epidemiological models.

Authors: W M Liu; S A Levin; Y Iwasa
Journal: J Math Biol Date: 1986 Impact factor: 2.259

9. Disease extinction and community size: modeling the persistence of measles.

Authors: M J Keeling; B T Grenfell
Journal: Science Date: 1997-01-03 Impact factor: 47.728

10. Estimating the impact of school closure on influenza transmission from Sentinel data.

Authors: Simon Cauchemez; Alain-Jacques Valleron; Pierre-Yves Boëlle; Antoine Flahault; Neil M Ferguson
Journal: Nature Date: 2008-04-10 Impact factor: 49.962

98 in total

1. Estimation of measles vaccine efficacy and critical vaccination coverage in a highly vaccinated population.

Authors: Michiel van Boven; Mirjam Kretzschmar; Jacco Wallinga; Philip D O'Neill; Ole Wichmann; Susan Hahné
Journal: J R Soc Interface Date: 2010-04-14 Impact factor: 4.118

2. A dynamic dose-response model to account for exposure patterns in risk assessment: a case study in inhalation anthrax.

Authors: Bryan T Mayer; James S Koopman; Edward L Ionides; Josep M Pujol; Joseph N S Eisenberg
Journal: J R Soc Interface Date: 2010-11-10 Impact factor: 4.118

Plug-and-play inference for disease dynamics: measles in large and small populations as a case study.

Introduction

Methods: inference machinery, data and models

Results

Discussion and conclusion

1. Realistic distributions of infectious periods in epidemic models: changing patterns of persistence and dynamics.

2. Destabilization of epidemic models with the inclusion of realistic distributions of infectious periods.

3. Inference for nonlinear dynamical systems.

4. Estimation of multiple transmission rates for epidemics in heterogeneous populations.

5. The period of transmission in certain epidemic diseases; an observational method for its discovery.

6. Nonlinear population dynamics: models, experiments and data.

7. Impact of vaccination on the spatial correlation and persistence of measles dynamics.

8. Influence of nonlinear incidence rates upon the behavior of SIRS epidemiological models.

9. Disease extinction and community size: modeling the persistence of measles.

10. Estimating the impact of school closure on influenza transmission from Sentinel data.

1. Estimation of measles vaccine efficacy and critical vaccination coverage in a highly vaccinated population.

2. A dynamic dose-response model to account for exposure patterns in risk assessment: a case study in inhalation anthrax.

3. Parameterizing state-space models for infectious disease dynamics by generalized profiling: measles in Ontario.

4. Inferring the causes of the three waves of the 1918 influenza pandemic in England and Wales.

5. Resolving the impact of waiting time distributions on the persistence of measles.

6. Decreasing stochasticity through enhanced seasonality in measles epidemics.

7. A second-order iterated smoothing algorithm.

8. Modeling and inference for infectious disease dynamics: a likelihood-based approach.

9. Quantifying the consequences of measles-induced immune modulation for whooping cough epidemiology.

10. Forcing versus feedback: epidemic malaria and monsoon rains in northwest India.