| Literature DB >> 18364313 |
Michael J Tildesley1, Rob Deardon, Nicholas J Savill, Paul R Bessell, Stephen P Brooks, Mark E J Woolhouse, Bryan T Grenfell, Matt J Keeling.
Abstract
Since 2001 models of the spread of foot-and-mouth disease, supported by the data from the UK epidemic, have been expounded as some of the best examples of problem-driven epidemic models. These claims are generally based on a comparison between model results and epidemic data at fairly coarse spatio-temporal resolution. Here, we focus on a comparison between model and data at the individual farm level, assessing the potential of the model to predict the infectious status of farms in both the short and long terms. Although the accuracy with which the model predicts farms reporting infection is between 5 and 15%, these low levels are attributable to the expected level of variation between epidemics, and are comparable to the agreement between two independent model simulations. By contrast, while the accuracy of predicting culls is higher (20-30%), this is lower than expected from the comparison between model epidemics. These results generally support the contention that the type of the model used in 2001 was a reliable representation of the epidemic process, but highlight the difficulties of predicting the complex human response, in terms of control strategies to the perceived epidemic risk.Entities:
Mesh:
Year: 2008 PMID: 18364313 PMCID: PMC2376304 DOI: 10.1098/rspb.2008.0006
Source DB: PubMed Journal: Proc Biol Sci ISSN: 0962-8452 Impact factor: 5.349
Values for the seven epidemiological factors for Cumbria, Devon, the rest of England, Wales and Scotland.
| parameter | Cumbria | Devon | rest of England | Wales | Scotland |
|---|---|---|---|---|---|
| 5.7 | 4.9 | 2.3 | 0.7 | 10.2 | |
| 8.3 | 11.0 | 23.2 | 36.3 | 28.2 | |
| 8.2 | 5.8 | 8.2 | 30.1 | 23.2 | |
| 0.20 | 0.40 | 0.30 | 0.43 | 0.33 | |
| 0.41 | 0.37 | 0.42 | 0.31 | 0.23 | |
| 0.49 | 0.42 | 0.37 | 0.22 | 0.40 | |
| 0.42 | 0.37 | 0.44 | 0.25 | 0.20 |
Figure 1Graph showing the log likelihood of correctly predicting the status of all farms in a one-week interval for varying start dates. Likelihoods are calculated independently for each farm, from the results of multiple stochastic simulations. Farms are defined as being in the correct class if they are infected or culled (or simply remain susceptible) in both the model and the 2001 data in a given one-week prediction interval. The inset shows the log likelihood against the total number of reported and culled farms for each starting point of the simulations—we note that the log likelihood increases linearly with the number of reported and culled farms.
The mean value (and 2.5 and 97.5 percentiles) for the matrix of nine variables that record the number of farms in a particular state in the 2001 data and in the simulated model outbreaks. (Due to the large number of simulations involved the CIs for the mean are very small; therefore percentiles are quotes such that 95% of the simulations lie within the given range. The diagonal elements give the total number of farms whose status is correctly predicted by the model.)
| data | model | ||
|---|---|---|---|
| reported | culled | susceptible | |
| reported | 230 (193–269) | 733 (666–795) | 995 (920–1081) |
| culled | 519 (436–604) | 1962 (1738–2167) | 5703 (5438–5986) |
| susceptible | 1323 (977–1699) | 4977 (3785–6318) | 171 982 (170 293–173 513) |
Figure 2Model and data comparison for the entire country. (a) The daily number of farms that report infection (black) and farms that were culled (grey), together with the timings of national control measures for the 2001 epidemic. (b) Similar results from a single replicate model simulation, starting with the conditions on 23 February 2001. (c–h) Accuracy (solid lines) and associated repeatability (dashed lines) results (together with 95% CIs) for various time intervals and various farm types. If t is the time on the x-axis, the accuracy results are (c) accuracyall(23 February, t), (d) accuracyall(t, t+14), (e) accuracyreported(23 February, t), (f) accuracyreported(t, t+14), (g) accuracyculls(23 February, t), (h) accuracyculls(t, t+14). At least 2500 simulations were used to determine each data point. Regional plots, for Cumbria, Devon, the rest of England, Wales and Scotland, are shown in the electronic supplementary material.
Odds ratios for reported farms over the entire epidemic. (Again, owing to the small CIs about the mean, 2.5 and 97.5 percentiles are quoted such that 95% of the simulations lie within the given range. Results are given for the whole of Great Britain and for the five regions.)
| region | odds ratio (model–data) | odds ratio (model–model) |
|---|---|---|
| whole GB | 13.41 (10.77–16.66) | 15.61 (12.05–20.16) |
| Cumbria | 1.91 (1.51–2.42) | 1.93 (1.51–2.53) |
| Devon | 4.02 (1.41–8.84) | 4.86 (1.62–15.85) |
| rest of England | 9.01 (5.33–13.61) | 17.64 (11.37–28.32) |
| Wales | 5.66 (1.78–20.22) | 11.39 (2.83–62.78) |
| Scotland | 26.06 (15.91–40.79) | 27.42 (16.20–45.43) |
Mean odds ratios for reported farms over a two-week interval averaged over different start dates. (The maximum and minimum odds ratios correspond to average values at specific start dates and therefore capture the variation across the course of the epidemic and not between epidemic simulations. Again results are given for the whole of Great Britain and for the five regions.)
| region | model–data odds ratio | model–model odds ratio | ||
|---|---|---|---|---|
| mean | (min–max) | mean | (min–max) | |
| whole GB | 208.39 | (15.25–998.67) | 322.68 | (18.89–1437.01) |
| Cumbria | 13.79 | (2.01–83.15) | 18.53 | (2.49–119.27) |
| Devon | 13.41 | (0.46–129.72) | 43.95 | (4.72–381.64) |
| rest of England | 188.26 | (5.59–1458.62) | 281.93 | (30.35–1143.65) |
| Wales | 93.54 | (0.48–968.54) | 171.79 | (13.92–709.96) |
| Scotland | 83.48 | (2.54–735.64) | 848.73 | (23.10–1101.47) |
Figure 3Results of multiple simulations of the entire epidemic for the whole of Great Britain. (a) The distribution of farms reporting infection in proportion p of simulations. This distribution is partitioned into those farms reporting infection in 2001 (grey) and those not (white). (b) Comparable distributions for culled farms again partitioned into those farms culled in 2001 (grey) and those not (white). In graphs (c,d), we define a threshold proportion Pc, such that only those farms reporting infection in more than Pc simulations are identified as likely to report infection in the 2001 epidemic. (c) The number of correctly identified reports (grey) and the number of false positives and false negatives (hatched lines). (d) The number of false positives and false negatives (hatched lines) relative to the number of correctly identified reports. (Results are from 250 replicate simulations.)