| Literature DB >> 20640032 |
Xu-Sheng Zhang1, Margo E Chase-Topping, Iain J McKendrick, Nicholas J Savill, Mark E J Woolhouse.
Abstract
Identifying risk factors for the presence of Escherichia coli O157 infection on cattle farms is important for understanding the epidemiology of this zoonotic infection in its main reservoir and for informing the design of interventions to reduce the public health risk. Here, we use data from a large-scale field study carried out in Scotland to fit both "SIS"-type dynamical models and statistical risk factor models. By comparing the fit (assessed using maximum likelihood) of different dynamical models we are able to identify the most parsimonious model (using the AIC statistic) and compare it with the model suggested by risk factor analysis. Both approaches identify 2 key risk factors: the movement of cattle onto the farm and the number of cattle on the farm. There was no evidence for a role of other livestock species or seasonality, or of significant risk of local spread. However, the most parsimonious dynamical model does predict that farms can infect other farms through routes other than cattle movement, and that there is a nonlinear relationship between the force of infection and the number of infected farms. An important prediction from the most parsimonious model is that although only approximately 20% farms may harbour E. coli O157 infection at any given time approximately 80% may harbour infection at some point during the course of a year.Entities:
Mesh:
Year: 2010 PMID: 20640032 PMCID: PMC2890141 DOI: 10.1016/j.epidem.2010.02.001
Source DB: PubMed Journal: Epidemics ISSN: 1878-0067 Impact factor: 4.396
Fig. 1(a) distribution of farm sizes. (b) number of cattle moved between Scottish livestock farms each month from January 2002 to December 2004. (c) distances moved and (d) number of cattle moved per movement event from the same data set as (b).
Fig. 2(a) geographical distribution of the 461 farms sampled in Scotland between February 2002 and February 2004. Red dots indicate positive farms. The prevalence is estimated as 18.9% (exact binomial 95% CI = 15.5 to 22.7). (b) temporal variation in prevalence of E. coli O157 infection. Black is the estimated prevalence for a given month and red dotted lines represent exact binomial 95% confidence intervals.
Fig. 3The relationship between variation in negative log-likelihood l and the number of samples used for model fitting. The basic model is used with a = b = 1.0 and β = 10−8, γ = 0.011 per day per farm. 12 replicates were used and the mean and standard deviation of log-likelihood calculated from these. When the number of samples is small, the values of the estimated log-likelihood fluctuate widely among different replicates. With increasing number of samples, they converge quickly. When the number of samples is greater than or equal to 1000, the values of log-likelihood are distributed across a small range, with a standard deviation of order 0.5.
Fig. 4The ROC curve for the comparison between IPRAVE samples and prediction of the basic stochastic model with best fit parameters (see Table 1). The X- and Y- axes are false positive rate and the true positive rate, respectively.
Comparison between models. The model variants are listed in the ascending order of their AIC values. l is the natural log of likelihood calculated using Eq. (6), Iprev is the prevalence of infection on 461 IPRAVE farms, AIC is the Akaike Information Criterion (Eq. (7)) and w1/w is the evidence ratio (Eq. (8)). OR is the odds ratio of the model prediction (Eq. (9)).
| Model variants | Additional parameter | AIC§ | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Basic model | 3.38e-4 | 3.50e-2 | 0.270 | 0.230 | – | 213.0 | 19.4 | 434.0 | 1.00 | 1.38 |
| Density-independent | 2.40e-3 | 3.54e-2 | 0.249 | 0.0 | – | 214.2 | 19.8 | 434.4 | 1.25 | 1.26 |
| Imperfect detection | 1.27e-4 | 2.32e-2 | 0.437 | 0.188 | – | 213.4 | 19.7 | 434.8 | 1.48 | 1.48 |
| Imperfect sensitivity | 2.91e-4 | 3.35e-2 | 0.317 | 0.212 | – | 213.7 | 19.4 | 435.4 | 2.01 | 1.32 |
| Pig | 1.26e-4 | 2.82e–2 | 0.476 | 0.176 | 212.9 | 18.2 | 435.8 | 2.48 | 1.51 | |
| Seasonality | 2.72e-4 | 2.84e-2 | 0.292 | 0.196 | 213.1 | 19.2 | 436.2 | 3.00 | 1.42 | |
| Sheep | 1.95e-4 | 2.94e-2 | 0.396 | 0.189 | 213.6 | 20.1 | 437.2 | 4.95 | 1.43 | |
| Density-dependent | 1.48e-6 | 5.14e-2 | 0.248 | 1.0 | – | 215.9 | 17.2 | 437.7 | 6.55 | 1.23 |
| No herd size effect | 3.15e-4 | 4.38e-2 | 0.0 | 0.437 | – | 217.0 | 19.5 | 440.0 | 20.7 | 1.25 |
| 2.28e-4 | 0.112 | 0.187 | 0.329 | 215.1 | 18.8 | 440.3 | 23.1 | 1.33 | ||
| No movement | 1.73e-4 | 3.16e-2 | 0.379 | 0.246 | – | 217.2 | 18.2 | 442.4 | 68.0 | 1.20 |
| Local spread | 3.32e-5 | 1.94e-2 | 0.87 | – | α = 0.5 | 262.6 | 22.6 | 531.2 | 1.32e+21 | 1.71 |
| 9.25e-5 | 1.89e-2 | 0.901 | – | α = 1.0 | 272.8 | 19.6 | 551.5 | 3.34e+25 | 1.65 | |
| 2.36e-3 | 1.20e-2 | 0.459 | – | α = 2.0 | 277.1 | 23.8 | 560.2 | 2.61e+27 | 1.42 |
#For model variant Seasonality, the value of β listed in this column is βc for December to April.
§The standard errors in estimates of −l and AIC are approximately 0.15 and 0.30 respectively (Fig. 3).
†The prevalence of infection for the whole system is about 2–3% lower than Iprev.
Fig. 5The distribution of Cohen's κ values between different sampling periods as predicted by the most parsimonious simulation model. The modal value is 0.01. The observed value of κ for the SEERAD and IPRAVE surveys is 0.077 (indicated by the arrow).
Results of the GLMM statistical models of risk factors for the presence of E. coli O157 on the 461 IPRAVE farms. Overall OR gives empirical estimate of odds ratio for the entire model.
| Predictors | Estimate | SE | Overall OR | |
|---|---|---|---|---|
| Movement within 4 weeks | 0.664 | 0.288 | 0.0214 | |
| Number of cattle | 0.785 | 0.322 | 0.0151 | |
| 1.297 (0.379) |
Mean (SD).
Log10 transformed.