| Literature DB >> 28620609 |
Anke Hüls1,2, Cornelia Frömke3, Katja Ickstadt1, Katja Hille3, Johanna Hering3, Christiane von Münchhausen3, Maria Hartmann3, Lothar Kreienbrock3.
Abstract
Antimicrobial resistance in livestock is a matter of general concern. To develop hygiene measures and methods for resistance prevention and control, epidemiological studies on a population level are needed to detect factors associated with antimicrobial resistance in livestock holdings. In general, regression models are used to describe these relationships between environmental factors and resistance outcome. Besides the study design, the correlation structures of the different outcomes of antibiotic resistance and structural zero measurements on the resistance outcome as well as on the exposure side are challenges for the epidemiological model building process. The use of appropriate regression models that acknowledge these complexities is essential to assure valid epidemiological interpretations. The aims of this paper are (i) to explain the model building process comparing several competing models for count data (negative binomial model, quasi-Poisson model, zero-inflated model, and hurdle model) and (ii) to compare these models using data from a cross-sectional study on antibiotic resistance in animal husbandry. These goals are essential to evaluate which model is most suitable to identify potential prevention measures. The dataset used as an example in our analyses was generated initially to study the prevalence and associated factors for the appearance of cefotaxime-resistant Escherichia coli in 48 German fattening pig farms. For each farm, the outcome was the count of samples with resistant bacteria. There was almost no overdispersion and only moderate evidence of excess zeros in the data. Our analyses show that it is essential to evaluate regression models in studies analyzing the relationship between environmental factors and antibiotic resistances in livestock. After model comparison based on evaluation of model predictions, Akaike information criterion, and Pearson residuals, here the hurdle model was judged to be the most appropriate model.Entities:
Keywords: Poisson regression; hurdle model; model selection; overdispersion; underdispersion; veterinary epidemiology; zero inflation
Year: 2017 PMID: 28620609 PMCID: PMC5449455 DOI: 10.3389/fvets.2017.00071
Source DB: PubMed Journal: Front Vet Sci ISSN: 2297-1769
Figure 1The histogram illustrates simulated count data from a Poisson distribution (A), with underdispersion (B), and with overdispersion (C) and count data with zero inflation (D) motivated by the real data example for antibiotic resistances in 48 German fattening pig farms.
Description of variables included in the count models (Poisson, quasi-Poisson, and negative binomial model) and in the count-part of the zero-inflated Poisson and the hurdle model.
| Farms with fattening pigs (%) | |
|---|---|
| 48 | |
| Moving single pigs, | 39 (81.25) |
| Seperate pen for diseased pigs, | 32 (66.67) |
| Use of purchased feed only, | 11 (22.92) |
| Water birds in 1 km radius of farm, | 17 (35.42) |
| Disinfection of livestock trail | |
| Never, | 8 (16.67) |
| After housing out, | 33 (68.75) |
| Less frequent than after housing out, | 7 (14.58) |
| Disinfection with chlorine, | 8 (16.67) |
| Number of fattening pigs | |
| 0 to ≤1,000, | 20 (41.67) |
| >1,000 to ≤1,500, | 14 (29.17) |
| >1,500, | 14 (29.17) |
.
.
Regression estimates*, SEs and .
| Poisson | Quasi-Poisson | Negative binomial | ZIP | Hurdle | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| exp( | SE | exp( | SE | exp( | SE | exp( | SE | exp( | SE | |||||||
| Count model/Count-part | Intercept | 0.82 | 0.34 | 0.548 | 0.82 | 0.42 | 0.632 | 0.46 | 0.45 | 0.095 | 2.18 | 0.73 | 0.290 | 2.61 | 0.41 | |
| Moving single pigs | 1.76 | 0.22 | 1.76 | 0.27 | 1.83 | 0.31 | 0.061 | 1.72 | 0.28 | 0.056 | 1.73 | 0.24 | ||||
| Separate pen for diseased pigs | 1.74 | 0.20 | 1.74 | 0.25 | 1.85 | 0.30 | 1.50 | 0.26 | 0.127 | 1.44 | 0.21 | 0.084 | ||||
| Use of purchased feed only | 0.56 | 0.19 | 0.56 | 0.24 | 0.49 | 0.29 | 0.64 | 0.24 | 0.072 | 0.65 | 0.21 | |||||
| Water birds in 1 km radius of farm | 1.24 | 0.16 | 0.177 | 1.24 | 0.20 | 0.285 | 1.44 | 0.26 | 0.170 | 1.10 | 0.17 | 0.576 | 1.09 | 0.16 | 0.592 | |
| Disinfection of livestock trail after housing out | 2.21 | 0.24 | 2.21 | 0.30 | 3.01 | 0.33 | 1.15 | 0.45 | 0.761 | 0.99 | 0.26 | 0.977 | ||||
| Disinfection of livestock trail less frequent than after housing out | 1.64 | 0.29 | 0.086 | 1.64 | 0.36 | 0.176 | 2.46 | 0.44 | 0.95 | 0.46 | 0.914 | 0.83 | 0.31 | 0.555 | ||
| Disinfection with chlorine | 1.38 | 0.19 | 0.090 | 1.38 | 0.24 | 0.182 | 1.72 | 0.31 | 0.089 | 1.24 | 0.21 | 0.307 | 1.20 | 0.19 | 0.318 | |
| Number of fattening pigs (>1,000 to ≤1,500) | 1.07 | 0.17 | 0.697 | 1.07 | 0.22 | 0.756 | 1.19 | 0.29 | 0.547 | 0.93 | 0.18 | 0.696 | 0.93 | 0.18 | 0.700 | |
| Number of fattening pigs (>1,500) | 2.01 | 0.18 | 2.01 | 0.22 | 2.83 | 0.29 | 1.47 | 0.24 | 0.104 | 1.42 | 0.20 | 0.082 | ||||
| Zero-part | Intercept | 2.81 | 1.28 | 0.419 | 3.32 | 1.18 | 0.309 | |||||||||
| Separate pen for diseased pigs | 0.09 | 1.48 | 0.097 | 0.07 | 1.27 | |||||||||||
| Use of purchased feed only | 6.32 | 2.16 | 0.393 | 5.04 | 1.33 | 0.223 | ||||||||||
| Water birds in 1 km radius of farm | 0.10 | 2.6 | 0.382 | 0.16 | 1.55 | 0.240 | ||||||||||
| Disinfection with chlorine | 0.00 | 7,210.56 | 0.998 | 0.00 | 5,847.27 | 0.998 | ||||||||||
| Number of fattening pigs (>1,000 to ≤1,500) | 0.09 | 2.24 | 0.287 | 0.12 | 1.62 | 0.199 | ||||||||||
| Number of fattening pigs (>1,500) | 0.02 | 3.86 | 0.331 | 0.04 | 1.66 | 0.056 | ||||||||||
| Akaike information criterion (AIC) | 231.6385 | 231.6385 | 265.2818 | 233.2511 | 233.0947 | |||||||||||
The variables in the zero-part were chosen following a backward selection based on the AIC of the hurdle model.
*Interpretation of estimates in the count model/count-part: a one unit change in the predictor variable is associated with a .
.
Bold: p-values that passed the significance threshold (.
Figure 2Flow chart for model building processes for count data.
Figure 3The histogram illustrates the observed count data. The lines show the predictions from the different regression models [Poisson, negative binomial, zero-inflated Poisson, hurdle model (Poisson)].
Figure 4Residual plots (Pearson residuals against the fitted values) of the four models for count data in our study.
Sensitivity analysis: the variables in the zero-part were chosen following a backward selection based on the Akaike information criterion (AIC) of the zero-inflated Poisson (ZIP) model instead of the hurdle model (compare Table .
| ZIP | Hurdle | ||||||
|---|---|---|---|---|---|---|---|
| exp( | SE | exp( | SE | ||||
| Count-part | Intercept | 1.54 | 0.38 | 0.264 | 2.61 | 0.41 | |
| Moving single pigs | 1.82 | 0.23 | 1.73 | 0.24 | |||
| Separate pen for diseased pigs | 1.66 | 0.20 | 1.44 | 0.21 | 0.084 | ||
| Use of purchased feed only | 0.62 | 0.20 | 0.65 | 0.21 | |||
| Water birds in 1 km radius of farm | 1.10 | 0.16 | 0.537 | 1.09 | 0.16 | 0.592 | |
| Disinfection of livestock trail after housing out | 1.39 | 0.27 | 0.229 | 0.99 | 0.26 | 0.977 | |
| Disinfection of livestock trail less frequent than after housing out | 1.13 | 0.32 | 0.703 | 0.83 | 0.31 | 0.555 | |
| Disinfection with chlorine | 1.30 | 0.19 | 0.158 | 1.20 | 0.19 | 0.318 | |
| Number of fattening pigs (>1,000 to ≤1,500) | 0.94 | 0.18 | 0.712 | 0.93 | 0.18 | 0.700 | |
| Number of fattening pigs (>1,500) | 1.59 | 0.19 | 1.42 | 0.20 | 0.082 | ||
| Zero-part | Intercept | 0.38 | 0.68 | 0.151 | 0.40 | 0.58 | 0.114 |
| Use of purchased feed only | >1,000 | >1,000 | 0.993 | 3.63 | 1.14 | 0.259 | |
| Water birds in 1 km radius of farm | 0.00 | >1,000 | 1.000 | 0.19 | 1.26 | 0.185 | |
| Number of fattening pigs (>1,000 to ≤1,500) | 0.00 | >1,000 | 1.000 | 0.22 | 1.20 | 0.214 | |
| Number of fattening pigs (>1,500) | 0.00 | >1,000 | 0.988 | 0.15 | 1.30 | 0.148 | |
| 229.91 | 236.71 | ||||||
Estimates*, SEs, and .
*Interpretation of estimates in the count model/count-part: a one unit change in the predictor variable is associated with a .
.
Bold: p-values that passed the significance threshold (.