| Literature DB >> 26343696 |
Ali Arab1.
Abstract
Epidemiological data often include excess zeros. This is particularly the case for data on rare conditions, diseases that are not common in specific areas or specific time periods, and conditions and diseases that are hard to detect or on the rise. In this paper, we provide a review of methods for modeling data with excess zeros with focus on count data, namely hurdle and zero-inflated models, and discuss extensions of these models to data with spatial and spatio-temporal dependence structures. We consider a Bayesian hierarchical framework to implement spatial and spatio-temporal models for data with excess zeros. We further review current implementation methods and computational tools. Finally, we provide a case study on five-year counts of confirmed cases of Lyme disease in Illinois at the county level.Entities:
Keywords: Bayesian analysis; Integrated Nested Laplace Approximation (INLA); hierarchical modeling; hurdle models; spatial models; spatio-temporal models; zero-inflated models
Mesh:
Year: 2015 PMID: 26343696 PMCID: PMC4586626 DOI: 10.3390/ijerph120910536
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Histogram (a) and map (b) of the total number of confirmed cases of Lyme disease in Illinois by county for the 5–year interval 2007–2011.
Figure 2INLA mesh for the study region, blue line represents the border for the state of Illinois, and red dots represent the coordinates for the county seats.
Model selection results based on DIC values.
| Model | DIC | |
|---|---|---|
| Spatial Poisson Hurdle | 404 | 42.94 |
| Spatial Zero-Inflated Poisson | 360 | 48.54 |
| Spatial Poisson Hurdle with Probability Model | 380 | 44.84 |
| Spatial Negative Binomial Hurdle | 459 | 11.85 |
| Spatial Zero-Inflated Negative Binomial | 420 | 11.53 |
| Spatial Neg. Bin. Hurdle with Probability Model | 435 | 13.84 |
Model results for the spatial Poisson hurdle model (with regression for probability).
| Coefficient | Mean | Standard Deviation | 95% |
|---|---|---|---|
| Intercept | −3.2931 | 1.6830 | (−6.6478, −0.0008) |
| Elevation | 0.0051 | 0.0019 | (0.0014, 0.0089) |
| Population per square mile | −0.0007 | 0.0056 | (−0.0120, 0.0102) |
| Intercept | 7.4643 | 1.8093 | (4.1338, 11.2494) |
| Elevation | −0.0097 | 0.0022 | (−0.0143, −0.0056) |
| Population per square mile | −0.0025 | 0.0086 | (−0.0196, 0.0143) |
Figure 3Spatial fields (a) posterior mean, (b) posterior standard deviation.