| Literature DB >> 36168515 |
Fedelis Mutiso1, John L Pearce2, Sara E Benjamin-Neelon3,4, Noel T Mueller5,6, Hong Li1, Brian Neelon1,7.
Abstract
Overdispersed count data arise commonly in disease mapping and infectious disease studies. Typically, the level of overdispersion is assumed to be constant over time and space. In some applications, however, this assumption is violated, and in such cases, it is necessary to model the dispersion as a function of time and space in order to obtain valid inferences. Motivated by a study examining spatiotemporal patterns in COVID-19 incidence, we develop a Bayesian negative binomial model that accounts for heterogeneity in both the incidence rate and degree of overdispersion. To fully capture the heterogeneity in the data, we introduce region-level covariates, smooth temporal effects, and spatially correlated random effects in both the mean and dispersion components of the model. The random effects are assigned bivariate intrinsic conditionally autoregressive priors that promote spatial smoothing and permit the model components to borrow information, which is appealing when the mean and dispersion are spatially correlated. Through simulation studies, we show that ignoring heterogeneity in the dispersion can lead to biased and imprecise estimates. For estimation, we adopt a Bayesian approach that combines full-conditional Gibbs sampling and Metropolis-Hastings steps. We apply the model to a study of COVID-19 incidence in the state of Georgia, USA from March 15 to December 31, 2020.Entities:
Keywords:
zzm321990
Year: 2022 PMID: 36168515 PMCID: PMC9500097 DOI: 10.1016/j.spasta.2022.100703
Source DB: PubMed Journal: Spat Stat
Fig. 1Descriptive maps for incidence rate and overdispersion from intercept only models for data from March 15, 2020 to December 31, 2020 for the 159 counties in Georgia. Panel (a): Predicted incidence. Panel (b): Overdispersion. Legends correspond to sample quintiles.
Results from simulation study comparing a spatially correlated NB model with three misspecified models: Model 1: Spatially correlated random intercepts in the mean and dispersion components. Model 2: Separate spatial random intercepts in the mean and dispersion components. Model 3: Spatial random intercept in mean component and only covariates in the dispersion component. Model 4: Spatial random intercept in mean component and a constant dispersion parameter. Note that in all models and in models 1 and 2.
| Model | Component | Parameter | Truth | Post. Mean (95% CrI) | Bias |
|---|---|---|---|---|---|
| Model 1 (Simulated Model) | Count | −0.08 | |||
| −0.25 | |||||
| −0.09 | |||||
| Dispersion | −0.50 | −0.04 | |||
| −0.07 | |||||
| Random effects | −0.09 | ||||
| −0.04 | |||||
| −0.05 | |||||
| Model 2 | Count | −0.08 | |||
| −0.25 | |||||
| −0.04 | |||||
| Dispersion | −0.50 | −0.02 | |||
| −0.06 | |||||
| Random effects | −0.22 | ||||
| −0.12 | |||||
| Model 3 | Count | −0.34 | |||
| −0.25 | −0.11 | ||||
| −0.14 | |||||
| Dispersion | −0.50 | ||||
| −0.11 | |||||
| −0.17 | |||||
| Random effects | |||||
| Model 4 | Count | −0.39 | |||
| −0.25 | −0.50 | ||||
| −0.24 | |||||
| Dispersion | – | – | |||
| Random effects | |||||
Model comparison results for Simulation Study .
| WAIC estimate | |||||
|---|---|---|---|---|---|
| Likelihood | Penalty | Total | Successive difference | ||
| Model 1 | −75701.53 | 261.44 | 151925.90 | – | |
| Model 2 | −75714.45 | 259.09 | 151947.10 | 21.20 | |
| Model 3 | −76294.23 | 202.50 | 152993.40 | 1046.30 | |
| Model 4 | −77208.52 | 187.39 | 154791.80 | 1798.42 | |
Fig. 2(a): Average time trend for the mean in simulation study. (b): Average time trend for dispersion in simulation study.
Fig. 3Simulated and predicted random effects for the mean and dispersion components for Model 1: spatial NB model with correlated random intercepts. Panel (a): Simulated random intercept for the mean component. Panel (b): Predicted random intercept for the mean component. Panel (c): Simulated random intercept for dispersion component. Panel (d): Predicted random intercept for dispersion component. Legends correspond to sample quintiles.
Posterior mean rate ratios and 95% posterior intervals for the COVID-19 study from the spatial NB model with correlated intercepts (excluding spline parameters for time) .
| Model component | Variable | Rate ratio (95% CrI) | |||
|---|---|---|---|---|---|
| Mean | SVI | ||||
| % fair or poor health | |||||
| % female | |||||
| % of adult smokers | |||||
| Population density | |||||
| Dispersion | SVI | ||||
| % fair or poor health | |||||
| % female | |||||
| % of adult smokers | |||||
| Population density | |||||
| Random effects | var( | ||||
| cov( | |||||
| var( | |||||
| corr( |
Fig. 4Daily incidence rate and overdispersion trends for the COVID-19 study. Panel (a): Mean incidence rate trend across counties. Panel (b): Mean overdispersion () across counties.
Fig. 5Model-based spatial random effect estimates for the COVID-19 study. Panel (a): Posterior mean predicted random effects for the mean component . Panel (b): Posterior mean predicted random effects for the dispersion component . Legends correspond to sample quintiles.