| Literature DB >> 24648100 |
Duncan Lee1, Richard Mitchell2.
Abstract
Estimating the long-term health impact of air pollution using an ecological spatio-temporal study design is a challenging task, due to the presence of residual spatio-temporal autocorrelation in the health counts after adjusting for the covariate effects. This autocorrelation is commonly modelled by a set of random effects represented by a Gaussian Markov random field (GMRF) prior distribution, as part of a hierarchical Bayesian model. However, GMRF models typically assume the random effects are globally smooth in space and time, and thus are likely to be collinear to any spatially and temporally smooth covariates such as air pollution. Such collinearity leads to poor estimation performance of the estimated fixed effects, and motivated by this epidemiological problem, this paper proposes new GMRF methodology to allow for localised spatio-temporal smoothing. This means random effects that are either geographically or temporally adjacent are allowed to be autocorrelated or conditionally independent, which allows more flexible autocorrelation structures to be represented. This increased flexibility results in improved fixed effects estimation compared with global smoothing models, which is evidenced by our simulation study. The methodology is then applied to the motivating study investigating the long-term effects of air pollution on respiratory ill health in Greater Glasgow, Scotland between 2007 and 2011.Entities:
Keywords: Gaussian Markov random fields; air pollution and health studies; spatio-temporal autocorrelation
Mesh:
Year: 2014 PMID: 24648100 PMCID: PMC4272194 DOI: 10.1177/0962280214527384
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Figure 1.Maps displaying the standardised incidence ratio (SIR) for respiratory disease in 2007 (top left panel) and 2011 (top right panel) and the annual mean concentration for particles less than 10 µm (PM10) in 2006 (bottom left panel) and 2010 (bottom right panel).
Summary of the spatio-temporal variation in the data between year 1 (2007) and year 5 (2011) of the study, presented as the mean (standard deviation) value across the study region.[a]
| Variable | Year 1 | Year 2 | Year 3 | Year 4 | Year 5 |
|---|---|---|---|---|---|
| Respiratory disease | 75.3 (32.8) | 81.0 (37.0) | 78.1 (34.2) | 78.4 (33.0) | 83.2 (35.0) |
| CO (mgm−3) | 0.19 (0.04) | 0.19 (0.02) | 0.20 (0.01) | 0.18 (0.01) | 0.22 (0.01) |
| NO2 (µgm−3) | 17.4 (6.7) | 15.7 (5.6) | 15.2 (5.3) | 15.4 (5.0) | 17.0 (6.3) |
| PM2.5 (µgm−3) | 8.2 (1.2) | 5.8 (0.8) | 6.9 (1.1) | 7.4 (1.1) | 9.0 (1.2) |
| PM10 (µgm−3) | 13.9 (2.0) | 12.3 (1.5) | 10.9 (1.5) | 11.5 (1.5) | 13.0 (1.7) |
| Job seekers allowance (%) | 2.9 (1.8) | 3.2 (1.9) | 4.8 (2.5) | 5.0 (2.7) | 5.1 (2.8) |
| House price (£, 000) | 133.5 (55.0) | 134.8 (53.4) | 121.3 (49.6) | 124.9 (56.4) | 123.9 (58.1) |
| Ethnicity (%) | 9.2 (13.0) | 9.8 (13.3) | 10.3 (13.6) | 10.7 (13.7) | 11.1 (14.1) |
Note, the pollution concentrations are lagged by one year relative to the disease data.
Estimated covariate effects and 95% credible intervals from the Poisson generalised linear model (1), the non-separable global BYM model (4) and the non-separable localised smoothing model (11).[a]
| Spatio-temporal autocorrelation models | |||
|---|---|---|---|
| Covariate | Poisson GLM | BYM | Localised |
| CO | 1.026 (1.011, 1.042) | 1.001 (0.980, 1.023) | 1.008 (0.989, 1.027) |
| NO2 | 1.088 (1.071, 1.104) | 1.015 (0.988, 1.042) | 1.034 (1.011, 1.057) |
| PM2.5 | 1.012 (0.997, 1.027) | 0.991 (0.969, 1.013) | 0.994 (0.975, 1.013) |
| PM10 | 1.091 (1.076, 1.107) | 1.038 (1.013, 1.062) | 1.053 (1.032, 1.075) |
| JSA | 1.169 (1.150, 1.188) | 1.222 (1.199, 1.246) | 1.212 (1.191, 1.235) |
| House price | 0.860 (0.843, 0.877) | 0.904 (0.886, 0.921) | 0.899 (0.882, 0.916) |
| Ethnicity | 0.986 (0.972, 1.001) | 0.993 (0.972, 1.015) | 0.993 (0.974, 1.012) |
The results are presented as relative risks, for a one standard deviation increase in each covariates value, which are: CO – 0.024, NO2 – 5.87, PM2.5 – 1.56, PM10 – 1.96, job seekers allowance (JSA) – 2.57%, house price – £ 54,800 and ethnicity – 13.5%.
Figure 2.Maps displaying the residuals from the Poisson generalised linear model for 2007 and 2011.
Figure 3.A map showing the piecewise constant mean function (with possible values {−1, 0, 1}) for the random effects that generate localised spatial autocorrelation in the first year of the simulation study.
Figure 4.Root mean square error (RMSE) for the estimated regression parameter β for different values of M and E in the presence of either separable or non-separable residual spatio-temporal autocorrelation. In each case, the dot represents the RMSE while the vertical line is a bootstrapped 95% uncertainty interval. The models are: (a) Poisson generalised linear model (GLM), (b) Besag-York-Mollié (BYM) separable, (c) localised separable, (d) BYM non-separable and (e) localised non-separable.
Coverage probabilities for the estimated regression parameter β for PM10 from each of the following models: (a) Poisson GLM, (b) BYM separable, (c) localised separable, (d) BYM non-separable, (e) localised non-separable.[a]
| Coverage probability (%) | ||||||
|---|---|---|---|---|---|---|
| (a) | (b) | (c) | (d) | (e) | ||
| Separable | ||||||
| 0.5 | [10, 25] | 84.0 | 95.6 | 95.6 | 94.2 | 94.8 |
| 0.5 | [50, 100] | 82.6 | 94.0 | 97.2 | 97.4 | 95.2 |
| 0.5 | [150, 200] | 78.4 | 89.4 | 91.4 | 99.6 | 87.2 |
| 1 | [10, 25] | 90.6 | 94.2 | 94.8 | 92.8 | 92.6 |
| 1 | [50, 100] | 92.4 | 94.2 | 97.4 | 97.8 | 96.2 |
| 1 | [150, 200] | 91.4 | 94.6 | 96.6 | 99.6 | 94.8 |
| Non-separable | ||||||
| 0.5 | [10, 25] | 88.0 | 94.4 | 93.8 | 95.4 | 96.6 |
| 0.5 | [50, 100] | 81.4 | 73.6 | 75.2 | 95.0 | 96.8 |
| 0.5 | [150, 200] | 82.0 | 56.0 | 58.0 | 94.2 | 90.4 |
| 1 | [10, 25] | 87.0 | 92.8 | 94.0 | 94.6 | 96.6 |
| 1 | [50, 100] | 87.6 | 72.8 | 73.4 | 93.6 | 95.2 |
| 1 | [150, 200] | 86.0 | 55.4 | 57.4 | 95.6 | 93.2 |
The top half of the table corresponds to data containing separable spatio-temporal autocorrelation, while the bottom half has non-separable structure. In both cases, the results are presented for different levels of localised structure (via M) and different disease prevalences (via E).