| Literature DB >> 24219031 |
Barbara K Butland, Ben Armstrong1, Richard W Atkinson, Paul Wilkinson, Mathew R Heal, Ruth M Doherty, Massimo Vieno.
Abstract
BACKGROUND: Assessing health effects from background exposure to air pollution is often hampered by the sparseness of pollution monitoring networks. However, regional atmospheric chemistry-transport models (CTMs) can provide pollution data with national coverage at fine geographical and temporal resolution. We used statistical simulation to compare the impact on epidemiological time-series analysis of additive measurement error in sparse monitor data as opposed to geographically and temporally complete model data.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24219031 PMCID: PMC3871053 DOI: 10.1186/1471-2288-13-136
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Comparison of observed model and chemistry-transport model (CTM) data
| Daily maximum running 8-hour mean§ O3 | Rural (N=21) | 26,995 | Monitor | 72.17 (5.10) | 21.65 (3.88) | 334.927 | 0.730 | −0.424 | 4.740 (4.375) |
| | | CTM | 76.91 (1.81) | 20.50 (2.98) | | | 0.304 | | |
| Urban (N=35) | 40,938 | Monitor | 61.73 (7.38) | 25.28 (2.81) | 455.779 | 0.757 | −0.442 | 10.250 (5.166) | |
| | | | CTM | 71.98 (3.93) | 23.41 (2.92) | | | 0.246 | |
| loge(Daily maximum 1-hour¶ NO2) | Rural (N=14) | 16,080 | Monitor | 2.696 (0.495) | 0.705 (0.110) | 0.422 | 0.667 | −0.194 | −0.236 (0.210) |
| | | CTM | 2.460 (0.588) | 0.866 (0.146) | | | 0.595 | | |
| Urban (N=43) | 51,596 | Monitor | 3.857 (0.309) | 0.473 (0.094) | 0.195 | 0.612 | −0.158 | −0.538 (0.268) | |
| CTM | 3.319 (0.438) | 0.645 (0.138) | 0.674 |
¶ Values on 35 days set to missing as daily monitor maximum 1-hour NO2=0. §The 8-hour rolling mean ozone concentration assigned to hour h is the average of hourly concentrations at h-7, h-6, h-5, h-4, h-3, h-2, h-1, and h. In calculating pollution metrics we employ the 75% rule such that a valid 8-hour mean must be based on at least 6 values, a daily maximum 8-hour mean must be based on at least 18 valid 8-hour means and a daily maximum 1-hour concentration on at least 18 hourly concentrations.
Figure 1Simple linear regression analysis of Pearson correlation by distance. The figure presents results for (a) urban background ozone, (b) rural ozone, (c) urban background loge(NO2) and (d) rural loge(NO2). Each point on graphs represents the Pearson correlation (P) between daily standardised pollution concentrations measured at two distinct monitoring sites, plotted against the distance in km (D) between those sites. R-sq: estimate of the proportion of variance in Pearson correlation (P) explained by the fitted linear relationship with distance in km (D).
Summarising the analysis of 1000 simulated data sets: urban background pollution concentrations with additive error
| | | ||||
|---|---|---|---|---|---|
| 1 | 0.00375 (0.00209) | 94%; 43% | 0.0297 (0.0104) | 78%; 81% | |
| | 2 | 0.00388 (0.00214) | 95%; 44% | 0.0348 (0.0113) | 91%; 86% |
| | 3 | 0.00393 (0.00215) | 95%; 45% | 0.0369 (0.0117) | 93%; 89% |
| | 5 | 0.00396 (0.00216) | 96%; 45% | 0.0387 (0.0120) | 94%; 90% |
| | 10 | 0.00400 (0.00217) | 95%; 45% | 0.0403 (0.0122) | 95%; 90% |
| 25 | |||||
| - | 0.00325 (0.00226) | 94%; 29% | 0.0193 (0.0076) | 15%; 72% | |
The table presents estimated regression coefficients , standard errors , coverage probabilities and power, each based on the analysis of 1000 sets of simulated time-series data. The “true” value of the regression coefficient β for ozone (i.e. β × 10 = 0.00399) equates to a 0.4% increase in mortality per 10 μg/m3 increase in ozone and the “true” value of the regression coefficient for loge(NO2) (i.e. β = 0.0419) equates to a 0.4% increase in mortality per 10% increase in NO2.
Summarising the analysis of 1000 simulated data sets: rural pollution concentrations with additive error
| | | ||||
|---|---|---|---|---|---|
| 1 | 0.00346 (0.00244) | 95%; 30% | 0.0258 (0.0072) | 39%; 96% | |
| | 2 | 0.00371 (0.00254) | 95%; 31% | 0.0319 (0.0080) | 76%; 98% |
| | 3 | 0.00381 (0.00258) | 95%; 31% | 0.0347 (0.0083) | 86%; 98% |
| | 5 | 0.00389 (0.00261) | 95%; 32% | 0.0372 (0.0087) | 93%; 99% |
| | 10 | 0.00397 (0.00263) | 96%; 31% | 0.0395 (0.0089) | 95%; 99% |
| 25 | |||||
| - | 0.00310 (0.00257) | 94%; 22% | 0.0233 (0.0058) | 11%; 99% | |
The table presents estimated regression coefficients , standard errors , coverage probabilities and power, each based on the analysis of 1000 sets of simulated time-series data. The “true” value of the regression coefficient β for ozone (i.e. β × 10 = 0.00399) equates to a 0.4% increase in mortality per 10 μg/m3 increase in ozone and the “true” value of the regression coefficient for loge(NO2) (i.e. β = 0.0419) equates to a 0.4% increase in mortality per 10% increase in NO2.
Summarising the analysis of 1000 simulated data sets: nitrogen dioxide concentrations with proportional error
| | | ||||
|---|---|---|---|---|---|
| 1 | 0.00262 (0.00191) | 86%; 28% | 0.00201 (0.00319) | 86%; 10% | |
| | 2 | 0.00313 (0.00207) | 91%; 32% | 0.00252 (0.00350) | 92%; 11% |
| | 3 | 0.00333 (0.00213) | 93%; 35% | 0.00279 (0.00365) | 93%; 12% |
| | 5 | 0.00353 (0.00219) | 94%; 37% | 0.00306 (0.00381) | 93%; 13% |
| | 10 | 0.00372 (0.00224) | 95%; 38% | 0.00335 (0.00396) | 94%; 14% |
| 25 | |||||
| - | 0.00231 (0.00182) | 85%; 25% | 0.00185 (0.00222) | 83%; 14% | |
The table presents estimated regression coefficients , standard errors , coverage probabilities and power, each based on the analysis of 1000 sets of simulated time-series data. The “true” value of the regression coefficient β for NO2 (i.e. β × 10 = 0.00399) equates to a 0.4% increase in mortality per 10 μg/m3 increase in NO2.
Estimated attenuation in the health effect estimate: comparing simulation and theory
| | |||||
|---|---|---|---|---|---|
| | | ||||
| Urban background | 0.00375 | 0.00367 | 0.0297 | 0.0293 | |
| | Rural | 0.00346 | 0.00336 | 0.0258 | 0.0255 |
| Urban background | 0.00325 | 0.00332 | 0.0193 | 0.0196 | |
| Rural | 0.00310 | 0.00318 | 0.0233 | 0.0236 | |
For model data we base our predictions on the average observed within-site covariance rather than the average observed within-site correlation.