| Literature DB >> 23757600 |
Silas Bergen1, Lianne Sheppard, Paul D Sampson, Sun-Young Kim, Mark Richards, Sverre Vedal, Joel D Kaufman, Adam A Szpiro.
Abstract
BACKGROUND: Studies estimating health effects of long-term air pollution exposure often use a two-stage approach: building exposure models to assign individual-level exposures, which are then used in regression analyses. This requires accurate exposure modeling and careful treatment of exposure measurement error.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23757600 PMCID: PMC3764074 DOI: 10.1289/ehp.1206010
Source DB: PubMed Journal: Environ Health Perspect ISSN: 0091-6765 Impact factor: 9.031
Figure 1Locations of IMPROVE and CSN monitors and predicted national average PM2.5 component concentrations from final predictions models. (A) EC, (B) OC, (C) Si, and (D) S. Insets show predictions for St. Paul, MN.
Summary data for observed pollution concentrations (mean ± SD) at monitoring networks; predicted concentrations (mean ± SD) for the MESA cohort at exam 1 and summaries of selected LUR covariates.
| Covariates | IMPROVE | CSN | All monitors | MESA Air |
|---|---|---|---|---|
| Sites ( | 190 | 98 | 288 | 5501 |
| EC (μg/m3) | 0.19±0.18 | 0.66±0.24 | 0.37±0.30 | 0.74±0.18 |
| OC (μg/m3) | 0.93±0.55 | 2.23±0.71 | 1.43±0.88 | 2.17±0.36 |
| Si (ng/m3) | 0.16±0.12 | 0.10±0.09 | 0.14±0.11 | 0.09±0.03 |
| S (μg/m3) | 0.41±0.27 | 0.69±0.25 | 0.51±0.29 | 0.78±0.15 |
| Sites <150m to an A1 road[ | 4 (2) | 3 (3) | 7 (2) | 249 (6) |
| Sites <150m to an A3 road[ | 36 (19) | 43 (44) | 79 (27) | 2,763 (50) |
| Median distance to comm (m) | 4,696 | 127 | 1,235 | 302 |
| Median pop dens | 3 | 805 | 20 | 3,496 |
| NDVI | 150 | 140 | 146 | 137 |
| Abbreviations: comm, commercial or service centers; pop dens, population density. | ||||
LUR covariates (Figure 2 abbreviations) and (where applicable) covariate buffer sizes that made it through preprocessing and were considered by PLS.
| Abbreviation | Variable description | Buffer sizes |
|---|---|---|
| Distance to features | A1 road | NA |
| Nearest road | NA | |
| Airport | NA | |
| Large airport | NA | |
| Port | NA | |
| Coastline | NA | |
| Commercial or service center | NA | |
| Railroad | NA | |
| Rail yard | NA | |
| SO2 | SO2 Emissions | 30km |
| PM2.5 | PM2.5 | 30km |
| PM10 | PM10 | 30km |
| NOx | NOx | 30km |
| Population | Population density | 500m, 1km, 1.5km, 2km, 2.5km, 3km, 5km, 10km, 15km |
| NDVI–winter | Median winter | 250m, 500m, 1km, 2.5km, 5km, 7.5km, 10km |
| NDVI–summer | Median summer | 250m, 500m, 1km, 2.5km, 5km, 7.5km, 10km |
| NDVI–Q75 | 75th percentile | 250m, 500m, 1km, 2.5km, 5km, 7.5km, 10km |
| NDVI–Q50 | 50th percentile | 250m, 500m, 1km, 2.5km, 5km, 7.5km, 10km |
| NDVI–Q25 | 25th percentile | 250m, 500m, 1km, 2.5km, 5km, 7.5km, 10km |
| Transport | Transportation, communities, and utilities | 750m, 3km, 5km, 10km, 15km |
| Transition | Transitional areas | 15km |
| Stream | Streams and canals | 3km |
| Shrub | Shrub and brush rangeland | 1.5km, 3km, 5km, 10km, 15km |
| Residential | Residential | 400m, 500m, 750m, 1km, 1.5km, 3km, 5km, 10km, 15km |
| Other urban | Other urban or built-up | 400m |
| Mixed range | Mixed rangeland | 3km, 5km, 10km, 15km |
| Mixed forest | Mixed forest land | 750m, 1km, 1.5km, 3km, 5km, 10km, 15km |
| Lakes | Lakes | 10 km |
| Industrial | Industrial | 1km |
| Indust/comm | Industrial and commercial complexes | 15km |
| Herb range | Herbaceous rangeland | 3km |
| Green | Evergreen forest land | 400m, 500m, 750m, 1km, 1.5km, 3km, 5km, 10km, 15km |
| Forest | Deciduous forest land | 750m, 1km, 1.5km, 3km, 5km, 10km, 15km |
| Crop | Cropland and pasture | 400m, 500m, 750m, 1km, 1.5km, 3km, 5km, 10km, 15km |
| Comm | Commercial and services | 500m, 750m, 1km, 1.5km, 3km, 5km, 10km, 15km |
| A23 | Total distance of A2 and A3 roads within buffer | 100m, 150m, 300m, 400m, 500m, 750m, 1km, 1.5km, 3km, 5km |
| A1 | Total distance of A1 roads within buffer | 1km, 1.5km, 3km, 5km |
| Most variables were used in each of the four PM2.5 component models; however, the pre-processing procedure selected some variables for EC and OC that were not selected for Si and S, and vice versa because EC and OC monitoring locations were not identical to Si and S locations. | ||
Subject-specific covariates for the MESA cohort used in health modeling.
| Variable | Mean±SD or % | |
|---|---|---|
| CIMT | 5,501 | 0.68±0.19 |
| Age (years) | 5,501 | 61.9±10.1 |
| Weight (lb) | 5,501 | 173.0±37.5 |
| Height (cm) | 5,501 | 166.6±10.0 |
| Waist (cm) | 5,500 | 97.8±14.1 |
| Body surface area (m2) | 5,501 | 1.9±0.2 |
| BMI (kg/m2) | 5,501 | 28.2±5.3 |
| DBP | 5,499 | 71.8±10.3 |
| Sex | ||
| Female | 2,872 | 52.2 |
| Male | 2,629 | 47.8 |
| Race | ||
| White (Caucasian) | 2,168 | 39.4 |
| Chinese American | 675 | 12.3 |
| Black (African American) | 1,459 | 26.5 |
| Hispanic | 1,199 | 21.8 |
| Site | ||
| Winston-Salem | 878 | 16.0 |
| New York | 867 | 15.8 |
| Baltimore | 776 | 14.1 |
| St. Paul and Minneapolis | 899 | 16.3 |
| Chicago | 998 | 18.1 |
| Los Angeles | 1,083 | 19.7 |
| Education | ||
| Incomplete high school | 916 | 16.7 |
| Completed high school | 991 | 18.0 |
| Some college | 1,571 | 28.6 |
| Completed college | 2,010 | 36.5 |
| Missing | 13 | 0.2 |
| Income per year | ||
| <$12,000 | 566 | 10.3 |
| $12,000–24,999 | 1,022 | 18.6 |
| $25,000–49,999 | 1,543 | 28 |
| $50,000–74,999 | 901 | 16.4 |
| >$75,000 | 1,271 | 23.1 |
| Missing | 198 | 3.6 |
| Hypertension | ||
| No | 3,106 | 56.5 |
| Yes | 2,395 | 43.5 |
| Statin use | ||
| No | 4,681 | 85.1 |
| Yes | 817 | 14.9 |
| Missing | 3 | 0.1 |
Cross-validated R2 and RMSEP for each component of PM2.5, for both primary models and comparison PLS-only models, and the estimated kriging parameters from the likelihood fit on the entire data set for each pollutant.
| Correction | Model | EC | OC | Si | S |
|---|---|---|---|---|---|
| 3 PLS scores | 2 PLS scores | 2 PLS scores | 2 PLS scores | ||
| PLS only | 0.79 | 0.60 | 0.36 | 0.63 | |
| PLS+UK | 0.82 | 0.69 | 0.62 | 0.95 | |
| RMSEP | PLS only | 0.11 | 0.22 | 0.10 | 0.13 |
| PLS+UK | 0.10 | 0.20 | 0.08 | 0.05 | |
| Estimated UK parameters | (τ2) | 0.0074 | 0.0251 | 0.0043 | 0.0007 |
| (σ2) | 0.0025 | 0.0199 | 0.0086 | 0.0251 | |
| (φ) | 413 | 304 | 2,789 | 2,145 | |
| (τ2/σ2) | 2.96 | 1.26 | 0.5 | 0.03 | |
| UK, universal kriging. | |||||
Figure 2Coefficients of the PLS fit, where the coefficients describe the associations of each geographic covariate with exposure for (A) EC, (B) OC, (C) Si, and (D) S. The size of each circle represents covariate buffer size, with larger circles indicating larger buffers. Each closed circle for “distance to feature” represents a different feature (listed in Table 2): A1 road, nearest road, airport, large airport, port, coastline, commercial or service center, railroad, and rail yard. Variable abbreviations and buffer sizes are indicated in Table 2. Most of the variables shown here were used for modeling all four pollutants, but not all. Variables used for modeling Si and S but not EC and OC were PM2.5 and PM10 emissions, streams and canals within a 3-km buffer, other urban or built-up land use within a 400-m buffer, lakes within a 10-km buffer, industrial and commercial complexes within a 15-km buffer, and herbaceous rangeland within a 3-km buffer. The variables used for modeling EC and OC but not Si and S were industrial land use within 1- and 1.5-km buffers.
Point estimates ± SEs and 95% CIs for the different pollutants, using naïve analysis and with bootstrap correction for measurement error in covariate of interest.
| PM2.5 component | Analysis/correction | β̂x | 95% CI |
|---|---|---|---|
| EC (μg/m3) | Naïve | 0.001±0.014 | –0.03, 0.03 |
| PB, | 0.001±0.015 | –0.03, 0.03 | |
| PB, λ=1 | 0.001±0.015 | –0.03, 0.03 | |
| OC (μg/m3) | Naïve | 0.025±0.008 | 0.01, 0.04 |
| PB, λ=0 | 0.025±0.008 | 0.01, 0.04 | |
| PB, λ=1 | 0.025±0.008 | 0.01, 0.04 | |
| Si (ng/m3) | Naïve | 0.408±0.081 | 0.25, 0.57 |
| PB, λ=0 | 0.408±0.126 | 0.16, 0.66 | |
| PB, λ=1 | 0.408±0.127 | 0.16, 0.66 | |
| S (μg/m3) | Naïve | 0.055±0.017 | 0.022, 0.088 |
| PB, λ=0 | 0.055±0.025 | 0.006, 0.104 | |
| PB, λ=1 | 0.055±0.025 | 0.006, 0.104 | |
| Point estimates are estimates of the increase in CIMT for a 1-unit increase in each pollutant. | |||