| Literature DB >> 30538298 |
Keith R Spangler1,2,3, Kate R Weinberger4,5, Gregory A Wellenius4,5.
Abstract
Epidemiologic analyses of the health effects of meteorological exposures typically rely on observations from the nearest weather station to assess exposure for geographically diverse populations. Gridded climate datasets (GCD) provide spatially resolved weather data that may offer improved exposure estimates, but have not been systematically validated for use in epidemiologic evaluations. As a validation, we linearly regressed daily weather estimates from two GCDs, PRISM and Daymet, to observations from a sample of weather stations across the conterminous United States and compared spatially resolved, population-weighted county average temperatures and heat indices from PRISM to single-pixel PRISM values at the weather stations to identify differences. We found that both Daymet and PRISM accurately estimate ambient temperature and mean heat index at sampled weather stations, but PRISM outperforms Daymet for assessments of humidity and maximum daily heat index. Moreover, spatially-resolved exposure estimates differ from point-based assessments, but with substantial inter-county heterogeneity. We conclude that GCDs offer a potentially useful approach to exposure assessment of meteorological variables that may, in some locations, reduce exposure measurement error, as well as permit assessment of populations distributed far from weather stations.Entities:
Keywords: Gridded data; Heat; Meteorology; Mortality; Spatial analysis
Mesh:
Year: 2018 PMID: 30538298 PMCID: PMC6559872 DOI: 10.1038/s41370-018-0105-2
Source DB: PubMed Journal: J Expo Sci Environ Epidemiol ISSN: 1559-0631 Impact factor: 5.563
Overview of the Meteorological Datasets Used in the Analysis
| First-Order Stations (ISD-Lite) | USCRN Stations | PRISM | Daymet | |
|---|---|---|---|---|
| Spatial Coverage | 274 stations in the US, located primarily at airports | 137 stations in the US, located at sites minimally affected by anthropogenic land uses | Conterminous United States (CONUS) at horizontal spatial resolution of 4 km or 800 m | North America at horizontal spatial resolution of 1 km |
| Temporal Coverage | Hourly or sub-hourly observations over the past several decades | Sub-hourly observations since as early as 2000 | Daily from 1981 to six months prior to present | Daily since 1980; updated annually |
| Day Definition | 00:00 – 23:59 UTC | 00:00 – 23:59 UTC | 12:01 UTC on day | 00:00 – 23:59 UTC |
| Meteorological Variables Reported | Temperature | Temperature | ||
| User-Derived Variables | Heat index | Heat index |
Table 1: Abbreviations: minimum, maximum, and mean temperature (T, T, and T); mean vapor pressure (VP); minimum and maximum vapor-pressure deficits (VPD and VPD); minimum, maximum, and mean relative humidity (RH, RH, and RH); mean absolute humidity (AH); and minimum, maximum, and mean heat index (HI, HI, and HI).
Figure 1:Map of climates and locations of weather observatories in climatically representative sample. First-order weather stations in the ISD-Lite database (represented as stars) are located primarily at airports, while stations in the US Climate Reference Network (represented as circles), are located primarily in land-conservation areas. The stations in both networks are representative of the climate zones in CONUS, based on the Köppen-Geiger classification system [31], indicated by various colors (shapefile of data provided by ORNL DAAC [32, 33]). The nomenclature for the climate zones begins with the first letter for the broad climate type (“equatorial,” “arid,” “warm temperate,” “snow,” and “polar” for A, B, C, D, and E, respectively), then denotes the intra-annual precipitation and temperature characteristics (if applicable) within those zones in the second and third letters, respectively [31]. Background mapping provided by ArcWorld and ArcWorld Supplement from Esri®.
Univariate Linear Regression of First-Order Weather Station Observations on GCD Estimates
| Variable | Days | PRISM | Daymet | ||||||
|---|---|---|---|---|---|---|---|---|---|
| r2 | Slope | Y-Int | MAE | r2 | Slope | Y-Int | MAE | ||
| Tmax | All | 0.99 | 0.98 | 1.52 | 1.20 | 0.98 | 0.98 | 1.62 | 2.20 |
| Warm | 0.96 | 0.97 | 2.46 | 1.03 | 0.90 | 0.98 | 1.90 | 1.93 | |
| Tmim | All | 0.99 | 0.98 | 0.29 | 1.58 | 0.98 | 0.98 | −1.38 | 2.71 |
| Warm | 0.96 | 0.96 | 1.42 | 1.56 | 0.94 | 0.97 | −0.26 | 2.68 | |
| Tmean | All | 0.99 | 1.00 | −0.05 | 1.49 | 0.99 | 0.98 | 0.65 | 1.66 |
| Warm | 0.95 | 0.98 | 0.81 | 1.41 | 0.95 | 0.98 | 1.22 | 1.57 | |
| RHmax | All | 0.94 | 1.02 | −2.64 | 2.59 | - | - | - | - |
| Warm | 0.96 | 1.02 | −2.62 | 2.47 | - | - | - | - | |
| RHmin | All | 0.96 | 0.99 | 2.11 | 2.88 | 0.52 | 0.50 | 23.97 | 10.68 |
| Warm | 0.97 | 0.98 | 1.83 | 2.36 | 0.64 | 0.54 | 22.99 | 9.92 | |
| RHmean | All | 0.91 | 0.95 | −0.49 | 4.96 | 0.47 | 0.73 | 7.02 | 14.12 |
| Warm | 0.95 | 0.93 | 0.30 | 4.68 | 0.58 | 0.79 | 1.88 | 13.34 | |
| AHmean | All | >0.99 | 0.97 | −0.04 | 0.32 | 0.87 | 0.95 | −0.12 | 1.36 |
| Warm | >0.99 | 0.97 | −0.02 | 0.44 | 0.79 | 0.98 | −0.55 | 1.87 | |
| HImax | All | - | - | - | - | - | - | - | - |
| Warm | 0.93 | 0.96 | 3.57 | 1.56 | 0.85 | 1.13 | −9.81 | 3.20 | |
| HImin | All | - | - | - | - | - | - | - | - |
| Warm | 0.96 | 0.96 | 1.48 | 1.65 | - | - | - | - | |
| HImean | All | - | - | - | - | - | - | - | - |
| Warm | 0.96 | 1.01 | −1.24 | 1.50 | 0.94 | 1.02 | 0.78 | 2.74 | |
Table 2: Results of univariate linear regression of observed meteorological conditions at first-order weather stations in the ISD-Lite sample on modeled data from PRISM and Daymet. “Warm” refers to days with observed maximum temperature ≥ 70°F. “MAE” is the mean absolute error and measures the average difference between modeled and observed data; units are the same as for the variable measured. Higher r, lower MAE, slopes closer to one, and y-intercepts closer to zero indicate greater agreement between modeled and observed meteorological conditions.
Univariate Linear Regression of USCRN Station Observations on GCD Estimates
| Variable | Days | PRISM | Daymet | ||||||
|---|---|---|---|---|---|---|---|---|---|
| r2 | Slope | Y-Int | MAE | r2 | Slope | Y-Int | MAE | ||
| Tmax | All | 0.99 | 0.99 | 2.12 | 1.87 | 0.97 | 0.99 | 1.70 | 2.56 |
| Warm | 0.96 | 0.98 | 3.10 | 1.72 | 0.88 | 0.98 | 2.13 | 2.21 | |
| Tmim | All | 0.98 | 0.98 | 0.87 | 2.05 | 0.97 | 0.98 | −0.90 | 2.74 |
| Warm | 0.93 | 0.94 | 3.56 | 1.98 | 0.91 | 0.94 | 2.33 | 2.54 | |
| Tmean | All | 0.99 | 1.00 | 0.50 | 1.75 | 0.98 | 0.99 | 0.93 | 1.94 |
| Warm | 0.94 | 0.98 | 1.76 | 1.68 | 0.94 | 0.98 | 2.13 | 1.76 | |
| RHmax | All | 0.85 | 0.94 | 6.31 | 4.98 | - | - | - | - |
| Warm | 0.89 | 0.96 | 5.39 | 5.03 | - | - | - | - | |
| RHmin | All | 0.91 | 0.94 | 3.23 | 4.43 | 0.51 | 0.42 | 26.72 | 12.25 |
| Warm | 0.94 | 0.93 | 3.06 | 3.52 | 0.63 | 0.44 | 26.33 | 11.65 | |
| RHmean | All | 0.89 | 0.93 | 1.27 | 5.83 | 0.48 | 0.65 | 11.71 | 14.76 |
| Warm | 0.93 | 0.89 | 2.15 | 5.56 | 0.65 | 0.74 | 4.90 | 13.50 | |
| AHmean | All | 0.99 | 0.97 | 0.08 | 0.35 | 0.87 | 0.93 | 0.21 | 1.24 |
| Warm | 0.98 | 0.96 | 0.26 | 0.49 | 0.82 | 0.93 | 0.31 | 1.64 | |
| HImax | All | - | - | - | - | - | - | - | - |
| Warm | 0.93 | 1.02 | −0.24 | 2.06 | 0.84 | 1.18 | −12.76 | 3.60 | |
| HImin | All | - | - | - | - | - | - | - | - |
| Warm | 0.94 | 0.95 | 2.91 | 2.03 | - | - | - | - | |
| HImean | All | - | - | - | - | - | - | - | - |
| Warm | 0.95 | 1.03 | −1.62 | 1.69 | 0.94 | 1.03 | 1.64 | 3.61 | |
Table 3: Results of univariate linear regression of observed meteorological conditions at USCRN weather stations on modeled data from PRISM and Daymet. “Warm” refers to days on which the observed maximum temperature was at least 70°F. “MAE” is the mean absolute error and measures the average difference between modeled and observed data; units are the same as for the variable measured. Higher r, lower MAE, slopes closer to one, and y-intercepts closer to zero indicate greater agreement between modeled and observed meteorological conditions.
Figure 2:Comparison of epidemiologic analyses of the association between mean daily temperature and mortality rates using either observations from the nearest first-order weather station or population-weighted county means from PRISM. The distribution of daily mean temperature (A. and D.), 21-day cumulative exposure-response function between daily mean temperature and mortality on an absolute scale (B. and E.), and 21-day cumulative exposure-response function between daily mean temperature and mortality on a percentile-based scale (C. and F.) for Marion County, Indiana (left panels) and Los Angeles County, California (right panels), 1988–2003. Results based on observed station data are shown in red, results based on PRISM estimates are shown in blue, and the overlap between the two is shown in purple.
Figure 3:Spatial heterogeneity of ambient temperatures across Los Angeles County, California on an extreme-heat day. This geographic schematic shows the range and distribution of ambient maximum temperatures on the PRISM date of 28 September 2010 across Los Angeles County, California. The star symbol indicates the location of Los Angeles International Airport (LAX), which was included in the climatically representative sample in this paper; the diamond symbols indicate other first-order NWS stations. None of these stations are located in the hottest part of LA County on this day, and they cannot capture the large temperature differential experienced across the area (ranging from approximately 83.6˚F to 115.8˚F on this day). Note that adjacent islands, including those that are part of LA County, are not displayed. Background mapping provided by ArcWorld and ArcWorld Supplement from Esri®.