| Literature DB >> 31505818 |
Niru Senthilkumar1, Mark Gilfether1, Francesca Metcalf1, Armistead G Russell1, James A Mulholland1, Howard H Chang2.
Abstract
Accurate spatiotemporal air quality data are critical for use in assessment of regulatory effectiveness and for exposure assessment in health studies. A number of data fusion methods have been developed to combine observational data and chemical transport model (CTM) results. Our approach focuses on preserving the temporal variation provided by observational data while deriving the spatial variation from the community multiscale air quality (CMAQ) simulations, a type of CTM. Here we show the results of fusing regulatory monitoring observational data with 12 km resolution CTM simulation results for 12 pollutants (CO, NOx, NO2, SO2, O3, PM2.5, PM10, NO3-, NH4+, EC, OC, SO42-) over the contiguous United States on a daily basis for a period of ten years (2005-2014). An annual mean regression between the CTM simulations and observational data is used to estimate the average spatial fields, and spatial interpolation of observations normalized by predicted annual average is used to provide the daily variation. Results match the temporal variation well (R2 values ranging from 0.84-0.98 across pollutants) and the spatial variation less well (R2 values 0.42-0.94). Ten-fold cross validation shows normalized root mean square error values of 60% or less and spatiotemporal R2 values of 0.4 or more for all pollutants except SO2.Entities:
Keywords: CMAQ; air pollution; data fusion; gas species; particulate species; spatiotemporal pollutant fields
Mesh:
Substances:
Year: 2019 PMID: 31505818 PMCID: PMC6765984 DOI: 10.3390/ijerph16183314
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Comparison of monitor coverage between particulate matter (PM2.5) and elemental carbon (EC) for the year 2011. The different colored dots represent the different temporal sampling frequencies present for particulate species.
Summary statistics for the observational data from 2005–2014. A range is presented to show the changes in observational data over the ten-year span. Completeness is defined as the percentage of days during the year with measurement data available. Monitor total represents the total number of active monitors during the year, while daily represents the number of active monitors that take daily measurements during the year. Total observations (OBS) represents the total measurements taken during the ten-year span.
| Pollutant | Monitor Total | Daily | Total OBS | Completeness (%) | |
|---|---|---|---|---|---|
| Particulate Species | PM10 | 691–999 | 248–320 | 1,416,226 | 37–55 |
| PM2.5 | 768–1071 | 114–183 | 1,231,795 | 35–39 | |
| EC | 95–172 | 0 | 87,776 | 13–22 | |
| OC | 95–172 | 0 | 85,734 | 13–22 | |
| SO42− | 103–172 | 0 | 102,311 | 20–23 | |
| NO3− | 103–172 | 0 | 90,176 | 20–23 | |
| NH4+ | 102–172 | 0 | 94,332 | 19–22 | |
| Gases | NOx | 308–419 | 268–371 | 1,164,912 | 85–90 |
| NO2 | 300–400 | 270–389 | 1,373,569 | 87–90 | |
| CO | 303–418 | 278–393 | 1,257,734 | 87–90 | |
| O3 | 1182–1265 | 556–740 | 3,439,169 | 75–80 | |
| SO2 | 429–507 | 396–481 | 1,572,601 | 90–93 |
The calculated parameter range for the and parameters and the range of the R2 for the fit between the community multiscale air quality (CMAQ) and observations. Each range encompasses the values used during the ten-year period.
|
|
| R2 | ||
|---|---|---|---|---|
| Particulate Species | PM10 | 11.67–15.64 | 0.10–0.23 | 0.03–0.14 |
| PM2.5 | 3.68–4.62 | 0.32–0.50 | 0.28–0.50 | |
| EC | 0.58–0.84 | 0.34–0.54 | 0.26–0.49 | |
| OC | 1.45–1.80 | 0.18–0.45 | 0.10–0.35 | |
| SO42− | 1.11–1.31 | 0.82–1.01 | 0.77–0.93 | |
| NO3− | 0.88–1.29 | 0.53–0.89 | 0.43–0.60 | |
| NH4+ | 0.74–1.45 | 0.46–0.76 | 0.24–0.78 | |
| Gases | NOx | 2.04–4.20 | 0.62–0.94 | 0.48–0.65 |
| NO2 | 1.66–2.21 | 0.68–0.76 | 0.63–0.71 | |
| CO | 0.82–1.09 | 0.43–0.67 | 0.16–0.36 | |
| O3 | 0.32–0.73 | 0.64–0.91 | 0.50–0.65 | |
| SO2 | 1.36–3.55 | 0.69–0.95 | 0.37–0.54 |
Figure 2Normalized concentrations for the 12 species averaged over the ten-year time span. The concentrations are normalized against the maximum value over the contiguous United States (CONUS), excluding Mexico and Canada. The values plotted are dimensionless.
Figure 3Comparison of fused field metrics (Pearson R2) to the original CMAQ model. The blue bar represents the fused field metric, while the black line for each species represents the CMAQ metric.
Comparison of the R2 and normalized root mean squared error (NRMSE) between C* and observations on the different sampling days. Speciated PM monitors, EC and SO42− have two sampling frequencies, and PM2.5 has three sampling frequencies. Day A represents 1 in 3 days, Day B represents 1 in 6 days, and Day C represents daily measurements (only for PM2.5). The metrics shown are the averages of all days included in each of the three categories.
|
|
|
|
|
| |||||||
|
|
|
|
|
|
|
|
|
|
| ||
| EC | 1 in 3 | 0.57 | 0.52 | 0.61 | 0.44 | 0.61 | 0.48 | 0.64 | 0.45 | 0.76 | 0.31 |
| 1 in 6 | 0.59 | 0.49 | 0.62 | 0.42 | 0.64 | 0.44 | 0.68 | 0.38 | 0.76 | 0.31 | |
| SO42− | 1 in 3 | 0.90 | 0.24 | 0.91 | 0.52 | 0.92 | 0.18 | 0.87 | 0.23 | 0.92 | 0.20 |
| 1 in 6 | 0.90 | 0.25 | 0.90 | 0.54 | 0.93 | 0.18 | 0.89 | 0.23 | 0.91 | 0.21 | |
| PM2.5 | 1 in 3 | 0.81 | 0.23 | 0.80 | 0.23 | 0.82 | 0.23 | 0.81 | 0.22 | 0.81 | 0.22 |
| 1 in 6 | 0.81 | 0.24 | 0.80 | 0.24 | 0.81 | 0.23 | 0.81 | 0.22 | 0.80 | 0.22 | |
| daily | 0.77 | 0.26 | 0.76 | 0.25 | 0.80 | 0.25 | 0.80 | 0.27 | 0.80 | 0.30 | |
|
|
|
|
|
| |||||||
|
|
|
|
|
|
|
|
|
|
| ||
| EC | 1 in 3 | 0.70 | 0.38 | 0.71 | 0.34 | 0.69 | 0.37 | 0.76 | 0.34 | 0.69 | 0.36 |
| 1 in 6 | 0.68 | 0.41 | 0.72 | 0.32 | 0.72 | 0.31 | 0.76 | 0.33 | 0.71 | 0.33 | |
| SO42− | 1 in 3 | 0.85 | 0.31 | 0.91 | 0.19 | 0.90 | 0.21 | 0.89 | 0.20 | 0.87 | 0.24 |
| 1 in 6 | 0.84 | 0.30 | 0.91 | 0.18 | 0.89 | 0.21 | 0.90 | 0.20 | 0.88 | 0.24 | |
| PM2.5 | 1 in 3 | 0.79 | 0.23 | 0.81 | 0.22 | 0.74 | 0.25 | 0.77 | 0.26 | 0.73 | 0.27 |
| 1 in 6 | 0.78 | 0.23 | 0.81 | 0.22 | 0.74 | 0.26 | 0.76 | 0.26 | 0.74 | 0.28 | |
| daily | 0.78 | 0.24 | 0.78 | 0.24 | 0.74 | 0.26 | 0.75 | 0.26 | 0.72 | 0.27 | |
Figure 4R2 and NRMSE used to evaluate model performance on the withheld dataset. The plots represent the average over each of the 10 withholding runs that were performed. The black line for each species represents the metric for the raw CMAQ model, while the blue bar represents the calculated withheld metric of the fused field.
Model evaluation metrics, R2 and NRMSE divided spatially into the eastern and western contiguous United States for PM2.5, SO42−, and EC. The metrics are shown for each year and are the average of the monitors in the domain.
|
|
|
|
|
| |||||||
|
|
|
|
|
|
|
|
|
|
| ||
| EC | Eastern | 0.51 | 0.42 | 0.56 | 0.40 | 0.38 | 0.52 | 0.45 | 0.47 | 0.67 | 0.36 |
| Western | 0.47 | 0.54 | 0.48 | 0.51 | 0.43 | 0.59 | 0.37 | 0.58 | 0.62 | 0.43 | |
| SO42− | Eastern | 0.81 | 0.32 | 0.76 | 0.32 | 0.79 | 0.31 | 0.74 | 0.33 | 0.71 | 0.31 |
| Western | 0.52 | 0.45 | 0.57 | 0.41 | 0.54 | 0.44 | 0.48 | 0.43 | 0.56 | 0.41 | |
| PM2.5 | Eastern | 0.75 | 0.24 | 0.76 | 0.23 | 0.82 | 0.23 | 0.80 | 0.23 | 0.79 | 0.23 |
| Western | 0.77 | 0.36 | 0.56 | 0.48 | 0.65 | 0.37 | 0.64 | 0.36 | 0.63 | 0.36 | |
|
|
|
|
|
| |||||||
|
|
|
|
|
|
|
|
|
|
| ||
| EC | Eastern | 0.64 | 0.40 | 0.59 | 0.37 | 0.57 | 0.39 | 0.59 | 0.39 | 0.53 | 0.40 |
| Western | 0.59 | 0.45 | 0.56 | 0.48 | 0.58 | 0.45 | 0.63 | 0.44 | 0.61 | 0.44 | |
| SO42− | Eastern | 0.68 | 0.35 | 0.73 | 0.33 | 0.65 | 0.32 | 0.72 | 0.32 | 0.65 | 0.34 |
| Western | 0.42 | 0.51 | 0.58 | 0.40 | 0.58 | 0.37 | 0.61 | 0.39 | 0.54 | 0.46 | |
| PM2.5 | Eastern | 0.80 | 0.23 | 0.78 | 0.24 | 0.75 | 0.23 | 0.79 | 0.23 | 0.76 | 0.24 |
| Western | 0.60 | 0.38 | 0.65 | 0.37 | 0.63 | 0.37 | 0.64 | 0.38 | 0.60 | 0.41 | |
Figure 5Nationwide averaged population weighted ambient concentrations plotted over the ten-year domain of 2005–2014 for the twelve pollutants. The y-axis shows the population weighted concentration with the units shown in the title, and the x-axis sows the dates ranging from 1 January 2005–31 December 2014.