| Literature DB >> 29317739 |
Jeanette M Reyes1, Heidi F Hubbard2, Matthew A Stiegel3, Joachim D Pleil4,5, Marc L Serre6.
Abstract
Currently in the United States there are no regulatory standards for ambient concentrations of polycyclic aromatic hydrocarbons (PAHs), a class of organic compounds with known carcinogenic species. As such, monitoring data are not routinely collected resulting in limited exposure mapping and epidemiologic studies. This work develops the log-mass fraction (LMF) Bayesian maximum entropy (BME) geostatistical prediction method used to predict the concentration of nine particle-bound PAHs across the US state of North Carolina. The LMF method develops a relationship between a relatively small number of collocated PAH and fine Particulate Matter (PM2.5) samples collected in 2005 and applies that relationship to a larger number of locations where PM2.5 is routinely monitored to more broadly estimate PAH concentrations across the state. Cross validation and mapping results indicate that by incorporating both PAH and PM2.5 data, the LMF BME method reduces mean squared error by 28.4% and produces more realistic spatial gradients compared to the traditional kriging approach based solely on observed PAH data. The LMF BME method efficiently creates PAH predictions in a PAH data sparse and PM2.5 data rich setting, opening the door for more expansive epidemiologic exposure assessments of ambient PAH.Entities:
Keywords: Ambient exposures; Bayesian maximum entropy; Geostatistics; Mass fraction; PAHs
Mesh:
Substances:
Year: 2018 PMID: 29317739 PMCID: PMC6013350 DOI: 10.1038/s41370-017-0009-6
Source DB: PubMed Journal: J Expo Sci Environ Epidemiol ISSN: 1559-0631 Impact factor: 5.563
Neighborhood optimization for each PAH estimate. Optimized n closest observed data locations (as determined by the space/time metric) corresponding to the minimized mean squared error validation statistic calculated through the linear regression and mass fractions methods across the 9 PAHs, with Total PAH being the summation. Bolded numbers indicate the lowest MSE for each PAH across the neighborhood optimization methods.
| Linear Regression | log-Mass Fraction | |||||
|---|---|---|---|---|---|---|
| PAH | S/T Metric (km/days) | MSE ( | S/T Metric (km/days) | MSE ( | ||
| benz(a)anthracene | 14 | 0.891 | 1.128 | 5 | 0.839 | |
| chrysene | 7 | 0.600 | 0.979 | 5 | 0.839 | |
| benzo(b)fluoranthrene | 7 | 0.863 | 1.358 | 5 | 0.899 | |
| benzo(k)fluoranthrene | 14 | 0.895 | 1.375 | 5 | 0.842 | |
| benzo(e)pyrene | 14 | 0.895 | 1.006 | 2 | 0.868 | |
| benzo(a)pyrene | 14 | 0.895 | 5 | 0.899 | 1.417 | |
| indeno(1,2,3-c,d)pyrene | 14 | 0.891 | 0.892 | 2 | 0.868 | |
| benzo(g,h,i)perylene | 14 | 0.895 | 0.757 | 2 | 0.777 | |
| dibenzo(a,h)anthracene | 14 | 0.772 | 1.532 | 3 | 0.820 | |
| Total PAH | 14 | 0.895 | 0.890 | 3 | 0.820 | |
Figure 1Map of benzo(b)fluoranthrene (ng/m3). Maps of mean benzo(b)fluoranthrene concentration for North Carolina on April 16, 2005 across the 4 prediction methods: (a) kriging, (b) cokriging, (c) Linear Regression BME, (d) log-Mass Fraction BME. Square markers indicate observed data, circle markers indicate PAH estimates, X's mark known fires for that day with a 100 km buffer.
Cross validation statistics. Leave-One-Out Cross Validation statistics for Total PAH (summation of the 9 PAHs) comparing observed and predicted concentrations across the 4 prediction methods for North Carolina in 2005. ME is Mean Error, VE is Variance of Error, RMSE is Root Mean Squared Error, MSE is Mean Squared Error and r2 is the Pearson correlation coefficient squared.
| Statistic | Kriging | Cokriging | Linear Regression BME | log-Mass Fraction BME |
|---|---|---|---|---|
| ME ( | -0.145 | -0.137 | -0.102 | -0.042 |
| VE ( | 0.806 | 0.782 | 0.764 | 0.591 |
| RMSE ( | 0.904 | 0.890 | 0.875 | 0.765 |
| MSE ( | 0.818 | 0.792 | 0.766 | 0.586 |
| 0.747 | 0.752 | 0.744 | 0.821 |
Figure 2Probability of exceedance. Probability of annual benzo(a)pyrene exceeding 0.25 ng/m3 across North Carolina in 2005 as predicted by (a) kriging, (b) cokriging, (c) Linear Regression BME and (d) log-Mass Fraction BME.
Area (in km2) covered corresponding to increasing probabilities of exceeding the average annual benzo(a)pyrene standard of 0.25 ng/m3 among the prediction locations in and around North Carolina in 2005.
| Probability of Exceedance | ≥0.10 | ≥0.15 | ≥0.20 | ≥0.25 | ≥0.30 |
|---|---|---|---|---|---|
| Kriging | 162,000 | 119,232 | 5,184 | 0 | 0 |
| Cokriging | 143,856 | 54,432 | 0 | 0 | 0 |
| Linear Regression BME | 164,592 | 139,968 | 36,288 | 10,368 | 2,592 |
| log-Mass Fraction BME | 156,816 | 116,640 | 28,512 | 15,552 | 6,480 |
Mean difference in PAH near versus far from fires. 95% confidence intervals comparing the mean difference in predicted PAH near (within 100 km) versus far (> 100 km) from fires for each of the 9 PAHs and Total PAH across the 4 prediction methods. Units are in ng/m3.
| PAH | Kriging | Cokriging | Linear Regression BME | log-Mass Fraction BME |
|---|---|---|---|---|
| benz(a)anthracene | (-4.94E-03,-2.17E-03) | (-2.61E-03,-1.07E-05) | (-1.26E-03,1.07E-03) | (1.57E-03,4.38E-03) |
| chrysene | (-6.67E-03,-3.53E-03) | (-3.74E-03,-8.28E-04) | (-9.40E-04,1.67E-03) | (2.07E-03,5.39E-03) |
| benzo(b)fluoranthrene | (3.98E-03,1.11E-02) | (3.78E-03,1.10E-02) | (1.09E-02,2.23E-02) | (2.36E-02,3.02E-02) |
| benzo(k)fluoranthrene | (3.14E-03,6.47E-03) | (2.32E-03,5.01E-03) | (5.27E-03,7.94E-03) | (7.80E-03,1.08E-02) |
| benzo(e)pyrene | (-2.92E-03,2.23E-03) | (-3.17E-03,1.71E-03) | (5.22E-03,9.80E-03) | (1.83E-02,2.49E-02) |
| benzo(a)pyrene | (-3.83E-03,1.84E-03) | (-6.24E-03,-8.42E-04) | (2.23E-03,1.37E-02) | (5.14E-03,1.02E-02) |
| indeno(1,2,3-c,d)pyrene | (1.87E-02,3.05E-02) | (1.73E-02,2.87E-02) | (2.04E-02,3.24E-02) | (4.79E-02,6.11E-02) |
| benzo(g,h,i)perylene | (3.04E-02,4.27E-02) | (2.54E-02,3.66E-02) | (2.72E-02,4.06E-02) | (3.13E-02,4.06E-02) |
| dibenzo(a,h)anthracene | (-2.07E-02,-1.36E-02) | (-1.71E-02,-1.07E-02) | (-4.80E-03,2.16E-03) | (1.90E-03,8.77E-03) |
| Total PAH | (2.28E-02,6.75E-02) | (2.13E-02,6.57E-02) | (6.29E-02,1.03E-01) | (1.72E-01,2.30E-01) |
mean difference is statistically significant (p-value≤ 0.05),
mean difference > 0.
Figure 3PAH ratios. Ratio of indeno(1,2,3-c,d)pyrene/(indeno(1,2,3-c,d)pyrene+benzo(g,h,i)perylene) on March 5, 2005 in North Carolina across the 4 prediction methods: (a) kriging, (b) cokriging, (c) Linear Regression BME, (d) log-Mass Fraction BME. Square markers indicate the ratio of observed data, circle markers indicate the ratio of PAH estimates, X's mark known fires for that day with a 100 km buffer.