| Literature DB >> 31372305 |
Bryan N Vu1, Odón Sánchez2, Jianzhao Bi1, Qingyang Xiao1, Nadia N Hansel3, William Checkley3, Gustavo F Gonzales4,5,6, Kyle Steenland1, Yang Liu1.
Abstract
It is well recognized that exposure to fine particulate matter (PM2.5) affects health adversely, yet few studies from South America have documented such associations due to the sparsity of PM2.5 measurements. Lima's topography and aging vehicular fleet results in severe air pollution with limited amounts of monitors to effectively quantify PM2.5 levels for epidemiologic studies. We developed an advanced machine learning model to estimate daily PM2.5 concentrations at a 1 km2 spatial resolution in Lima, Peru from 2010 to 2016. We combined aerosol optical depth (AOD), meteorological fields from the European Centre for Medium-Range Weather Forecasts (ECMWF), parameters from the Weather Research and Forecasting model coupled with Chemistry (WRF-Chem), and land use variables to fit a random forest model against ground measurements from 16 monitoring stations. Overall cross-validation R2 (and root mean square prediction error, RMSE) for the random forest model was 0.70 (5.97 μg/m3). Mean PM2.5 for ground measurements was 24.7 μg/m3 while mean estimated PM2.5 was 24.9 μg/m3 in the cross-validation dataset. The mean difference between ground and predicted measurements was -0.09 μg/m3 (Std.Dev. = 5.97 μg/m3), with 94.5% of observations falling within 2 standard deviations of the difference indicating good agreement between ground measurements and predicted estimates. Surface downwards solar radiation, temperature, relative humidity, and AOD were the most important predictors, while percent urbanization, albedo, and cloud fraction were the least important predictors. Comparison of monthly mean measurements between ground and predicted PM2.5 shows good precision and accuracy from our model. Furthermore, mean annual maps of PM2.5 show consistent lower concentrations in the coast and higher concentrations in the mountains, resulting from prevailing coastal winds blown from the Pacific Ocean in the west. Our model allows for construction of long-term historical daily PM2.5 measurements at 1 km2 spatial resolution to support future epidemiological studies.Entities:
Keywords: Lima; MAIAC AOD; PM2.5; Peru; WRF-chem; air pollution; machine learning; random forest; remote sensing
Year: 2019 PMID: 31372305 PMCID: PMC6671674 DOI: 10.3390/rs11060641
Source DB: PubMed Journal: Remote Sens (Basel) ISSN: 2072-4292 Impact factor: 5.349
Figure 1.Study domain and location of air monitors. The yellow line details the Lima political border while the gray line details the 10 km buffer. The; magenta circles denote the location, distribution, and overall mean PM2.5 concenttations in μg/m3 of the Servicio Nacional de Meteorología e Hidtología del Perú (SENAMHI) monitor network, while the purple circles denote the same information for the Johns Hopkins University (JHU) monitor network:.
PM2.5 ground monitor information, elevation, and total number of observations at each monitor and their respective network.
| Network | Station | Elevation (m.) | # of Measurements |
|---|---|---|---|
| JHU | Station 02 | 94.6 | 339 |
| JHU | Station 07 | 123.6 | 417 |
| JHU | Station 08 | 74.2 | 288 |
| JHU | Station 09 | 186.0 | 443 |
| JHU | Station 10 | 192.1 | 287 |
| JHU | Station 11 | 109.2 | 307 |
| SENAMHI | ATE | 372.7 | 528 |
| SENAMHI | CDM | 124.5 | 544 |
| SENAMHI | CRB | 219.5 | 737 |
| SENAMHI | HCH | 301.2 | 696 |
| SENAMHI | PPD | 186.0 | 778 |
| SENAMHI | SBJ | 131.3 | 581 |
| SENAMHI | SJL | 237.5 | 757 |
| SENAMHI | SMP | 58.5 | 775 |
| SENAMHI | STA | 254.3 | 598 |
| SENAMHI | VMT | 328.3 | 395 |
Note: SENAMHI Station is abbreviated from the name of the location. JHU stations collected measurements from November 2011 to March 2013 and SENAMHI stations collected measurements from April 2014 to December 2016.
Figure 2.Time series of monthly mean ground PM2.5 measurements in μg/m3 at each monitor station for both the SENAMHI and JHU networks from November 2011 through December 201. 6. SENAMHI station names Eire abbreviated from the name of the location.
Figure 3.(A) Density plot of ground and predicted PM2.5 measurements in μg/m3 based on the cross-validation of the Random Forest model. (B) Bland-Altman plot of differences between ground and predicted PM2.5 in μg/ m3 against theme means of each pair. This p lot shows good agreement as 94.5% of observation pairs fall within 2 standard deviations o° the mean difference.
Figure 4.Importance of each variable in the random forest model by percent increase mean square prediction error (MSE).
Figure 5.Time series of monthly mean ground measurements and predicted PM2.5 in μg/m3 based on random forest model at each monitor station. SENAMHI station names are abbreviated from the name of the location.
Figure 6.Annual mean prediction maps of PM2.5 in μg/m3 from the random forest model in Lima, Peru from 2010 to 2016.