| Literature DB >> 31851499 |
Qian Di1,2, Heresh Amini2, Liuhua Shi2,3, Itai Kloog4, Rachel Silvern5, James Kelly6, M Benjamin Sabath7, Christine Choirat7, Petros Koutrakis2, Alexei Lyapustin8, Yujie Wang9, Loretta J Mickley10, Joel Schwartz2.
Abstract
NO2 is a combustion byproduct that has been associated with multiple adverse health outcomes. To assess NO2 levels with high accuracy, we propose the use of an ensemble model to integrate multiple machine learning algorithms, including neural network, random forest, and gradient boosting, with a variety of predictor variables, including chemical transport models. This NO2 model covers the entire contiguous U.S. with daily predictions on 1-km-level grid cells from 2000 to 2016. The ensemble produced a cross-validated R2 of 0.788 overall, a spatial R2 of 0.844, and a temporal R2 of 0.729. The relationship between daily monitored and predicted NO2 is almost linear. We also estimated the associated monthly uncertainty level for the predictions and address-specific NO2 levels. This NO2 estimation has a very high spatiotemporal resolution and allows the examination of the health effects of NO2 in unmonitored areas. We found the highest NO2 levels along highways and in cities. We also observed that nationwide NO2 levels declined in early years and stagnated after 2007, in contrast to the trend at monitoring sites in urban areas, where the decline continued. Our research indicates that the integration of different predictor variables and fitting algorithms can achieve an improved air pollution modeling framework.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31851499 PMCID: PMC7065654 DOI: 10.1021/acs.est.9b03358
Source DB: PubMed Journal: Environ Sci Technol ISSN: 0013-936X Impact factor: 9.028