| Literature DB >> 35742435 |
Sichen Wang1, Xi Mu2, Peng Jiang1,2,3, Yanfeng Huo4, Li Zhu1, Zhiqiang Zhu1, Yanlan Wu1,2.
Abstract
Ozone (O3), whose concentrations have been increasing in eastern China recently, plays a key role in human health, biodiversity, and climate change. Accurate information about the spatiotemporal distribution of O3 is crucial for human exposure studies. We developed a deep learning model based on a long short-term memory (LSTM) network to estimate the daily maximum 8 h average (MDA8) O3 across eastern China in 2020. The proposed model combines LSTM with an attentional mechanism and residual connection structure. The model employed total O3 column product from the Tropospheric Monitoring Instrument, meteorological data, and other covariates as inputs. Then, the estimates from our model were compared with real observations of the China air quality monitoring network. The results indicated that our model performed better than other traditional models, such as the random forest model and deep neural network. The sample-based cross-validation R2 and RMSE of our model were 0.94 and 10.64 μg m-3, respectively. Based on the O3 distribution over eastern China derived from the model, we found that people in this region suffered from excessive O3 exposure. Approximately 81% of the population in eastern China was exposed to MDA8 O3 > 100 μg m-3 for more than 150 days in 2020.Entities:
Keywords: data-driven model; deep learning; eastern China; human exposure; long short-term memory network; ozone pollution; tropospheric monitoring instrument
Mesh:
Substances:
Year: 2022 PMID: 35742435 PMCID: PMC9223487 DOI: 10.3390/ijerph19127186
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1Study area and the distribution of ground O3 monitoring sites.
Figure 2One layer AR-LSTM.
Figure 3Comparisons of the five imputed methods for in situ missing observations.
Figure 4Imputed TROPOMI-O3 by using OK method on 11 June (a) and original data (b).
Figure 5AR-LSTM model performance with different parameters. (a–e) are the R2 curves of 16, 32, 64, 128, and 256 neurons with different layers and time steps, respectively. (f–h) are mean values of different hidden layers, time steps and different numbers of neurons, respectively.
Figure 6CV results.
Figure 7Scatter plot of the generated data and in situ observations.
Figure 8Quarterly averages of MDA8 ozone of TAP, ARLSTM and in situ observations. The line plot below is a comparison of RMSE between the two datasets and in situ observations.
Performance of various models in sample-based CV and city-based CV. The bold is used to emphasize the best results.
| Model | Sample-Based CV | City-Based CV | ||||||
|---|---|---|---|---|---|---|---|---|
| R2 | RMSE | MAE | MAPE | R2 | RMSE | MAE | MAPE | |
| RF | 0.89 | 14.33 | 10.41 | 13.17 | 0.79 | 19.17 | 14.76 | 19.45 |
| DNN | 0.88 | 15.28 | 11.48 | 14.52 | 0.79 | 19.64 | 14.83 | 19.80 |
| GRU | 0.91 | 13.28 | 9.64 | 11.80 | 0.80 | 19.36 | 14.45 | 18.61 |
| LSTM | 0.92 | 12.80 | 9.34 | 11.45 | 0.82 | 18.65 | 14.10 | 18.14 |
| CNN | 0.90 | 13.72 | 10.26 | 12.96 | 0.80 | 19.70 | 14.92 | 19.93 |
| AR-LSTM | 0.94 | 10.64 | 7.52 | 8.82 | 0.85 | 17.25 | 13.07 | 16.90 |
Seasonal and annual averages ± deviations of population-weighted MDA8 O3 for five provinces and the all of eastern China in 2020 (μg m−3).
| Region | Spring | Summer | Fall | Winter | Annual |
|---|---|---|---|---|---|
| Shandong | 124.23 ± 33.14 | 142.31 ± 36.48 | 101.99 ± 41.02 | 66.73 ± 19.14 | 110.03 ± 43.46 |
| Anhui | 116.58 ± 30.77 | 109.03 ± 23.31 | 102.24 ± 36.32 | 67.12 ± 19.44 | 99.69 ± 33.70 |
| Jiangsu | 122.91 ± 32.43 | 118.40 ± 29.46 | 102.05 ± 36.33 | 67.83 ± 19.55 | 103.87 ± 36.72 |
| Shanghai | 119.45 ± 32.50 | 107.32 ± 39.28 | 97.71 ± 34.38 | 70.80 ± 21.41 | 99.81 ± 36.94 |
| Zhejiang | 113.29 ± 33.06 | 95.53 ± 20.64 | 100.30 ± 35.72 | 66.53 ± 23.12 | 94.76 ± 33.32 |
| Eastern China | 120.06 ± 27.98 | 118.55 ± 19.10 | 101.33 ± 33.93 | 67.31 ± 18.36 | 102.89 ± 32.86 |
Figure 9Cumulative distributions of daily mean exposure to O3. The two white dotted line are air quality guideline (AQG: 100 μg m−3) and interim target 1 (IT1: 160 μg m−3), respectively.