| Literature DB >> 35383223 |
Vigneshkumar Balamurugan1, Vinothkumar Balamurugan2, Jia Chen3.
Abstract
Surface ozone (O[Formula: see text]) is primarily formed through complex photo-chemical reactions in the atmosphere, which are non-linearly dependent on precursors. Even though, there have been many recent studies exploring the potential of machine learning (ML) in modeling surface ozone, the inclusion of limited available ozone precursors information has received little attention. The ML algorithm with in-situ NO information and meteorology explains 87% (R[Formula: see text] = 0.87) of the ozone variability over Munich, a German metropolitan area, which is 15% higher than a ML algorithm that considers only meteorology. The ML algorithm trained for the urban measurement station in Munich can also explain the ozone variability of the other three stations in the same city, with R[Formula: see text] = 0.88, 0.91, 0.63. While the same model robustly explains the ozone variability of two other German cities' (Berlin and Hamburg) measurement stations, with R[Formula: see text] ranges from 0.72 to 0.84, giving confidence to use the ML algorithm trained for one location to other locations with sparse ozone measurements. The inclusion of satellite O[Formula: see text] precursors information has little effect on the ML model's performance.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35383223 PMCID: PMC8983660 DOI: 10.1038/s41598-022-09619-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Different ML simulation type and associated training data (marked as X).
| ML simulation name | Predictor variables | Result | ||||||
|---|---|---|---|---|---|---|---|---|
| Meteorology | In-situ ozone precursors measurement | Satellite ozone precursors retrieval | CTMs simulation | |||||
| T, RH, BLH, WS, WD | DW, S | Surface NO, NO | Tropospheric column NO | CAMS surface O | R | RMSE | Mean R | |
| ML_met (1) | X | 0.74 | 18.1 | 0.72 | ||||
| ML_met_ds (2) | X | X | 0.76 | 17.5 | 0.74 | |||
| ML_cams (3) | X | 0.64 | 21.6 | 0.63 | ||||
| ML_insitu (4) | X | 0.47 | 26.1 | 0.49 | ||||
| ML_met_ds_insitu (5) | X | X | X | 0.80 | 15.8 | 0.81 | ||
| ML_met_ds_insitu_cams (6) | X | X | X | X | 0.83 | 14.9 | 0.84 | |
| ML_satellite (7) | X | − 0.35 | 41.7 | -0.31 | ||||
| ML_met_ds_satellite (8) | X | X | X | 0.77 | 17.1 | 0.74 | ||
| ML_met_ds_satellite_cams (9) | X | X | X | X | 0.81 | 15.6 | 0.80 | |
T-Temperature, RH-Relative Humidity, BLH-Boundary Layer Height, WS-Wind Speed, WD-Wind Direction, DW-Day of Week, S-Season, NO-Nitric oxide, NO-Nitrogen Dioxide, CO-Carbon Monoxide, O-Ozone and HCHO-Formaldehyde. The index of different ML simulation types is given in brackets in the first column, to which we refer in Fig. 2. The performance of each ML simulation with fewer days case (689 days) at lothstrasee station is shown in the last three columns.
Figure 2Performance comparison of different ML simulation types with 5375 days (blue) and 689 days (red) for training and testing. X axis indexes refer to the index of different ML simulation type (Table 1).
Figure 1Density scatter plots of predicted ozone by different ML simulation type vs ground-truth ozone at Lothstrasse station at Munich. In a total of 5375 days (between 2001 to 2017), first 3800 days used for training and remaining 1575 days used for testing. Mean R of k(10)-fold cross validation is also given at bottom of figure panels at each case. Red solid line represents the linear fit and red dotted line represents 1:1 line.
Figure 3Density scatter plots of predicted ozone by “ML_s_rh_t_wd_blh_no” trained for Lothstrasse station at Munich vs ground-truth ozone measurements for different locations. First row shows the stations for Munich, second row for Berlin and third row for Hamburg stations. U represents urban station and SU represents suburban station. Red solid line represents the linear fit and red dotted line represents 1:1 line.