| Literature DB >> 34068705 |
Changshun Li1,2, Ziyang Xie3,4, Bo Chen3,4, Kaijin Kuang3,4, Daowei Xu3,4, Jinfu Liu3,4, Zhongsheng He3,4.
Abstract
The concentration of negative air ions (NAIs) is an important indicator of air quality. Here, we analyzed the distribution patterns of negative air ion (NAI) concentrations at different time scales using statistical methods; then described the contribution of meteorological factors of the different season to the concentration of NAIs using correlation analysis and regression analysis; and finally made the outlook for the trends of NAI concentrations in the prospective using the auto regressive integrated moving average (ARIMA) models. The dataset of NAI concentrations and meteorological factors measured at the fixed stations in the Mountain Wuyi National Park were obtained from the Fujian Provincial Meteorological Bureau. The study showed that NAI concentrations were correlated with relative humidity spanning all seasons. Water was an important factor affecting the distribution of NAI concentrations in different time series. Compared with other ARIMA models, the outlook value of the ARIMA (0,1, 1) model was closer to the original data and the errors were smaller. This article provided a unique perspective on the study of the distribution of negative air oxygen ions over time series.Entities:
Keywords: ARIMA model; health; negative air ions (NAIs); time series
Year: 2021 PMID: 34068705 PMCID: PMC8126208 DOI: 10.3390/ijerph18095037
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1The types of NAIs generated through different ways through the oxygen − based (a) NAI compositions. (b) The evolution formula of NAIs. The blue arrows indicate the NAI transformation processes.
Figure 2Meteorological observation system, including; (a) FR500 negative oxygen ion monitor, and (b) DZZ4 automatic meteorological station.
Figure 3ARIMA model flow chart.
Pearson correlation between NAI concentrations and meteorological factors based on raw hourly data.
| Season | Correlation and Significance | PRE | TEM | PRS | RHU | WIN | VIS |
|---|---|---|---|---|---|---|---|
| Spring | Pearson correlation | −0.052 * | −0.034 | 0.136 ** | −0.070 ** | 0.004 | 0.082 ** |
| Significance | 0.031 | 0.155 | 0.000 | 0.004 | 0.876 | 0.001 | |
|
| 1699 | 1699 | 1699 | 1699 | 1699 | 1699 | |
| Summer | Pearson correlation | 0.002 | −0.117 ** | −0.008 | 0.140 ** | −0.096 ** | −0.026 |
| Significance | 0.931 | 0.000 | 0.735 | 0.000 | 0.000 | 0.303 | |
|
| 1608 | 1608 | 1608 | 1608 | 1608 | 1608 | |
| Autumn | Pearson correlation | 0.023 | 0.020 | −0.112 ** | 0.150 ** | −0.099 ** | 0.030 |
| Significance | 0.302 | 0.357 | 0.000 | 0.000 | 0.000 | 0.181 | |
|
| 2051 | 2051 | 2051 | 2051 | 2051 | 2051 | |
| Winter | Pearson correlation | −0.017 | −0.116 ** | −0.179 ** | 0.306 ** | −0.061 ** | 0.127 ** |
| Significance | 0.448 | 0.000 | 0.000 | 0.000 | 0.006 | 0.000 | |
|
| 2028 | 2028 | 2028 | 2028 | 2028 | 2028 |
Note: ** p < 0.01; * p < 0.05. The parameters were as follows: precipitation per hour (PRE, mm); average air temperatures (TEM, °C); average atmospheric pressure (PRS, hpa); average relative humidity (RHU, %); average wind speed of 10 min (WIN, m/s) and visibility (VIS, m).
Figure 4Monthly average NAI concentrations from October 2018 to February 2020.
Multiple linear regression analysis of meteorological factors and NAIs in different seasons.
| Season | R | R2 | Durbin-Watson | Meteorological Factors |
|---|---|---|---|---|
| Spring | 0.160 | 0.026 | 0.340 | PRS; RHU; VIS |
| Summer | 0.144 | 0.021 | 0.251 | TEM; RHU; WIN |
| Autumn | 0.209 | 0.044 | 0.495 | PRS; RHU; WIN |
| Winter | 0.473 | 0.224 | 0.124 | TEM; PRS; RHU; WIN; VIS |
Note: ** p < 0.01; * p < 0.05. The parameters were as follows: precipitation per hour (PRE, mm); average air temperatures (TEM, °C); average atmospheric pressure (PRS, hpa); average relative humidity (RHU, %); average wind speed of 10 min (WIN, m/s) and visibility (VIS, m).
Figure 5Daily distribution of hour-by-hour value of NAI concentrations. (a) Average hourly value of NAI concentration distribution a day. (b) Boxplot of NAI concentrations of raw hourly data. The value of the boxplot includes outliers, maximum, third quartile, median, first quartile and minimum.
Figure 6Frequency histogram of hourly data of NAI concentrations.
Comparison of fitted values and evaluation parameters for different ARIMA models.
| ( | Fitted Value | AIC | ME | RMSE | MAE | MPE | MAPE | MASE | ACF1 |
|---|---|---|---|---|---|---|---|---|---|
| ( | 44,412.54 | 545.99 | 432.3713 | 12,361.73 | 8842.708 | −8.91302 | 105.5697 | 1.428684 | −0.53091 |
| ( | 33,384.6 | 527.72 | 483.9527 | 7722.098 | 5739.791 | −22.8288 | 60.53931 | 0.927357 | −0.31949 |
|
|
|
|
|
|
|
|
|
|
|
| ( | 40,049.89 | 539.91 | 496.785 | 10,450.91 | 7624.494 | −19.8323 | 89.65436 | 1.231862 | −0.28397 |
| ( | 27,482.05 | 533.29 | 252.3898 | 8654.897 | 6388.379 | −27.5835 | 70.08172 | 1.032147 | −0.14205 |
| ( | 28,093.83 | 526.45 | 700.1022 | 6448.297 | 4708.604 | −27.836 | 56.00602 | 0.760752 | −0.057 |
| ( | 26,988.48 | 525.95 | 472.1389 | 6666.584 | 5086.942 | −27.9409 | 59.45506 | 0.821879 | −0.07016 |
| ( | 29,899.82 | 527.44 | 502.4846 | 7288.674 | 5579.637 | −27.4644 | 63.05462 | 0.901482 | −0.15394 |
| ( | 28,553.56 | 524.83 | 740.2056 | 6524.263 | 4830.089 | −28.8038 | 58.05218 | 0.78038 | −0.04547 |
Note: The mathematical indicators were as follows: Akaike information criterion (AIC), mean error (ME), root mean squared error (RMSE), mean absolute error (MAE), maximum permissible error (MPE), mean absolute percentage error (MAPE, %), the mean absolute scaled error (MASE) and ACF1 (auto correlation of errors at lag 1). The bold format of the values represents the type of ARIMA selected in the study, due to its lower AIC, RMSE, etc.
Figure 7The normal distribution QQ plot of the residuals of the ARIMA (0, 2, 2) model.
Comparison of evaluation parameters for different models.
| ( | AIC | ME | RMSE | MAE | MPE | MAPE | MASE | ACF1 |
|---|---|---|---|---|---|---|---|---|
| ( | 1127.64 | 68.91637 | 8058.546 | 5822.154 | −16.5185 | 50.52383 | 0.981823 | −0.40433 |
| ( | 1117.11 | 198.4567 | 7014.965 | 5393.174 | −19.694 | 51.15323 | 0.909482 | −0.01324 |
|
|
|
|
|
|
|
|
|
|
| ( | 1120.18 | 97.27034 | 7371.012 | 5222.038 | −19.3682 | 49.95879 | 0.880622 | −0.10025 |
| ( | 1117.14 | 192.8843 | 7017.028 | 5385.995 | −19.7073 | 51.08738 | 0.908271 | −0.0086 |
| ( | 1119.26 | 121.0036 | 7167.071 | 5176.703 | −19.6148 | 49.82251 | 0.872977 | −0.03197 |
| ( | 1119.07 | 210.9474 | 7011.803 | 5392.778 | −19.7556 | 51.31665 | 0.909415 | −0.01143 |
| ( | 1121.10 | 202.9406 | 7013.89 | 5395.582 | −19.7244 | 51.23666 | 0.909888 | −0.01233 |
Note: The mathematical indicators were as follows: Akaike information criterion (AIC), mean error (ME), root mean squared error (RMSE), mean absolute error (MAE), maximum permissible error (MPE), mean absolute percentage error (MAPE, %), the mean absolute scaled error (MASE) and ACF1 (auto correlation of errors at lag 1). The bold format of the values represents the type of ARIMA selected in the study, due to its lower AIC, RMSE, etc.
Figure 8Time series of weekly data of NAI concentration. Week 1 to 55 was used as a time series of training data. We can see that the time series was highly fluctuating and uneven, and combined with topic 3.2, where the data did not pass the smoothness test, we made 1 difference to the series to make the series smooth, which was the red line. In the ARIMA (0, 1, 1) model time series plot, the black line represented the raw data of NAI concentrations, and the green line represented the predicted values of NAI concentrations, which were closer to the raw values. The blue line represents weekly precipitation (accumulate hourly precipitation into weekly precipitation).
Correlation analysis of weekly data of NAI concentrations with meteorological factors.
| Period | Correlation and Significance | PRE_W | TEM_W | PRS_W | RHU_W | WIN_W | VIS_W |
|---|---|---|---|---|---|---|---|
| Week 1–Week 68 | Pearson correlation | 0.472 ** | 0.154 | −0.384 ** | 0.471 ** | −0.270 | 0.093 |
| Significance | 0.000 | 0.209 | 0.001 | 0.000 | 0.026 | 0.452 | |
|
| 68 | 68 | 68 | 68 | 68 | 68 |
Note: ** p < 0.01. The PRE_W in this table is the hourly precipitation summed to a week, TEM_W stands for average weekly temperature, PRS_W represents the weekly average atmospheric pressure, RHU_W represents the weekly average relative humidity, WIN_W represents the weekly average wind speed, VIS_W represents weekly average visibility. TEM_W, PRS_W, RHU_W, WIN_W and VIS_W are calculated by averaging the raw hourly data over a week.
Multiple linear regression analysis of meteorological factors and NAIs from 69 weeks.
| Period | R | R2 | Durbin-Watson | Meteorological Factors |
|---|---|---|---|---|
| Week 1–Week 68 | 0.591 | 0.349 | 1.574 | PRE_W; PRS_W; RHU_W |
Figure 9The normal distribution QQ plot of the residuals of the ARIMA (0, 1, 1) model.