| Literature DB >> 35564780 |
Yun-Peng Chen1, Le-Fan Liu2, Yang Che3, Jing Huang4, Guo-Xing Li4, Guo-Xin Sang3, Zhi-Qiang Xuan5, Tian-Feng He3,4.
Abstract
The autoregressive integrated moving average with exogenous regressors (ARIMAX) modeling studies of pulmonary tuberculosis (PTB) are still rare. This study aims to explore whether incorporating air pollution and meteorological factors can improve the performance of a time series model in predicting PTB. We collected the monthly incidence of PTB, records of six air pollutants and six meteorological factors in Ningbo of China from January 2015 to December 2019. Then, we constructed the ARIMA, univariate ARIMAX, and multivariate ARIMAX models. The ARIMAX model incorporated ambient factors, while the ARIMA model did not. After prewhitening, the cross-correlation analysis showed that PTB incidence was related to air pollution and meteorological factors with a lag effect. Air pollution and meteorological factors also had a correlation. We found that the multivariate ARIMAX model incorporating both the ozone with 0-month lag and the atmospheric pressure with 11-month lag had the best performance for predicting the incidence of PTB in 2019, with the lowest fitted mean absolute percentage error (MAPE) of 2.9097% and test MAPE of 9.2643%. However, ARIMAX has limited improvement in prediction accuracy compared with the ARIMA model. Our study also suggests the role of protecting the environment and reducing pollutants in controlling PTB and other infectious diseases.Entities:
Keywords: air pollution; meteorological factor; pulmonary tuberculosis; time series
Mesh:
Year: 2022 PMID: 35564780 PMCID: PMC9105987 DOI: 10.3390/ijerph19095385
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Descriptive statistics of the monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2019.
| Variables |
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| Monthly incidence of PTB | 3.85 | 0.50 | 2.77 | 3.48 | 3.80 | 4.17 | 5.03 | 0.69 |
| Air pollutants concentration | ||||||||
| PM2.5 (μg/m3) | 35.79 | 14.73 | 14.35 | 24.30 | 32.64 | 43.95 | 81.42 | 19.65 |
| PM10 (μg/m3) | 57.42 | 21.51 | 24.48 | 41.62 | 52.55 | 69.57 | 121.16 | 27.95 |
| SO2 (μg/m3) | 10.90 | 3.97 | 5.130 | 8.182 | 10.36 | 12.665 | 26.900 | 4.483 |
| CO (mg/m3) | 0.78 | 0.15 | 0.53 | 0.67 | 0.75 | 0.85 | 1.25 | 0.18 |
| NO2 (μg/m3) | 37.47 | 12.25 | 14.68 | 28.76 | 35.36 | 46.19 | 64.52 | 17.43 |
| O3 (μg/m3) | 95.26 | 26.79 | 31.68 | 71.86 | 104.70 | 113.32 | 143.55 | 41.46 |
| Meteorological factors | ||||||||
| MAT (°C) | 17.79 | 7.85 | 5.32 | 10.62 | 18.21 | 24.66 | 30.57 | 14.04 |
| MAHT (°C) | 21.88 | 7.97 | 8.61 | 14.66 | 23.25 | 28.24 | 35.97 | 13.58 |
| MALT (°C) | 14.53 | 8.01 | 2.83 | 6.35 | 14.20 | 22.06 | 26.74 | 15.70 |
| MAH (%) | 76.70 | 5.23 | 63.40 | 73.49 | 77.74 | 80.78 | 86.70 | 7.29 |
| MAP (hPa) | 1015.58 | 7.77 | 1002.69 | 1009.205 | 1015.89 | 1022.41 | 1027.19 | 15.18 |
| MAS (m/s) | 21.36 | 4.72 | 10.67 | 18.64 | 21.75 | 25.18 | 32.81 | 6.54 |
Notes: IQR = P75 − P25; MAT: monthly average temperature; MAHT: monthly average highest temperature; MALT: monthly average lowest temperature; MAH: monthly average relative humidity; MAP: monthly average atmospheric pressure; MAS: monthly average wind speed.
Figure 1Time series of the monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2019.
The optimal ARIMA model for monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2018.
| Variables | Model | BIC | Ljung–Box Test | |
|---|---|---|---|---|
|
|
| 18.2228 | 2.1833 | 0.1395 |
| Air pollutants concentration | ||||
| PM2.5 (μg/m3) | ARIMA (1,0,0)(0,1,0)12 | 266.6336 | 0.7367 | 0.3907 |
| PM10 (μg/m3) | ARIMA (1,0,0)(0,1,0)12 | 298.6734 | 0.6880 | 0.4068 |
| SO2 (μg/m3) | ARIMA (1,0,0)(0,1,0)12 | 161.6301 | 0.5341 | 0.4649 |
| CO (mg/m3) | ARIMA (1,0,0)(1,1,0)12 | −74.6422 | 1.3417 | 0.2467 |
| NO2 (μg/m3) | ARIMA (1,0,0)(0,1,1)12 | 238.207 | 0.7107 | 0.3992 |
| O3 (μg/m3) | ARIMA (0,0,1)(1,1,0)12 | 297.4298 | 0.0011 | 0.9736 |
| Meteorological factors | ||||
| MAT (°C) | ARIMA (0,0,0)(0,1,1)12 | 126.2939 | 2.2553 | 0.1332 |
| MAHT (°C) | ARIMA (1,0,0)(0,1,1)12 | 148.7069 | 0.1210 | 0.7280 |
| MALT (°C) | ARIMA (0,0,0)(0,1,0)12 | 139.3857 | 2.5996 | 0.1069 |
| MAH (%) | ARIMA (0,0,0)(1,1,0)12 | 224.0759 | 0.0009 | 0.9757 |
| MAP (hPa) | ARIMA (0,0,0)(1,1,0)12 | 250.4339 | 0.2565 | 0.6125 |
| MAS (m/s) | ARIMA (0,1,1) | 262.4965 | 0.4851 | 0.4861 |
CCF coefficients between the prewhitened ambient factors residuals series with different time lags and the prewhitened PTB incidence residuals series from 2015 to 2018.
| Factors | Lag Periods (Months) | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | |
| PM2.5 | 0.040 | 0.145 | −0.125 | 0.353 * | 0.028 | 0.124 | −0.018 | 0.164 | −0.175 | 0.151 | 0.013 | −0.122 | 0.112 |
| PM10 | 0.068 | 0.166 | −0.114 | 0.343 * | 0.041 | 0.076 | −0.049 | 0.209 | −0.172 | 0.190 | −0.007 | −0.110 | 0.077 |
| SO2 | 0.202 | 0.069 | −0.127 | 0.176 | 0.125 | −0.161 | −0.034 | 0.151 | −0.211 | 0.031 | 0.103 | −0.270 | 0.145 |
| CO | −0.013 | 0.075 | −0.028 | −0.086 | −0.111 | 0.100 | −0.036 | 0.115 | −0.072 | 0.291 * | −0.083 | 0.165 | 0.000 |
| NO2 | 0.034 | −0.100 | −0.106 | 0.044 | 0.073 | 0.013 | −0.164 | 0.008 | −0.209 | 0.278 | −0.075 | −0.064 | 0.073 |
| O3 | −0.321 * | −0.135 | −0.279 | −0.085 | −0.014 | 0.026 | −0.064 | 0.072 | 0.111 | 0.081 | 0.113 | −0.151 | 0.216 |
| MAT | −0.206 | −0.406 * | −0.102 | −0.316 * | −0.002 | −0.081 | 0.034 | 0.093 | 0.022 | −0.126 | −0.095 | 0.146 | 0.124 |
| MAHT | 0.093 | −0.215 | −0.040 | −0.171 | 0.155 | −0.014 | 0.148 | 0.241 | 0.162 | 0.080 | −0.029 | 0.177 | 0.174 |
| MALT | −0.198 | −0.310 * | −0.040 | −0.423 * | −0.094 | −0.085 | −0.077 | −0.158 | −0.018 | −0.133 | 0.014 | 0.149 | 0.122 |
| MAH | −0.136 | −0.016 | −0.019 | −0.240 | −0.238 | 0.073 | −0.053 | −0.307 * | −0.074 | −0.273 | −0.036 | −0.085 | −0.157 |
| MAP | 0.029 | −0.136 | −0.299 * | −0.030 | 0.105 | −0.252 | −0.151 | 0.066 | 0.053 | −0.248 | 0.027 | −0.335 * | −0.066 |
| MAS | −0.091 | 0.125 | −0.112 | −0.205 | −0.169 | −0.098 | 0.068 | 0.045 | 0.091 | −0.199 | 0.038 | −0.053 | −0.240 |
Note: *, p < 0.05.
CCF coefficients between the prewhitened air pollutants concentration and meteorological factors residuals series at lag 0 between 2015 and 2018.
| Air Pollutants | Meteorological Factors | |||||
|---|---|---|---|---|---|---|
| MAT | MAHT | MALT | MAH | MAP | MAS | |
| PM2.5 | −0.317 * | −0.154 | −0.527 * | −0.477 * | 0.022 | −0.289 * |
| PM10 | −0.319 * | −0.136 | −0.586 * | −0.605 * | 0.066 | −0.248 |
| SO2 | −0.249 | −0.059 | −0.457 * | −0.529 * | 0.428 * | −0.098 |
| CO | −0.249 | −0.097 | −0.115 | −0.059 | −0.199 | −0.150 |
| NO2 | −0.272 | −0.201 | −0.308 * | −0.400 * | −0.016 | −0.437 * |
| O3 | 0.048 | 0.122 | −0.161 | −0.403 * | 0.318 * | 0.027 |
Note: *, p < 0.05.
Summary of the fitted parameters of the optimal ARIMA, univariate ARIMAX, and the multivariate ARIMAX model analysis in Ningbo, 2016–2018.
| Model | BIC | MAPE(%) | Risk Factors | |||||
|---|---|---|---|---|---|---|---|---|
| Fitted | Test | Vars | Coef | S.E. | T | |||
| (1) ARIMA(0,0,0)(1,1,0)12 | 9.0376 | 3.3269 | 10.6693 | sar1 | −0.5829 | 0.1907 | 3.0566 | 0.0021 * |
| (2) ARIMA(0,0,0)(1,1,0)12+PM2.5(lag3) d | 12.1682 | 3.3748 | 10.5262 d | sar1 | −0.5458 | 0.2720 | 2.0064 | 0.0264 * |
| PM2.5(lag3) | 0.0019 | 0.0087 | 0.2147 | 0.4156 | ||||
| (3) ARIMA(0,0,0)(1,1,0)12+PM10(lag3) d | 12.0662 | 3.3833 | 10.4819 d | sar1 | −0.5355 | 0.2460 | 2.1767 | 0.0183 * |
| PM10(lag3) | 0.0018 | 0.0048 | 0.3809 | 0.3528 | ||||
| (4) ARIMA(0,0,0)(1,1,0)12+CO(lag9) d | 11.1900 | 3.2306 c | 9.5569 d | sar1 | −0.5843 | 0.1903 | 3.0707 | 0.0021 * |
| PM10(lag3) | 0.0042 | 0.0041 | 1.0221 | 0.1570 | ||||
| (5) ARIMA(0,0,0)(1,1,0)12+O3(lag0) abcd | 8.2634 a | 3.0226 c | 9.7944 d | sar1 | −0.6418 | 0.1717 | 3.7374 | 0.0003 * |
| O3(lag0) | −0.0061 | 0.0029 | 2.0751 | 0.0228 * b | ||||
| (6) ARIMA(0,0,0)(1,1,0)12+MAT(lag1) abc | −1.4075 a | 2.3205c | 12.0531 | sar1 | −0.5673 | 0.1978 | 2.8679 | 0.0035 * |
| MAT(lag1) | −0.1317 | 0.0310 | 4.2421 | <0.0001 * b | ||||
| (7) ARIMA(0,0,0)(1,1,0)12+MAT(lag3) | 10.5244 | 3.3860 | 11.0464 | sar1 | −0.5146 | 0.2169 | 2.3730 | 0.0117 * |
| MAT(lag3) | −0.0595 | 0.0443 | 1.3410 | 0.0944 | ||||
| (8) ARIMA(0,0,0)(1,1,0)12+MALT(lag1) abc | 7.8580 a | 2.8932 c | 12.4087 | sar1 | −0.4583 | 0.2670 | 1.7165 | 0.0476 * |
| MALT(lag1) | −0.0697 | 0.0357 | 1.9497 | 0.0297 * b | ||||
| (9) ARIMA(0,0,0)(1,1,0)12+MALT(lag3) abc | 8.0334 a | 3.2171 c | 11.4344 | sar1 | −0.5035 | 0.2217 | 2.2709 | 0.0148 * |
| MALT(lag3) | −0.0736 | 0.0347 | 2.1198 | 0.0207 * b | ||||
| (10) ARIMA(0,0,0)(1,1,0)12+MAH(lag7) cd | 10.2365 | 3.1969 c | 10.4618 d | sar1 | −0.6337 | 0.1732 | 3.6587 | 0.0004 * |
| MAH(lag7) | −0.0157 | 0.0107 | 1.4625 | 0.0764 | ||||
| (11) ARIMA(0,0,0)(1,1,0)12+MAP(lag2) c | 9.9590 | 3.1324 c | 10.9889 | sar1 | −0.6733 | 0.1607 | 4.1897 | <0.0001 |
| MAP(lag2) | −0.0095 | 0.0059 | 1.6023 | 0.0592 | ||||
| (12) ARIMA(0,0,0)(1,1,0)12+MAP(lag11) abcd | 8.4574 a | 3.2063 c | 10.0108 d | sar1 | −0.4940 | 0.2186 | 2.2601 | 0.0152 * |
| MAP(lag11) | −0.0125 | 0.0060 | 2.0680 | 0.0232 * b | ||||
| (13) ARIMA(0,0,0)(1,1,0)12+ O3(lag0)+MAP(lag11) abcd | 8.1092 a | 2.9097 c | 9.2643 d | sar1 | −0.5608 | 0.2018 | 2.7791 | 0.0045 * |
| O3(lag0) | −0.0054 | 0.0028 | 1.9383 | 0.0306 * b | ||||
| MAP(lag11) | −0.0115 | 0.0059 | 1.9461 | 0.0301 * b | ||||
Notes: Fitted: fitted results; Test: test results; BIC: Bayesian information criterion; MAPE: mean absolute percentage error; Coef: coefficient of risk factors; lag: time lag of risk factors; S.E.: standard error; T: t statistic; sar1: seasonal AR (1); a: meet criteria (a): the BIC value smaller than the optimal ARIMA model; b: meet criteria (b): the coefficients of the regression term all significant (p < 0.05); c: meet criteria (c): the fitted MAPE value smaller than the optimal ARIMA model; d: meet criteria (d): the test MAPE value smaller than the optimal ARIMA model; *: p < 0.05.
Figure 2Actual incidence, fitted and predicted incidences of ARIMA (0,0,0) (1,1,0)12 and ARIMA (0,0,0) (1,1,0)12 with O3 (0-month lag) and MAP (11-month lag) in Ningbo.