Literature DB >> 35564780

Modeling and Predicting Pulmonary Tuberculosis Incidence and Its Association with Air Pollution and Meteorological Factors Using an ARIMAX Model: An Ecological Study in Ningbo of China.

Yun-Peng Chen1, Le-Fan Liu2, Yang Che3, Jing Huang4, Guo-Xing Li4, Guo-Xin Sang3, Zhi-Qiang Xuan5, Tian-Feng He3,4.   

Abstract

The autoregressive integrated moving average with exogenous regressors (ARIMAX) modeling studies of pulmonary tuberculosis (PTB) are still rare. This study aims to explore whether incorporating air pollution and meteorological factors can improve the performance of a time series model in predicting PTB. We collected the monthly incidence of PTB, records of six air pollutants and six meteorological factors in Ningbo of China from January 2015 to December 2019. Then, we constructed the ARIMA, univariate ARIMAX, and multivariate ARIMAX models. The ARIMAX model incorporated ambient factors, while the ARIMA model did not. After prewhitening, the cross-correlation analysis showed that PTB incidence was related to air pollution and meteorological factors with a lag effect. Air pollution and meteorological factors also had a correlation. We found that the multivariate ARIMAX model incorporating both the ozone with 0-month lag and the atmospheric pressure with 11-month lag had the best performance for predicting the incidence of PTB in 2019, with the lowest fitted mean absolute percentage error (MAPE) of 2.9097% and test MAPE of 9.2643%. However, ARIMAX has limited improvement in prediction accuracy compared with the ARIMA model. Our study also suggests the role of protecting the environment and reducing pollutants in controlling PTB and other infectious diseases.

Entities:  

Keywords:  air pollution; meteorological factor; pulmonary tuberculosis; time series

Mesh:

Year:  2022        PMID: 35564780      PMCID: PMC9105987          DOI: 10.3390/ijerph19095385

Source DB:  PubMed          Journal:  Int J Environ Res Public Health        ISSN: 1660-4601            Impact factor:   4.614


1. Introduction

Tuberculosis (TB) is a chronic infectious disease caused by Mycobacterium tuberculosis, and it often affects the lungs. WHO proposed an End TB Strategy in 2014, with the targets to reduce TB deaths by 95% and to cut incident cases by 90% between 2015 and 2035 [1]. The cumulative reduction in TB incidence from 2015 to 2020 was 11%, just over half-way to the 2020 milestone of the strategy [2]. To achieve this ambitious goal, accurate prediction of disease trends, as well as related factors, is of great importance. The autoregressive integrated moving average (ARIMA) model, also known as the Box–Jenkins model, is a commonly used model in time series analysis. Despite its effectiveness to study time series, the ARIMA model applies only to one variable. The autoregressive integrated moving average with exogenous regressors (ARIMAX) model, however, adds other variables related to the target series as input variables to improve the prediction accuracy. Unlike the ARIMA model, the autoregressive integrated moving average with exogenous regressors (ARIMAX) model adds other variables related to the target series as input variables to improve the prediction accuracy. Several ARIMAX studies suggested that weather parameters such as temperature, rainfall, humidity, wind speed, and air pollutants may influence the occurrence of disease [3,4,5,6,7]. ARIMAX modeling studies of tuberculosis are still rare. A time series study in Jiangsu, China, showed that the prediction accuracy of the ARIMA model for PTB was improved by adding monthly PM2.5 with 0-month lag as an external variable [8]. Another time series study in eastern China also indicated that the predictive performance of the ARIMA model was improved after incorporating meteorological factors [9]. Several studies have proposed the biological mechanisms linking air pollution exposure and the risk of PTB. For example, PM2.5 exposure disrupted the synthesis and secretion of inflammatory cytokines including interferon (IFN)-γ, tumor necrosis factor (TNF)-α, and interleukin (IL)-10 and impaired key anti-mycobacterial T cell immune functions [10]. In addition, blocking the IL-10 pathway and downregulating TNF-α by CO in lung macrophages may promote the reactivation of PTB [11]. Therefore, we predicted a positive correlation between pollutant concentration and tuberculosis incidence. The possible link between PTB and meteorological factors may be attributable to the following reasons. The risk of Mycobacterium tuberculosis transmission appears to be the greatest during the cold winter, particularly in overcrowded and poorly ventilated settings [12]. Less humidity leads to the evaporation of droplets, reduces their size, and escalates their ability to travel further, which increases the possibility of transmission [13]. The atmosphere pressure has an indirect negative effect on the incidence of PTB. Low-level air rises under low-pressure conditions, and surface pollutants diffuse vertically into the air, resulting in increased air pollution. Therefore, we predicted a negative correlation between meteorological factors and tuberculosis incidence [14]. To our knowledge, few studies used the ARIMAX model to incorporate both the air pollution and meteorological factors to predict PTB. Thus, in the current study, we performed a time series analysis in Ningbo, China, and applied ARIMA models (ARIMA, univariate ARIMAX and multivariate ARIMAX model) to explore whether the inclusion of air pollution and meteorological factors can improve the performance of prediction modeling.

2. Materials and Methods

2.1. Study Site and Data Collection

As a city in Zhejiang province located along the eastern coast of China, Ningbo covers an area of 98,000 thousand square kilometers. It governed 10 counties and had a permanent population of 9.4 million at the end of 2020. All newly diagnosed TB cases are registered in an online Tuberculosis Management Information System (TBIMS), which is operated by the Center for Disease Control and Prevention (CDC) of China. We extracted monthly incidence of PTB from January 2015 to December 2019 as the study subjects. Population data were obtained from the Ningbo Statistical Yearbook. We used the monthly incidence from January 2015 to December 2018 as the model-construction datasets and incidence from January 2019 to December 2019 as the validation datasets. The monthly average concentrations of the ambient air pollutants including nitrogen dioxide (NO2), carbon monoxide (CO), sulfur dioxide (SO2), ozone (O3), particulate matter 2.5 μm in diameter (PM2.5), and particulate matter 10μm in diameter (PM10) for the same period were obtained from the Ningbo Environment Monitoring Center. Meteorological factors included monthly average temperature (MAT, °C), monthly average highest temperature (MAHT, °C), monthly average lowest temperature (MALT, °C), monthly average relative humidity (MAH, %), monthly average atmospheric pressure (MAP, hPa), and monthly average wind speed (MAS, m/s) and data were obtained from the Ningbo Meteorological Bureau.

2.2. Construction of the ARIMA Model

Following Li et al. [9], we constructed a seasonal ARIMA model, which was expressed as ARIMA (p, d, q) (P, D, Q)s. The variable p is the order of the autoregression (AR) process, q is the number of moving average (MA) terms, d represents the differencing process to form a stationary times series, and P, D, and Q are the seasonal orders of the AR, differencing, and MA processes, respectively [3]. Additionally, s denotes the seasonal period. The number of PTB incidence predicted at time t (Y) was determined by the formula: , where θ(B) is the operator of the moving-average model, Θ(B) is the operator of the seasonal-moving average model, φ(B) is the operator of the auto-regressive model, Φ(B) is the operator of the seasonal autoregressive model, (1−B) is the component of the ordinary differences, (1−B) is the component of the seasonal differences, a is white noise, and Y is the predicted variable [9]. Based on the monthly incidence of PTB, we used the auto. arima ( ) function in R software (the R Core Team, Vienna, Austria) to undertake automatic ARIMA model selection and fitting according to Bayesian information criterion (BIC). BIC takes into account the number of observations and has a larger penalty compared with Akaike Information Criterion (AIC) [15]. When the number of observations is too large, it can effectively prevent the model from being too complicated and over-fitting. The model with the lowest BIC is defined as the optimal model. The Ljung–Box test was used to test whether the model residual sequence showed auto-correlation. Finally, we selected the optimal ARIMA model to predict PTB incidence in 2019 and mean absolute percentage error (MAPE) was used to model validation.

2.3. Cross-Correlation Analysis

Due to strong auto-correlations in the data, correlations of the time series of the monthly incidence of PTB with air pollution and meteorological factors were difficult to identify. In this study, a prewhitening process was applied to the data among the multiple exogenous regressors [3]. Prewhitening is used to avoid common trends between incidence and ambient factors [16]. We calculated the cross-correlation function (CCF) between the residual series from ARIMA model of the monthly incidence and ambient factors to identify the significant time lags. Based on the results from previous studies as well as biological and epidemiological plausibility, within a lag length of 12, we selected the lag periods with positive and significant values for the next step of the analysis [17].

2.4. Construction of the ARIMAX Model

The lag value of the statistically significant factors identified by the cross-correlation analysis was incorporated as exogenous regressors into the ARIMAX model constructed above. The ARIMAX model is described by equation: , where X represents the exogenous regressor, which can be univariate or multivariate [9,16]. The other parameters are as described in the ARIMA model above. In this study, we first took a single lag period of risk factor in the univariate ARIMAX model. The coefficients of the model were estimated using the maximum likelihood method. Fitted MAPE was used to measure the performance of the ARIMA and the ARIMAX model in predicting PTB incidence from 2016 to 2018, and test MAPE was used to measure the performance in 2019. We determined the suitable univariate ARIMAX models according to four criteria: (a) the BIC value smaller than the optimal ARIMA model; (b) the coefficients of the regression term all significant (p < 0.05); (c) the fitted MAPE value smaller than the optimal ARIMA model; and (d) the test MAPE value smaller than the optimal ARIMA model. For the multivariate ARIMAX analysis, the incorporated factors were selected from the factors in the suitable univariate ARIMAX models. The suitable multivariate ARIMAX models were also determined by four criteria above. The optimal ARIMAX model should have the lowest BIC, fitted MAPE, and test MAPE values.

2.5. Statistical Software

We used the packages of “forecast”, “Stats”, and “ggplot2” of R 3.5.1 (the R Core Team, Vienna, Austria) (https://www.r-project.org/, accessed on 8 Janunary 2022) to estimate the ARIMA and ARIMAX models and present data visualization. The R code can be found in Supplementary File S1. The significance level was set at 0.05.

3. Results

3.1. Descriptive Analysis

Descriptive statistics of the monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2019 are listed in Table 1. Time series plots of these are shown in Figure 1. The annual PTB notification rates from 2015 to 2019 of Ningbo were 48.20/100,000, 45.81/100,000, 45.99/10,000, 46.92/100,000, and 44.14/100,000, respectively. The monthly incidence series plot showed a seasonal fluctuation. The peak incidence mainly occurred in March, April, and May, and the trough was more common in November, December, and February. The mean concentrations of PM2.5, PM10, SO2, CO, NO2, O3, were 35.79 μg/m3, 57.42 μg/m3, 10.90 μg/m3, 0.78 mg/m3, 37.47 μg/m3, and 95.26 μg/m3, respectively, compared with the World Health Organization air quality guideline levels of 5 μg/m3, 15 μg/m3, 40 μg/m3, 4 mg/m3, 10 μg/m3, and 60 μg/m3, respectively [18].
Table 1

Descriptive statistics of the monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2019.

Variables Mean S.D. Min P 25 P 50 P 75 MAX IQR
Monthly incidence of PTB3.850.502.773.483.804.175.030.69
Air pollutants concentration
PM2.5 (μg/m3)35.7914.7314.3524.3032.6443.9581.4219.65
PM10 (μg/m3)57.4221.5124.4841.6252.5569.57121.1627.95
SO2 (μg/m3)10.903.975.1308.18210.3612.66526.9004.483
CO (mg/m3)0.780.150.530.670.750.851.250.18
NO2 (μg/m3)37.4712.2514.6828.7635.3646.1964.5217.43
O3 (μg/m3)95.2626.7931.6871.86104.70113.32143.5541.46
Meteorological factors
MAT (°C)17.797.855.3210.6218.2124.6630.5714.04
MAHT (°C)21.887.978.6114.6623.2528.2435.9713.58
MALT (°C)14.538.012.836.3514.2022.0626.7415.70
MAH (%)76.705.2363.4073.4977.7480.7886.707.29
MAP (hPa)1015.587.771002.691009.2051015.891022.411027.1915.18
MAS (m/s)21.364.7210.6718.6421.7525.1832.816.54

Notes: IQR = P75 − P25; MAT: monthly average temperature; MAHT: monthly average highest temperature; MALT: monthly average lowest temperature; MAH: monthly average relative humidity; MAP: monthly average atmospheric pressure; MAS: monthly average wind speed.

Figure 1

Time series of the monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2019.

3.2. ARIMA Model

The optimal ARIMA model constructed by monthly incidence of PTB from 2015 to 2018 according to BIC was the ARIMA (0,0,0)(1,1,0)12 model with a BIC of 18.2228. The optimal ARIMA model for PM2.5, PM10, SO2, CO, NO2, O3, MAT, MAHT, MALT, MAH, MAP, and MAS from 2015 to 2018 is listed in Table 2. The Ljung–Box test indicated that residual sequence from all models did not significantly depart from a white noise sequence (p > 0.05).
Table 2

The optimal ARIMA model for monthly incidence of PTB, the monthly average air pollutants concentration, and meteorological factors in Ningbo from 2015 to 2018.

VariablesModelBICLjung–Box Test
X-Squaredp-Value
Monthly incidence of PTB ARIMA(0,0,0)(1,1,0)12 18.22282.18330.1395
Air pollutants concentration
PM2.5 (μg/m3)ARIMA (1,0,0)(0,1,0)12266.63360.73670.3907
PM10 (μg/m3)ARIMA (1,0,0)(0,1,0)12298.67340.68800.4068
SO2 (μg/m3)ARIMA (1,0,0)(0,1,0)12161.63010.53410.4649
CO (mg/m3)ARIMA (1,0,0)(1,1,0)12−74.64221.34170.2467
NO2 (μg/m3)ARIMA (1,0,0)(0,1,1)12238.2070.71070.3992
O3 (μg/m3)ARIMA (0,0,1)(1,1,0)12297.42980.00110.9736
Meteorological factors
MAT (°C)ARIMA (0,0,0)(0,1,1)12126.29392.25530.1332
MAHT (°C)ARIMA (1,0,0)(0,1,1)12148.70690.12100.7280
MALT (°C)ARIMA (0,0,0)(0,1,0)12139.38572.59960.1069
MAH (%)ARIMA (0,0,0)(1,1,0)12224.07590.00090.9757
MAP (hPa)ARIMA (0,0,0)(1,1,0)12250.43390.25650.6125
MAS (m/s)ARIMA (0,1,1)262.49650.48510.4861

3.3. Cross-Correlation Analysis

3.3.1. CCF between the PTB and Ambient Factors

After prewhitening, we calculated the cross-correlation function (CCF) coefficients between the PTB and each ambient factor at different lag periods. Table 3 shows that the PTB was positively correlated with the PM2.5 (3-month lag), PM10 (3-month lag), and CO (9-month lag) and negatively correlated with O3 (0-month lag), MAT (1, 3-month lag), MALT (1,3-month lag), MAH (7-month lag), and MAP (2, 11-month lag) (p < 0.05).
Table 3

CCF coefficients between the prewhitened ambient factors residuals series with different time lags and the prewhitened PTB incidence residuals series from 2015 to 2018.

FactorsLag Periods (Months)
0123456789101112
PM2.50.0400.145−0.1250.353 *0.0280.124−0.0180.164−0.1750.1510.013−0.1220.112
PM100.0680.166−0.1140.343 *0.0410.076−0.0490.209−0.1720.190−0.007−0.1100.077
SO20.2020.069−0.1270.1760.125−0.161−0.0340.151−0.2110.0310.103−0.2700.145
CO−0.0130.075−0.028−0.086−0.1110.100−0.0360.115−0.0720.291 *−0.0830.1650.000
NO20.034−0.100−0.1060.0440.0730.013−0.1640.008−0.2090.278−0.075−0.0640.073
O3−0.321 *−0.135−0.279−0.085−0.0140.026−0.0640.0720.1110.0810.113−0.1510.216
MAT−0.206−0.406 *−0.102−0.316 *−0.002−0.0810.0340.0930.022−0.126−0.0950.1460.124
MAHT0.093−0.215−0.040−0.1710.155−0.0140.1480.2410.1620.080−0.0290.1770.174
MALT−0.198−0.310 *−0.040−0.423 *−0.094−0.085−0.077−0.158−0.018−0.1330.0140.1490.122
MAH−0.136−0.016−0.019−0.240−0.2380.073−0.053−0.307 *−0.074−0.273−0.036−0.085−0.157
MAP0.029−0.136−0.299 *−0.0300.105−0.252−0.1510.0660.053−0.2480.027−0.335 *−0.066
MAS−0.0910.125−0.112−0.205−0.169−0.0980.0680.0450.091−0.1990.038−0.053−0.240

Note: *, p < 0.05.

3.3.2. CCF between the Air Pollutants and Meteorological Factors

We also calculated the CCF coefficients between prewhitened air pollutants concentration and meteorological factors residual series at lag 0 between 2015 and 2018. Table 4 shows that PM2.5 and PM10 were significant negatively correlated with MAT, MALT, and MAH; SO2 was significant negatively correlated with MALT and MAH; NO2 was significant negatively correlated with MALT, MAH, and MAS; O3 was significant negatively correlated with MAH, but significant positively correlated with MAP.
Table 4

CCF coefficients between the prewhitened air pollutants concentration and meteorological factors residuals series at lag 0 between 2015 and 2018.

Air PollutantsMeteorological Factors
MATMAHTMALTMAHMAPMAS
PM2.5−0.317 *−0.154−0.527 *−0.477 *0.022−0.289 *
PM10−0.319 *−0.136−0.586 *−0.605 *0.066−0.248
SO2−0.249−0.059−0.457 *−0.529 *0.428 *−0.098
CO−0.249−0.097−0.115−0.059−0.199−0.150
NO2−0.272−0.201−0.308 *−0.400 *−0.016−0.437 *
O30.0480.122−0.161−0.403 *0.318 *0.027

Note: *, p < 0.05.

3.4. Univariate and Multivariate ARIMAX Analyses

Based on the cross-correlation analysis, we tested the univariate ARIMAX model by incorporating different lag periods of risk factors as exogenous regressors based on the ARIMA (0,0,0)(1,1,0)12 model from PTB time series. Because different ambient factors have different lag periods, from 1 to 11 months, in order to compare the performance of different univariate ARIMAX models and the optimal ARIMA model, it is necessary to make the sequence length of modeling consistent. Therefore, the monthly incidence of PTB and environmental factors with different lag periods from January 2016 to December 2018 were selected to construct the ARIMAX model and the modeling times were all 36 months. As shown in Table 5, the ARIMAX (0,0,0)(1,1,0)12 model with the O3 (0-month lag) and ARIMAX (0,0,0)(1,1,0)12 model with the MAP (11-month lag) is the optimal univariate ARIMAX model in terms of the four criteria being considered. The BIC value, fitted and test MAPE of the two univariate ARIMAX models were all smaller than the ARIMA model, and the coefficients of the regression term of these two univariate ARIMAX models were all significant (p < 0.05). Then, we tested the multivariate ARIMAX model by incorporating both the O3 (0-month lag) and the MAP (11-month lag) as exogenous regressors based on the ARIMA (0,0,0)(1,1,0)12 model, and the coefficients of the regression term of the multivariate ARIMAX model were all significant (p < 0.05). Compared with the optimal ARIMA and two suitable univariate ARIMAX models, the multivariate ARIMAX model was the best fitting model, with the lowest BIC (8.1092), fitted MAPE (2.9097%), and test MAPE (9.2643%). Finally, the actual incidence, fitted and predicted incidences of PTB using ARIMA (0,0,0)(1,1,0)12 and ARIMA (0,0,0)(1,1,0)12 with O3 (0-month lag) and MAP (11-month lag) in Ningbo were shown in Figure 2.
Table 5

Summary of the fitted parameters of the optimal ARIMA, univariate ARIMAX, and the multivariate ARIMAX model analysis in Ningbo, 2016–2018.

ModelBICMAPE(%)Risk Factors
FittedTestVarsCoefS.E.Tp-Value
(1) ARIMA(0,0,0)(1,1,0)129.03763.326910.6693sar1−0.58290.19073.05660.0021 *
(2) ARIMA(0,0,0)(1,1,0)12+PM2.5(lag3) d12.16823.374810.5262 dsar1−0.54580.27202.00640.0264 *
PM2.5(lag3)0.00190.00870.21470.4156
(3) ARIMA(0,0,0)(1,1,0)12+PM10(lag3) d12.06623.383310.4819 dsar1−0.53550.24602.17670.0183 *
PM10(lag3)0.00180.00480.38090.3528
(4) ARIMA(0,0,0)(1,1,0)12+CO(lag9) d11.19003.2306 c9.5569 dsar1−0.58430.19033.07070.0021 *
PM10(lag3)0.00420.00411.02210.1570
(5) ARIMA(0,0,0)(1,1,0)12+O3(lag0) abcd8.2634 a3.0226 c9.7944 dsar1−0.64180.17173.73740.0003 *
O3(lag0)−0.00610.00292.07510.0228 * b
(6) ARIMA(0,0,0)(1,1,0)12+MAT(lag1) abc−1.4075 a2.3205c12.0531sar1−0.56730.19782.86790.0035 *
MAT(lag1)−0.13170.03104.2421<0.0001 * b
(7) ARIMA(0,0,0)(1,1,0)12+MAT(lag3)10.52443.386011.0464sar1−0.51460.21692.37300.0117 *
MAT(lag3)−0.05950.04431.34100.0944
(8) ARIMA(0,0,0)(1,1,0)12+MALT(lag1) abc7.8580 a2.8932 c12.4087sar1−0.45830.26701.71650.0476 *
MALT(lag1)−0.06970.03571.94970.0297 * b
(9) ARIMA(0,0,0)(1,1,0)12+MALT(lag3) abc8.0334 a3.2171 c11.4344sar1−0.50350.22172.27090.0148 *
MALT(lag3)−0.07360.03472.11980.0207 * b
(10) ARIMA(0,0,0)(1,1,0)12+MAH(lag7) cd10.23653.1969 c10.4618 dsar1−0.63370.17323.65870.0004 *
MAH(lag7)−0.01570.01071.46250.0764
(11) ARIMA(0,0,0)(1,1,0)12+MAP(lag2) c9.95903.1324 c10.9889sar1−0.67330.16074.1897<0.0001
MAP(lag2)−0.00950.00591.60230.0592
(12) ARIMA(0,0,0)(1,1,0)12+MAP(lag11) abcd8.4574 a3.2063 c10.0108 dsar1−0.49400.21862.26010.0152 *
MAP(lag11)−0.01250.00602.06800.0232 * b
(13) ARIMA(0,0,0)(1,1,0)12+ O3(lag0)+MAP(lag11) abcd8.1092 a2.9097 c9.2643 dsar1−0.56080.20182.77910.0045 *
O3(lag0)−0.00540.00281.93830.0306 * b
MAP(lag11)−0.01150.00591.94610.0301 * b

Notes: Fitted: fitted results; Test: test results; BIC: Bayesian information criterion; MAPE: mean absolute percentage error; Coef: coefficient of risk factors; lag: time lag of risk factors; S.E.: standard error; T: t statistic; sar1: seasonal AR (1); a: meet criteria (a): the BIC value smaller than the optimal ARIMA model; b: meet criteria (b): the coefficients of the regression term all significant (p < 0.05); c: meet criteria (c): the fitted MAPE value smaller than the optimal ARIMA model; d: meet criteria (d): the test MAPE value smaller than the optimal ARIMA model; *: p < 0.05.

Figure 2

Actual incidence, fitted and predicted incidences of ARIMA (0,0,0) (1,1,0)12 and ARIMA (0,0,0) (1,1,0)12 with O3 (0-month lag) and MAP (11-month lag) in Ningbo.

4. Discussion

In this study, we developed and evaluated time series models to characterize the ambient factors of pulmonary tuberculosis transmission in Ningbo to inform control measures. To our knowledge, this is the first time series study to construct ARIMAX models to explore the role of both air pollution and meteorological factors in predicting PTB. We found that both air pollution and meteorological factors were associated with the incidence of tuberculosis in Ningbo with a lag effect. Additionally, the multivariate ARIMAX model that included both the ozone at lag 0 and the monthly mean atmospheric pressure at lag 11 had a better predicting performance than the inclusion of one variable. This modeling technique can be a useful tool for planning control interventions and could be implemented during routine tuberculosis surveillance in Ningbo. Although preventive measures have made great progress, the prevention and treatment of tuberculosis still involves enormous challenges, such as increased drug resistance [19,20], dual infection of tuberculosis and AIDS [21], and increased migrant population [22]. Furthermore, urban air quality is an important potential factor in the contribution of tuberculosis infection. There is a growing body of evidence suggesting an association between air pollution exposure and PTB incidence. However, available evidence on the association of air pollution and PTB risk is inconsistent [23]. Our study found a significantly positive correlation of particulate matter with an aerodynamic diameter ≤2.5 μm (PM2.5) or ≤10 μm (PM10) with PTB incidence, which is consistent with other studies [24,25]. Moreover, we also found that CO was positively correlated with PTB incidence, while O3 was negatively correlated with PTB incidence, which is consistent with other studies [11,24]. Limited evidence from vitro studies has suggested that the survival rate of mice intravenously inoculated with Mycobacterium tuberculosis after intravenous injection of dissolved ozone was significantly higher than that of mice not treated with ozone [26]. The association between PTB and air pollution exposure varied across regions, which may be partially attributed to pollutant concentration or analytic methods. Our study site is a coastal city of South China, the concentrations of air pollutants in our study (PM2.5: 35.79 ± 14.73 μg/m3, SO2: 10.90 ± 3.97 μg/m3, NO2: 37.47 ± 12.25 μg/m3) were much lower than those in mega cities such as Beijing (PM2.5: 105.10 ± 80.90 μg/m3, SO2: 48.60 ± 49.10 μg/m3, NO2: 64.20 ± 25.70 μg/m3) [27]. Evidence from the area with lower air pollution levels will be important to strengthen the basis for policy making [28]. Meteorological factors, including MAT, MALT, MAH, and MAP, had a negative effect on PTB with a lag effect in our study, which are largely consistent with the findings in previous studies [14,29]. Climate change affects tuberculosis through diverse pathways: changes in climatic factors such as temperature, humidity, and precipitation influence host response through alterations in vitamin D distribution, ultraviolet radiation, malnutrition, and other risk factors [30]. Our research also found that PM2.5 and PM10 were significant negatively correlated with MAT, MALT, and MAH, and O3 was significant negatively correlated with MAH, but significant positively correlated with MAP. The results of our study support previous findings that climatic factors could affect the incidence of tuberculosis by indirectly regulating urban air quality [14]. Our study used incidence of PTB from 2016 to 2018 and delayed ambient factors modeling to predict TB incidence in 2019 and compare it with the actual incidence. Cross-validating to calculate test MAPE could avoid the over-fitting of the model caused by incorporating ambient factors. For example, ARIMA (0,0,0)(1,1,0)12 + MAT (lag1) has the smallest fitted MAPE of 2.3205%, but its test MAPE is as high as 12.0531%, which is an obvious over-fitting phenomenon. The fitted and test MAPE of ARIMA (0,0,0)(1,1,0)12 + O3 (lag0) + MAP (lag11) are both smaller than the optimal ARIMA model, so over-fitting is effectively avoided. Through ARIMA model prewhitening, we found a lag correlation between the residual sequence of ambient factors and the residual sequence of tuberculosis incidence, which suggests that it is possible to control tuberculosis and other infectious diseases by protecting the environment and reducing pollutants. Admittedly, there were limitations in our study. Firstly, because there was a lag relationship between ambient factors and the incidence of PTB, we could accurately predict the incidence in these future lag periods by using the ARIMAX model. For example, the lag time of MAP is 11 months, so we can accurately predict the incidence of the next 11 months. However, there was no lag relationship between O3 and PTB in this study. Therefore, using O3 to predict tuberculosis in the same month does not produce significant result. In addition, the ARIMA model can only extract the seasonal and long-term trends of ambient factors and tuberculosis incidence. This model cannot extract and predict the information of residual sequences. Thus, it is invalid to use the ARIMA model to predict future ambient factors and then incorporate them into the ARIMAX model. Because we are temporarily unable to accurately predict the future ambient factors, we can only rely on factors such as MAP to accurately predict the incidence of PTB in the next 11 months. It is our future direction to improve the accuracy of the ARIMAX model in predicting long-term incidence of PTB by finding a more advanced model to predict the future ambient factors. Secondly, the ARIMAX model can only identify a correlation between PTB incidence and ambient factors but cannot identify causal relationships. Thirdly, data from fixed monitoring stations assume that all participants in the target area were at the same level of air pollution exposure, and personal exposure data were not collected [31].

5. Conclusions

Ambient air pollutants and meteorological factors were associated with monthly incidence of PTB with a lag effect. Meteorological factors may affect the incidence of PTB by indirectly regulating urban air quality. The multivariate ARIMAX model that included both the ozone with a 0-month lag and the monthly average atmospheric pressure with an 11-month lag had the best performance to predict the incidence of PTB. However, ARIMAX has limited improvement in prediction accuracy compared with the ARIMA model. This modeling technique can be a useful tool for planning control interventions and could be implemented during routine tuberculosis surveillance in Ningbo. When the peak of tuberculosis and low atmosphere pressure arise, health education on tuberculosis should be strengthened to remind people to seek medical treatment as soon as symptoms appear.
  27 in total

Review 1.  Model selection and psychological theory: a discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC).

Authors:  Scott I Vrieze
Journal:  Psychol Methods       Date:  2012-02-06

Review 2.  A systematic literature review and critical appraisal of epidemiological studies on outdoor air pollution and tuberculosis outcomes.

Authors:  Igor Popovic; Ricardo J Soares Magalhaes; Erjia Ge; Guy B Marks; Guang-Hui Dong; Xiaolin Wei; Luke D Knibbs
Journal:  Environ Res       Date:  2018-12-07       Impact factor: 6.498

3.  [Action of dissolved ozone on mycobacterium tuberculosis and alveolar macrophages in experimental tuberculosis].

Authors:  I I Belianin; G M Nikolaeva; L P Martynova
Journal:  Probl Tuberk       Date:  1997

4.  Risk of Treatment Failure in Patients with Drug-susceptible Pulmonary Tuberculosis in China.

Authors:  Ni Wang; Yan Ma; Yu Hong Liu; Jian DU; Hui Zhang; Shi Heng Xie; Kun Zhu; Xiao Ya Lyu; Wei Shu; Hong Hong Wang; Guo Feng Zhu; Shou Yong Tan; Yan Yong Fu; Li Ping Ma; Lian Ying Zhang; Fei Ying Liu; Dai Yu Hu; Yan Ling Zhang; Xiang Qun Li; Liang Li
Journal:  Biomed Environ Sci       Date:  2016-08       Impact factor: 3.118

5.  Association between short-term exposure to ambient air pollutants and the risk of tuberculosis outpatient visits: A time-series study in Hefei, China.

Authors:  Kai Huang; Kun Ding; Xiao-Jing Yang; Cheng-Yang Hu; Wen Jiang; Xiao-Guo Hua; Jie Liu; Ji-Yu Cao; Tao Zhang; Xiao-Hong Kan; Xiu-Jun Zhang
Journal:  Environ Res       Date:  2020-03-03       Impact factor: 6.498

6.  Migration-related tuberculosis: epidemiology and characteristics of tuberculosis cases originating outside the European Union and European Economic Area, 2007 to 2013.

Authors:  Csaba Ködmön; Phillip Zucs; Marieke J van der Werf
Journal:  Euro Surveill       Date:  2016

7.  Epidemiology of anti-tuberculosis drug resistance in a Chinese population: current situation and challenges ahead.

Authors:  Yan Shao; Dandan Yang; Weiguo Xu; Wei Lu; Honghuan Song; Yaoyao Dai; Hongbing Shen; Jianming Wang
Journal:  BMC Public Health       Date:  2011-02-17       Impact factor: 3.295

8.  Imported cases and minimum temperature drive dengue transmission in Guangzhou, China: evidence from ARIMAX model.

Authors:  Q L Jing; Q Cheng; J M Marshall; W B Hu; Z C Yang; J H Lu
Journal:  Epidemiol Infect       Date:  2018-05-21       Impact factor: 4.434

9.  Effects of particulate air pollution on tuberculosis development in seven major cities of Korea from 2010 to 2016: methodological considerations involving long-term exposure and time lag.

Authors:  Honghyok Kim; Sarah Yu; Hongjo Choi
Journal:  Epidemiol Health       Date:  2020-03-12

10.  The burden of air pollution on years of life lost in Beijing, China, 2004-08: retrospective regression analysis of daily deaths.

Authors:  Yuming Guo; Shanshan Li; Zhaoxing Tian; Xiaochuan Pan; Jinliang Zhang; Gail Williams
Journal:  BMJ       Date:  2013-12-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.