| Literature DB >> 33148337 |
Zhong-Qi Li1, Hong-Qiu Pan2, Qiao Liu1, Huan Song1, Jian-Ming Wang3.
Abstract
BACKGROUND: Many studies have compared the performance of time series models in predicting pulmonary tuberculosis (PTB), but few have considered the role of meteorological factors in their prediction models. This study aims to explore whether incorporating meteorological factors can improve the performance of time series models in predicting PTB.Entities:
Keywords: Meteorological factor; Predicting; Pulmonary tuberculosis; Time series
Mesh:
Year: 2020 PMID: 33148337 PMCID: PMC7641658 DOI: 10.1186/s40249-020-00771-7
Source DB: PubMed Journal: Infect Dis Poverty ISSN: 2049-9957 Impact factor: 4.520
Fig. 1Geographical locations of the three cities in Jiangsu Province
Fig. 2The recurrent neural network (RNN). a The structure diagram of the RNN; b The unfolding diagram of the forward propagation of the RNN. : input at time ; : hidden state at time ; : output at time ; : weight of the input; : weight of the input at the moment; : weight of the output
Fig. 3Monthly pulmonary tuberculosis cases in the three cities between 2005 and 2017
The monthly number of pulmonary tuberculosis cases in the three cities in 2018 predicted by the ARIMA, ARIMAX, and RNN models
| Month | Xuzhou city | Nantong city | Wuxi city | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Observation | ARIMA | ARIMAX | RNN | Observation | ARIMA | ARIMAX | RNN | Observation | ARIMA | ARIMAX | RNN | |
| January | 237 | 239 | 227 | 258 | 193 | 145 | 177 | 194 | 158 | 137 | 135 | 141 |
| February | 188 | 259 | 249 | 249 | 177 | 149 | 163 | 202 | 112 | 143 | 141 | 170 |
| March | 300 | 323 | 304 | 227 | 261 | 216 | 258 | 209 | 194 | 174 | 175 | 177 |
| April | 273 | 291 | 277 | 285 | 221 | 195 | 202 | 219 | 188 | 177 | 184 | 139 |
| May | 271 | 288 | 283 | 282 | 251 | 196 | 212 | 206 | 206 | 178 | 181 | 194 |
| June | 273 | 293 | 286 | 285 | 230 | 196 | 204 | 224 | 183 | 202 | 204 | 188 |
| July | 229 | 242 | 237 | 250 | 204 | 167 | 186 | 200 | 218 | 183 | 187 | 193 |
| August | 248 | 257 | 237 | 242 | 166 | 195 | 205 | 218 | 206 | 198 | 200 | 214 |
| September | 169 | 255 | 247 | 215 | 170 | 189 | 170 | 207 | 203 | 195 | 199 | 192 |
| October | 208 | 206 | 238 | 192 | 151 | 134 | 125 | 178 | 150 | 160 | 170 | 177 |
| November | 193 | 209 | 243 | 180 | 149 | 146 | 150 | 183 | 191 | 183 | 181 | 199 |
| December | 241 | 206 | 240 | 201 | 162 | 195 | 212 | 190 | 177 | 178 | 181 | 179 |
ARIMA autoregressive integrated moving average, ARIMAX autoregressive integrated moving average with exogenous variables, RNN recurrent neural network
Fig. 4Cross-correlation function plots of the residual series of pulmonary tuberculosis and meteorological factors. a: PTB and MAT; b PTB and MAP; c PTB and MAS; d PTB and MAH; e PTB and MP; f PTB and MST; 1: Xuzhou; 2: Nantong; 3: Wuxi. PTB: Pulmonary tuberculosis; MAT: Monthly average temperature; MAP: Monthly average atmospheric pressure; MAS: Monthly average wind speed; MAH: Monthly average relative humidity; MP: Monthly precipitation; MST: Monthly sunshine time
Alternative ARIMAX models for the three cities
| City | Model | Normalized BIC value | MAPE (%)a | |
|---|---|---|---|---|
| Xuzhou | ARIMA (1,1,1)(0,1,1)12 | 8.857 | 0.861 | 12.54 |
| ARIMA (1,1,1)(0,1,1)12 + MAS2 | 8.595 | 0.714 | 14.05 | |
| ARIMA (1,1,1)(0,1,1)12 + MAH1 | 8.467 | 0.399 | 24.09 | |
| ARIMA (1,1,1)(0,1,1)12 + MP2 | 8.617 | 0.356 | 11.96 | |
| ARIMA (1,1,1)(0,1,1)12 + MST1 | 8.593 | 0.767 | 17.62 | |
| ARIMA (1,1,1)(0,1,1)12 + MAS2 + MAH1 | 8.609 | 0.338 | 25.02 | |
| ARIMA (1,1,1)(0,1,1)12 + MAS2 + MP2 | 8.658 | 0.691 | 17.22 | |
| ARIMA (1,1,1)(0,1,1)12 + MAS2 + MST1 | 8.679 | 0.902 | 17.34 | |
| ARIMA (1,1,1)(0,1,1)12 + MAH1 + MP2 | 8.560 | 0.431 | 20.68 | |
| ARIMA (1,1,1)(0,1,1)12 + MAH1 + MST1 | 8.604 | 0.416 | 24.30 | |
| ARIMA (1,1,1)(0,1,1)12 + MP2 + MST1 | 8.674 | 0.751 | 17.55 | |
| ARIMA (1,1,1)(0,1,1)12 + MAS2 + MAH1 + MP2 | 8.700 | 0.371 | 20.71 | |
| ARIMA (1,1,1)(0,1,1)12 + MAS2 + MAH1 + MST1 | 8.755 | 0.427 | 23.01 | |
| ARIMA (1,1,1)(0,1,1)12 + MAS2 + MP2 + MST1 | 8.755 | 0.851 | 17.21 | |
| ARIMA (1,1,1)(0,1,1)12 + MAH1 + MP2 + MST1 | 8.692 | 0.241 | 39.17 | |
| ARIMA (1,1,1)(0,1,1)12 + MAS2 + MAH1 + MP2 + MST1 | 8.831 | 0.581 | 17.44 | |
| Nantong | ARIMA (0,1,1)(0,1,1)12 | 8.609 | 0.433 | 15.57 |
| ARIMA (0,1,1)(0,1,1)12 + MAT0 | 8.288 | 0.981 | 16.77 | |
| ARIMA (0,1,1)(0,1,1)12 + MAP1 | 8.183 | 0.777 | 11.16 | |
| ARIMA (0,1,1)(0,1,1)12 + MAS2 | 8.323 | 0.730 | 16.29 | |
| ARIMA (0,1,1)(0,1,1)12 + MAT0 + MAP1 | 8.340 | 0.836 | 14.99 | |
| ARIMA (0,1,1)(0,1,1)12 + MAT0 + MAS2 | 8.419 | 0.965 | 16.97 | |
| ARIMA (0,1,1)(0,1,1)12 + MAP1 + MAS2 | 8.314 | 0.766 | 11.90 | |
| ARIMA (0,1,1)(0,1,1)12 + MAT0 + MAP1 + MAS2 | 8.470 | 0.892 | 13.06 | |
| Wuxi | ARIMA (0,1,1)(0,1,1)12 | 6.933 | 0.176 | 9.70 |
| ARIMA (0,1,1)(0,1,1)12 + MAH0 | 6.845 | 0.119 | 9.66 | |
| ARIMA (0,1,1)(0,1,1)12 + MST0 | 6.818 | 0.068 | 10.51 | |
| ARIMA (0,1,1)(0,1,1)12 + MAH0 + MST0 | 7.003 | 0.088 | 9.74 |
BIC Bayesian information criterion, MAPE mean absolute percentage error, MAT monthly average temperature; MAP monthly average atmospheric pressure, MAS monthly average wind speed, MAH monthly average relative humidity, MP Monthly precipitation, MST monthly sunshine time, 0 0-month lag, 1 1-month lag, 2 2-month lag
*Ljung-Box test
a MAPE of the model in predicting the monthly number of PTB cases in 2018
Alternative recurrent neural network models for the three cities
| City | Model | Learning rate | Dimensions of hidden layer | Number of epochs | MAPE (%)a | MAPE (%)b | MAPE (%)c |
|---|---|---|---|---|---|---|---|
| Xuzhou | RNN1 | 0.05 | 3 | 500 | 16.14 | 15.99 | 16.46 |
| RNN2 | 0.05 | 3 | 500 | 13.42 | 13.30 | 14.41 | |
| RNN3 | 0.2 | 3 | 150 | 13.08 | 11.95 | 12.07 | |
| RNN4 | 0.05 | 3 | 600 | 10.33 | 10.33 | 10.40 | |
| RNN5 | 0.05 | 5 | 600 | 8.45 | 8.25 | 8.54 | |
| RNN6 (RNN5 + MAS1) | 0.05 | 3 | 1000 | 7.36 | 7.33 | 7.33 | |
| RNN7 (RNN5 + MAS2 + MST2) | 0.05 | 3 | 800 | 6.38 | 6.31 | 6.42 | |
| RNN8 (RNN5 + MAT3 + MAS3 + MP3 + MST3) | 0.05 | 5 | 600 | 4.78 | 4.89 | 4.97 | |
| RNN9 (RNN5 + MAS1 + MAS2 + MST2 + MAT3 + MAS3 + MP3 + MST3) | 0.05 | 10 | 600 | 5.75 | 5.40 | 5.90 | |
| Nantong | RNN1 | 0.05 | 3 | 500 | 21.91 | 21.99 | 21.78 |
| RNN2 | 0.2 | 5 | 80 | 16.92 | 17.81 | 16.31 | |
| RNN3 | 0.2 | 3 | 150 | 13.82 | 14.26 | 13.86 | |
| RNN4 | 0.2 | 3 | 150 | 12.78 | 12.84 | 12.80 | |
| RNN5 | 0.2 | 5 | 100 | 11.38 | 11.44 | 11.24 | |
| RNN6 (RNN5 + MAS1 + MAH1) | 0.05 | 5 | 1000 | 9.19 | 8.82 | 8.84 | |
| RNN7 (RNN5 + MAS2 + MAH2) | 0.05 | 5 | 1000 | 8.58 | 8.26 | 8.52 | |
| RNN8 (RNN5 + MAS3 + MAH3) | 0.05 | 10 | 800 | 8.87 | 8.79 | 8.69 | |
| RNN9 (RNN5 + MAS1 + MAH1 + MAS2 + MAH2 + MAS3 + MAH3) | 0.05 | 5 | 800 | 8.79 | 9.21 | 9.19 | |
| Wuxi | RNN1 | 0.1 | 10 | 150 | 23.76 | 23.81 | 23.77 |
| RNN2 | 0.05 | 5 | 400 | 19.93 | 19.54 | 20.17 | |
| RNN3 | 0.05 | 10 | 250 | 18.23 | 17.84 | 18.59 | |
| RNN4 | 0.05 | 10 | 400 | 17.15 | 17.40 | 17.31 | |
| RNN5 | 0.05 | 5 | 600 | 14.10 | 13.93 | 13.95 | |
| RNN6 (RNN5 + MAT1 + MAP1 + MAS1 + MAH1 + MST1) | 0.05 | 3 | 1500 | 13.01 | 13.39 | 13.04 | |
| RNN7 (RNN5 + MAS2) | 0.1 | 5 | 800 | 12.62 | 12.36 | 12.80 | |
| RNN8 (RNN5 + MAT3 + MAS3 + MAH3) | 0.05 | 10 | 1000 | 12.71 | 13.06 | 12.94 | |
| RNN9 (RNN5 + MAT1 + MAP1 + MAS1 + MAH1 + MST1 + MAS2 + MAT3 + MAS3 + MAH3) | 0.1 | 3 | 1000 | 12.81 | 12.80 | 13.46 |
RNN recurrent neural network, MAPE mean absolute percentage error, MAT monthly average temperature, MAP monthly average atmospheric pressure, MAS monthly average wind speed, MAH monthly average relative humidity, MP monthly precipitation, MST monthly sunshine time, 1 1 month prior, 2 2 months prior, 3 3 months prior
a MAPE of the model with the testing set after the first training
b MAPE of the model with the testing set after the second training
c MAPE of the model with the testing set after the third training
Evaluation of the performance of the ARIMA, ARIMAX, and RNN models in predicting the monthly number of pulmonary tuberculosis cases in the three cities in 2018
| City | Diagnostic indicator | Model | ||
|---|---|---|---|---|
| ARIMA | ARIMAX | RNN | ||
| Xuzhou | MAPE (%) | 12.54 | 11.96 | 12.36 |
| RMSE | 36.194 | 33.956 | 34.785 | |
| Nantong | MAPE (%) | 15.57 | 11.16 | 14.09 |
| RMSE | 34.073 | 25.884 | 31.828 | |
| Wuxi | MAPE (%) | 9.70 | 9.66 | 12.50 |
| RMSE | 19.545 | 19.026 | 26.019 | |
ARIMA autoregressive integrated moving average, ARIMAX autoregressive integrated moving average with exogenous variables, RNN recurrent neural network, MAPE mean absolute percentage error, RMSE root mean square error