| Literature DB >> 35162879 |
Yu-Tse Tsan1,2,3, Der-Yuan Chen4,5, Po-Yu Liu6, Endah Kristiani7,8, Kieu Lan Phuong Nguyen9, Chao-Tung Yang7,10.
Abstract
This paper proposed the forecasting model of Influenza-like Illness (ILI) and respiratory disease. The dataset was extracted from the Taiwan Environmental Protection Administration (EPA) for air pollutants data and the Centers for Disease Control (CDC) for disease cases from 2009 to 2018. First, this paper applied the ARIMA method, which trained based on the weekly number of disease cases in time series. Second, we implemented the Long short-term memory (LSTM) method, which trained based on the correlation between the weekly number of diseases and air pollutants. The models were also trained and evaluated based on five and ten years of historical data. Autoregressive integrated moving average (ARIMA) has an excellent model in the five-year dataset of ILI at 2564.9 compared to ten years at 8173.6 of RMSE value. This accuracy is similar to the Respiratory dataset, which gets 15,656.7 in the five-year dataset and 22,680.4 of RMSE value in the ten-year dataset. On the contrary, LSTM has better accuracy in the ten-year dataset than the five-year dataset. For example, on average of RMSE in the ILI dataset, LSTM has 720.2 RMSE value in five years and 517.0 in ten years dataset. Also, in the Respiratory disease dataset, LSTM gets 4768.6 of five years of data and 3254.3 of the ten-year dataset. These experiments revealed that the LSTM model generally outperforms ARIMA by three to seven times higher model performance.Entities:
Keywords: ARIMA; ILI; LSTM; PM2.5; air pollution; influenza-like illness; respiratory disease
Mesh:
Substances:
Year: 2022 PMID: 35162879 PMCID: PMC8835266 DOI: 10.3390/ijerph19031858
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1System Architecture.
Figure 2System Workflow.
Matrix Correlation of ILI and Respiratory Disease.
| Order | ILI | Respiratory |
|---|---|---|
| 1 | AMB_TEMP | AMB_TEMP |
| 2 | NOx | NO2 |
| 3 | NO2 | NOx |
| 4 | NO | CO |
| 5 | CO | NO |
Extra Trees Classifier of ILI and Respiratory.
| Order | ILI | Respiratory |
|---|---|---|
| 1 | NO2 | NOx |
| 2 | O3 | WIND_SPEED |
| 3 | WIND_DIREC | AMB_TEMP |
| 4 | SO2 | PM10 |
| 5 | NOx | WS_HR |
Chemical Base of Feature Selection.
| Order | Correlation |
|---|---|
| 1 | PM2.5 |
| 2 | PM10 |
| 3 | NO2 |
| 4 | SO2 |
Dataset Air Pollutant Parameters.
|
| |||
|
|
|
|
|
| 1 | CO | ppm | Carbon Monoxide |
| 2 | NO | ppb | Nitrogen Monoxide |
| 3 | NO2 | ppb | Nitrogen Dioxide |
| 4 | NOx | ppb | Nitrogen Oxide |
| 5 | O3 | ppb | Ozone |
| 6 | PM2.5 | fine aerosol | |
| 7 | PM10 | aerosol | |
| 8 | SO2 | ppb | Sulfur Dioxide |
|
| |||
| 1 | AMB_TEMP | Celsius | ambient temperature |
| 2 | RAINFALL | mm | Rainfall |
| 3 | RH | % | Relative Humidity |
| 4 | WD_HR | degrees | Wind Direction Hourly |
| 5 | WIND_DIREC | degrees | Wind Direction |
| 6 | WIND_SPEED | m/s | Wind Speed |
| 7 | WS_HR | m/s | Wind Speed Hourly |
Figure 3ARIMA model statespace.
Figure 4LSTM model architecture.
The RMSE of ARIMA Model Comparison.
| ARIMA Model | RMSE |
|---|---|
| ILI ARIMA 5 years | 2174.8 |
| ILI ARIMA 10 years | 8173.6 |
| Respiratory ARIMA 5 years | 5581.6 |
| Respiratory ARIMA 10 years | 15,093.1 |
Figure 5ILI and Respiratory ARIMA models. (a) ILI from 5 years dataset training; (b) ILI from 10 years dataset training; (c) Respiratory from 5 years dataset training; (d) Respiratory from 10 years dataset training.
ILI RMSE using 5 and 10 years data.
|
|
| |||
|
|
|
|
| |
| 1 | 754.3 | 703.6 | 780.9 | 706.4 |
| 2 | 705.1 | 701.0 | 682.1 | 717.4 |
| 3 | 738.8 | 703.6 | 738.0 | 715.2 |
| 4 | 723.3 | 696.7 | 726.3 | 708.5 |
| 5 | 731.5 | 734.8 | 716.3 | 720.4 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
|
|
|
| |
| 1 | 336.19 | 574.67 | 433.57 | 459.57 |
| 2 | 734.85 | 449.82 | 594.45 | 385.61 |
| 3 | 431.1 | 458.10 | 633.23 | 612.43 |
| 4 | 335.31 | 430.23 | 770.90 | 649.33 |
| 5 | 693.32 | 454.50 | 435.67 | 468.72 |
|
|
|
|
|
|
|
|
|
|
|
|
Respiratory RMSE using 5 and 10 years data.
|
|
| |||
|
|
|
|
| |
| 1 | 4813.9 | 4731.2 | 4629.1 | 4690.1 |
| 2 | 4637.6 | 4817.0 | 4916.0 | 4783.7 |
| 3 | 4608.9 | 4720.7 | 4858.2 | 4853.7 |
| 4 | 4916.9 | 4818.8 | 4795.5 | 4718.4 |
| 5 | 4783.1 | 4663.6 | 4791.6 | 4822.4 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
|
|
|
| |
| 1 | 3273.5 | 3015.2 | 3355.9 | 3046.1 |
| 2 | 3084.6 | 3130.4 | 3160.6 | 3596.6 |
| 3 | 3210.2 | 3131.0 | 3247.6 | 3326.3 |
| 4 | 3687.0 | 3007.9 | 3066.8 | 3931.4 |
| 5 | 3034.6 | 3038.8 | 3102.6 | 3638.3 |
|
|
|
|
|
|
|
|
|
|
|
|
Figure 6The bloxplot prediction distribution in 5 times repetitions. (a) ILI LSTM boxplot from 5 years dataset training; (b) ILI LSTM boxplot from 10 years dataset training; (c) Respiratory LSTM boxplot from 5 years dataset training; (d) Respiratory LSTM boxplot from 10 years dataset training.
Figure 7The model comparison results in graphs. (a) The RMSE of Influenza-like Illness; (b) The RMSE of Respiratory Disease.
Figure 8The Prediction sampling of ARIMA. (a) ARIMA ILI 5 years; (b) ARIMA Respiratory 10 years.
Figure 9ILI LSTM sampling predictions. (a) ILI LSTM MC model for 10 years dataset training; (b) The Prediction Result of LSTM MC 10 years.
Figure 10Respiratory LSTM sampling predictions. (a) Respiratory LSTM ALL model for 5 years dataset training; (b) The Prediction Result of LSTM ALL 5 years.