| Literature DB >> 35551008 |
Weiqiu Jin1, Shuqing Dong2, Chengqing Yu2, Qingquan Luo3.
Abstract
The COVID-19 outbreak poses a huge challenge to international public health. Reliable forecast of the number of cases is of great significance to the planning of health resources and the investigation and evaluation of the epidemic situation. The data-driven machine learning models can adapt to complex changes in the epidemic situation without relying on correct physical dynamics modeling, which are sensitive and accurate in predicting the development of the epidemic. In this paper, an ensemble hybrid model based on Temporal Convolutional Networks (TCN), Gated Recurrent Unit (GRU), Deep Belief Networks (DBN), Q-learning, and Support Vector Machine (SVM) models, namely TCN-GRU-DBN-Q-SVM model, is proposed to achieve the forecasting of COVID-19 infections. Three widely-used predictors, TCN, GRU, and DBN are used as elements of the hybrid model ensembled by the weights provided by reinforcement learning method. Furthermore, an error predictor built by SVM, is trained with validation set, and the final prediction result could be obtained by combining the TCN-GRU-DBN-Q model with the SVM error predictor. In order to investigate the forecasting performance of the proposed hybrid model, several comparison models (TCN-GRU-DBN-Q, LSTM, N-BEATS, ANFIS, VMD-BP, WT-RVFL, and ARIMA models) are selected. The experimental results show that: (1) the prediction effect of the TCN-GRU-DBN-Q-SVM model on COVID-19 infection is satisfactory, which has been verified in three national infection data from the UK, India, and the US, and the proposed model has good generalization ability; (2) in the proposed hybrid model, SVM can efficiently predict the possible error of the predicted series given by TCN-GRU-DBN-Q components; (3) the integrated weights based on Q-learning can be adaptively adjusted according to the characteristics of the data in the forecasting tasks in different countries and multiple situations, which ensures the accuracy, robustness and generalization of the proposed model.Entities:
Keywords: Artificial intelligence; COVID-19; Hybrid model; Infection prediction
Mesh:
Year: 2022 PMID: 35551008 PMCID: PMC9042415 DOI: 10.1016/j.compbiomed.2022.105560
Source DB: PubMed Journal: Comput Biol Med ISSN: 0010-4825 Impact factor: 6.698
A summary of recent COVID-19 forecast models.
| (A) Recent COVID-19 forecast models | |||||||
|---|---|---|---|---|---|---|---|
| Types | Method | Performance | Religion | Ref. | |||
| MAE | RMSE | Pearson | Spearman | ||||
| Dynamics | SAHQD model (Susceptible, infected, hospitalized, quarantined, deceased) | N.P. | N.P. | N.P. | N.P. | U.S. | [ |
| SCUAQIHMRD model (Susceptible, close contact, uninfected under home quarantine, asymptomatic under home quarantine, mild symptoms under home quarantine, severe symptoms under home quarantine, infectious in Designed Hospitals, infectious in Fangcang Hospitals, Recovered, Death) | N.P. | N.P. | N.P. | N.P. | Wuhan, China | [ | |
| SEPIAHR model (Susceptible, exposed, pre-symptomatic infectious, ascertained infectious, unascertained infectious, isolation in hospital and removed | N.P. | N.P. | N.P. | N.P. | Wuhan, China | [ | |
| SEIAIR model (Susceptible, incubation, asymptotic infected, recovered) | N.P. | N.P. | N.P. | N.P. | Wuhan, China | [ | |
| SEIRMH model (Susceptible, exposed without symptoms, infected with symptoms, with medical care, and removed from the system) | N.P. | N.P. | 0.84 | N.P. | Belgium | [ | |
| Adaptive interacting cluster-based SEIR (AICSEIR) model | N.P. | N.P. | 0.84 | N.P. | Italy, the U.S., and India | [ | |
| modified SEIR model (Including vaccination) | N.P. | N.P. | N.P. | N.P. | NYC, U.S. | [ | |
| SEIR model with Bayesian inference | N.P. | N.P. | N.P. | N.P. | Israel | [ | |
| SLIR model (Susceptible, latent, infected, recovered | N.P. | N.P. | N.P. | N.P. | China | [ | |
| SEIR model | N.P. | N.P. | N.P. | N.P. | Texas, USA | [ | |
| Sequential compartmental models | N.P. | N.P. | N.P. | N.P. | Homeless Shelter, Chicago, Illinois, USA | [ | |
| Time series | smooth transition autoregressive (STAR) model | 0.208 | 0.297 | N.P. | N.P. | Africa sub-region | [ |
| Linear AR model | 0.251 | 0.385 | N.P. | N.P. | Africa sub-region | [ | |
| ARIMA | 27.86 | 35.69 | N.P. | N.P. | Malaysia | [ | |
| ARIMA | N.P. | N.P. | N.P. | N.P. | France | [ | |
| Modified VAR regression | 47.43 | N.P. | N.P. | N.P. | NYC, U.S. | [ | |
| Linear regression | N.P. | 7.562 | N.P. | N.P. | Iran | [ | |
| Poisson count time series model (Disease surveillance and Twitter-based population mobility data) | N.P. | N.P. | N.P. | N.P. | South Carolina | [ | |
| ARIMA | 50.109 | 95.322 | N.P. | N.P. | India | [ | |
| Grey forecast | Fractional Order Accumulation Grey Model (FGM) | N.P. | 109496/96411/14560/64253/15/1123/106223 | N.P. | N.P. | U.S., France, UK, Germany, China, Japan, India | [ |
| Hybrid grey exponential smoothing approach | N.P. | 5.05 | N.P. | N.P. | Sri Lanka | [ | |
| Internally Optimized Grey Prediction Models (IOGMs) | N.P. | N.P. | N.P. | N.P. | Rajasthan, Maharashtra, Delhi | [ | |
| ML methods | random forest regression algorithm | 5.42 | 9.27 | 0.89 | 0.84 | 215 countries and territories | [ |
| long short-term memory (LSTM) models | N.P. | 27.187 | N.P. | N.P. | Iran | [ | |
| multilayer perceptron (MLP) neural network | 0.36 ( | 0.64 ( | 0.36 ( | N.P. | U.S. | [ | |
| Pearson correlation test and general linear model | N.P. | N.P. | 0.978 | N.P. | U.S. | [ | |
| a simple random forest statistical model | N.P. | N.P. | 0.89 | N.P. | Ohio, U.S. | [ | |
| WEKA tool | ≈1200 | ≈1000 | N.P. | N.P. | Pakistan | [ | |
| deep interval type-2 fuzzy LSTM (DIT2FLSTM) | N.P. | N.S. | N.P. | N.P. | USA, Brazil, etc. | [ | |
| generalized linear and tree-based machine learning models | 0.21 | N.P. | 0.99 | N.P. | Tennessee | [ | |
| an ensemble of 10 LSTM-based networks | 90.38 | N.P. | N.P. | N.P. | The county-level in the US | [ | |
| LSTM + | N.P. | N.P. | 0.872 | N.P. | West Virginia | [ | |
| Least-Square Boosting Classification algorithm | 1200 | N.P. | N.P. | N.P. | Countries having maximum number >2000 of confirmed cases in a day | [ | |
N.P. = Not provided. Meanwhile, it is worth noting that although we give specific model performance in above table, it is not generalizable and comparable across datasets due to the different number of infections within different geographic regions.
Fig. 1(A) Dynamic modeling of COVID-19 pandemic, SEIR model as an example; (B) Basic idea of forecast with Grey Theory; (C) A neuron net structure in Deep-learning forecast of COVID-19; (D) ARIMA model, a combination of differential operation and ARMA method.
Fig. 2(A) The architecture of the proposed model; (B) The experimental procedures of this study, where the total experiment was divided into four parts: training the component model, training the hybrid model, training the SVM model for error predicting, and calculating the final prediction value by hybrid ensemble model composed of TCN-GRU-DBN-Q hybrid model and error-predicting SVM model.
Fig. 3Architecture and elements in a TCN [57]. (A) A dilated casual convolution where the dilation factors d = 1, 2, 4 and filter size is 3; (B) The residual block of TCN, where an unit convolution (1 × 1 conv) is added to adapt the model structure to situations that the residual input is of different dimension with the output; (C) A possible residual connection in a TCN, where the solid purple lines are filters and the dashed purple lines are identity mappings. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)
Fig. 4The principle of Q-learning [64].
Data information [70].
| (A) Content and split of data | |||
|---|---|---|---|
| Nation | Training set (60%) | Validation set (20%) | Test set (20%) |
| India | 2020/2/19–2020/8/16 (180 days) | 2020/8/17–2020/10/15 (60 days) | 2020/10/16–2020/12/14 (60 days) |
| UK | |||
| US | |||
Essential parameters used in model training.
| Name of parameter | Essential parameters |
|---|---|
| Maximum iteration | 50 |
| Learning rate | 0.95 |
| Discount parameter | 0.5 |
| Size of input units | 3/5/7/9 |
| Size of hidden units | 100 |
| Size of output units | 1 |
| Number of the Hidden layers | 16 |
| Optimizer | Adam |
| Learning rate | 0.01 |
| Training epochs | 200 |
| Size of input units | 3/5/7/9 |
| Size of hidden units | 20 |
| Size of output units | 1 |
| number of the Hidden layer | 1 |
| Momentum factor | 0 |
| Optimizer | Adam |
| Learning rate | 0.01 |
| Training epochs | 200 |
| Size of input units | 3/5/7/9 |
| Size of hidden units | 60 |
| Size of output units | 1 |
| number of the Hidden layer | 6 |
| Learning rate | 0.01 |
| Optimizer | Adam |
| Filter size | 2 |
| Training epochs | 100 |
| Dropout | 0.05 |
| Size of input units | 3/5/7/9 |
| Size of output units | 1 |
| Kernel function | RBF |
| Gamma | 10 |
| σ2 | 20 |
Fig. 5The prediction results (X-1) and the model structure (X-2) (X = A, B, C, D) for different numbers of input neurons: (A) three input neurons; (B) five input neurons; (C) seven input neurons; (D) nine input neurons.
Comparative results of performance indices from models with different input neuron numbers.
Fig. 6(A) Model training processes (object function: RMSE, data: the number of infected people in UK/India/US); (B) to (D) Predictive output of elements (GRU, DBN, TCN) and error output predicted by SVM: (B) UK; (C) India; (D) US.
The weights of ensemble model (TCN, GRU, DBN) determined by Q-learning.
| Weights | |||
|---|---|---|---|
| UK | 0.46557 | 0.16110 | 0.29218 |
| India | 0.32941 | 0.65251 | 0.02186 |
| USA | 0.08401 | 0.21992 | 0.74092 |
The performance indices of models in case studies†.
Fig. 7Case studies: infection prediction of the UK, India, and the US.
The performance indices in case studies.
| Nation | MAE | MAPE% | RMSE | PCC |
|---|---|---|---|---|
| UK | 1952.11 | 9.95 | 2779.90 | 0.76 |
| India | 13760.00 | 10.72 | 20602.58 | 0.93 |
| US | 2744.22 | 7.06 | 3527.82 | 0.91 |
Fig. 8Contrast studies of different models (TCN-GRU-DBN-Q-SVM, TCN-GRU-DBN-Q, LSTM, N-BEATS, ANFIS, VMD-BP, WT-RVFL, ARIMA) used for infection prediction in the UK, India, and the US.
Comparative results.
The promoting percentages of the proposed model comparing to other experimental models.
| INDICES | COMPARISON MODELS | UK | INDIA | US |
|---|---|---|---|---|
| Model 1 v.s. Model 2 | 12.75% | 14.09% | 11.34% | |
| Model 1 v.s. Model 3 | 16.49% | 29.78% | 22.42% | |
| Model 1 v.s. Model 4 | 18.08% | 21.29% | 14.40% | |
| Model 1 v.s. Model 5 | 24.29% | 45.80% | 26.14% | |
| Model 1 v.s. Model 6 | 20.82% | 45.57% | 29.72% | |
| Model 1 v.s. Model 7 | 28.28% | 48.17% | 21.13% | |
| Model 1 v.s. Model 8 | 29.20% | 19.23% | 18.94% | |
| Model 1 v.s. Model 2 | 15.12% | 13.63% | 9.62% | |
| Model 1 v.s. Model 3 | 18.65% | 29.60% | 16.76% | |
| Model 1 v.s. Model 4 | 15.78% | 25.07% | 16.11% | |
| Model 1 v.s. Model 5 | 27.51% | 45.35% | 19.07% | |
| Model 1 v.s. Model 6 | 23.79% | 45.12% | 25.45% | |
| Model 1 v.s. Model 7 | 31.40% | 47.73% | 19.03% | |
| Model 1 v.s. Model 8 | 28.59% | 22.29% | 19.53% | |
| Model 1 v.s. Model 2 | 7.34% | 15.09% | 9.06% | |
| Model 1 v.s. Model 3 | 10.39% | 29.93% | 11.02% | |
| Model 1 v.s. Model 4 | 4.83% | 21.66% | 7.01% | |
| Model 1 v.s. Model 5 | 16.91% | 46.62% | 14.37% | |
| Model 1 v.s. Model 6 | 14.02% | 46.57% | 27.65% | |
| Model 1 v.s. Model 7 | 20.83% | 48.67% | 19.98% | |
| Model 1 v.s. Model 8 | 20.41% | 22.82% | 19.65% | |
| Model 1 v.s. Model 2 | 6.79% | 3.71% | 1.08% | |
| Model 1 v.s. Model 3 | 7.16% | 3.80% | 0.50% | |
| Model 1 v.s. Model 4 | 3.32% | 6.33% | 0.95% | |
| Model 1 v.s. Model 5 | 7.18% | 8.67% | 0.35% | |
| Model 1 v.s. Model 6 | 7.30% | 8.98% | 0.98% | |
| Model 1 v.s. Model 7 | 7.27% | 9.88% | 0.71% | |
| Model 1 v.s. Model 8 | 3.58% | 4.69% | 1.89% |
: Where Model 1 is TCN-GRU-DBN-Q-SVM (the proposed model), Model 2 is TCN-GRU-DBN-Q, Model 3 is LSTM, Model 4 is N-BEATS, Model 5 is ANFIS, Model 6 is VMD-BP, Model 7 is WT-RVFL, Model 8 is ARIMA.Based on the experimental results, the following conclusions can be drawn.
Fig. 9Trade-offs to consider in data-driven and dynamics model-driven COVID-19 prevalence modeling.