| Literature DB >> 35039713 |
Arinjita Bhattacharyya1, Tanujit Chakraborty2, Shesh N Rai1,3,4,5,6.
Abstract
An unprecedented outbreak of the novel coronavirus (COVID-19) in the form of peculiar pneumonia has spread globally since its first case in Wuhan province, China, in December 2019. Soon after, the infected cases and mortality increased rapidly. The future of the pandemic's progress was uncertain, and thus, predicting it became crucial for public health researchers. These predictions help the effective allocation of health-care resources, stockpiling, and help in strategic planning for clinicians, government authorities, and public health policymakers after understanding the extent of the effect. The main objective of this paper is to develop a hybrid forecasting model that can generate real-time out-of-sample forecasts of COVID-19 outbreaks for five profoundly affected countries, namely the USA, Brazil, India, the UK, and Canada. A novel hybrid approach based on the Theta method and autoregressive neural network (ARNN) model, named Theta-ARNN (TARNN) model, is developed. Daily new cases of COVID-19 are nonlinear, non-stationary, and volatile; thus, a single specific model cannot be ideal for future prediction of the pandemic. However, the newly introduced hybrid forecasting model with an acceptable prediction error rate can help healthcare and government for effective planning and resource allocation. The proposed method outperforms traditional univariate and hybrid forecasting models for the test datasets on an average.Entities:
Keywords: Asymptotic stationarity; Autoregressive Neural Networks; COVID-19 Forecasting; Hybrid model; Theta model
Year: 2022 PMID: 35039713 PMCID: PMC8754528 DOI: 10.1007/s11071-021-07099-3
Source DB: PubMed Journal: Nonlinear Dyn ISSN: 0924-090X Impact factor: 5.741
Fig. 1The main workflow of the proposed TARNN model
Description of COVID-19 datasets
| Country name | Start–end date | No. of days | Minimum value | Maximum value | Train size | Test size |
|---|---|---|---|---|---|---|
| USA | 22/01/2020–26/02/2021 | 402 | 0 | 251,161 | 342 | 60 |
| Brazil | 26/02/2020–26/02/2021 | 367 | 0 | 70,574 | 307 | 60 |
| India | 30/01/2020–26/02/2021 | 394 | 0 | 97,894 | 334 | 60 |
| UK | 31/01/2020–26/02/2021 | 393 | 0 | 41,460 | 333 | 60 |
| Canada | 26/01/2020–26/02/2021 | 398 | 0 | 10,100 | 338 | 60 |
Training datasets (new daily cases) and corresponding ACF and PACF plots for all the countries considered in this study
Fig. 2Time series forecasting tools (available and proposed) used in this study
R functions and packages for standard forecasting model implementations
| Model | R function | R package | References |
|---|---|---|---|
| ARIMA | Auto.arima | Forecast | [ |
| ETS | Ets | Forecast | [ |
| TBATS | Tbats | Forecast | [ |
| Theta | Thetaf | Forecast | [ |
| ARNN | Nnetar | Forecast | [ |
| WARIMA | WaveletFittingarma | WaveletArima | [ |
Quantitative measure of performances of forecasting methods on the COVID-19 test datasets for five countries
| Methods | USA | UK | India | Canada | Brazil |
|---|---|---|---|---|---|
| ARIMA | 3.55 | 5.53 | 0.67 | 2.08 | 1.55 |
| Theta | 3.24 | 5.88 | 1.89 | 2.08 | 1.31 |
| WARIMA | 4.97 | 6.46 | 5.26 | 1.8 | 1.53 |
| ETS | 2.47 | 5.62 | 1.17 | 1.65 | 1.86 |
| TBATS | 5.58 | 1.44 | 1.81 | 4.77 | |
| ARNN | 3.29 | 5.67 | 3.16 | 2.23 | 2.83 |
| ARIMA-ARNN | 3.59 | 5.50 | 2.06 | 1.53 | |
| ARIMA-WARIMA | 3.64 | 5.69 | 0.72 | 2.05 | 1.58 |
| Proposed TARNN | 3.27 | 1.60 | |||
| ARIMA | 85,378.21 | 19,598.5 | 5325.44 | 2879.54 | 28,661.69 |
| Theta | 77,538.27 | 21,033.15 | 9159.85 | 2976.44 | 22,950.91 |
| WARIMA | 116,791.17 | 43,298.22 | 25,609.64 | 2764.52 | 28,139.06 |
| ETS | 60,473.05 | 20,017.92 | 6527.45 | 2425.78 | 33,242.02 |
| TBATS | 19,855.08 | 7262.95 | 2647.24 | 86,202.46 | |
| ARNN | 79,694.81 | 16,796.84 | 14,859.34 | 3063.08 | 48,693.24 |
| ARIMA-ARNN | 86,626.36 | 19,557.8 | 2876.86 | 27,825.87 | |
| ARIMA-WARIMA | 86,938.13 | 20,249.21 | 5428.55 | 2845.54 | 28,908.41 |
| Proposed TARNN | 78,320.37 | 7854.66 | |||
| ARIMA | 72,823.9 | 17,672.58 | 2833.76 | 2390.47 | 24,714.92 |
| Theta | 66,491.15 | 18,797.76 | 8024.95 | 2497.19 | 20,853.47 |
| WARIMA | 101,923.84 | 36,890.4 | 22,279.76 | 2069.01 | 24,294.93 |
| ETS | 50,537.67 | 17,953.84 | 4972.05 | 1893.77 | 29,506.42 |
| TBATS | 17,839.37 | 6091.65 | 2074.26 | 75,834.12 | |
| ARNN | 67,409.69 | 14,941.82 | 13,389.13 | 2556.93 | 44,939.99 |
| ARIMA-ARNN | 73,690.5 | 17,585.56 | 2386.66 | 24,259.25 | |
| ARIMA-WARIMA | 74,637.97 | 18,187.05 | 3056.05 | 2353.5 | 25,162.57 |
| Proposed TARNN | 66,939.65 | 5487.11 | |||
Best results are made ‘bold’. Proposed TARNN beats all the models for three out of five datasets and remains competitive for other two datasets
Fig. 3Out-of-sample forecast of COVID-19 cases for March 20–29, 2021, using proposed TARNN model