| Literature DB >> 34056624 |
Mintodê Nicodème Atchadé1, Yves Morel Sokadjo2, Aliou Djibril Moussa1, Svetlana Vladimirovna Kurisheva3, Marina Vladimirovna Bochenina3.
Abstract
Many papers have proposed forecasting models and some are accurate and others are not. Due to the debatable quality of collected data about COVID-19, this study aims to compare univariate time series models with cross-validation and different forecast periods to propose the best one. We used the data titled "Coronavirus Pandemic (COVID-19)" from "'Our World in Data" about cases for the period of 31 December 2019 to 21 November 2020. The Mean Absolute Percentage Error (MAPE) is computed per model to make the choice of the best fit. Among the univariate models, Error Trend Season (ETS), Exponential smoothing with multiplicative error-trend, and ARIMA; we got that the best one is ETS with additive error-trend and no season. The findings revealed that with the ETS model, we need at least 100 days to have good forecasts with a MAPE threshold of 5%.Entities:
Keywords: COVID-19; Cross-validation; Forecast; Statistical modeling; Time series
Year: 2021 PMID: 34056624 PMCID: PMC8150153 DOI: 10.1007/s42979-021-00699-1
Source DB: PubMed Journal: SN Comput Sci ISSN: 2661-8907
Different models in ETS modeling
| Trend | N | A | M |
| N | N, N | N,A | N, M |
| A | A, N | A, A | A, M |
| AD | AD, N | AD, A | AD, M |
| M | M, N | M, A | M, M |
| MD | MD, N | MD, A | MD, M |
N none, A additive, M multiplicative, D damped, AD additive damped, MD multiplicative damped
Fig. 1Study analysis process
Fig. 2Cumulative number of daily confirmed cases of COVID-19
MAPE of ARIMA with different training data sets
| TDS | 1 week MAPE | 2 week MAPE | 3 week MAPE | ARIMA | AC | Norm | Hetero | Stat |
|---|---|---|---|---|---|---|---|---|
| 1–30 | 17.76 | 28.82 | 36.70 | 020 | 0.35 | 0.00 | 0.09 | 0.99 |
| 1–44 | 22.24 | 17.87 | 13.77 | 020 | 0.65 | 0.00 | 0.21 | 0.71 |
| 1–58 | 8.40 | 18.36 | 021 | 0.93 | 0.00 | 0.33 | 0.02 | |
| 1–72 | 13.96 | 30.13 | 43.43 | 021 | 0.99 | 0.00 | 0.67 | 0.06 |
| 1–86 | 9.74 | 16.58 | 21.32 | 020 | 0.07 | 0.00 | 0.25 | 0.26 |
| 1–100 | 020 | 0.29 | 0.00 | 0.04 | 0.01 | |||
| 1–114 | 020 | 0.92 | 0.00 | 0.03 | 0.01 | |||
| 1–128 | 021 | 0.92 | 0.00 | 0.02 | 0.01 | |||
| 1–142 | 121 | 0.97 | 0.00 | 0.05 | 0.01 | |||
| 1–156 | 422 | 0.42 | 0.00 | 0.05 | 0.01 | |||
| 1–170 | 222 | 0.22 | 0.00 | 0.14 | 0.01 | |||
| 1–184 | 222 | 0.34 | 0.00 | 0.04 | 0.01 | |||
| 1–198 | 520 | 0.06 | 0.00 | 0.03 | 0.01 | |||
| 1–212 | 020 | 0.53 | 0.00 | 0.00 | 0.01 | |||
| 1–226 | 222 | 0.00 | 0.16 | 0.01 | ||||
| 1–240 | 222 | 0.00 | 0.11 | 0.01 | ||||
| 1–254 | 222 | 0.00 | 0.10 | 0.01 | ||||
| 1–268 | 222 | 0.00 | 0.04 | 0.01 | ||||
| 1–282 | 222 | 0.00 | 0.01 | 0.01 | ||||
| 1–296 | 222 | 0.00 | 0.01 | 0.01 |
TDS training dataset, AC autocorrelation, Norm normality, Hetero heteroscedasticity, Stat stationarity, values in bold mean MAPE and values in italic mean a problem of residuals autocorrelation (p-value < 0.05)
MAPE of the best ETS with different training data sets
| TDS | 1 week MAPE | 2 week MAPE | 3 week MAPE | ETS | AC | Norm | Hetero | Stat |
|---|---|---|---|---|---|---|---|---|
| 1–30 | 13.99 | 24.68 | 32.62 | M,A,N | 0.17 | 0.00 | 0.62 | 0.10 |
| 1–44 | 21.90 | 17.36 | 13.13 | M,A,N | 0.16 | 0.00 | 0.77 | 0.10 |
| 1–58 | 9.30 | 19.41 | M,A,N | 0.15 | 0.00 | 0.50 | 0.03 | |
| 1–72 | 12.30 | 28.07 | 41.41 | M,A,N | 0.14 | 0.00 | 0.36 | 0.04 |
| 1–86 | 9.12 | 15.75 | 20.37 | M,A,N | 0.09 | 0.00 | 0.28 | 0.01 |
| 1–100 | M,A,N | 0.10 | 0.00 | 0.23 | 0.01 | |||
| 1–114 | A,A,N | 0.92 | 0.00 | 0.03 | 0.01 | |||
| 1–128 | A,A,N | 0.90 | 0.00 | 0.02 | 0.01 | |||
| 1–142 | A,A,N | 0.82 | 0.00 | 0.03 | 0.01 | |||
| 1–156 | A,A,N | 0.95 | 0.00 | 0.02 | 0.01 | |||
| 1–170 | A,A,N | 0.93 | 0.00 | 0.01 | 0.01 | |||
| 1–184 | A,A,N | 0.51 | 0.00 | 0.00 | 0.01 | |||
| 1–198 | A,A,N | 0.97 | 0.00 | 0.00 | 0.01 | |||
| 1–212 | A,A,N | 0.53 | 0.00 | 0.00 | 0.01 | |||
| 1–226 | A,A,N | 0.07 | 0.00 | 0.00 | 0.01 | |||
| 1–240 | A,A,N | 0.06 | 0.00 | 0.00 | 0.01 | |||
| 1–254 | A,A,N | 0.00 | 0.00 | 0.01 | ||||
| 1–268 | A,A,N | 0.05 | 0.00 | 0.00 | 0.01 | |||
| 1–282 | A,A,N | 0.05 | 0.00 | 0.00 | 0.01 | |||
| 1–296 | A,A,N | 0.00 | 0.00 | 0.01 |
TDS training dataset, AC autocorrelation, Norm normality, Hetero heteroscedasticity, Stat stationarity, values in bold mean MAPE and values in italic mean a problem of residuals autocorrelation (p-value < 0.05)
MAPE of ESM with different training data sets
| TDS | 1 week MAPE | 2 week MAPE | 3 week MAPE | MMN | AC | Norm | Hetero | Stat |
|---|---|---|---|---|---|---|---|---|
| 1–30 | 61.09 | 292.57 | 1241.40 | M,M,N | 0.05 | 0.00 | 0.64 | 0.07 |
| 1–44 | 20.02 | 13.31 | 16.42 | M,M,N | 0.05 | 0.00 | 0.74 | 0.01 |
| 1–58 | 9.05 | 18.95 | M,M,N | 0.05 | 0.00 | 0.48 | 0.01 | |
| 1–72 | 10.66 | 24.56 | 36.58 | M,M,N | 0.00 | 0.33 | 0.01 | |
| 1–86 | 10.29 | 29.51 | M,M,N | 0.00 | 0.26 | 0.01 | ||
| 1–100 | 8.12 | 17.22 | M,M,N | 0.00 | 0.21 | 0.01 | ||
| 1–114 | 5.47 | 9.54 | M,M,N | 0.00 | 0.18 | 0.01 | ||
| 1–128 | M,M,N | 0.00 | 0.16 | 0.01 | ||||
| 1–142 | M,M,N | 0.00 | 0.15 | 0.01 | ||||
| 1–156 | M,M,N | 0.00 | 0.14 | 0.01 | ||||
| 1–170 | M,M,N | 0.00 | 0.13 | 0.01 | ||||
| 1–184 | M,M,N | 0.00 | 0.12 | 0.01 | ||||
| 1–198 | M,M,N | 0.00 | 0.11 | 0.01 | ||||
| 1–212 | M,M,N | 0.00 | 0.11 | 0.01 | ||||
| 1–226 | M,M,N | 0.00 | 0.10 | 0.01 | ||||
| 1–240 | M,M,N | 0.00 | 0.10 | 0.01 | ||||
| 1–254 | M,M,N | 0.00 | 0.10 | 0.01 | ||||
| 1–268 | M,M,N | 0.00 | 0.09 | 0.01 | ||||
| 1–282 | M,M,N | 0.00 | 0.10 | 0.01 | ||||
| 1–296 | M,M,N | 0.00 | 0.09 | 0.01 |
TDS training dataset, AC autocorrelation, Norm normality, Hetero heteroscedasticity, Stat stationarity,values in bold mean MAPE and values in italic mean a problem of residuals autocorrelation (p-value < 0.05)
Mimima, means, and maxima of the MAPEs
| Model | Min | Mean | Max | Error |
|---|---|---|---|---|
| ARIMA | 3.70 | 22.24 | 1 week | |
| ETS | 0.18 | 1 week | ||
| MMN | 0.21 | 5.34 | 61.09 | 1 week |
| ARIMA | 5.76 | 30.13 | 2 weeks | |
| ETS | 0.17 | 2 weeks | ||
| MMN | 0.19 | 18.53 | 292.57 | 2 weeks |
| ARIMA | 0.20 | 7.72 | 43.43 | 3 weeks |
| ETS | 3 weeks | |||
| MMN | 0.23 | 69.02 | 1241.40 | 3 weeks |
Values in bold mean smallest MAPEs level for 1, 2 and, 3 weeks forecasts respectively
Fig. 3Different forecasts of the kept ETS (A,A,N) model using each training data set
Fig. 4Best model ETS (A,A,N) actual, fitted, and predicted values