Literature DB >> 34658470

Short-term Covid-19 forecast for latecomers.

Marcelo C Medeiros1, Alexandre Street2, Davi Valladão3, Gabriel Vasconcelos4, Eduardo Zilberman5,1.   

Abstract

The number of new Covid-19 cases is still high in several countries, despite vaccination efforts. A number of countries are experiencing new and severe waves of infection. Therefore, the availability of reliable forecasts for the number of cases and deaths in the coming days is of fundamental importance. We propose a simple statistical method for short-term real-time forecasting of the number of Covid-19 cases and fatalities in countries that are latecomers-i.e., countries where cases of the disease started to appear some time after others. In particular, we propose a penalized LASSO regression model with an error correction mechanism to construct a model of a latecomer country in terms of other countries that were at a similar stage of the pandemic some days before. By tracking the number of cases in those countries, we use an adaptive rolling-window scheme to forecast the number of cases and deaths in the latecomer. We apply this methodology to 45 countries and we provide detailed results for four of them: Brazil, Chile, Mexico, and Portugal. We show that the methodology performs very well when compared to alternative methods. These forecasts aim to foster better short-run management of the healthcare system and can be applied not only to countries but also to different regions within a country. Finally, the modeling framework derived in the paper can be applied to other infectious diseases.
© 2021 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Covid-19; Forecasting; Infectious diseases; LASSO; Pandemics

Year:  2021        PMID: 34658470      PMCID: PMC8511688          DOI: 10.1016/j.ijforecast.2021.09.013

Source DB:  PubMed          Journal:  Int J Forecast        ISSN: 0169-2070


Introduction

Accurate forecasts of the number of cases and deaths of infectious diseases, like Covid-19, in the short run (e.g., in the next few days or weeks) are crucial for managing healthcare systems properly. Depending on the expected pressure on the healthcare system, one can make more informed decisions on how to allocate hospital beds and ventilators, whether to setup more field hospitals, whether to train more healthcare workers, and so on. Furthermore, accurate forecasting models can help trace the evolution of new variants of the disease and the effects of vaccination. In this paper, we propose a statistical method to forecast in real time the very short-run evolution of the number of Covid-19 cases and deaths in countries that are latecomers. Given that these latecomers were hit by the Covid-19 pandemic after other countries, we can use information from these other countries when they were at a similar stage of the pandemic, a few days or weeks before. Here, a latecomer country refers to one where Covid-19 cases and deaths were identified after others. Note that the labeling of a given country as a latecomer depends on the number of countries we consider. We use the penalized least absolute selection and shrinkage operator (LASSO) regression model proposed by Tibshirani (1996) to construct an error correction model (ECM) of a latecomer in terms of the other countries. The idea behind the ECM is to adjust the short-run dynamics of the latecomer to departures from the equilibrium (long-run relation) between the latecomer country and its peers. By tracking the number of cases in those countries, we can forecast the evolution of the disease in the latecomer a few days ahead. Forecasts of the number of deaths are constructed as a linear regression on the number of cases. As the pandemic evolves, one can run the model on a daily basis, and in an adaptive rolling-window scheme, to obtain updated forecasts for the next few days. The model is easily estimated and confidence intervals can be computed in a straightforward manner by simulation techniques. An (adaptive) rolling-window scheme is important in order to acknowledge the dynamic nature of the pandemic and to attenuate the effects of outliers and potential structural breaks (due to, for example, more or less testing after a given period, policy changes, the start of vaccination campaigns, new variants of the virus, and changes in the relations between countries used as explanatory variables and the latecomer). Nonetheless, it is important to emphasize that despite this attenuation, one might expect worse forecasts a few days after a structural break as the model adapts. Hence, and needless to say, the use of the proposed forecasting method should be complemented by evaluations of how the pandemic is evolving.1 We apply the methodology to 45 different countries and present detailed results for four of them: Brazil, Chile, Mexico, and Portugal. The 45 countries were selected based on the following criteria: (1) there must be at least six countries ahead of it by 14 days or more; (2) it has a population of at least 10 million people, and (3) it had at least 30,000 reported cases of Covid-19 by the end of 2020. We show that our method performs very well at forecasting the out-of-sample number of cases and deaths up to the next 14 days. The number of cases correspond to those that are detected by the healthcare system, which is the proper measure to track if the concern is to evaluate the impact on its capacity. Tracking the evolution of Covid-19 poses several challenges. The proposed method overcomes some of them. First, standard epidemiological models that are used to track the evolution of an epidemic are hard to discipline quantitatively to a new disease. Despite the enormous effort worldwide to understand transmission, recovery, and death rates, many parameters that need to calibrated remain uncertain Atkeson (2020), and the behavioral responses of individuals, as well as containment policies, affect these parameters (Eichenbaum, Rebelo, & Trabandt, 2020).2 By contrast, the proposed forecasting method has the advantage of being model-free, and makes projections based solely on available data. Second, even if it were possible to discipline those epidemiological models reliably, they speak to the evolution of the infected population. From the perspective of managing healthcare resources, the relevant figure is the number of infected individuals that end up pressuring the healthcare system. Many individuals with Covid-19 are asymptomatic and do not need access to the healthcare system. Hence, the evolution of the virus among the sub-population that actually needs healthcare—a sub-population with certain characteristics that differ from the rest of the population—might be different from the evolution of the virus among the whole population. The proposed method avoids this problem by directly forecasting the number of infected individuals detected by the healthcare system or specific parts of it that are of interest, e.g., specific regions within a country. Finally, alternative methods to track the evolution of Covid-19 and forecast the pressure on healthcare resources, such as mass testing, are expensive and unavailable in many countries. The methodology and our codes are immediately and cheaply reproducible to any latecomer that tracks the number of Covid-19 cases (and deaths). Note that the proposed methodology can also be applied to different regions within a country. This is particularly useful in large countries such as Brazil and India, where the disease hits distinct regions with delays.3 The aforementioned challenges are even harder to overcome in poor and developing countries, mostly latecomers, due to the lack of high-quality research, reliable data, and budget limitations. By tracking the very short-run evolution of the number of Covid-19 cases (and deaths) in real time, we hope to provide a methodology that can inform policymakers and the general public. In our view, an adaptive and accurate data-driven forecast is critical to foster better management of the healthcare system, especially in those countries that lack proper resources.

Main takeaways

The ECM method proposed in this paper provides forecasts of cases and deaths. In general, our method showed lower mean absolute percentage errors (MAPEs) than two benchmark models—namely, a simple quadratic trend regression (trend) that has been shown to be quite precise for short-term forecasts of Covid-19 fatalities (Coroneo, Iacone, & Monteiro, 2020), and an integrated autoregressive model of order one (AR).4 Over the 14 forecasting horizons, the ECM approach produced smaller MAPEs on average than the AR benchmark in 51.10% of the countries studied when the dependent variable was the number of Covid-19 cases, and in 65.50% in the case of deaths. Compared to the trend model, the percentages were 85.24% and 82.06% for cases and deaths, respectively. Compared with the AR model, the gains of the ECM alternative were more noticeable at forecasting horizons of more than two days. On the other hand, the ECM outperformed the trend benchmark at all horizons. Finally, the results improved remarkably when we considered a simple average combination of the ECM and AR forecasts. Regarding the four countries for which we provide an in-depth analysis, the following results emerge. For Brazil, the ECM delivered MAPEs for Covid-19 cases between 0.44% (one day ahead) and 3.94% (14 days ahead), whereas the trend model MAPEs ranged between 0.76% and 4.86%. The AR benchmark MAPEs were between 0.38% and 6.40%. Although, the performance of the ECM was inferior to the AR model for one-day-ahead forecasts, it considerably outperformed the latter at horizons of two or more days. For Chile, the ECM yielded MAPEs between 0.32% and 3.99%, whereas the trend approach produced MAPEs between 0.48% and 4.09%. The MAPEs from the AR models ranged between 0.32% and 5.22%. The superiority of the proposed ECM in thus clear in the case of Chile. For Mexico, our approach delivered MAPEs ranging from 0.23% to 2.41%, whereas the trend alternative yielded MAPEs between 0.41% and 2.78%. The MAPEs from the AR model were between 0.23% and 4.16%. For Portugal, the ECM delivered MAPEs between 0.25% and 5.18%, whereas the trend approach produced MAPEs between 0.74% and 6.83%. However, in this case, the AR performed best, with MAPEs between 0.22% and 4.17%. Nevertheless, when we considered a simple average of the forecasts from the ECM and AR models, the MAPEs reduced to 0.21% and 3.94%. Now, we analyze the results concerning deaths. For Brazil, the MAPEs of the ECM ranged between 0.49% and 4.13%. The trend (AR) model yielded MAPEs between 0.83% (0.35%) and 4.96% (5.39%). For Chile the ECM delivered a minimum MAPE of 0.73% for one-day-ahead forecasts and a maximum of 5.77% at the two-week-ahead horizon. The trend benchmark had MAPEs between 1.44% and 10.60%. The minimum MAPE of the AR model was 0.58%. The maximum MAPE was of the order of , due to a major outlier. When this forecast was removed, the maximum MAPE was 8.25%. For Mexico, our model produced MAPEs between 0.61% and 3.17%. The trend method yielded MAPEs between 0.55% and 8.23%. The MAPEs of the AR benchmark were between 0.39% and 6.70%. For Portugal, the minimum (one-day-ahead) and maximum (14-day-ahead) MAPEs of the ECM were 0.30% and 4.14%, respectively. The trend model yielded MAPEs between 0.77% and 7.09%, whereas the AR alternative had MAPEs ranging from 0.14% to 3.59%. For Portugal, the AR benchmark was the model with the lowest MAPEs. However, as before, a simple average combination of the ECM and AR models reduced the MAPEs to 0.17% and 3.53%. Although the forecast combination was not able to beat the AR model for one-day-ahead forecasts, it outperformed the AR benchmark at horizons of more than four days. Since the outbreak of the Covid-19 pandemic, ample research dealing with short-term forecasts of cases and deaths has been published in a wide collection of academic journals. The models range from different versions of epidemiological compartmental models to pure statistical and machine learning approaches. The models can be as simple as pure trend regression or as complicated as deep learning neural networks. Nevertheless, a number of studies provide strong evidence that it is quite difficult to beat the simplest alternatives. Our approach keeps the simplicity of several statistical models, explores equilibrium relations between a latecomer country and its peers, and shows robustness against many breaks in the dynamics of the series over the years 2020 and 2021. Coroneo et al. (2020) compare the predictive accuracy of forecasts for the number of fatalities produced by several forecasting teams and collected by the United States Centers for Disease Control and Prevention (CDC) and a simple benchmark alternative. The set of models includes both statistical (dynamic growth models) and compartmental approaches (different versions of SEIRD models). The authors find that a simple quadratic trend regression outperforms all alternatives at horizons up to one week ahead. For longer horizons, some of the models outperform the simple benchmark. However, the authors show that an ensemble of models outperforms all the other alternatives. Similar quadratic trend models are also considered with some adjustments by Jiang, Zhao, and Sha (2020) and Li and Linton (2021). Due to the previous satisfactory performance of the quadratic trend model, we adopted it as a benchmark specification in our study. Hendry (2022) consider a flexible trend model and apply it to forecasts of confirmed cases and deaths in a large number of countries. As with ours, their model has no epidemiological component. For confirmed cases, the authors report MAPEs between 0.4% and 2.1% for one-day-ahead forecasts and from 1.7% to 7.6% for four-day-ahead forecasts. In the case of death counts, the MAPEs are higher. Although their results are not directly comparable to ours, as we do not analyze the same countries, our MAPEs are lower than the ones reported by Hendry (2022). See also Petropoulos, Makridakis, and Stylianou (2022) for a similar approach to Hendry (2022). Nonlinear machine learning methods, such as long short-term memory, deep neural networks, random forests, and support vector machines, have also been used to forecast cases and death counts in the short run. For example, Ribeiro, da Silva, Mariani, and Coelho (2020) estimate a vast amount of statistical and machine learning methods to forecast future cases in Brazil. However, their models are much more complex than the ones considered here, and their MAPEs range from 0.87%—3.51%, 1.02%—5.63%, and 0.95%—6.90% for one-, three-, and six-day-ahead forecasts, respectively. These MAPEs are higher than the MAPEs from our ECM specification. Other examples of the application of nonlinear machine learning models include Zeroual, Harrou, Dairi, and Sun (2020), Chimmula and Zhang (2020), and da Silva, Ribeiro, , Mariani, and Coelho (2020), among many others.5

Organization of the paper

The remainder of the paper is organized as follows. Section 2 presents the methodology. Section 3 gives some guidance to practitioners. Section 4 describes the results, and Section 5 concludes.

Methodology

Let represent the number of days after the 100th confirmed case of Covid-19 in a given country or region.6 Define as the natural logarithm of the number of confirmed cases days after the 100th case of the disease in this specific country or region. In addition, let be a vector containing the natural logarithm of the number of reported cases for other countries days after the 100th case has been reported and a quadratic trend. That is, also includes and . The idea is that, in the regular time scale, may be ahead of time of . For example, in France and Spain, the 100th day was reported on February 29, 2020 and March 2, 2020, respectively. On the other hand, in Brazil, a latecomer, the 100th case was confirmed only on March 14, 2020. Therefore, the idea is to use, for instance, data from France on February 29 and Spain on March 2 to explain the number of cases in Brazil on March 14. Note that we do not claim any causal link between the countries and the latecomers. Our proposal explores the fact that the evolution of the disease in different countries shares similar patterns.7 The statistical approach considered in this paper is a simple error correction model (ECM) which maps into as follows: where is zero-mean random noise with variance , , , and , , and are unknown parameters to be estimated. As can be seen in Fig. 1, the number of cases in -scale in different countries displays a strong common exponential trend. The logarithm transformation is important to turn the exponential trend into a linear one.
Fig. 1

Evolution of cases in different countries in epidemic time.

The figure illustrates the evolution of the cases of Covid-19 in different countries according to the epidemic calendar, i.e., the -axis represents days from the first confirmed case of Covid-19. It is clear that come countries are in front of others in epidemic time.

Evolution of cases in different countries in epidemic time. The figure illustrates the evolution of the cases of Covid-19 in different countries according to the epidemic calendar, i.e., the -axis represents days from the first confirmed case of Covid-19. It is clear that come countries are in front of others in epidemic time. The model is estimated in two steps. In the first step, we estimate in a long-run equilibrium model: where is a zero-mean second-order stationary error term. Due to the limited amount of data and the large dimension of as compared to the sample size, we use the least absolute and shrinkage operator (LASSO) to recover the parameter vector. The goal of the LASSO is to balance the trade-off between bias and variance. It is a useful tool to select the relevant peers in an environment with very few data points. Therefore, the estimator of the unknown parameter in Eq. (2.2) is defined as where is the number of days in the estimation window, and is the penalty parameter. Theoretical justification for the use of LASSO to estimate the parameters in this setup with trends can be found in Masini and Medeiros (2021). Once we estimate Eq. (2.2), we proceed to a second step: estimating the ECM using ordinary least squares (OLS) with the variables selected in the first step with the LASSO. The final prediction steps ahead from reads as where is the forecast for the previous day. Confidence intervals are obtained through simulation by assuming that the error term in (2.1) is normally distributed. The intuition behind the proposed ECM is to model the dynamics and the reactions to departures from the equilibrium between and : the disease behaves in a somewhat similar fashion in different countries. Note that we do not claim a causal link, for instance, from the cases in Germany to the cases in Brazil, due to mobility in these two countries. What the model explores is that the evolution of diseases like Covid-19 seems to share similar patterns in different locations. Furthermore, as the first-stage LASSO regression is a model selection tool, if our hypothesis of common dynamics among countries is not valid, the LASSO will not select any country to explain the latecomer behavior and/or the residuals of the first-stage LASSO regression present statistical evidence of non-stationarity. Our interest relies on the forecasts for the number of cases in levels, not in logs: . Therefore, for the horizon , the forecasts are constructed as follows: where is a correction which is essential to attenuate the induced bias when we take the exponential of the forecasted value of ; see Wooldridge (2019). Note that, in the -scale, the peers are ‘in the future’ and we can plug in actual values of to construct our forecasts. Further, note that a rolling estimation window of days induces an adaptive forecasting framework suitable for capturing the dynamic nature of the pandemic and to attenuate the effects of outliers and potential structural breaks. Finally, in order to give more weight to the newest observations, we inflate the data by repeating the last four observations, where the last observation is repeated four times with a linear decay for the observations before. It is worth emphasizing that this model is only a local approximation of a more complex and dynamic process. Therefore, its best use relies on fresh and updated data, and the rolling-window scheme takes care of that. Although the model provides excellent adherence, the proposed forecasting method may be complemented with indexes, such as proxies for social distancing, to guide the evaluation of the future dynamics affecting the number of cases and deaths. However, the inclusion of other regressors such as Google Mobility did not show any improvement to the quality of the forecasts.

Guide to practice

The implementation of the proposed forecasting method requires the choice of three tuning parameters: the penalty term in the first-stage LASSO regression (), the length of the estimation window (), and the data inflation mechanism. The penalty term is selected by the Bayesian information criterion (BIC), as discussed in Medeiros and Mendes (2016). The degrees of freedom of the LASSO are determined by the nonzero estimated coefficients. Cross-validation can also be used to determine the penalty parameters. However, we prefer the BIC in order to avoid any extra computational burden. The estimation window length () and the inflation scheme for the most recent observations can be estimated in a rolling-window process. The motivation for the rolling-window estimation and the data inflation is to give weight to more recent observations and attenuate potential structural breaks. Before computing the actual forecasts, one could select these tuning parameters from a rolling window using previous data and choose the values that minimize the out-of-sample error measure (MAPE, for example). However, this procedure gives us the best model for past data, especially because we need a significant number of windows that go back several weeks to obtain stable estimates. Although a procedure like this could lead to some local improvements, it could also lead to situations where the forecast explodes, especially when a very small is selected with no data inflation. To avoid unnecessary data mining that could lead to unreliable results, we use a fixed value of and the inflation scheme for the four most recent lags. This sample size is enough to get a satisfactory model, given the number of variables, and it is not so big that it includes many structural breaks. The inflation scheme merely gives more weight to the most recent data, which is likely to be more similar to future data in the short run. Small changes in the inflation strategy do not affect the overall results. However, no inflation yields higher errors. Another important point is to check whether the first-stage errors are stationary. This can be conducted by common unit-root tests. If the null hypothesis of unit-root is not rejected, the first stage is clearly misspecified and the forecasts will not be reliable. In our study, we ran unit-root tests after every LASSO estimation, and there was no evidence of misspecification. Finally, the proposed ECM methodology is flexible enough to include other regressors, such as mobility data. However, in our experience the inclusion of such data did not bring any evident improvement in the performance of the model.

Data

We used the John Hopkins compiled data8 for all countries with Covid-19 cases. We also used the Brazilian Ministry of Health official data.9 The data were organized in epidemiological time, i.e., representing the number of days after the 100th case. The models were computed on a rolling-window scheme with 28 in-sample observations per window. For each country, model estimation started when the number of confirmed cases of Covid-19 reached 20,000. The last in-sample day for every country was June 30, 2021. As described in Section 3, we set the window length to 28 days because this turned out to be a good trade-off between the quality of the in-sample adjustment and robustness to potential structural breaks. Fig. 1 illustrates the evolution of Covid-19 cases in several countries. The data are displayed in epidemiological time. That is, the -axis represents the number of days since the first registered case. It is clear from the figure that some countries are ahead of others in epidemiological time.

Results

Benchmark models

In other to compare the performance of the proposed ECM model, we considered the benchmark model described in Coroneo et al. (2020). The model is a simple quadratic trend regression (trend), defined as where is a zero-mean error term. As mentioned above, although this benchmark is simple, it proved to be quite precise for short-term forecasts. We also considered an integrated first-order autoregressive AR model: where is a zero-mean error term. Forecasting Results for Cases: Distribution of Mean Absolute Percentage Error Ratios. The table presents results with respect forecasting models for the number of cases of Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean absolute percentage error (MAPE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Results for Deaths: Distribution Mean Absolute Percentage Error Ratios. The table presents results with respect forecasting models for the number of deaths by Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean absolute percentage error (MAPE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Results for Cases: Mean Absolute Percentage Error Ratios. The table shows the ratios of the forecasting mean absolute percentage error (MAPE) of either the ECM or the trend model over the AR benchmark. Numbers below one indicates that the ECM or the trend model outperforms the AR. The table presents results for forecasting horizons of 1 to 14 days ahead of the Covid-19 accumulated number of cases. The models were computed on a rolling window scheme with 28 in-sample observations per window. For each country, the model estimation started when the number of cases of Covid-19 reached 20,000. The last out-of-sample day for every country is July 1, 2021. Values in parenthesis are -values for the Giacomini & White test. (Giacomini & White, 2006). Forecasting Results for Deaths: Mean Absolute Percentage Error Ratios. The table shows the ratios of the forecasting mean absolute percentage error (MAPE) of either the ECM or the trend model over the AR benchmark. Numbers below one indicates that the ECM or the trend model outperforms the AR. The table presents results for forecasting horizons of 1 to 14 days ahead of the Covid-19 accumulated number of deaths. The models were computed on a rolling window scheme with 28 in-sample observations per window. For each country, the model estimation started when the number of cases of Covid-19 reached 20,000. The last out-of-sample day for every country is July 1, 2021. Values in parenthesis are -values for the Giacomini & White test. (Giacomini & White, 2006).

Mean absolute percentage errors

We start by reporting summary results for all 45 countries considered. Table 1, Table 2 report summary statistics for the forecasts of cases and deaths, respectively. Specifically, the tables show descriptive statistics for the ratio of the forecasting mean absolute percentage error (MAPE) of either ECM or trend alternatives and the AR benchmark. Ratios below one indicate that the alternative models outperform the benchmark. For cases, the medians of the ratio of the MAPEs from the ECM and the AR models are below one for horizons between three and 12 days ahead. When deaths are considered, the medians are below one for all horizons of more than two days ahead. Furthermore, the ECM is much better than the trend benchmark, which is frequently beaten by the AR model. The medians of the ratios over the 14 forecasting horizons and all countries are 0.99 for cases and 0.78 for deaths, indicating the superiority of the ECM over the AR model.
Table 1

Forecasting Results for Cases: Distribution of Mean Absolute Percentage Error Ratios. The table presents results with respect forecasting models for the number of cases of Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean absolute percentage error (MAPE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Mean Absolute Percentage Errors
Days aheadCases: ECM x AR
1234567891011121314
Min0.8850.5360.1030.0060.0000.0000.0000.0000.0000.0000.0000.0000.0000.000
5% prct0.9840.7480.5140.4140.3650.3400.3290.3350.3440.3450.3550.3580.3700.377
10% prct1.0120.7870.6680.5820.5560.5470.5680.5790.5900.5860.5920.5990.6070.615
25% prct1.0900.8990.7870.7450.7240.7250.7090.7090.7190.7360.7660.7770.7790.798
Median1.1721.0340.9550.9210.9080.9620.9510.9750.9750.9890.9930.9961.0421.103
75% prct1.3831.1491.0731.0971.0901.0901.0981.1381.1471.1841.1921.2061.2161.267
90% prct1.8121.5721.5931.5601.4831.6481.8361.9271.9151.8861.8621.8381.8301.856
95% prct2.1162.0031.8421.8041.8461.8731.9731.9792.0092.0332.1012.1772.2482.325
Max2.7102.5702.4392.3042.2382.1402.0352.1342.6673.4084.6536.4039.45812.029
Mean1.3011.1231.0190.9730.9630.9700.9911.0101.0271.0531.0941.1551.2441.330
Std0.3680.3890.4160.4250.4310.4360.4510.4610.5000.5690.7080.9291.3431.708

Days aheadCases: Trend x AR
1234567891011121314

Min1.1510.8590.2470.0110.0000.0000.0000.0000.0000.0000.0000.0000.0000.000
5% prct1.5520.9170.5850.4550.3940.3580.3390.3520.3610.3800.3910.3980.4050.413
10% prct1.6941.0720.7800.7040.6680.6350.6190.6160.6240.6220.6220.6260.6320.639
25% prct1.9751.2991.0110.9020.8790.9170.9340.9210.9080.9060.8910.8530.8110.783
Median2.6851.7261.4681.3221.2921.2561.2531.2301.1951.1671.1511.1591.1511.157
75% prct3.4522.3731.9101.6911.5981.6001.6071.5621.5311.5081.5241.5381.5311.526
90% prct5.7283.1572.6232.3492.2292.1282.0622.0241.9561.9041.8561.8151.7861.777
95% prct6.9204.6153.6443.0832.7222.5102.3382.1832.0591.9751.9501.9631.9902.004
Max11.6527.2105.4604.7374.1703.8153.5723.3863.2403.1303.0703.0553.0743.104
Mean3.1952.0291.6181.4311.3401.3081.2971.2701.2371.2111.1971.1931.1961.203
Std1.9121.1950.9560.8330.7400.6810.6420.6080.5820.5600.5470.5420.5420.545
Table 2

Forecasting Results for Deaths: Distribution Mean Absolute Percentage Error Ratios. The table presents results with respect forecasting models for the number of deaths by Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean absolute percentage error (MAPE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Mean Absolute Percentage Errors
Days aheadDeaths: ECM x AR
1234567891011121314
Min0.0490.0000.0000.1050.1190.1360.1530.0760.0270.0070.0020.0000.0000.000
5% prct0.1380.1560.2110.4130.4210.3120.2020.1570.1530.1580.1630.0610.0100.001
10% prct0.8540.6460.5170.4940.4270.4280.4460.4340.3770.3770.2620.1920.0790.020
25% prct1.1580.8960.7530.6690.6060.5880.5830.5760.5790.5780.5650.5540.5560.573
Median1.3101.0080.8540.7900.7430.7230.7170.7090.7010.7320.7000.7430.7510.766
75% prct1.5911.4691.2611.1331.0671.0411.0361.0711.0881.0971.1021.1231.1661.171
90% prct2.2131.9781.7501.5951.6181.4341.4771.4111.3661.3491.3531.3571.4221.511
95% prct2.5682.0901.9821.8471.8151.9712.2362.3382.4162.5012.6122.7902.9503.128
Max3.6252.9982.2502.1922.3712.4872.5743.2834.2393.4884.3845.0085.9007.125
Mean1.4091.1180.9780.9280.8860.8630.8670.8820.9010.8830.8980.9160.9451.013
Std0.6610.5510.4800.4480.4550.4730.5270.6010.7090.6490.7580.8600.9961.194

Deaths: Trend x AR
Days ahead1234567891011121314

Min0.0920.0000.0000.1620.1850.2060.2270.1310.0470.0120.0020.0000.0000.000
5% prct0.3090.3260.3930.5590.5180.4680.2870.2180.2120.2180.2290.1020.0160.001
10% prct1.0480.7680.6880.6210.6400.6090.6140.6050.5900.5160.4020.2470.1030.026
25% prct1.6801.1200.9330.8490.7690.7450.7460.7230.6880.6590.6440.6270.6310.636
Median2.1691.4661.2001.0741.0110.9790.9440.9190.9020.8920.8320.8440.8610.873
75% prct2.7461.9591.6291.5041.4511.4021.3271.2611.2311.2271.2161.2081.2051.211
90% prct4.0463.2642.6662.2572.0031.9021.7751.7041.6591.7201.7771.8471.9081.966
95% prct5.3663.5433.0102.8542.7732.6662.5932.5212.4702.3922.3852.4242.4822.553
Max18.9579.5636.5475.1654.3783.8903.5663.3333.1543.1633.2633.4033.5243.655
Mean2.6081.7261.4151.3011.2111.1571.1201.0881.0631.0381.0140.9950.9880.993
Std2.7321.4291.0240.8230.7300.6800.6540.6400.6340.6350.6510.6830.7110.733
Now we turn our attention to four specific countries. Table 3, Table 4 present the forecasting results for the full out-of-sample period. The tables show the MAPE for forecasting horizons of 1 to 14 days ahead of the Covid-19 accumulated number of cases (Table 3) and deaths (Table 4) for Brazil, Chile, Mexico, and Portugal. Values in parenthesis are -values for the Giacomini & White test of superior predictive ability (Giacomini & White, 2006). The null hypothesis of the test is that both forecasts have the same MAPE.
Table 3

Forecasting Results for Cases: Mean Absolute Percentage Error Ratios. The table shows the ratios of the forecasting mean absolute percentage error (MAPE) of either the ECM or the trend model over the AR benchmark. Numbers below one indicates that the ECM or the trend model outperforms the AR. The table presents results for forecasting horizons of 1 to 14 days ahead of the Covid-19 accumulated number of cases. The models were computed on a rolling window scheme with 28 in-sample observations per window. For each country, the model estimation started when the number of cases of Covid-19 reached 20,000. The last out-of-sample day for every country is July 1, 2021. Values in parenthesis are -values for the Giacomini & White test. (Giacomini & White, 2006).

Cases: Forecasting Mean Absolute Percentage Errors
Days aheadBrazil
1234567891011121314
Trend1.9831.1260.8880.8130.8130.8320.8470.8250.7880.7670.7610.7610.7600.759
(0.000)(0.180)(0.184)(0.071)(0.088)(0.114)(0.131)(0.093)(0.051)(0.029)(0.021)(0.016)(0.012)(0.014)
ECM1.1570.8990.7390.6560.6380.6210.6160.6100.5900.5860.5920.5990.6070.615
(0.024)(0.074)(0.000)(0.000)(0.000)(0.001)(0.004)(0.011)(0.016)(0.018)(0.020)(0.022)(0.027)(0.034)

Days aheadChile
1234567891011121314

Trend1.4760.8690.6710.5870.5490.5430.5980.6160.6310.6460.6700.7000.7360.783
(0.003)(0.203)(0.017)(0.010)(0.017)(0.030)(0.054)(0.071)(0.081)(0.089)(0.102)(0.119)(0.144)(0.181)
ECM0.9940.7580.6070.5360.4980.4890.5410.5790.6030.6280.6590.6880.7200.764
(0.479)(0.037)(0.004)(0.003)(0.006)(0.014)(0.025)(0.034)(0.039)(0.045)(0.054)(0.067)(0.082)(0.106)

Days aheadMexico
1234567891011121314

Trend1.8001.1070.8500.7450.7100.7040.7190.7140.6920.6740.6660.6610.6620.668
(0.000)(0.116)(0.038)(0.002)(0.001)(0.001)(0.003)(0.003)(0.002)(0.001)(0.001)(0.002)(0.003)(0.008)
ECM1.0030.8310.6680.5990.5560.5500.5680.5770.5730.5660.5720.5720.5820.580
(0.478)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)

Days aheadPortugal
1234567891011121314

Trend3.3532.0791.6771.5361.5401.6051.6641.6811.6451.6021.5891.6011.6211.638
(0.000)(0.000)(0.005)(0.023)(0.031)(0.026)(0.023)(0.025)(0.031)(0.040)(0.045)(0.045)(0.044)( 0.043)
ECM1.1470.9650.8910.8810.9060.9621.0241.0721.0901.0981.1171.1531.1951.241
(0.011)(0.315)(0.072)(0.078)(0.174)(0.375)(0.435)(0.336)(0.315)(0.310)(0.288)(0.250)(0.212)(0.182)
Table 4

Forecasting Results for Deaths: Mean Absolute Percentage Error Ratios. The table shows the ratios of the forecasting mean absolute percentage error (MAPE) of either the ECM or the trend model over the AR benchmark. Numbers below one indicates that the ECM or the trend model outperforms the AR. The table presents results for forecasting horizons of 1 to 14 days ahead of the Covid-19 accumulated number of deaths. The models were computed on a rolling window scheme with 28 in-sample observations per window. For each country, the model estimation started when the number of cases of Covid-19 reached 20,000. The last out-of-sample day for every country is July 1, 2021. Values in parenthesis are -values for the Giacomini & White test. (Giacomini & White, 2006).

Deaths: Forecasting Mean Absolute Percentage Errors
Days aheadBrazil
1234567891011121314
Trend2.3691.2591.0210.9690.9731.0071.0541.0431.0060.9720.9560.9460.9330.920
(0.000)(0.141)(0.463)(0.444)(0.453)(0.489)(0.420)(0.437)(0.490)(0.457)(0.433)(0.417)(0.398)(0.375)
ECM1.4000.8810.7450.6950.6860.6960.7250.7380.7380.7320.7380.7430.7510.766
(0.000)(0.138)(0.027)(0.010)(0.002)(0.000)(0.000)(0.001)(0.016)(0.024)(0.032)(0.036)(0.039)(0.059)

Days aheadChile
1234567891011121314

Trend2.4711.5111.2070.9660.7450.5220.2960.1310.0470.0130.0030.0000.0000.000
(0.000)(0.031)(0.268)(0.465)(0.294)(0.210)(0.175)(0.162)(0.158)(0.156)(0.156)(0.156)(0.155)(0.155)
ECM1.2560.8180.6680.5420.4180.2950.1690.0760.0270.0070.0020.0000.0000.000
(0.079)(0.208)(0.137)(0.103)(0.086)(0.103)(0.130)(0.146)(0.153)(0.155)(0.156)(0.156)(0.155)(0.155)

Days aheadMexico
1234567891011121314

Trend1.7990.9870.7960.7190.6770.6560.6490.6340.6100.5910.5760.5650.5570.552
(0.000)(0.453)(0.024)(0.006)(0.002)(0.000)(0.000)(0.000)(0.000)(0.001)(0.001)(0.001)(0.001)(0.001)
ECM1.5520.9830.7560.6270.5280.4810.4920.5110.5130.4940.4760.4550.4500.454
(0.000)(0.418)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)(0.000)(0.001)(0.001)(0.001)(0.001)

Days aheadPortugal
1234567891011121314

Trend5.4253.8813.3412.9832.7362.5702.4452.3402.2582.1822.1132.0612.0161.974
(0.000)(0.000)(0.000)(0.000)(0.001)(0.002)(0.004)(0.006)(0.009)(0.012)(0.015)(0.018)(0.021)(0.024)
ECM2.1141.6401.5261.4191.3471.2921.2671.2371.2171.1971.1831.1721.1611.153
(0.000)(0.000)(0.000)(0.001)(0.010)(0.034)(0.057)(0.092)(0.122)(0.151)(0.171)(0.191)(0.212)(0.227)
We start by comparing the models with respect to the forecasts for case counts. For Brazil, the ECM has lower MAPEs than the AR benchmark at all horizons of more than one day ahead, and the differences are all significant. It is also clear from the table that the ECM outperforms the trend model. For Chile, the ECM is better than the benchmark at all horizons. The differences are statistically significant at 12 out of 14 horizons. For Mexico, the ECM is superior to the benchmark at all horizons of more than one day ahead, and the differences are significant. Finally, for Portugal, the ECM has a much worse performance and is superior to the AR alternative only at horizons between two and six days ahead, but the differences are only significant for three-day-ahead forecasts. Still, the ECM considerably outperforms the trend model. Turning to fatalities, the ECM model is clearly superior to the benchmark, and the differences are much more pronounced. For Brazil, the ECM is better than the AR for all horizons of more than one day ahead, and the differences are strongly significant. Our proposal also shows much better results than the trend alternative. For Chile and Mexico, the results are similar. However, we should point out the for Chile there is a strong outlier in the forecasts of the AR models that distorts the results, with ratios close to zero at larger horizons. When we remove this specific outlier, the ratio for one-day-ahead forecasts is 1.323 and ranges between 0.302 and 0.868. This still indicates the superiority of the ECM for most horizons. For Portugal, the AR model delivers the best results at all horizons, followed by the ECM. In order to complement the analysis, we compute the frequency of days that the ECM has a lower absolute percentage error than the benchmark and the median of the ratio of the daily absolute errors of the ECM and benchmark specification over the forecasting sample. Fig. 2, Fig. 3 report, for each forecasting horizon, the frequency of days over the sample when the daily absolute percentage error of the ECM is smaller than the one from the AR and trend alternatives, respectively. The first column in the figures presents the results for cases, whereas the second column shows the numbers of deaths. To quantify these gains, Fig. 4, Fig. 5 present, for each horizon, the median of the ratios of the daily absolute errors. A number less than one favors the ECM model. As before, the first (second) column concerns cases (deaths). We prefer the use the median instead of the mean to avoid the potential effects of outliers.
Fig. 2

Frequency of days when ECM is better than the AR model.

The figure illustrates for different countries and horizons, the frequency of days when the absolute error of the ECM is smaller than the one from the AR specification.

Fig. 3

Frequency of days when ECM is better than the trend model.

The figure illustrates for different countries and horizons, the frequency of days when the absolute error of the ECM is smaller than the one from the trend specification.

Fig. 4

Median of daily absolute error ratios.

The figure illustrates for different countries and horizons, the median of the daily ratio of absolute errors of the ECM the one from the AR specification.

Fig. 5

Median of daily absolute error ratios.

The figure illustrates for different countries and horizons, the median of the daily ratio of absolute errors of the ECM the one from the AR specification.

Frequency of days when ECM is better than the AR model. The figure illustrates for different countries and horizons, the frequency of days when the absolute error of the ECM is smaller than the one from the AR specification. Frequency of days when ECM is better than the trend model. The figure illustrates for different countries and horizons, the frequency of days when the absolute error of the ECM is smaller than the one from the trend specification. For Brazil, the results favor the ECM over the AR alternative for horizons of more than one day ahead. Fig. 3, Fig. 5 both point to the superiority of the benchmark. However, when looking at the results in Table 3, Table 4 above, we reach a different conclusion. Therefore, it is important to uncover the drivers of the best MAPE of the ECM over the entire out-of-sample period. The reason for this finding is that the ECM is far superior to the benchmark during the first 100 days of the sample. This was the period when the number of cases and deaths in Brazil was accelerating the most. Median of daily absolute error ratios. The figure illustrates for different countries and horizons, the median of the daily ratio of absolute errors of the ECM the one from the AR specification. Median of daily absolute error ratios. The figure illustrates for different countries and horizons, the median of the daily ratio of absolute errors of the ECM the one from the AR specification. For Chile, the one-day-ahead forecasts of ECM are the best ones in 72.17% (61.75%) of the days when cases (deaths) are considered. These numbers drop to 46.52% (53.91%) when the forecasting horizon is set to 14 days. For almost all the horizons, the proportion of days when the ECM is better than the benchmark is larger than 50%. From the analysis of the results in Fig. 4, it is clear that the ECM is better than the benchmark at almost all horizons when cases are considered. When fatalities are analyzed, the ECM is always superior. For Mexico, the results are very supportive of the ECM specification. The ECM is superior to the benchmark in more than 50% of the days in almost every case considered in the analysis. The median ratios of the absolute percentage errors are always bellow one when Covid-19 cases are considered. For deaths, the ratios are below one at horizons of more than four days ahead. Finally, for Portugal, the superiority of the ECM draws attention. For all horizons considered, the ECM is better than the benchmark in terms of the number of days with lower errors, as well as in terms of the relative magnitude of the absolute percentage errors. First-stage Residual AR Coefficients. The figure illustrates the empirical distribution of the estimated AR coefficients of an AR(1) model estimated with the residuals of the first-stage LASSO regression.

Diagnostic tests

We report two diagnostic tests. First, Fig. 6 illustrates the empirical distribution of the estimated coefficient of a first-order autoregressive (AR(1)) model estimated with the residuals from the first-stage LASSO regression. The distribution is over all the rolling windows and each of the four countries analyzed. It is clear from the figure that, apart from a single case, all the estimates are bellow one in absolute values. Unit-root tests also strongly reject the null of unit roots in all but one case. This specific negative case is related to a huge outlier in the data that distorts the estimation of the AR coefficient and, consequently, the unit-root test. Based on this analysis, we are confident that our methodology is adequate for the present data.
Fig. 6

First-stage Residual AR Coefficients.

The figure illustrates the empirical distribution of the estimated AR coefficients of an AR(1) model estimated with the residuals of the first-stage LASSO regression.

The second diagnostic is related to the data inflation heuristic. Table 6 presents the MAPEs of the ECM with data inflation divided by the MAPEs of the ECM without data inflation. Numbers lower than one favor the inflation heuristic. For Brazil, Chile, and Portugal, it is clear that data inflation is superior to no inflation at all. For Mexico, we see improvements when the forecasts for cases are considered but not deaths. Changing the number of inflated observations seemed to have no significant effect. These additional results are available upon request.
Table 6

Effects of data inflation. Forecasting MAPEs of the ECM with data inflation divided by the forecasting MAPEs of the ECM without data inflation. Numbers lower than one favors the inflation heuristic.

HorizonCountry
Brazil
Chile
Mexico
Portugal
CasesDeathsCasesDeathsCasesDeathsCasesDeaths
10.8681.0400.8280.9420.9151.0090.8070.880
20.9500.9990.8990.9120.9711.0360.8140.833
30.9710.9640.8470.9150.9891.0440.8340.820
40.9440.9250.8280.9140.9891.0660.8600.822
50.9070.8890.8270.9150.9741.0830.8730.852
60.8700.8540.8170.9250.9461.0520.8780.865
70.8570.8400.8180.9220.9481.0510.8830.875
80.8590.8250.8340.9220.9591.0540.8920.873
90.8490.8270.8420.9250.9661.0520.9000.861
100.8540.8250.8480.9290.9741.0600.9160.855
110.8660.8200.8520.9270.9921.0640.9320.862
120.8640.8110.8500.9310.9911.0560.9470.872
130.8650.8110.8440.9291.0011.0560.9580.878
140.8600.8130.8370.9410.9831.0500.9700.882
Proportion of times each variable is selected by the LASSO. The table shows the frequency of times each variable is selected by the LASSO in the first-stage regression. Effects of data inflation. Forecasting MAPEs of the ECM with data inflation divided by the forecasting MAPEs of the ECM without data inflation. Numbers lower than one favors the inflation heuristic. Finally, it is important to understand which variables are being selected by the LASSO during the first stage of the methodology. Table 5 shows the frequency of selection of each variable over the rolling windows. Mexico is the latecomer country where each variable in the pool is selected at least once. Portugal seems to be the country with the most parsimonious model. Note that the frequency of selection of each variable differs from country to country.
Table 5

Proportion of times each variable is selected by the LASSO. The table shows the frequency of times each variable is selected by the LASSO in the first-stage regression.

Targettt2FranceIranItalyJapanSouth KoreaSingaporeGermanySpainUnited KingdomUS
Brazil0.520.070.500.640.700.370.530.57
Chile0.630.160.550.550.380.450.320.690.25
Mexico0.410.090.080.660.170.310.320.250.390.290.330.67
Portugal0.540.420.690.690.400.52

Robustness

In this section, we report a number of robustness checks. We start by reporting results concerning a number of alternative loss functions. In addition, we provide results showing the potential effects of vaccination on the performance of the ECM model.

Alternative loss functions

In addition to the MAPE, we present forecasting results for the following loss functions: (i) the median absolute deviation from the median (MAD), (ii) the mean absolute error (MAE), and (iii) the mean squared error. The results for the MAD, which is a loss function that is robust to outliers, are shown in Tables 7, 8, and 14. The tables provide descriptive statistics of the MAD ratios with respect to the AR benchmark. With respect to cases, the ECM outperforms the AR alternative in more than 50% of the countries at horizons between three and 12 days. Furthermore, the ECM is much better than the trend model. With respect to deaths, the performance of the ECM improves. In Table 14, we report the results for the combination of ECM and AR models. It is clear from the table that the combination of the models strongly improves the results for both cases and deaths.
Table 7

Forecasting Results for Cases: Distribution of Median Absolute Deviation from the Median Ratios. The table presents results with respect forecasting models for the number of cases of Covid-19. The table shows descriptive statistics for the ratio of the forecasting median absolute deviation from the median (MAD) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Median Absolute Deviation from the Median Ratios
Days aheadCases: ECM x AR
1234567891011121314
Min0.9100.6860.4800.4810.5360.5230.4780.5460.5690.5480.5700.5870.6220.604
5% prct0.9720.7730.6250.5420.5410.5830.5860.5880.5800.6080.6340.6660.7060.699
10% prct1.0270.8100.6730.5700.5890.6120.6520.6810.6940.6740.6940.7210.7360.771
25% prct1.0880.8600.7480.7200.7240.6990.8320.8240.8000.8240.8330.8700.8940.922
Median1.1931.0210.8690.9040.8910.8950.9420.9500.9280.9980.9630.9941.0421.102
75% prct1.4411.2551.0941.0981.1221.1941.3301.2821.3851.3661.3351.4161.4591.506
90% prct1.9061.5931.3981.3701.3401.5741.5241.5651.7911.7321.6961.7941.8851.941
95% prct2.1581.7311.6651.6201.7531.9021.8171.8381.8831.9932.0401.9121.9462.238
Max2.6563.0012.6222.4312.2932.2902.1822.0812.5502.4202.5662.2902.2312.340
Mean1.3201.1250.9990.9640.9601.0191.0581.0731.1061.1071.1271.1391.1721.232
Std0.3730.4060.3780.3710.3640.4070.3800.3680.4280.4190.4320.4020.3990.452

Days aheadCases: Trend x AR
1234567891011121314

Min1.2300.9040.6360.5610.5390.5800.5890.5690.5600.5310.5330.5370.5430.570
5% prct1.6240.9850.7400.6700.7370.7350.7210.7660.7130.7350.7530.7250.7200.702
10% prct1.6841.0930.8350.7220.7450.7970.8280.8960.8370.8070.8430.8390.8930.881
25% prct2.1541.3041.0380.9380.9661.0261.0691.0411.0461.0191.0171.0211.0561.116
Median2.6511.6221.3491.2631.2771.3181.3671.3651.3181.2921.3281.3451.3901.419
75% prct3.6292.1851.9151.6471.6251.7331.7351.7451.7061.6851.6751.7161.7381.806
90% prct5.2973.2512.3702.2032.1522.1382.0702.0332.0852.0742.0111.9781.9592.052
95% prct7.3044.0053.3252.6262.6082.3382.1942.1302.2982.4512.5972.3482.2572.544
Max12.2377.2715.9755.2654.7964.4603.9383.8323.7623.7733.7443.6993.6373.563
Mean3.2362.0071.6001.4311.4051.4361.4401.4361.4431.4341.4361.4371.4501.499
Std2.0591.1900.9310.7970.7160.6560.5820.5580.5850.5900.5920.5660.5480.562
Table 8

Forecasting Results for Deaths: Distribution Median Absolute Deviation from the Median Ratios. The table presents results with respect forecasting models for the number of deaths by Covid-19. The table shows descriptive statistics for the ratio of the forecasting median absolute deviation from the median (MAD) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Median Absolute Deviation from the Median Ratios
Days aheadDeaths: ECM x AR
1234567891011121314
Min0.9440.6660.6760.5140.4730.3980.4710.4860.4800.4200.4400.4500.4560.433
5% prct1.0400.8160.6890.6430.5810.5960.5800.5680.5680.5610.5320.5200.5270.562
10% prct1.1280.9040.7470.6630.6140.6090.6540.6370.6180.6280.6220.6010.5890.598
25% prct1.2900.9630.8430.7440.7000.7010.7190.7340.7160.7360.7130.7330.7490.767
Median1.4551.1620.9860.8980.9000.9100.8990.9260.9300.9270.8940.8910.9310.935
75% prct1.8271.2601.2091.0771.0491.0701.1161.1521.1851.2061.1971.1561.2081.198
90% prct2.2161.5521.3261.3561.4101.3051.4171.3551.3771.3471.4181.4281.4801.492
95% prct3.9341.9152.2301.6161.4651.4651.4411.4191.4531.5231.5421.4991.5851.586
Max5.0912.6602.3052.1211.7591.5771.5581.5721.5451.5431.6661.5971.6911.744
Mean1.7321.2091.0800.9780.9410.9280.9520.9690.9610.9600.9680.9570.9830.998
Std0.8690.3820.3760.3170.2920.2770.2870.2730.2780.2890.3090.3020.3190.319

Days aheadDeaths: Trend x AR
1234567891011121314

Min1.2580.6610.5920.5190.4610.4250.4320.4250.4480.4510.4440.4720.4720.497
5% prct1.4490.8420.6840.5700.5730.5450.5710.5880.6190.6100.6000.6030.6400.534
10% prct1.5171.0970.8320.6410.7230.7310.7120.7660.7600.7960.7980.8410.7230.737
25% prct1.8171.2611.0400.9810.9680.9720.9921.0210.9690.9810.9800.9730.9941.020
Median2.2381.5921.3931.2581.2361.2421.2251.1981.2121.1911.2141.1781.1901.205
75% prct2.6491.9731.6681.4861.4701.4211.4681.4611.4871.5171.5521.4851.4781.545
90% prct3.5142.2902.1032.1341.7911.8251.9141.8861.8571.8661.9542.0432.1212.127
95% prct9.1702.7952.5312.3722.4012.3682.4092.3642.4122.3782.3482.4022.4392.517
Max10.0186.2184.8164.2103.5023.0033.0513.1373.0392.9062.9923.0633.1093.326
Mean2.7331.7061.4641.3401.2951.2751.2901.2921.2761.2751.2881.2991.3111.341
Std1.9350.8520.7110.6340.5650.5170.5330.5140.5140.4950.5090.5190.5400.576
Table 14

Forecasting Combination Results for Cases and Deaths: Distribution of Median Absolute Deviation from the Median Ratios. The table presents results with respect forecasting models for the number of cases and deaths. The table shows descriptive statistics for the ratio of the forecasting median absolute deviation from the median (MAD) of either the combination ECM and AR or ECM and Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Median Absolute Deviation from the Median
Days aheadCases: ECM and AR x AR
1234567891011121314
Min0.3740.2800.2560.2500.2500.2500.2500.2500.2500.2490.2470.2430.2410.240
5% prct0.5850.3970.3310.2890.2780.2630.2600.2550.2540.2500.2500.2500.2800.250
10% prct0.6460.5080.4530.3970.3680.3150.3380.2930.3230.3110.3320.3100.3480.301
25% prct0.7020.5560.5000.4720.4460.4410.4500.4660.4600.4610.4660.4650.4880.498
Median0.8540.6850.6300.5990.5630.5850.6640.6830.6860.6960.7030.7010.7460.802
75% prct1.0400.8690.8120.8160.8710.9300.9280.9380.9280.9070.9310.9771.0421.090
90% prct1.4411.3571.2211.3961.5871.6031.7031.6031.5141.4851.4471.4141.4601.562
95% prct1.6931.9331.7622.9082.4502.2841.9531.8851.9262.0772.2482.6373.4014.638
Max2.7303.3747.7225.5604.4912.9274.2635.26111.15821.24846.52091.392205.177355.921
Mean0.9580.8410.8760.8940.8550.8050.8400.8620.9971.2291.8102.8385.5369.048
Std0.4070.5461.1041.0540.8350.6040.6950.7971.6123.0916.83813.51630.80653.518

Deaths: ECM and AR x AR
Days ahead1234567891011121314

Min0.7660.6630.5970.5930.5430.5080.5160.5020.5420.5350.5280.5220.5560.536
5% prct0.8240.7130.6980.6240.5970.5720.5730.6000.5820.5730.5550.5660.5630.581
10% prct0.8560.7730.7130.6490.6210.6280.6120.6060.6050.5980.5890.5950.5830.601
25% prct0.9650.8610.7750.7430.7170.6850.6820.6750.6940.6930.6900.6990.7170.671
Median1.0160.9210.8870.8400.8200.8110.7840.7940.8380.8180.8010.7820.7820.785
75% prct1.2831.0230.9450.9400.9240.9330.9350.9700.9700.9931.0050.9600.9600.978
90% prct1.6491.0981.0491.0350.9841.0401.1361.0671.0771.0981.0901.0971.0711.117
95% prct2.3121.4121.4641.2351.1231.1851.2481.1041.1311.1851.1561.1371.1671.174
Max4.0061.7581.6411.5271.3511.2331.4481.3111.3161.3871.2751.3491.4811.506
Mean1.2220.9630.9050.8600.8260.8230.8300.8290.8410.8430.8410.8260.8330.831
Std0.5840.2090.2040.1820.1670.1760.2050.1810.1840.1980.1930.1890.2010.208
The results concerning the MAE are reported in Tables 9, 10, and 15. The results in Table 9 for the ECM are not very encouraging. However, the combination of the ECM and the AR model is clearly superior to the AR benchmark. For deaths, the results for the ECM are much better.
Table 9

Forecasting Results for Cases: Distribution of Mean Absolute Error Ratios. The table presents results with respect forecasting models for the number of cases of Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean absolute error (MAE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Mean Absolute Error Ratios
Days aheadCases: ECM x AR
1234567891011121314
Min0.1500.0140.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.000
5% prct0.5280.3390.2030.1520.1330.1240.1250.1350.1430.1470.1410.1170.1140.064
10% prct0.7090.5690.4070.3380.2630.2520.2430.2390.2670.3200.3890.2700.2210.233
25% prct0.9430.7500.6380.5950.6140.6540.6450.7050.6700.7440.7790.8800.9050.940
Median1.5171.1401.0040.9310.9491.0361.0731.1611.1661.1641.1831.2871.2961.615
75% prct2.0971.5301.3881.4301.5701.5801.7691.8171.8401.8911.9582.0172.1412.342
90% prct3.3893.5492.7613.6764.4423.9963.5853.6533.0092.8372.7192.9353.6354.516
95% prct4.2564.7464.3018.6018.0017.7366.1355.1105.1605.7516.4067.4859.88314.446
Max7.48810.52827.35719.75316.4539.44515.98920.13944.06784.623186.017365.585820.8551424.237
Mean1.7831.5981.8161.9701.8571.6841.8181.9062.4383.3845.7139.84220.14433.878
Std1.3121.7844.0313.9263.0822.1172.6003.0856.51112.49927.55954.285122.105212.001

Days aheadCases: Trend x AR
1234567891011121314

Min0.5510.0270.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.000
5% prct0.8050.3260.1960.1550.1320.1230.1180.1170.1180.1200.1230.1270.1330.095
10% prct1.1840.6200.4440.3380.2940.2990.3340.3330.3070.2850.2760.2870.2820.252
25% prct2.6321.2470.8480.6900.6240.6260.7120.7430.7250.7160.7030.7280.7670.790
Median5.4132.0771.5541.3161.2281.3891.4341.4851.4211.3101.2971.3391.3671.469
75% prct12.0895.5613.7053.0722.8022.5092.5672.6322.5562.3652.1912.1372.2132.215
90% prct31.33914.5999.4407.4676.5485.5345.1884.5143.9153.5233.2032.9643.0163.100
95% prct46.28219.31811.8978.9777.2036.4265.6425.2454.9724.7264.4584.2964.2114.168
Max65.54123.31716.25612.49010.2539.1358.4247.9617.6617.4617.4157.4207.4617.557
Mean10.9804.7943.2162.5582.2392.1202.0781.9731.8391.7381.6731.6431.6441.662
Std14.1295.9873.8732.9082.3762.0761.8781.6961.5631.4691.4141.3881.3781.395
Table 10

Forecasting Results for Deaths: Distribution Mean Absolute Error Ratios. The table presents results with respect forecasting models for the number of deaths by Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean absolute ratio (MAE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Mean Absolute Error Ratios
Days aheadDeaths: ECM x AR
1234567891011121314
Min0.0340.0000.0000.0740.0860.1000.1140.1070.0390.0090.0010.0000.0000.000
5% prct0.0930.0850.1180.3450.3200.2990.2120.1270.1190.1210.0910.0240.0030.000
10% prct0.8510.6610.5030.5120.4100.3240.3120.3770.4330.2980.2730.1540.0600.015
25% prct1.1510.8780.7730.7060.6430.6170.6010.6070.5880.5730.5920.5660.5540.551
Median1.3991.0890.9070.8280.8020.7700.7730.7610.7610.7490.7100.7200.7410.749
75% prct1.7551.3541.2231.1781.1181.1101.1471.1101.0591.0811.0981.1061.1141.141
90% prct1.9781.7891.8481.7091.5611.5731.5981.6211.6941.6721.6341.6331.6151.601
95% prct2.7442.2132.0442.0141.9641.8971.8071.7451.7411.7711.8692.0312.1852.628
Max10.69210.3837.8567.4106.5996.4066.7856.8166.7826.7556.7547.0217.7468.455
Mean1.5951.3091.1351.0771.0090.9790.9770.9740.9670.9530.9480.9540.9781.023
Std1.4991.4631.1241.0560.9480.9220.9760.9850.9900.9931.0081.0621.1781.304

Days aheadDeaths: Trend x AR
1234567891011121314

Min0.0560.0000.0000.1020.1180.1340.1500.1260.0460.0110.0020.0000.0000.000
5% prct0.2780.1480.1860.4450.4440.3780.3000.1680.1540.1560.1650.0600.0090.001
10% prct1.0150.8160.6600.6090.5930.5030.5400.5410.5280.5090.3470.2010.0790.020
25% prct1.5741.0010.8350.7480.7080.6840.6980.7110.6800.6590.6030.5900.5890.591
Median2.0271.4511.2171.0911.0120.9810.9400.9170.9010.8430.8160.8190.8230.826
75% prct2.5631.8351.6471.5821.5171.4841.4711.4781.3441.3151.2911.2731.2621.288
90% prct4.1112.6222.3272.2032.0541.9621.9651.8861.8781.8441.8131.7951.7711.772
95% prct4.7493.2982.7973.0342.7172.4442.1642.1412.1682.1872.1572.1112.1222.142
Max10.7536.4774.8584.0723.5923.2572.9972.8002.6282.4932.3892.3302.4262.552
Mean2.3491.6111.3471.2811.1941.1441.1061.0681.0310.9970.9680.9460.9370.937
Std1.6891.0850.8630.7730.6940.6400.6000.5740.5590.5560.5690.5900.6100.625
Table 15

Forecasting Combination Results for Cases and Deaths: Distribution of Mean Absolute Error Ratios. The table presents results with respect forecasting models for the number of cases and deaths. The table shows descriptive statistics for the ratio of the forecasting mean absolute error (MAE) of either the combination ECM and AR or ECM and Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Mean Absolute Error
Days aheadCases: ECM and AR x AR
1234567891011121314
Min0.8090.6450.5390.5020.5000.4910.4830.4750.4710.4640.4620.4620.4580.457
5% prct0.8320.7040.5950.5260.5060.5000.5000.5000.5000.5000.5000.5000.5000.500
10% prct0.8520.7160.6710.6320.5890.5660.5500.5610.5700.5740.5740.5690.5650.563
25% prct0.8800.7600.7160.6890.6680.6640.6590.6620.6630.6640.6650.6650.6530.644
Median0.9430.8390.7770.7530.7630.7550.7530.7660.7750.7710.7710.7780.8090.825
75% prct1.0300.9320.8950.8830.8960.9290.9580.9660.9670.9680.9780.9791.0001.040
90% prct1.2281.1521.0681.0331.0521.0461.0681.0771.0931.0981.1101.1331.1821.262
95% prct1.2941.3051.2921.2741.2481.2371.2271.2111.1911.1871.2111.2521.3281.357
Max1.7711.7311.6641.5711.5081.4531.4151.3851.3711.3631.3581.3561.4621.712
Mean0.9890.8880.8370.8120.8020.8020.8060.8100.8130.8160.8210.8300.8440.862
Std0.1740.1990.2130.2170.2140.2110.2100.2070.2070.2110.2160.2280.2480.273

Deaths: ECM and AR x AR
Days ahead1234567891011121314

Min0.5120.5000.5000.5210.5240.5240.5130.5070.5030.5010.5000.5000.5000.500
5% prct0.5310.5270.5340.5520.5470.5280.5230.5200.5140.5080.5040.5020.5010.500
10% prct0.7670.6430.5830.5940.5670.5580.5320.5290.5300.5310.5310.5220.5040.501
25% prct0.8880.7610.6960.6620.6360.6120.6140.5990.6080.6000.5900.5800.5740.565
Median0.9840.8530.7830.7440.7120.6860.6810.6840.6690.6620.6600.6590.6440.640
75% prct1.1250.9680.9040.8610.8670.8430.8140.8030.8010.7800.7750.7720.7730.778
90% prct1.3451.1321.1901.1691.0911.0301.0491.1161.1351.1521.1401.1441.1371.140
95% prct1.6071.3941.3571.2821.2571.2421.2071.1921.2281.2121.2411.3171.3951.612
Max5.4875.4004.1953.9803.5803.4853.6773.6903.6703.6553.6623.7964.1574.514
Mean1.1030.9730.8920.8590.8230.8050.8000.7990.8000.7960.7940.7970.8070.827
Std0.7150.7050.5420.5130.4600.4490.4760.4800.4810.4810.4880.5140.5720.637
Forecasting Results for Cases: Distribution of Median Absolute Deviation from the Median Ratios. The table presents results with respect forecasting models for the number of cases of Covid-19. The table shows descriptive statistics for the ratio of the forecasting median absolute deviation from the median (MAD) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Results for Deaths: Distribution Median Absolute Deviation from the Median Ratios. The table presents results with respect forecasting models for the number of deaths by Covid-19. The table shows descriptive statistics for the ratio of the forecasting median absolute deviation from the median (MAD) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Results for Cases: Distribution of Mean Absolute Error Ratios. The table presents results with respect forecasting models for the number of cases of Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean absolute error (MAE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Results for Deaths: Distribution Mean Absolute Error Ratios. The table presents results with respect forecasting models for the number of deaths by Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean absolute ratio (MAE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Results for Cases: Distribution of Mean Squared Error Ratios. The table presents results with respect forecasting models for the number of cases of Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean squared error (MSE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Results for Deaths: Distribution Mean Squared Error Ratios. The table presents results with respect forecasting models for the number of deaths by Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean squared error ratio (MSE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Combination Results for Cases and Deaths: Distribution of Mean Absolute Percentage Error Ratios. The table presents results with respect forecasting models for the number of cases and deaths. The table shows descriptive statistics for the ratio of the forecasting mean absolute percentage error (MAPE) of either the combination ECM and AR or ECM and Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Combination Results for Cases and Deaths: Distribution of Median Absolute Deviation from the Median Ratios. The table presents results with respect forecasting models for the number of cases and deaths. The table shows descriptive statistics for the ratio of the forecasting median absolute deviation from the median (MAD) of either the combination ECM and AR or ECM and Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Combination Results for Cases and Deaths: Distribution of Mean Absolute Error Ratios. The table presents results with respect forecasting models for the number of cases and deaths. The table shows descriptive statistics for the ratio of the forecasting mean absolute error (MAE) of either the combination ECM and AR or ECM and Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Forecasting Combination Results for Cases and Deaths: Distribution of Mean Squared Error Ratios. The table presents results with respect forecasting models for the number of cases and deaths. The table shows descriptive statistics for the ratio of the forecasting mean squared error (MSE) of either the combination ECM and AR or ECM and Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window. Finally, the results for the MSE are shown in Tables 11, 12, and 16. The results for the ECM are not particularly good, especially due to some outliers. However, the combination of the ECM and the AR model seems to be very robust no matter which loss function is chosen.
Table 11

Forecasting Results for Cases: Distribution of Mean Squared Error Ratios. The table presents results with respect forecasting models for the number of cases of Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean squared error (MSE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Mean Squared Error Ratios
Days aheadCases: ECM x AR
1234567891011121314
Min0.1500.0140.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.000
5% prct0.5280.3390.2030.1520.1330.1240.1250.1350.1430.1470.1410.1170.1140.064
10% prct0.7090.5690.4070.3380.2630.2520.2430.2390.2670.3200.3890.2700.2210.233
25% prct0.9430.7500.6380.5950.6140.6540.6450.7050.6700.7440.7790.8800.9050.940
Median1.5171.1401.0040.9310.9491.0361.0731.1611.1661.1641.1831.2871.2961.615
75% prct2.0971.5301.3881.4301.5701.5801.7691.8171.8401.8911.9582.0172.1412.342
90% prct3.3893.5492.7613.6764.4423.9963.5853.6533.0092.8372.7192.9353.6354.516
95% prct4.2564.7464.3018.6018.0017.7366.1355.1105.1605.7516.4067.4859.88314.446
Max7.48810.52827.35719.75316.4539.44515.98920.13944.06784.623186.017365.585820.8551424.237
Mean1.7831.5981.8161.9701.8571.6841.8181.9062.4383.3845.7139.84220.14433.878
Std1.3121.7844.0313.9263.0822.1172.6003.0856.51112.49927.55954.285122.105212.001

Cases: Trend x AR
Days ahead1234567891011121314

Min0.5510.0270.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.000
5% prct0.8050.3260.1960.1550.1320.1230.1180.1170.1180.1200.1230.1270.1330.095
10% prct1.1840.6200.4440.3380.2940.2990.3340.3330.3070.2850.2760.2870.2820.252
25% prct2.6321.2470.8480.6900.6240.6260.7120.7430.7250.7160.7030.7280.7670.790
Median5.4132.0771.5541.3161.2281.3891.4341.4851.4211.3101.2971.3391.3671.469
75% prct12.0895.5613.7053.0722.8022.5092.5672.6322.5562.3652.1912.1372.2132.215
90% prct31.33914.5999.4407.4676.5485.5345.1884.5143.9153.5233.2032.9643.0163.100
95% prct46.28219.31811.8978.9777.2036.4265.6425.2454.9724.7264.4584.2964.2114.168
Max65.54123.31716.25612.49010.2539.1358.4247.9617.6617.4617.4157.4207.4617.557
Mean10.9804.7943.2162.5582.2392.1202.0781.9731.8391.7381.6731.6431.6441.662
Std14.1295.9873.8732.9082.3762.0761.8781.6961.5631.4691.4141.3881.3781.395
Table 12

Forecasting Results for Deaths: Distribution Mean Squared Error Ratios. The table presents results with respect forecasting models for the number of deaths by Covid-19. The table shows descriptive statistics for the ratio of the forecasting mean squared error ratio (MSE) of either ECM or Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Mean Squared Error Ratios
Days aheadDeaths: ECM x AR
1234567891011121314
Min0.0000.0000.0000.0010.0020.0020.0010.0000.0000.0000.0000.0000.0000.000
5% prct0.0000.0010.0010.0350.0160.0090.0030.0030.0040.0020.0000.0000.0000.000
10% prct0.6150.2490.1070.1140.1180.1090.0800.0590.0190.0080.0050.0010.0000.000
25% prct0.9560.6660.4800.4470.3700.3500.3270.3420.3360.2610.2920.3450.3800.373
Median1.5911.1070.8420.7190.6640.6750.6810.6370.6340.7200.7640.8070.8590.921
75% prct2.7932.5322.1681.8681.6131.6171.5931.6721.7331.7701.8902.1132.1282.102
90% prct4.3213.9374.0923.9073.7824.1454.2283.8074.6604.2484.0044.0203.8655.081
95% prct7.3417.3688.9448.6746.6535.3295.5565.5897.2306.64710.36714.76418.93930.781
Max1201.247603.935273.855237.143189.338194.820323.517387.550381.101461.763575.176753.3661072.2661496.674
Mean28.73715.0497.7246.7595.5345.5648.3879.8319.78211.57714.26018.45025.82735.895
Std178.77089.79640.63935.18228.07628.89348.06657.60656.63968.66085.550112.088159.595222.807

Deaths: Trend x AR
Days ahead1234567891011121314

Min0.0000.0000.0000.0010.0020.0030.0020.0000.0000.0000.0000.0000.0000.000
5% prct0.0020.0020.0030.0570.0230.0130.0040.0040.0050.0050.0010.0000.0000.000
10% prct0.6600.4210.2990.2260.1930.1370.0800.0410.0390.0160.0070.0000.0000.000
25% prct1.7520.8010.4920.4640.4200.4110.4330.3710.2520.2540.2570.2610.2670.279
Median2.8671.5551.1631.0790.9640.8960.7190.6620.6520.6300.6220.6200.6450.676
75% prct6.3543.2232.5072.3152.1921.7241.7251.6731.5901.5331.5501.5801.6251.684
90% prct11.2507.3125.3804.6854.7204.6994.2623.9174.0614.2584.1783.7953.4723.496
95% prct35.12016.40010.6928.2967.0696.6996.7036.5686.1295.8335.6655.9956.4806.985
Max114.43337.69420.86914.36511.0588.9477.4687.5578.92210.52412.53415.02718.13622.141
Mean8.0333.6752.5082.1311.8211.6581.5681.5021.4721.4771.5141.5801.6731.793
Std19.2066.9474.1923.0922.4632.1311.9941.9602.0262.1692.3982.7223.1503.716
Table 16

Forecasting Combination Results for Cases and Deaths: Distribution of Mean Squared Error Ratios. The table presents results with respect forecasting models for the number of cases and deaths. The table shows descriptive statistics for the ratio of the forecasting mean squared error (MSE) of either the combination ECM and AR or ECM and Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Mean Squared Error
Days aheadCases: ECM and AR x AR
1234567891011121314
Min0.3740.2800.2560.2500.2500.2500.2500.2500.2500.2490.2470.2430.2410.240
5% prct0.5850.3970.3310.2890.2780.2630.2600.2550.2540.2500.2500.2500.2800.250
10% prct0.6460.5080.4530.3970.3680.3150.3380.2930.3230.3110.3320.3100.3480.301
25% prct0.7020.5560.5000.4720.4460.4410.4500.4660.4600.4610.4660.4650.4880.498
Median0.8540.6850.6300.5990.5630.5850.6640.6830.6860.6960.7030.7010.7460.802
75% prct1.0400.8690.8120.8160.8710.9300.9280.9380.9280.9070.9310.9771.0421.090
90% prct1.4411.3571.2211.3961.5871.6031.7031.6031.5141.4851.4471.4141.4601.562
95% prct1.6931.9331.7622.9082.4502.2841.9531.8851.9262.0772.2482.6373.4014.638
Max2.7303.3747.7225.5604.4912.9274.2635.26111.15821.24846.52091.392205.177355.921
Mean0.9580.8410.8760.8940.8550.8050.8400.8620.9971.2291.8102.8385.5369.048
Std0.4070.5461.1041.0540.8350.6040.6950.7971.6123.0916.83813.51630.80653.518

Deaths: ECM and AR x AR
Days ahead1234567891011121314

Min0.2470.2500.2500.2470.2510.2500.2500.2500.2490.2450.2440.2440.2460.247
5% prct0.2500.2500.2700.2510.2550.2530.2520.2520.2500.2500.2500.2500.2500.250
10% prct0.4730.3440.3210.2760.2600.2580.2570.2570.2530.2520.2500.2500.2500.250
25% prct0.6790.5050.4370.4240.3830.3590.3020.3260.3250.3000.3020.3290.3450.342
Median0.8710.6850.5990.5460.5090.4850.4700.4610.4350.4440.4410.4520.4260.431
75% prct1.2441.0870.9370.8560.7880.7750.7830.7450.7490.7590.7850.8240.8590.914
90% prct1.5231.3511.6561.5161.5391.3971.4871.5821.6251.5281.4661.4791.4561.657
95% prct2.3432.2992.6802.5372.1731.9921.9451.8932.3292.1923.1984.2015.2318.144
Max303.101152.26469.16559.95547.86949.24581.82598.13896.613116.896145.307190.037270.110376.467
Mean7.6814.1692.3542.0501.7261.7232.4342.8022.7913.2403.9084.9516.7999.317
Std4 5.04322.58310.3258.8467.0537.26012.11214.54314.31417.33721.56928.23140.16056.002

Forecast errors and vaccination

One question of interest is whether the evolution of vaccination affects the performance of the ECM. To answer this question, we ran the following regression for each country: where and are the out-of-sample forecasting errors of the ECM and AR models, respectively, at the -step-ahead horizon, and is the proportion of the population that is fully vaccinated. In Fig. 7, we report the t-statistic for the null hypothesis . The red lines indicate the threshold. Standard errors for the coefficients are robust to heteroskedasticity and autocorrelation. As we can see from the plots, for most countries, the null hypothesis is not rejected, such that there is no evidence that vaccination is correlated with the performance of the models. For the case where the null is rejected, the effects can be either positive or negative. The adaptiveness of the locally regularized linear models capture the slow-varying dynamics in the relationship between dependent and independent variables. As such, we conclude that cross-country differences in vaccination do not notably affect the performance of the ECM.
Fig. 7

t-statistic.

The figure illustrates the t-statistic of the regression of on the proportion of the population vaccinated in each country.

t-statistic. The figure illustrates the t-statistic of the regression of on the proportion of the population vaccinated in each country.

Conclusion

We proposed a statistical model for very short-run forecasts of Covid-19 cases and deaths in countries and regions that are latecomers. We believe this is a useful tool to inform healthcare management. Nonetheless, structural breaks might worsen forecasts a few days after such breaks. So the use of this tool should be complemented by other external information, such as proxies for social distancing, to guide subjective or objective assessments of potential dynamic changes in the pandemic’s evolution. We hope to enhance the model by improving the methodology and incorporating more information. And we aim to keep the forecasts, methodology, and codes updated on a daily basis at https://covid19analytics.com.br/.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Table 13

Forecasting Combination Results for Cases and Deaths: Distribution of Mean Absolute Percentage Error Ratios. The table presents results with respect forecasting models for the number of cases and deaths. The table shows descriptive statistics for the ratio of the forecasting mean absolute percentage error (MAPE) of either the combination ECM and AR or ECM and Trend models and the AR benchmark. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Ratio of Forecasting Mean Absolute Percentage Errors
Days aheadCases: ECM and AR x AR
1234567891011121314
Min0.8210.6790.5410.5020.5000.4950.4550.4610.4690.4780.4750.4660.4580.446
5% prct0.8360.7050.5850.5460.5180.5000.5000.5000.4990.4970.4970.4960.4950.495
10% prct0.8670.7180.6260.5700.5590.5340.5270.5280.5170.5320.5420.5620.5590.543
25% prct0.8920.7900.7170.6770.6510.6310.6220.6160.6140.6090.6150.6140.6180.622
Median0.9370.8350.7910.7680.7670.7730.7620.7590.7640.7710.7700.7620.7690.775
75% prct1.0140.8900.8480.8430.8430.8570.8730.8850.8980.9100.9290.9440.9580.976
90% prct1.1371.0991.0681.0831.1171.0991.0911.1421.1861.1991.2101.2401.2791.322
95% prct1.3921.3471.2721.2151.1941.2081.2981.3341.3401.3431.3431.3611.3991.433
Max1.6711.6111.5581.4781.4471.4111.3881.3621.6031.9732.5963.4725.0016.279
Mean0.9910.8880.8320.8050.7950.7900.7940.7970.8060.8180.8350.8610.9020.941
Std0.1730.1910.1980.2030.2090.2120.2250.2300.2490.2830.3510.4620.6710.852

Deaths: ECM and AR x AR
Days ahead1234567891011121314

Min0.5170.5000.4870.4590.4430.4480.4610.4670.4380.4450.4450.4720.4910.489
5% prct0.5460.5410.5190.5250.5270.5160.5090.5050.5080.5020.5000.4990.4980.497
10% prct0.7210.6250.5610.5800.5700.5480.5330.5220.5190.5190.5070.5020.5010.500
25% prct0.8580.7360.6800.6440.6080.5940.5790.5690.5700.5640.5550.5440.5270.525
Median0.9360.8090.7350.6970.6680.6610.6470.6410.6430.6390.6290.6210.6110.609
75% prct1.0860.9760.8910.8380.7870.7540.7320.7300.7280.7270.7300.7480.7660.780
90% prct1.3181.2011.0931.1051.1111.0000.9900.9851.0271.0030.9830.9790.9850.992
95% prct1.5551.3081.2581.2081.2511.3441.4761.5021.5211.5431.5831.6521.7361.828
Max2.0651.8561.5321.4811.3901.4271.5781.9632.4312.0532.5032.8233.2763.891
Mean0.9950.8640.8010.7710.7460.7310.7260.7310.7420.7320.7400.7500.7630.794
Std0.2930.2510.2210.2130.2150.2260.2550.2930.3460.3120.3660.4150.4840.584
Table 17

Forecasting Results: Distribution of Exception Rates (90% Prediction Interval). The table presents results with respect the prediction intervals produced by the ECM for both cases and deaths. The table shows descriptive statistics for the frequency that the absolute out-of-sample errors of the ECM model exceeds the 90% prediction interval. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Exception Rates (90% Confidence Interval)
Days aheadCases: ECM
1234567891011121314
Min0.0260.0220.0140.0030.0030.0030.0020.0020.0020.0020.0020.0020.0020.002
5% prct0.0380.0420.0410.0270.0240.0130.0120.0210.0210.0210.0180.0180.0140.005
10% prct0.0510.0540.0460.0370.0350.0320.0410.0350.0370.0370.0450.0400.0220.021
25% prct0.0720.0680.0680.0660.0630.0700.0710.0680.0660.0670.0670.0620.0600.061
Median0.0860.0860.0870.0910.0870.0880.0880.0890.0850.0870.0820.0820.0810.079
75% prct0.0980.1020.1010.1010.1030.1040.1000.0970.1000.1020.1020.0950.0970.099
90% prct0.1080.1100.1110.1130.1130.1200.1200.1200.1240.1200.1250.1280.1270.125
95% prct0.1140.1130.1150.1160.1210.1250.1310.1330.1370.1410.1370.1320.1340.131
Max0.1150.1180.1280.1280.1260.1350.1400.1430.1480.1530.1550.1480.1400.136
Mean0.0820.0840.0840.0820.0820.0840.0840.0830.0830.0830.0820.0790.0780.076
Std0.0210.0220.0250.0280.0290.0310.0310.0310.0320.0320.0320.0330.0340.035

Deaths: ECM
Days ahead1234567891011121314

Min0.0160.0160.0140.0180.0180.0230.0220.0160.0160.0180.0180.0150.0110.011
5% prct0.0210.0270.0270.0270.0300.0300.0290.0240.0250.0230.0190.0180.0170.015
10% prct0.0260.0350.0300.0310.0320.0350.0320.0320.0310.0280.0260.0260.0220.022
25% prct0.0540.0440.0440.0450.0510.0500.0580.0510.0550.0520.0510.0470.0530.052
Median0.0840.0770.0760.0730.0760.0740.0720.0740.0720.0750.0770.0720.0770.074
75% prct0.0950.0900.0910.0930.0970.0960.0920.0880.0900.0940.0930.0890.0910.091
90% prct0.1060.1070.1030.1110.1050.1050.1050.1050.1020.1040.1020.1000.0970.099
95% prct0.1110.1140.1140.1130.1140.1100.1080.1060.1070.1140.1110.1080.1030.104
Max0.1340.1240.1500.1350.1220.1240.1220.1290.1160.1290.1350.1500.1540.150
Mean0.0760.0710.0710.0700.0720.0720.0720.0710.0710.0720.0720.0700.0690.069
Std0.0290.0270.0300.0300.0290.0260.0250.0270.0260.0290.0290.0300.0310.030
Table 18

Forecasting Results: Distribution of Exception Rates (95% Prediction Interval). The table presents results with respect the prediction intervals produced by the ECM for both cases and deaths. The table shows descriptive statistics for the frequency that the absolute out-of-sample errors of the ECM model exceeds the 95% prediction interval. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Exception Rates (95% Confidence Interval)
Days aheadCases: ECM
1234567891011121314
Min0.0260.0130.0110.0030.0030.0020.0020.0020.0020.0020.0020.0020.0020.002
5% prct0.0290.0340.0280.0200.0160.0110.0120.0080.0150.0180.0180.0150.0070.005
10% prct0.0330.0420.0380.0310.0240.0280.0290.0270.0290.0240.0330.0240.0210.019
25% prct0.0490.0500.0530.0490.0500.0560.0540.0520.0510.0500.0490.0510.0450.045
Median0.0590.0610.0630.0640.0640.0660.0670.0660.0640.0620.0640.0620.0610.061
75% prct0.0720.0710.0730.0730.0760.0740.0740.0710.0710.0750.0770.0750.0760.075
90% prct0.0760.0800.0800.0810.0830.0900.0950.0900.0910.0920.0900.0920.0900.086
95% prct0.0780.0850.0850.0920.0930.0970.1020.0980.0980.0960.0990.0990.0980.097
Max0.0800.0920.0930.0980.1000.1000.1030.1050.1050.0980.1020.1080.1050.120
Mean0.0590.0600.0610.0600.0610.0620.0630.0610.0610.0610.0610.0610.0590.058
Std0.0150.0160.0170.0200.0230.0230.0240.0230.0230.0230.0230.0240.0250.026

Deaths: ECM
Days ahead1234567891011121314

Min0.0160.0130.0110.0140.0160.0180.0150.0160.0140.0150.0140.0110.0110.011
5% prct0.0170.0200.0210.0240.0260.0240.0200.0220.0190.0180.0150.0170.0150.013
10% prct0.0230.0230.0220.0260.0290.0270.0300.0270.0210.0200.0180.0190.0190.019
25% prct0.0420.0370.0350.0330.0350.0330.0350.0350.0360.0370.0370.0330.0410.038
Median0.0540.0500.0520.0540.0520.0520.0530.0550.0550.0560.0540.0540.0520.054
75% prct0.0650.0650.0630.0640.0630.0620.0640.0640.0640.0630.0640.0670.0660.068
90% prct0.0720.0710.0720.0720.0740.0750.0720.0750.0730.0740.0720.0750.0720.074
95% prct0.0790.0790.0780.0760.0770.0800.0760.0780.0790.0790.0780.0770.0740.075
Max0.0910.1050.0870.0820.0790.0850.0820.0880.0820.0860.0900.0970.1010.097
Mean0.0520.0510.0490.0500.0500.0510.0510.0520.0510.0520.0510.0510.0500.050
Std0.0180.0200.0190.0180.0170.0170.0170.0180.0180.0190.0190.0210.0200.021
Table 19

Forecasting Results: Distribution of Exception Rates (99% Prediction Interval). The table presents results with respect the prediction intervals produced by the ECM for both cases and deaths. The table shows descriptive statistics for the frequency that the absolute out-of-sample errors of the ECM model exceeds the 99% prediction interval. The results are shown for forecasting horizons of 1 to 14 days ahead. The models were computed on a rolling window scheme with 28 in-sample observations per window.

Descriptive Statistics: Exception Rates (99% Confidence Interval)
Days aheadCases: ECM
1234567891011121314
Min0.0120.0020.0040.0030.0020.0020.0020.0020.0020.0020.0020.0020.0020.002
5% prct0.0150.0140.0130.0110.0090.0110.0090.0060.0080.0080.0090.0050.0050.005
10% prct0.0190.0180.0180.0190.0160.0160.0190.0190.0190.0180.0150.0130.0150.015
25% prct0.0250.0240.0280.0280.0260.0270.0290.0270.0250.0270.0240.0260.0250.026
Median0.0330.0310.0340.0370.0350.0360.0360.0370.0360.0350.0360.0340.0350.033
75% prct0.0380.0380.0410.0420.0420.0410.0430.0420.0410.0420.0440.0420.0410.041
90% prct0.0420.0450.0480.0460.0480.0470.0480.0500.0480.0530.0530.0540.0530.054
95% prct0.0470.0520.0520.0500.0570.0560.0550.0560.0560.0590.0620.0620.0640.058
Max0.0530.0550.0560.0520.0600.0670.0670.0820.0820.0790.0820.0750.0710.064
Mean0.0320.0310.0340.0340.0340.0340.0350.0350.0340.0350.0350.0340.0340.033
Std0.0090.0110.0110.0110.0130.0130.0130.0140.0140.0150.0160.0160.0150.014

Deaths: ECM
Days ahead1234567891011121314

Min0.0100.0050.0070.0110.0110.0110.0110.0110.0110.0110.0070.0070.0070.009
5% prct0.0110.0100.0110.0120.0140.0140.0120.0130.0130.0120.0110.0110.0130.012
10% prct0.0130.0130.0140.0160.0150.0150.0150.0150.0140.0140.0140.0130.0150.014
25% prct0.0200.0210.0200.0230.0230.0230.0190.0180.0190.0190.0200.0190.0210.019
Median0.0270.0270.0270.0260.0280.0280.0280.0300.0300.0300.0300.0290.0300.029
75% prct0.0340.0330.0340.0330.0340.0330.0360.0380.0360.0340.0350.0370.0380.040
90% prct0.0380.0350.0400.0420.0440.0400.0410.0420.0410.0410.0450.0460.0460.047
95% prct0.0430.0430.0460.0440.0480.0510.0530.0530.0540.0540.0540.0580.0560.056
Max0.0470.0460.0500.0500.0510.0590.0600.0600.0640.0640.0620.0590.0590.064
Mean0.0270.0260.0270.0280.0290.0290.0290.0290.0300.0290.0300.0290.0300.030
Std0.0100.0090.0100.0100.0100.0100.0120.0120.0120.0120.0120.0130.0130.013
  13 in total

1.  Short-term forecasting COVID-19 cumulative confirmed cases: Perspectives for Brazil.

Authors:  Matheus Henrique Dal Molin Ribeiro; Ramon Gomes da Silva; Viviana Cocco Mariani; Leandro Dos Santos Coelho
Journal:  Chaos Solitons Fractals       Date:  2020-05-01       Impact factor: 5.944

2.  Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study.

Authors:  Joseph T Wu; Kathy Leung; Gabriel M Leung
Journal:  Lancet       Date:  2020-01-31       Impact factor: 79.321

3.  Modeling and forecasting the early evolution of the Covid-19 pandemic in Brazil.

Authors:  Saulo B Bastos; Daniel O Cajueiro
Journal:  Sci Rep       Date:  2020-11-10       Impact factor: 4.379

4.  When will the Covid-19 pandemic peak?

Authors:  Shaoran Li; Oliver Linton
Journal:  J Econom       Date:  2020-09-08       Impact factor: 2.388

5.  Time dynamics of COVID-19.

Authors:  Cody Carroll; Satarupa Bhattacharjee; Yaqing Chen; Paromita Dubey; Jianing Fan; Álvaro Gajardo; Xiner Zhou; Hans-Georg Müller; Jane-Ling Wang
Journal:  Sci Rep       Date:  2020-12-03       Impact factor: 4.379

6.  Testing the predictive accuracy of COVID-19 forecasts.

Authors:  Laura Coroneo; Fabrizio Iacone; Alessia Paccagnini; Paulo Santos Monteiro
Journal:  Int J Forecast       Date:  2022-01-31

7.  Early dynamics of transmission and control of COVID-19: a mathematical modelling study.

Authors:  Adam J Kucharski; Timothy W Russell; Charlie Diamond; Yang Liu; John Edmunds; Sebastian Funk; Rosalind M Eggo
Journal:  Lancet Infect Dis       Date:  2020-03-11       Impact factor: 25.071

View more
  1 in total

1.  COVID-19: average time from infection to death in Poland, USA, India and Germany.

Authors:  Antoni Wiliński; Łukasz Kupracz; Aneta Senejko; Grzegorz Chrząstek
Journal:  Qual Quant       Date:  2022-02-15
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.