Literature DB >> 32984666

Modeling Palestinian COVID-19 Cumulative Confirmed Cases: A Comparative Study.

Issam Dawoud1.   

Abstract

COVID-19 is still a major pandemic threatening all the world. In Palestine, there were 26,764 COVID-19 cumulative confirmed cases as of 27th August 2020. In this paper, two statistical approaches, autoregressive integrated moving average (ARIMA) and k-th moving averages - ARIMA models are used for modeling the COVID-19 cumulative confirmed cases in Palestine. The data was taken from World Health Organization (WHO) website for one hundred seventy-six (176) days, from March 5, 2020 through August 27, 2020. We identified the best models for the above mentioned approaches that are ARIMA (1,2,4) and 5-th Exponential Weighted Moving Average - ARIMA (2,2,3). Consequently, we recommended to use the 5-th Exponential Weighted Moving Average - ARIMA (2,2,3) model in order to forecast new values of the daily cumulative confirmed cases in Palestine. The forecast values are alarming, and giving the Palestinian government a good picture about the next number of COVID-19 cumulative confirmed cases to review her activities and interventions and to provide some robust structures and measures to avoid these challenges.
© 2020 The Author.

Entities:  

Keywords:  ARIMA models; COVID-19; Forecasting; Moving average models; Pandemic

Year:  2020        PMID: 32984666      PMCID: PMC7508169          DOI: 10.1016/j.idm.2020.09.001

Source DB:  PubMed          Journal:  Infect Dis Model        ISSN: 2468-0427


Introduction

The SARS-CoV-2 (COVID-19) is a new pandemic which spreads rapidly from person to person and to many countries (Ceylan, 2020). In most countries, the health systems are tenuous and especially in war regions. Now, This pandemic is the world crisis. According to studies, COVID-19 originated from Wuhan, China, in the end of 2019 and has caused an economic crisis whose impact will be felt for the next years (WHO, 2020). On January 30, 2020, the World Health Organization (WHO) declared the outbreak as a Public Health Emergency of International concern (International Health Regulations, 2020 30 January 2020; WHO, 2020). According to the ministry of health in Palestine, the first case was detected on March 5, 2020.152 deaths are reported on August 27, 2020 with more than 26,764 total number of infected cases. The phase of COVID-19 epidemics can be decomposed as an exponential growth. Since forecasting is an important issue nowadays, there are many different methods have been proposed and developed to get accurate forecasts. The well-known used method is called the Box and Jenkins method using autoregressive integrated moving average (ARIMA) models. Also, many studies in different fields discussed using different forecasting models as ARIMA models and k-th moving averages (k-th simple moving average (k-th SMA), k-th weighted moving average (k-th WMA) and k-th exponential weighted moving average (k-th EWMA)) with ARIMA models in order to find the best accurate forecasting models under some conditions, for example, Crane and Crotty (1967), Shami and Snyder (1998), Billah, King, Snyder, & Koehler (2006), Burman and Shumway (2006), Shih and Tsokos (2008), Tsokos (2010), Safi and Dawoud (2013) and Dawoud and Kaciranlar (2017a, 2017b, 2017c)). Recently, there are many studies about building models for forecasting the ongoing trend with data-driven analysis and estimating the COVID-19, to mention a few, Li et al. (2020), Fanelli and Piazza (2020), Roda et al. (2020), Wei et al. (2016), Al-qaness et al. (2020), Anastassopoulou et al. (2020), Wang et al. (2020), Ayinde et al. (2020), Ghosal et al. (2020), Ceylan (2020) and Ogundokun et al. (2020). The main purpose of this article is modeling COVID-19 cumulative confirmed cases in Palestine using different existing methods as ARIMA and k-th moving averages – ARIMA models for giving accurate forecast values. This article is organized as follows, in section 2 we present some fundamental definitions. In section 3 we give and describe measures of forecast accuracy. In section 4 we present data description, forecasting techniques and the fitting models for COVID-19 data. Finally, in section 5 some conclusions are given.

Fundamental definitions

In this section, we present the definitions of ARIMA, the k-th SMA, k-th WMA and k-th EWMA models. (Box et al., 1994) (ARIMA Model)

The classical model is defined as

where , is the difference filter, is the degree of differencing of the series, and . Stationarity and invertibility requires the roots of and to lie outside the unit circle, respectively. (Shih & Tsokos, 2008) (The k-th SMA method) The k-th SMA method of a time series is defined aswhere . (Shih & Tsokos, 2008) (The k-th SMA Back-Shift) The k-th SMA Back-Shift operator in order to obtain the estimates of the original time series data , that is. . (2.3). (Tsokos, 2010) (The k-th WMA Method) The k-th WMA method of a time series is defined as:where . (Tsokos, 2010) (The k-th WMA Back-Shift) The k-th WMA Back-Shift operator in order to obtain the estimates of the original time series data , that is. . (2.5). (Tsokos, 2010) (The k-th EWMA Method) The k-th EWMA method of a time series is defined as:where and the smoothing factor . (Tsokos, 2010) (The k-th EWMA Back-Shift) The k-th EWMA Back-Shift operator in order to obtain the estimates of the original time series data , that is,

Measures of forecast accuracy

Several measures of forecast accuracy have been introduced by many authors. These measures are recommended to use in comparing the forecast methods accuracy which are applied to univariate time series data. For example, Hyndman and Koehler (2006) introduced the Mean Square Error (MSE) as the most used measure of deviation between the actual and the predicted value. The MSE measure is defined as:where is the actual value and, is the predicted value. MSE is one of the most commonly used measures of forecast accuracy and the .

Fitting forecasting models for COVID-19 data

This section presents data description, forecasting techniques and the fitting models for COVID-19 data by using the two different approaches, ARIMA (p,d,q) and k-th moving averages-ARIMA (p,d,q) models. Consider the daily COVID-19 cumulative confirmed cases in Palestine, from 5-3–2020 through 27-8-2020, the forecasting results are presented in the following sub-sections.

Data description

We use a daily COVID-19 cumulative confirmed cases dataset of Palestine from 5-3–2020 through 27-8-2020 (data source: WHO website., 2020). The time-series plot of the daily COVID-19 cumulative confirmed cases is in Fig. 1.
Fig. 1

Daily COVID-19 cumulative confirmed cases over time.

Daily COVID-19 cumulative confirmed cases over time. Fig. 1 shows that the Daily COVID-19 cumulative confirmed cases over time is exponentially increasing which the cumulative number of confirmed cases reaches 26,764 cases as of 27, August 2020.

Forecasting technique

In this study, we are fitting the classical ARIMA, a k-th SMA, a k-th WMA and k-th EWMA with ARIMA models for 90% (approximately 158 observations) of the available data which is called the in-sample forecast or the training data and the remaining 10% (approximately 18 observations) is used for the out-of-sample forecast or testing the models. The forecast accuracy measures to these forecasting models are given for the in-sample and the out-of-sample. R-statistical software is used for fitting the above mentioned models.

Fitting the classical ARIMA model for COVID-19 data

After taking the second difference of the original data in order to make it stationarity, different combinations of ARIMA models with and their corresponding RMSE values are shown in Table 1.
Table 1

The RMSE of forecast accuracy for the different-ARIMA models for in-sample.

Model Order(1,2,0)(2,2,0)(3,2,0)(4,2,0)(5,2,0)(0,2,1)
RMSE74.7092763.6152163.1909862.134261.8354266.02938
Continued
Model Order(0,2,2)(0,2,3)(0,2,4)(0,2,5)(1,2,1)(1,2,2)
RMSE65.7185361.4393661.1992861.1861265.9433163.84385
Continued
Model Order(1,2,3)(1,2,4)(2,2,1)(2,2,2)(2,2,3)(3,2,1)
RMSE61.1479160.6634662.4458261.9661761.1100862.21753
Continued
Model Order(3,2,2)(4,2,1)(0,2,0)
RMSE61.9579661.7425178.79781
The RMSE of forecast accuracy for the different-ARIMA models for in-sample. Table 1 shows that the lowest RMSE value among all models is for ARIMA (1,2,4) which is equal to 60.66346 for in-sample forecasts. This result indicates that ARIMA (1,2,4) model is more efficient than other mentioned ARIMA models with for in-sample forecasts. We use maximum likelihood estimation and show the results obtained from the R statistical software in Table 2. Here we see that , , , and . We also see that the estimated noise variance is .
Table 2

Maximum Likelihood Estimates from R Software: COVID-19 series. The estimated model would be written.

AR (1)MA (1)MA (2)MA (3)MA (1)
Coefficients−0.78800.2011−0.9299−0.00990.4589
StandardError0.11560.12990.09840.09950.1121
Maximum Likelihood Estimates from R Software: COVID-19 series. The estimated model would be written. The RMSE of forecast accuracy for the k-th moving averages-ARIMA models for in-sample.

Fitting the k-th moving averages - ARIMA models for COVID-19 data

Different combinations of k-th moving averages-ARIMA models with and their corresponding RMSE values are shown in Table 3.
Table 3

The RMSE of forecast accuracy for the k-th moving averages-ARIMA models for in-sample.

Models2nd SMA-ARIMA2nd WMA-ARIMA2nd EWMA-ARIMA3rd SMA-ARIMA3rd WMA-ARIMA3rd EWMA-ARIMA
RMSE62.1436661.4407761.3554562.1413161.1475461.32235
Continued
Models4thSMA-ARIMA4thWMA-ARIMA4thEWMA-ARIMA5thSMA-ARIMA5thWMA-ARIMA5thEWMA-ARIMA
RMSE74.3468362.8760562.8856868.7791962.070959.9422
Table 3 shows that the lowest RMSE value among all models is for 5-th EWMA-ARIMA (2,2,3) which is equal to 59.9422 for in-sample forecasts. This result indicates that 5-th EWMA-ARIMA (2,2,3) model is more efficient than other k-th moving averages-ARIMA models with for in-sample forecasts. We use maximum likelihood estimation and show the results obtained from the R statistical software in Table 4. Here we see that , , , and . We also see that the estimated noise variance is . Noting the P-values, the estimates of all autoregressive and moving average coefficients are significantly different from zero statistically.
Table 4

Maximum Likelihood Estimates from R Software: COVID-19 series.

AR (1)AR (2)MA (1)MA (2)MA (3)
Coefficients−0.0086−0.87880.15840.91120.3873
StandardError0.04370.03920.10280.10000.1107
Maximum Likelihood Estimates from R Software: COVID-19 series. Table 5 shows the actual and forecasting results for COVID-19 cumulative confirmed cases from August 14, 2020 to August 31, 2020 based on the classical ARIMA (1,2,4) and 5th EWMA-ARIMA (2,2,3) models.
Table 5

Actual and Forecasting results of ARIMA (1,2,4) and 5-th EWMA-ARIMA (2,2,3) models daily COVID 19 cumulative confirmed cases from August 10, 2020 to August 27, 2020.

DateActual dataForecast
ARIMA5th EWMA-ARIMA
1008202018,65118,76618,787
1108202019,12119,25419,296
1208202019,59419,70219,776
1308202020,09320,19620,267
1408202020,52520,65320,704
1508202021,05621,14021,151
1608202021,55421,60421,575
1708202021,93522,08622,007
1808202022,39122,55322,459
1908202023,00323,03222,912
2008202023,42723,50123,321
2108202023,93923,97823,763
2208202024,39624,45024,177
2308202024,70724,92524,603
2408202025,02425,39825,101
2508202025,57725,87225,630
2608202026,16226,34526,100
2708202026,76426,82026,521
Actual and Forecasting results of ARIMA (1,2,4) and 5-th EWMA-ARIMA (2,2,3) models daily COVID 19 cumulative confirmed cases from August 10, 2020 to August 27, 2020. Table 5 shows that the 5-th EWMA-ARIMA forecasts are so near to the actual values of the COVID-19 cumulative confirmed cases than that of the classical ARIMA. Also, the RMSE for ARIMA and 5-th EWMA-ARIMA equal 158.1097 and 138.3910, respectively for the out-of sample forecasts. This result shows that RMSE of 5-th EWMA-ARIMA is 87.53% of RMSE for ARIMA. In other words, the RMSE of ARIMA model is 1.14 times RMSE of the 5-th EWMA-ARIMA model. This means 5-th EWMA-ARIMA model for forecasting is much more accurate and efficient than the ARIMA forecasting model. Table 6 shows the forecasting results for COVID-19 cumulative confirmed cases from 28 to 08–2020 to 30-10-2020 (62 values) based on the classical ARIMA (1,2,4) and 5th EWMA-ARIMA (2,2,3) models.
Table 6

Forecast values for daily COVID 19 cumulative confirmed cases from 01 to 09–2020 to 30-10-2020 (64 values) based on ARIMA (1,2,4), 5th EWMA-ARIMA (2,2,3).

DateARIMA5th EWMA-ARIMADateARIMA5th EWMA-ARIMA
2808202027,29326,8972909202042,45341,797
2908202027,76727,5313009202042,92742,258
3008202028,24027,9940110202043,40142,718
3108202028,71528,4630210202043,87443,177
0109202029,18828,9370310202044,34843,638
0209202029,66229,3510410202044,82244,098
0309202030,13629,8310510202045,29644,558
0409202030,60930,2910610202045,76945,018
0509202031,08330,7550710202046,24345,479
0609202031,55731,2180810202046,71745,939
0709202032,03131,6690910202047,19146,399
0809202032,50432,1311010202047,66446,859
0909202032,97832,5941110202048,13847,320
1009202033,45233,0551210202048,61247,780
1109202033,92633,5141310202049,08548,240
1209202034,39933,9721410202049,55948,700
1309202034,87334,4341510202050,03349,160
1409202035,34734,8951610202050,50749,621
1509202035,82135,3541710202050,98050,080
1609202036,29435,8141810202051,45450,541
1709202036,76836,2751910202051,92851,001
1809202037,24236,7362010202052,40251,461
1909202037,71637,1952110202052,87551,921
2009202038,18937,6552210202053,34952,381
2109202038,66338,1162310202053,82352,842
2209202039,13738,5762410202054,29753,302
2309202039,61139,0362510202054,77053,762
2409202040,08439,4962610202055,24454,222
2509202040,55839,9572710202055,71854,683
2609202041,03240,4172810202056,19255,143
2709202041,50640,8772910202056,66555,603
2809202041,97941,3363010202057,13956,063
Forecast values for daily COVID 19 cumulative confirmed cases from 01 to 09–2020 to 30-10-2020 (64 values) based on ARIMA (1,2,4), 5th EWMA-ARIMA (2,2,3).

Conclusion

In previous studies, many statistical methods and time series models were used to forecast epidemic cases. In this article, we have applied the ARIMA and the k-th moving averages-ARIMA models on the data of the cumulative confirmed cases of COVID-19 in Palestine. We have examined the model selection sensitivity based on forecast accuracy criterion, RMSE, for the above mentioned models. The main finding in general is that some of k-th moving averages ARIMA models give better results for in-sample and out-of-sample forecasts than the classical ARIMA models. In particular, the best model is 5-th EWMA-ARIMA (2,2,3) among all models. It is recommended for practitioners to use the k-th moving averages – ARIMA models for modeling any phenomenon and getting accurate forecast values.

Declaration of competing interest

The author declares that he has no conflict of interest.
  10 in total

1.  Optimization Method for Forecasting Confirmed Cases of COVID-19 in China.

Authors:  Mohammed A A Al-Qaness; Ahmed A Ewees; Hong Fan; Mohamed Abd El Aziz
Journal:  J Clin Med       Date:  2020-03-02       Impact factor: 4.241

2.  Application of a Combined Model with Autoregressive Integrated Moving Average (ARIMA) and Generalized Regression Neural Network (GRNN) in Forecasting Hepatitis Incidence in Heng County, China.

Authors:  Wudi Wei; Junjun Jiang; Hao Liang; Lian Gao; Bingyu Liang; Jiegang Huang; Ning Zang; Yanyan Liao; Jun Yu; Jingzhen Lai; Fengxiang Qin; Jinming Su; Li Ye; Hui Chen
Journal:  PLoS One       Date:  2016-06-03       Impact factor: 3.240

3.  Modeling Nigerian Covid-19 cases: A comparative analysis of models and estimators.

Authors:  Kayode Ayinde; Adewale F Lukman; Rauf I Rauf; Olusegun O Alabi; Charles E Okon; Opeyemi E Ayinde
Journal:  Chaos Solitons Fractals       Date:  2020-06-09       Impact factor: 5.944

4.  Trend and forecasting of the COVID-19 outbreak in China.

Authors:  Qiang Li; Wei Feng; Ying-Hui Quan
Journal:  J Infect       Date:  2020-02-27       Impact factor: 6.072

5.  Linear Regression Analysis to predict the number of deaths in India due to SARS-CoV-2 at 6 weeks from day 0 (100 cases - March 14th 2020).

Authors:  Samit Ghosal; Sumit Sengupta; Milan Majumder; Binayak Sinha
Journal:  Diabetes Metab Syndr       Date:  2020-04-02

6.  Data-based analysis, modelling and forecasting of the COVID-19 outbreak.

Authors:  Cleo Anastassopoulou; Lucia Russo; Athanasios Tsakris; Constantinos Siettos
Journal:  PLoS One       Date:  2020-03-31       Impact factor: 3.240

7.  Why is it difficult to accurately predict the COVID-19 epidemic?

Authors:  Weston C Roda; Marie B Varughese; Donglin Han; Michael Y Li
Journal:  Infect Dis Model       Date:  2020-03-25

8.  Real-time estimation and prediction of mortality caused by COVID-19 with patient information based algorithm.

Authors:  Lishi Wang; Jing Li; Sumin Guo; Ning Xie; Lan Yao; Yanhong Cao; Sara W Day; Scott C Howard; J Carolyn Graff; Tianshu Gu; Jiafu Ji; Weikuan Gu; Dianjun Sun
Journal:  Sci Total Environ       Date:  2020-04-08       Impact factor: 7.963

9.  Analysis and forecast of COVID-19 spreading in China, Italy and France.

Authors:  Duccio Fanelli; Francesco Piazza
Journal:  Chaos Solitons Fractals       Date:  2020-03-21       Impact factor: 5.944

  10 in total
  1 in total

1.  EKF-SIRD model algorithm for predicting the coronavirus (COVID-19) spreading dynamics.

Authors:  Abdennour Sebbagh; Sihem Kechida
Journal:  Sci Rep       Date:  2022-08-04       Impact factor: 4.996

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.