Literature DB >> 34335122

Grey forecasting models based on internal optimization for Novel Corona virus (COVID-19).

Akash Saxena1.   

Abstract

Pandemic forecasting has become an uphill task for the researchers on account of the paucity of sufficient data in the present times. The world is fighting with the Novel Coronavirus to save human life. In a bid to extend help to the concerned authorities, forecasting engines are invaluable assets. Considering this fact, the presented work is a proposal of two Internally Optimized Grey Prediction Models (IOGMs). These models are based on the modification of the conventional Grey Forecasting model (GM(1,1)). The IOGMs are formed by stacking infected case data with diverse overlap periods for forecasting pandemic spread at different locations in India. First, IOGM is tested using time series data. Its two models are then employed for forecasting the pandemic spread in three large Indian states namely, Rajasthan, Gujarat, Maharashtra and union territory Delhi. Several test runs are carried out to evaluate the performance of proposed grey models and conventional grey models GM(1,1) and NGM(1,1,k). It is observed that the prediction accuracies of the proposed models are satisfactory and the forecasted results align with the mean infected cases. Investigations based on the evaluation of error indices indicate that the model with a higher overlap period provides better results.
© 2021 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Corona; Grey forecasting models; Mean Absolute Percentage Error (MAPE); Optimization

Year:  2021        PMID: 34335122      PMCID: PMC8310466          DOI: 10.1016/j.asoc.2021.107735

Source DB:  PubMed          Journal:  Appl Soft Comput        ISSN: 1568-4946            Impact factor:   6.725


Background value coefficient Grey development coefficient Grey control parameter First order accumulated sequence of mean infected cases for a week considering overlap period Background value at th instant Forecasted value at th instant th Benchmark time series th In-sample for th benchmark time series th Out-sample for th benchmark time series Error tolerance Forecasted value of th element of time series Actual value of th element of time series Error in prediction of th element of time series Period of overlap Mean infected cases during time span of a week Forecasted value at instant considering 6 days of overlap Forecasted value at instant considering 5 days of overlap

Introduction

The month of December in 2019 was a watershed in the history of mankind with a series of cases reported suffering from inexplicable pneumonia at Wuhan, Hubei, China. The clinical investigation of the cases revealed their resemblance with viral pneumonia. Further, deep sequencing analysis from lower respiratory tract samples revealed that the root cause of this pneumonia is a new virus. People’s Republic of China (Centre for Disease Control) attributed the cause of this unidentified pneumonia to a novel form of Coronavirus and later World Health Organization (WHO) declared it as COVID-19. Coronaviruses came from a family of Coronaviridae and the order Nidovirales. These are enveloped non-segmented positive-sense RNA viruses that are broadly distributed in humans and other mammals [1]. The WHO declared the deadly disease as a global pandemic on March 11, 2020 [2]. The initial cluster was developed in a local seafood market in Wuhan. An epidemiological alert was issued by WHO at the end of April 2020 which spread like a wildfire world over. It ultimately turn out to be the nemesis for the mankind in the form of pandemic. With the spread of this disease, implications of the pandemic appeared as loss of life and several health-related issues. For combating such situations, many scientists have come forward to help the community by forecasting certain parameters so that authorities concerned can frame preventive healthcare policies. The prediction models based on Data science and Machine Learning Methods (MLMs) have assumed importance in providing better understanding about growth and trend of pandemic with respect to time. The forecasting of pandemic spread is quite challenging due to dependency of the pandemic spread on several factors. These factors are governed by the psychological behaviour of the community, combating strategies deployed by the authorities and of course,volatility and reliability of the data [3]. Moreover, the future does not repeat it in the same way as did in the past. Hence, considering the same policies which were employed in the past for forecasting the pandemic spread may not be applicable directly. Evolving fear amongst the population due to the enhanced death rate and concerns of politicians to take adequate steps as preventive measures create a strong perception. Hence, this pandemic has emerged as an infodemic as well. However, cutthroat competition among the vaccine manufacturers is a welcome move and it is furthering the cause of protecting the lives of people against COVID. These facts set the strong foundation for discussion and research in the area of forecasting the pandemic spread and healthcare-related parameters. Recently many prediction approaches have been reported by the researchers for fairly accurate forecasting about the spread of COVID, prediction of recovery rate and death rate. In Ref. [4], Maximum-Hasting (MH) parameter estimation method and the modified Susceptible Exposed Infectious Recovered (SEIR) model for prediction of COVID was developed. A study on the worst-hit states of India has been conducted in Ref. [5]. The study was based on system modelling and identification techniques. Time series-based approach in amalgamation with Long Short Term Associated Memory (LSTM) for prediction of COVID has been employed in Canada [6]. A similar approach based on LSTM has been reported in Ref. [7]. SIR model-based prediction has been developed in Ref. [8] for spread of Corona in Italy, China and France. Likewise, a prediction analysis of COVID based on deep learning models has also been presented in the study [9]. Real-time forecasting of COVID epidemic in China has been carried out in Ref. [10]. The Akaike Information Criterion (AIC) for model selection has been employed by the authors of Ref. [11] for comparing the SIR model and SEIR model. Along with this analysis authors also described and mentioned that forecasting a pandemic is not an easy task. From this analysis it can be observed that forecasting of pandemic spread is a current research domain and new forecasting methods are welcomed to encounter this deadly disease. In the past, various models based on grey prediction theories have been employed to predict various important parameters such as energy consumption [12], fuel consumption [13], load growth [14], [15] and many more. Research in grey forecasting theory can be subdivided into four parts broadly: Change in accumulation sequence operation. Change in Grey equations. Transformation of the original series with the use of some mathematical operators. Parameter optimization by heuristic and metaheuristic methods. Grey systems are defined as the systems that possess lacuna in information [16]. In other words, Grey systems contain some known information and a part of the information is unknown. Grey systems use accumulation operators to deal with randomness in data. Grey prediction Models (GPMs) have an inherent characteristic to transfer hidden original irregular data to strong regular data by using accumulation operation. The operator is known as Accumulating Generation Operator (AGO). For improving system performance, several research attempts have been made to formulate fractional-order accumulation operators. A concept of fractional order accumulation operator was put forward in researches [17] and [18]. An experiment of putting weights in accumulation sequence has also been conducted in the Ref. [19]. Another interesting approach based on time delay and multiple fractional-order grey system for forecasting the natural gas consumption in China was employed in Ref. [20]. Another interesting domain of research in GPMs is the transformation of the original data series into some other representative series. This transformation helps GPMs to achieve higher accuracy. In [21] a modified model of the Grey system has been presented and the transformation of the original time series was conducted with the help of logarithmic transformation. A technique based on background value optimization and data transformation was put forward in research for forecasting the energy consumption in Shanghai [22]. GPMs are based on Grey mathematical equations. Hence, the other aspect of conducted research in this area is related to change in the Grey system equations. A good example of this can be found in the development of Novel Grey Prediction Model (NGM) in Ref. [23]. An additional constant term has also been added in Grey system equation in order to overcome shortcomings of NGM and it has been named as NGM(1,1,k,c) [24]. Apparently, classical discrete model of Grey prediction can also be determined by changing the original Grey equations [25]. A novel discrete model was proposed for forecasting the emission in China [26]. NGM method has been applied for forecasting the consumption of natural gas in China [27]. An alternative approach based on integrating heuristic time series and fuzzy theory for forecasting of the renewable energy in Taiwan was described in [28]. Optimization-based approaches and especially the approaches which are based on some nature-inspired algorithms have been employed with GPMs in recent years. These approaches are based on the parameter estimation of the Grey models. A novel time delay forecasting model based on a nature-inspired optimizer was performed in [29]. Further, Refs. [12] and [30] are fine examples of such approaches. In addition to these published approaches, some experiments have been done to alter initial conditions of the Grey model in Ref. [26]. Authors of the paper employed alterable weighted coefficients in initial conditions. Another application of Grey prediction model for predicting sales in global integrated circuit industry is seen in Ref. [31]. Further, the Grey model has also been applied in power demand forecasting in Ref. [32]. The author employed residual modification with an artificial neural network for the modification of GM(1,1) model. From the literature review of GPMs, the motivation for employing GPMs in pandemic forecasting is very clear and pragmatic. However, in some studies, it has been reported that GM(1,1) models fail to predict with required accuracy when data mutate swiftly or the associated variables are volatile. In the past, it has been identified that forecasting accuracy of the grey models can be questionable when the initial conditions and starting points are not chosen correctly [15]. From this point of view, it is pragmatic that the involvement of optimization can enhance the forecasting accuracy. This involvement can be done at two levels during grey forecast. First, at the macro level by choosing the external optimization parameters such as data or selection of time series to develop different forecasting strategies. Secondly, by developing an internal optimization routine that integrates a few changes in forecast modelling and try to reduce the error between measured and sample data. Moreover, some researchers have emphasized in applying corrections in the internal parameter aggregation process by modifying the grey equation to achieve better results. Considering a variable (infectious cases) as a grey variable that increases with every passing day and mutates swiftly, is difficult to forecast. Hence, the presented work primarily focuses on an internal optimization model that aims at enhancing the forecasting accuracy by choosing internal processing parameters through the optimization process. The research objectives of the paper are as follows: To investigate the applicability of the GPMs on variety of benchmark time series data. To propose the prediction model based on internal optimization and analyse the results based on average ranks obtained by stacking the forecaster’s performance chronologically. To employ the proposed internal optimization-based model and other grey models on the real data by taking two different overlap periods and construct two grey prediction models for forecasting pandemic spread in different hot-spots of India. To evaluate the performance of these pandemic forecasting models based on error indices obtained for individual cities, average values of error for a particular city for both models. Remaining part of the paper is organized as follows: In Section 2, proposal of IOGM is presented and explained. In Section 3, verification of proposed model is conducted on benchmark data series and comparative analysis is presented. Section 4, presents the simulation and results analysis of proposed grey models on different parts of India. Finally, Section 5, presents the concluding remarks of the work and throws light on the future directions of the research work.

Grey internal optimization prediction model

In this section, basics of GPM and its application for COVID-19 spread prediction are explained.

GM(1,1) model

Based on the above explanation, following mathematical expressions are considered for the proposed grey forecasting model. Following steps are followed for constructing forecasting engine. The mean values of infected cases is considered in construction the initial time series. is representative denominator of time series. For the evolution of this series, successive elements are calculated based on overlap period of one week (7 days). In general, consider a time series having an overlap period “” for the duration ’’. The time series comprises of ‘k’ elements. The representation of this data series can be given as follows: By obtaining a one-time accumulating generation operation, the following series can be generated. Where m=1, 2, 3.....k. Where ‘a’ is Grey development coefficient and ‘b’ is Grey control parameter (driving coefficient). It is to be noted that the value of ‘a’ has a potential impact on background values (Z) of Grey derivatives, hence, the forecasted values get compromised due to the large value of ‘a’. Where m=2, 3.....k. Here, is the background value production coefficient. The values of this coefficient should be optimized between an interval of [0, 1]. Further, the native Grey Model (1, 1) can be derived while keeping . The following expression is for background values of grey derivatives for the native GM(1,1) model. The expression (4) can be solved with the help of least square estimation method and expression for Grey development coefficient and driving coefficient can be expressed as follows: in simplified form it can be written as The solution of Eq. (4) can be written as Where is the associated value. For obtaining predicted values of original time series Inverse Accumulation generation operation is required and can be represented as per following equations. This expression holds good for m=1. Generalized equation can be written as Eq. (13) is the generalized expression for m 2, 3, …...k. after rearranging the expressions one can get a generalized expression for forecasted values at instance. From these expressions, it can be concluded that the internal optimization of tunable parameters can have a potential impact on the forecast accuracy. Hence, these parameters should be tuned properly. The following subsection presents a discussion on the need of this optimization.

Discussion

After considering the facts involved in the development of a forecasting engine it appears that there is a potential impact of internal parameters on the accuracy of the forecast. In references  [33], [34], it has been pointed out that near accurate estimation of the background values can be expressed as per Eq. (5). The relationship between background value production coefficient and development coefficient can be defined as follows: Further, Chang et al. [35] proved that proper optimization of background value production coefficient can enhance the forecasting accuracy. A detailed explanation regarding this can be found in Ref. [36]. By using the L-Hopital rule as applied in [15] it can be concluded that for diverse values of ‘a’ the parameter revolves around 0.5 value. Moreover, it can be said that higher values of ‘a’ can lead to erroneous results. As ‘a’ approaches to zero, approaches to 0.5. Fig. 1 exhibits this relationship where 16 samples of ‘a’ are considered between span [−1,1] and is calculated as per expression (15). As indicated in different researches, larger values of ‘a’ yield erroneous forecast because of greater difference between and a. Hence, in a defined search (objective) space the error function is employed to bridge this difference through optimization.
Fig. 1

Relationship between a and .

Relationship between a and . Flow Diagram of Proposed Internal Optimization Grey Models. Proposed Grey Forecasting Models based on Internal Optimization.

Proposed internal optimization model

Based on the discussion in previous subsection, an optimization routine is formulated for predicting COVID-19 infected cases. Following are the steps involved in constructing the model. Start the iterative search by taking , while taking the model becomes conventional grey model. Calculate the background values as per Eq. (5) and further calculate values of a and b. Now by substituting the values of ‘a’, in expression (15) new value of that is designated as is obtained. Calculate the absolute error between obtained and initial value i.e. (0.5), if the value of error is greater than tolerance then stop the loop otherwise perturb the value of . The absolute error can be defined for two successive iterations by , where ‘t’ denotes current iteration. Now, update as Where is perturbed value, again calculate the values of a and b from the expression (5)–(7) and compute the absolute error between and . If the error reduces then increase the loop counter by 1 and accept the perturbation vector and append as as in same direction. Otherwise reject the perturbation value and assign opposite perturbation (). Now, update the alpha as and . Repeat the process, till the error between the successive iterations of the becomes less than tolerance value. Now the optimized model can be realized with the modified as represented by Eq. (14). For simulating the time series and prediction, is taken 10E-8 in this work. In this work, an internal optimization scheme is employed for forecasting COVID-19 cases. The flow of algorithm along with data stacking process are shown in Fig. 2, Fig. 3 respectively. Following steps are considered for framing GPMs for forecasting the pandemic:
Fig. 2

Flow Diagram of Proposed Internal Optimization Grey Models.

Fig. 3

Proposed Grey Forecasting Models based on Internal Optimization.

For staking data into Model-I and Model-II, two different overlap periods (=5 and 6) are considered. The data of three different states and Delhi are depicted in the result Section 4. Further, the data stacked in model array are segregated into two parts simulation and validation (forecasted) parts. On the basis of tolerance value ’’ obtained from simulated data, grey models have been constructed and coefficients ‘a’ and ‘b’ are calculated. However, it is quite necessary to judge the forecasting performance of the IOGM in comparison to classical GM model and Novel Grey Model (NGM) on benchmark time series. Following section depicts this analysis in depth.

Verification of proposed optimization model with benchmark time series data

For understanding the impact of internal optimization, let us consider a homogeneous Geometric Progression data series. This series is defined as follows: While forecasting the next value of series from GM(1,1) model, is set to 0.5. From this value of , undermentioned series is obtained. For this experiment obtained value of a is -0.667. The forecasted value of the series is written in bold face. From this forecasted value (201.56), it can be observed that the error is very high. A huge difference exists between actual value of the series (256) and the forecasted value of GM(1,1) model. Further, using optimized model, following series is obtained. For this model, value of is 0.5573 and value of a is -0.6931. It is observed that forecasted value is 255.9963. This value is quite close to the actual value as compared to non-optimized model. This fact validates the necessity of internal optimization for improving the forecasting accuracy of the conventional GM(1,1). Further, for verification of the internal optimized model of grey forecasting, certain benchmarks time series are used here from [37] and same are defined as under: Homogeneous Exponential Sequence (B1): The series can be identified with the help of following formula: Non-homogeneous Exponential Sequence (B2): The series can be identified with the help of following formula: Approximate Non-homogeneous Exponential Sequence (B3): The series can be identified with the help of following formula: Random Number Sequence (B4): The series can be identified as per following sequence: For evaluation of the performance of the proposed IOGM, simulations are carried out on B1–B4. For a better understanding of the computed forecasting accuracy, the whole process is subdivided into two parts. In the first part, 10 out of 15 samples are considered for building the grey architecture and for calculation of internal parameters of the forecaster. In the second part, remaining five samples of each series are employed as Out-Samples for evaluating the performance of the forecaster on the basis of error and Mean Absolute Percentage Error (MAPE). Both of these indices are defined as under: Table 1 shows the results of B1. Similar to previously reported results on Geometric Progression Series, it is observed that the IOGM model is better as far as simulated value error analysis is concerned. It is further observed that NGM is not suitable for this kind of time series. Grey models are developed with current series having 10 data points. Values of ‘a ‘and ‘b’ are provided along with the name of models in respective rows. Further, as per the analysis conducted on forecasted values, MAPE of models have been calculated and it is observed that the IOGM model gives the best results as MAPE values are optimal for this model.
Table 1

Simulated and Forecasted results of different Grey Models on B1.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.400, b = 0.9600
a = −0.3867, b = 0.2266
a = −0.4055, b = 0.9731
Simulated valueSimulated valueSimulated value
XB1in(1)1.21.201.201.20

XB1in(2)1.81.7705689121.635060.97269080545.961621.80

XB1in(3)2.72.6413784312.1711691.7086130936.718032.70

XB1in(4)4.053.9404735792.7043562.79198504831.06214.050

XB1in(5)6.0755.8784958063.2346374.38684747427.788526.0750

XB1in(6)9.11258.7696852283.7620286.73468944726.093949.11250

XB1in(7)13.6687513.082833014.28654410.1910138625.4429713.668750

XB1in(8)20.50312519.517293414.80820215.2791665325.4788420.5031250

XB1in(9)30.754687529.116380335.32701622.7695796525.9638730.75468750

XB1in(10)46.1320312543.436535295.84300333.7964281226.7397846.132031250

MAPE3.75244612930.138853120

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

XB1out(1)69.1980468864.79986.35602750.0293628627.7011969.197209020.001211

XB1out(2)103.797070396.66996.86644773.9263240828.77802103.79570880.001312

XB1out(3)155.6956055144.21457.374072109.105714929.9237155.69340610.001413

XB1out(4)233.5434082215.14277.878924160.894288731.10733233.53987350.001514

XB1out(5)350.3151123320.95518.381029237.133709332.30846350.30945680.001614

MAPE7.3729.963741160.001412603
Simulated and Forecasted results of different Grey Models on B1. Further, Table 2 shows results of B2. High errors are there for the non-homogeneous exponential model. The coefficients and internal parameters have been calculated based on In-Samples and the rest five Out-Samples are simulated with the grey equations of associated models. It is observed that MAPE for simulated values is optimal for IOGM. MAPE for forecasted values is also very competitive. However, NGM model possesses the optimal value of MAPE for forecasted values.
Table 2

Simulated and Forecasted results of different Grey Models on B2.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.5544, b = −2.1190
a = −0.5622, b = −0.7212
a = −0.5693, b = −2.1749
Simulated valueSimulated valueSimulated value
XB2in(1)5.845.8405.8405.840

XB2in(2)7.7121.49536150580.609942.99963909561.104261.54946035179.90845

XB2in(3)11.08162.60337540876.507224.29502213461.241862.73802954375.29211

XB2in(4)17.14694.53239132573.567286.56778943461.696934.83833340571.78304

XB2in(5)28.06447.89074486371.8834410.5553908262.388688.54975075169.53524

XB2in(6)47.715913.7375283971.2097517.5516921263.2162615.1081440268.3373

XB2in(7)83.088623.9165870271.2155629.8267986564.1024226.6973883167.86877

XB2in(8)146.759541.6379947571.6284251.3636413565.0014947.176578567.8545

XB2in(9)261.367172.4903852472.2649289.150327165.8907683.3650667968.10422

XB2in(10)467.66126.203386773.01386155.447585366.76056147.31323430

MAPE73.5444870663.4892464163.18707065

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

XB2out(1)838.989331219.715963173.81183271.767033667.60781260.315139568.97277

XB2out(2)1507.380796382.518296174.62365475.851100368.43192459.999213169.48354

XB2out(3)2710.485433665.951826275.43053833.919393169.23358812.858124670.01061

XB2out(4)4876.0737791159.4003276.222671462.15512170.013681436.39013370.54208

XB2out(5)8774.1328012018.47798776.995132564.40331970.773142538.22475371.0715

MAPE75.416760469.2120264770.01609993
Simulated and Forecasted results of different Grey Models on B2. The B3 benchmark consists of an approximate non-homogeneous model, the simulated results are shown in Table 3 . From the Table, it can be observed that MAPE value is optimal for IOGM (7.63). However, it is worth mentioning here that other competitive models NGM and GM possess high simulation errors for this sequence. This fact indicates that the IOGM model suits well for this kind of time series. Further, inspecting the forecasted results of Out-Samples, it can be concluded that the IOGM model possesses least MAPE. On the other hand, the NGM model possesses highest simulated and forecasted errors.
Table 3

Simulated and Forecasted results of different Grey Models on B3.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.3026, b = 1.7313
a = −0.2810, b = 0.4217
a = −0.3049, b = 1.7449
Simulated valueSimulated valueSimulated value
XB3in(1)3.323.3203.3203.320

XB3in(2)4.2483.19490800524.79031.79599968157.721293.22359079424.11509

XB3in(3)5.16724.32383572416.321492.86554485144.543574.37286010415.37273

XB3in(4)5.60415.8516725164.4177034.28203850923.590975.9318650265.848665

XB3in(5)8.62177.9193737758.1460296.15802686328.575268.046683836.669406

XB3in(6)11.408410.717701796.0542958.64256485324.2438510.915474374.320725

XB3in(7)15.413814.504825115.8971511.9330591722.5819814.807041423.936463

XB3in(8)20.8858319.630136736.01217816.2909530421.9999720.086023573.829421

XB3in(9)28.559426.56648846.97812822.0624989322.7487327.24705984.595125

XB3in(10)39.303135.953815078.52168129.7062697324.4174936.961136930

MAPE9.68210684230.047011257.631959339

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

XB3out(1)55.2443448.6581741211.9218839.8295942827.9028550.141242579.237322

XB3out(2)75.9020865.8516461713.2413253.2368120629.8611968.0175162510.38781

XB3out(3)106.782989.1204691116.540570.993181333.5163492.2670097513.59383

XB3out(4)146.3561120.611381417.5904794.5095180835.42495125.16189314.48126

XB3out(5)203.8385163.229676419.92206125.654291438.35596169.78440616.70641

MAPE15.8432453433.0122593112.88132694
Simulated and Forecasted results of different Grey Models on B3. To test the applicability of the IOGM model further, a random sequence time series is considered. Comparative analysis of the performance of different grey models is depicted through errors in simulated and forecasted values of the series in Table 4. In-Samples are employed for building the grey prediction models, by observing the individual error component and accumulated MAPE for the simulation model. It is observed that the NGM model is not suitable for forecasting the random samples. Further, the GM model also possesses higher MAPE as compared to the IOGM model. By comparing the MAPEs of these models it can be concluded that the IOGM model possesses optimal MAPE.
Table 4

Simulated and Forecasted results of different Grey Models on B4.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.0854, b = 33.9309
a = 0.3757, b = 29.7504
a = −0.0854, b = 33.9549
Simulated valueSimulated valueSimulated value
XB4in(1)78.357178.3571078.3571078.35710

XB4in(2)35.089442.4083090820.8578913.4348563261.7124942.4376314120.94146

XB4in(3)40.804546.1894451213.1969434.0277931216.6077446.2232877713.27988

XB4in(4)48.91250.307708252.85350948.171219521.51451750.346644282.933113

XB4in(5)58.035254.793156815.58633957.885060360.25870454.837825535.509371

XB4in(6)66.45359.6785291510.1943864.556619372.85371759.7296433910.11746

XB4in(7)77.883164.9994825716.5422569.1387100611.2275865.0578367216.46733

XB4in(8)68.230870.794853613.75791272.285733875.94296770.861332473.855345

XB4in(9)73.00277.106941465.62305374.447139991.9795977.182530085.726597

XB4in(10)79.354183.981816745.8317375.931614434.31292884.067611240

MAPE9.38266716911.823360458.758949572

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

XB4out(1)79.481691.469657715.0828176.951165713.18367391.5668772715.20513

XB4out(2)105.757499.625116555.79844477.651403326.5759199.735116645.694432

XB4out(3)95.2562108.507718313.9114578.1323331917.97664108.632005214.04193

XB4out(4)119.923118.18229511.45151978.4626404534.57248118.32254231.334571

XB4out(5)133.4256128.71945983.52716478.6894986541.02369128.87752553.408697

MAPE7.95427707724.666480427.93695043
Simulated and Forecasted results of different Grey Models on B4. Comparative analysis of Forecasting engines on the basis of rank. By comparing the results, on all benchmarks, on the basis of MAPE and arranging ranks on the basis of performance, one can calculate the average rank obtained from models. It can be concluded that IOGM secured 1.25 average rank as the performance of IOGM is better than other models on three out of four benchmarks. NGM performance is very weak on these benchmarks as the average rank possessed by this model is 2.5. Except, non-homogeneous model, performance of NGM is comparatively weak. Further, the average rank obtained by the GM model is 2.25. The same analysis is depicted in Fig. 4, where (a) segment shows the average rank and the rank obtained on the basis of MAPE of simulated and forecasted data of benchmark time series. Segment (b) shows the graphical representation of rank-based comparison of forecasters for simulated data. Likewise, segment (c) and (d) show rank-based analysis of forecasted data and MAPEs of forecasting engines respectively. From this analysis, it can be concluded that for all types of time series, the performance of IOGM is satisfactory. Following points support the argument for choosing IOGM for COVID-19 forecasting:
Fig. 4

Comparative analysis of Forecasting engines on the basis of rank.

It is observed that the overall performance of IOGM is competitive with conventional GM and NGM models. This is on the basis of average rank obtained in simulated data (In-Samples) and forecasted data (Out-Samples). Application of IOGM on previously reported approaches, motivated the author to employ this prediction theory for forecasting the pandemic growth in terms of reported infected cases. As it is indicated in the results of the benchmark data series that IOGM is compatible for all types of data and it can give fruitful results. Further, it is apparent from the results that the average rank method is a suitable criterion for evaluating the performance of different prediction models. Based on the results on known time-series data, the following section presents an application of conventional GM, NGM and proposed IOGMs for forecasting the spread of pandemic at different locations in India.

Simulation results

The proposed internal optimization based model is implemented to predict the infected cases in different states (Gujarat, Rajasthan and Maharashtra) and union territory Delhi. The data for this study has been taken from [38],[39]. For better understanding, it is to be noted here that variable indicates the mean values of infected cases of COVID-19 for the duration of the first week of April to the Second week of May 2020. Time series is constructed by excluding two values for Model-1 and excluding only one entry for Model-2. For considering the uncertainty and unavailability of the data in a few cases, the mean of available data is considered for the forecast. A few points may be noted here: Forecasting of such time series is quite difficult due to the unavailability of data. Also, at the initial stage, the value of the variable is quite small and it takes large abrupt changes at a later stage. This change sometimes is quite higher than the previous value. Also, it is to be noted here that the forecasting of the day ahead cases are merely meaningful as this short time forecast will give very less time to authorities for taking any preventive action. On the basis of this fact, the work reported in this paper addresses two representative models that can forecast weekly mean infected cases. Forecast, on the basis of mean values of the infected cases, can be helpful for authorities to cater to the needs of the patients and to think on needed health care and infrastructure. The advantage associated with these models is that they can give predictions of infected cases for an upcoming week.

Results of Model-I

On the basis of the discussion and steps presented in Section 2.3, two models are constructed. These are named as Model-I and Model-II. As shown in Fig. 3, the period of overlap is 5 days, which indicates that new time series element consists of 5 same values and last two are replaced by new values of infected case. The results of Model-I are shown in Table 5, Table 6, Table 7, Table 8 for Rajasthan, Maharashtra, Delhi and Gujarat respectively.
Table 5

Simulated and Forecasted results for Rajasthan on Model-I.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.1411, b = 596.2662
a = 0.0154, b = 227.0934
a = −0.14135, b = 596.3051
Simulated valueSimulated valueSimulated value
Cˆ(W,5)(0)(1)355.5714286355.57142860355.57142860355.57142860

Cˆ(W,5)(0)(2)503.8571429694.26204937.78946332.870953933.93545695.321962337.99982

Cˆ(W,5)(0)(3)685799.46571616.71032553.124543419.25189800.867572616.91497

Cˆ(W,5)(0)(4)871.8571429920.61121885.59198770.005705811.68212922.43435935.80109

Cˆ(W,5)(0)(5)1061.4285711060.1142730.123824983.56607837.3356321062.4542390.096631

Cˆ(W,5)(0)(6)1256.1428571220.7566542.8170521193.8565074.958541223.7282772.580485

Cˆ(W,5)(0)(7)1493.7142861405.7416715.8895211400.9270616.2118461409.4827265.639068

Cˆ(W,5)(0)(8)1727.7142861618.7580376.3063811604.827047.1127061623.4335616.035762

Cˆ(W,5)(0)(9)1933.2857141864.0534293.5810681805.6049916.6043381869.8608213.280679

Cˆ(W,5)(0)(10)2111.7142862146.5191871.6481822003.3087185.1335342153.6942281.987956

Cˆ(W,5)(0)(11)2278.5714292471.7878538.4797182197.9852913.5366962480.6118058.866976

MAPE8.0852283949.6147961698.109403844

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

Cˆ(W,5)(0)(12)2467.2857142846.3454815.363432389.6810623.1453452853.65564215.65972

Cˆ(W,5)(0)(13)2681.5714293277.6609822.229112578.441673.845873286.8230722.57078

Cˆ(W,5)(0)(14)2920.5714293774.33504529.232762764.3120585.3503013785.74265829.62335

Cˆ(W,5)(0)(15)3171.4285714346.27166237.04462947.3364797.0659674360.39517937.48994

Cˆ(W,5)(0)(16)3437.7142865004.87559645.587313127.5585119.0221515022.2764346.09348

MAPE29.891442425.68592693230.28745404
Table 6

Simulated and Forecasted results for Maharashtra on Model-I.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.1884, b = 1316.9084
a = −0.0955, b = 431.4991
a = −0.1890, b = 1320.8668
Simulated valueSimulated valueSimulated value
Cˆ(W,5)(0)(1)1028.1428571028.14285701028.14285701028.1428570

Cˆ(W,5)(0)(2)1386.4285711662.27557319.89623778.535837843.845951667.74178320.29049

Cˆ(W,5)(0)(3)1834.7142862006.8841789.3840171309.28701328.63812014.6066089.804923

Cˆ(W,5)(0)(4)2352.5714292422.934062.9908821893.20263219.526242433.6140223.444852

Cˆ(W,5)(0)(5)2872.4285712925.2358081.8384182535.60808511.725982939.7685792.344358

Cˆ(W,5)(0)(6)3522.4285713531.6704130.2623713242.36227.9509453551.1955560.81668

Cˆ(W,5)(0)(7)4274.8571434263.8257980.2580524019.9106735.9638594289.7900080.349318

Cˆ(W,5)(0)(8)5234.7142865147.7653111.6610074875.3448556.865125182.0008281.006998

Cˆ(W,5)(0)(9)63556214.9555232.203695816.4664248.4741716259.7778761.498381

Cˆ(W,5)(0)(10)7500.4285717503.3863860.0394356851.8585418.6471067561.7160930.81712

Cˆ(W,5)(0)(11)8690.5714299058.9235994.2385267990.9641268.0501889134.4375795.107445

MAPE3.88842059813.607969094.134597266

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

Cˆ(W,5)(0)(12)10027.2857110936.941349.0718039244.171987.80982811034.2611210.04235

Cˆ(W,5)(0)(13)11578.2857113204.2934914.043610622.911538.2514313329.2189515.12256

Cˆ(W,5)(0)(14)13442.5714315941.693518.591112139.757099.69170516101.4929719.77986

Cˆ(W,5)(0)(15)15590.1428619246.5876123.4535713808.5424811.4277419450.3576524.76061

Cˆ(W,5)(0)(16)18037.1428623236.623828.8265215644.4872713.2651623495.7350530.26306

MAPE18.7973182810.0891714719.99368955
Table 7

Simulated and Forecasted results for Delhi on Model-I.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.1199, b = 965.4484
a = 0.1151, b = 461.3211
a = −0.1201, b = 966.620
Simulated valueSimulated valueSimulated value
Cˆ(W,5)(0)(1)744.8571429744.85714290744.85714290744.85714290

Cˆ(W,5)(0)(2)968.42857141120.64855315.71825576.827109940.436791122.08921515.86701

Cˆ(W,5)(0)(3)12391263.4123521.970327949.865880623.336091265.2163582.115929

Cˆ(W,5)(0)(4)1459.8571431424.3633882.4313171282.34268412.159711426.5999622.278112

Cˆ(W,5)(0)(5)1698.8571431605.8186065.4765371578.6679827.0747071608.5687165.314657

Cˆ(W,5)(0)(6)1865.4285711810.3901142.9504461842.772671.2145151813.7483412.770421

Cˆ(W,5)(0)(7)2066.2857142041.0227851.2226252078.160220.5746792045.099481.025329

Cˆ(W,5)(0)(8)2286.1428572301.0366530.6514812287.9531590.0791862305.9604180.866856

Cˆ(W,5)(0)(9)2563.5714292594.1747041.1937752474.934493.4575572600.0952531.424724

Cˆ(W,5)(0)(10)2899.1428572924.6567570.880052641.5846098.8839452931.7482091.124655

MAPE3.2494803579.7217182443.278769158

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

Cˆ(W,5)(0)(11)3236.7142863297.240211.869982790.11420713.797953305.7048772.131501

Cˆ(W,5)(0)(12)3683.5714293717.2885260.9153372922.493620.661413727.3612731.188788

Cˆ(W,5)(0)(13)41954190.8484370.0989653040.47886327.521364202.8016940.185976

Cˆ(W,5)(0)(14)4846.1428574724.7369962.5052063145.63512635.089924738.8865172.213231

Cˆ(W,5)(0)(15)5560.4285715326.6397054.2045123239.35733641.742675343.3511873.903969

Cˆ(W,5)(0)(16)6233.1428576005.2211523.656613322.88876346.696024.9178383.34061

MAPE2.208434930.917217222.160679097
Table 8

Simulated and Forecasted results for Gujarat on Model-I.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.1538, b = 1806.6475
a = 0.4067, b = 1634.4552
a = −0.1540, b = 1810.2249
Simulated valueSimulated valueSimulated value
Cˆ(W,5)(0)(1)1784.7142861784.71428601784.71428601784.7142860

Cˆ(W,5)(0)(2)2234.1428572249.7882150.7002851463.37854134.499332254.190280.897321

Cˆ(W,5)(0)(3)2650.8571432623.9025351.0168262317.28944112.583392629.4924190.805955

Cˆ(W,5)(0)(4)3077.1428573060.2278330.5496992885.8585556.2162963067.2789430.320554

Cˆ(W,5)(0)(5)3569.4285713569.1090910.008953264.4353358.5445963577.9529350.238816

MAPE0.45515202912.368722340.452529218

Forecasted results

Out-sampleSimulated ValueErrorSimulated ValueErrorSimulated ValueError

Cˆ(W,5)(0)(6)4125.1428574162.6115430.90833516.50737714.754294173.6494921.175878

Cˆ(W,5)(0)(7)4751.2857144854.8067372.17883684.34734422.455784868.5240972.467509

Cˆ(W,5)(0)(8)5467.5714295662.1061573.5579733796.10212130.57065679.0889913.868583

Cˆ(W,5)(0)(9)6224.4285716603.6503366.0924753870.51306337.817386624.6055536.429136

Cˆ(W,5)(0)(10)7011.1428577701.7626579.8503173920.05893944.088167727.54200610.21801

MAPE4.51757319829.937240994.831822784

Forecasted results of Rajasthan

Forecasted results of model-I for the state of Rajasthan are depicted in Table 5. The data of reported infected cases have been segregated into two parts. First, 11 samples are taken as simulated data and the remaining five samples are considered for validation of grey models and for generating forecasts. The data of 6th April 2020–12th April 2020 is considered as and the data of mean infected cases during 26th April 2020 to 5th May 2020 is denoted as . Simulation process with defined time series with 11 data points provides the coefficient ‘a’ and ‘b’ for representative models of GM, NGM and IOGM. These values are depicted with corresponding grey models. From the obtained results, it can be concluded that proposed model gives competitive performance as per Lewis’ criterion for model evaluation [30]. This criterion has been used for evaluating the performance of the grey models on the basis of calculated MAPE. Simulated and forecasted results of these grey models are depicted in Fig. 5. It is observed that the NGM model gives better results as the increment in the forecasted values are at a comparatively low exponential rate. Further, errors in the forecasting and simulation process have been encapsulated in Fig. 6.
Fig. 5

Forecasted Results of Proposed Grey Model-I for different states and Delhi.

Fig. 6

Error of Proposed Grey Model-I for different states and Delhi.

Simulated and Forecasted results for Rajasthan on Model-I.

Forecasted results of Maharashtra

Table 6 shows the comparative analysis of different grey forecasters with the proposed IOGM. The analysis is being depicted through the calculation of MAPE for simulated data and forecasted data. Data of the infected cases for the state of Maharashtra are employed for the construction of IOGM and other forecasters. The first 11 samples are taken for constructing grey architectures and the internal parameters (a, b) of these, are depicted with respective grey forecasters. For simplification, it is to be noted that the data sample from 6th April 2020 to 12th April 2020 is considered as while represents the data of mean infected cases during 26th April 2020 to 5th May 2020. After careful evaluation of the results, it can be concluded that for this particular data, IOGM’s performance indicator i.e. MAPE is competitive (4.134597266). However, it can be concluded from this result that further improvement in the performance of IOGM may be possible, if the overlap period is varied. It is observed that the NGM model gives better results in this case, as the forecasted values increase at a comparatively lower exponential rate. Graphical representation of simulated and forecasted results along with errors in forecasting and simulation process of this model for Maharashtra state have been encapsulated in Fig. 5, Fig. 6 respectively. Simulated and Forecasted results for Maharashtra on Model-I.

Forecasted results of Delhi

Results of grey models along with proposed IOGM (model-I) for Delhi are depicted in Table 7. Data of infected cases have been segregated into two parts, the first 10 samples are taken as simulated data and the remaining 6 samples have been considered for validation and forecasting purposes. The data from 7th April 2020 to 13th April 2020 are depicted in Table 7 as and the data of mean infected cases during 25th April 2020 to 1st May 2020 are denoted as . Coefficients of constructed grey models are depicted along with the models. In addition to that error in the prediction of each sample is also shown in this analysis. Careful observation of Table 7 yields the fact that the proposed model exhibits competitive performance as values of MAPEs are competitive (3.278769158) and (2.160679097) for simulated and forecasted data. Depiction of forecasting performance and errors is exhibited in Fig. 5, Fig. 6. It is observed that the IOGM model gives better results in this case, as the forecasted values increase with a comparatively higher exponential rate. Simulated and Forecasted results for Delhi on Model-I.

Forecasted results of Gujarat

Forecasted results of model-I for Gujarat state are depicted in Table 8. Similar to the results reported in the previous subsection for different states, the data of infected cases have been subdivided into two parts. First 5 samples are taken as simulated data and the remaining 5 samples are considered for validation and forecasting purpose. The time series of this model can be identified as - . The mean value of infected cases from 18th April 2020 to 24th April 2020 is considered as the first element of the time series and the mean value of infected cases from 21st April 2020 to 27th April 2020 is considered as the last element of the time series. Likewise, the data employed for simulation, yield the coefficients ‘a’ and ‘b’ for representative models of GM, NGM and IOGM respectively. The entries of these parameters are also depicted. Inspecting the obtained results, it is easily predictable that the proposed model gives a competitive performance as values of MAPEs are quite competitive. Proposed IOGM exhibits superior performance with simulated MAPE (0.4525292188769158) and forecasted MAPE (4.831822784). Simulated and forecasted results of these models are depicted in Fig. 5. It is observed that the IOGM model gives competitive results in this case as the forecasted values increase with a comparatively higher exponential rate. Errors in the forecasting and simulation process of this model for the state of Gujarat have been exhibited in Fig. 6. Simulated and Forecasted results for Gujarat on Model-I. Forecasted Results of Proposed Grey Model-I for different states and Delhi. Error of Proposed Grey Model-I for different states and Delhi.

Results of Model-II

The results of proposed Model-II are presented in this section. The difference between this model and the first model is that it employs an extended overlap period. The compilation of forecasted results is presented in Table 9, Table 10, Table 11, Table 12 for Delhi, Maharashtra, Rajasthan and Gujarat respectively.
Table 9

Simulated and Forecasted results for Delhi on Model-II.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.0611, b = 904.781
a = 0.0031, b = 159.4292
a = −0.0611, b = 904.9924
Simulated valueSimulated valueSimulated value
Cˆ(W,6)(0)(1)664664066406640

Cˆ(W,6)(0)(2)744.8571429974.773571930.86718236.769797168.21272975.089044130.90954

Cˆ(W,6)(0)(3)8351036.20660224.0966395.223538752.667841036.56155924.13911

Cˆ(W,6)(0)(4)968.42857141101.51131813.74213553.188979142.877671101.90948413.78325

Cˆ(W,6)(0)(5)1109.1428571170.9317245.570866710.66762335.926411171.3771375.611025

Cˆ(W,6)(0)(6)12391244.7272040.462244867.660970729.970871245.2242380.50236

Cˆ(W,6)(0)(7)13451323.1734871.6227891024.17051823.853491323.7268791.581645

Cˆ(W,6)(0)(8)1459.8571431406.563683.6505941180.19775419.156631407.178563.608475

Cˆ(W,6)(0)(9)1577.5714291495.2093635.2208141335.74416815.329081495.8912845.177588

Cˆ(W,6)(0)(10)1698.8571431589.4417516.4405291490.81123912.246231590.1967226.396089

Cˆ(W,6)(0)(11)1780.4285711689.6129355.1007741645.4004467.5840241690.4474555.053902

Cˆ(W,6)(0)(12)1865.4285711796.0971943.7166461799.5132613.5335211797.0182933.667269

Cˆ(W,6)(0)(13)1961.1428571909.2923972.643891953.1511520.4075021910.3076742.59212

Cˆ(W,6)(0)(14)2066.2857142029.6214871.7744032106.3155831.9372862030.7391541.720312

Cˆ(W,6)(0)(15)2181.5714292157.5340621.1018372259.0080123.5495782158.7629941.045505

Cˆ(W,6)(0)(16)2286.1428572293.5080550.3221672411.2298955.4715322294.8578370.381209

Cˆ(W,6)(0)(17)2416.8571432438.0515190.876942562.982686.0460972439.5325030.938217

Cˆ(W,6)(0)(18)2563.5714292591.7045271.0974182714.2678155.8783772593.3278921.160742

Cˆ(W,6)(0)(19)27292755.0411890.9542392865.0867394.9866892756.8189991.019384

Cˆ(W,6)(0)(20)2899.1428572928.6717971.018543015.4408914.0114632930.6170721.085639

Cˆ(W,6)(0)(21)3061.8571433113.2451041.6783273165.3317013.379473115.3718931.747787

Cˆ(W,6)(0)(22)3236.7142863309.4507512.2472323314.7605982.4112823311.7742082.319016

Cˆ(W,6)(0)(23)3450.5714293518.0218421.9547613463.7290060.3813163520.5583092.028269

Cˆ(W,6)(0)(24)3683.5714293739.7376811.5247773612.2383421.936523742.5047811.599897

Cˆ(W,6)(0)(25)3939.2857143975.426690.917453760.2900234.5438623978.4434190.994031

Cˆ(W,6)(0)(26)41954225.9694970.7382483907.8854596.8442084229.2563310.816599

Cˆ(W,6)(0)(27)44944492.3022310.0377794055.0260559.7680014495.8812350.041861

MAPE4.42145112413.811543334.441512436

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

Cˆ(W,6)(0)(28)4846.1428574775.4200181.4593634201.71321213.297784779.3149671.378991

Cˆ(W,6)(0)(29)5214.7142865076.3807022.6527554347.9483316.621545080.6172042.571514

Cˆ(W,6)(0)(30)5560.4285715396.3087922.9515674493.732819.18375400.9144312.868738

Cˆ(W,6)(0)(31)5899.5714295736.399672.7658244639.06801121.366025741.4041482.680996

Cˆ(W,6)(0)(32)6233.1428576097.9240512.1693524783.95534723.249716103.3593512.082152

MAPE2.39977225218.743749812.316478228
Table 10

Simulated and Forecasted results for Maharashtra on Model-II.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.0845, b = 1604.26
a = −0.0468, b = 236.57
a = −0.0845, b = 1604.921
Simulated valueSimulated valueSimulated value
Cˆ(W,6)(0)(1)1028.1428571028.14285701028.14285701028.1428570

Cˆ(W,6)(0)(2)1209.7142861764.74618445.88124411.552296965.979381765.35226345.93134

Cˆ(W,6)(0)(3)1386.4285711920.44304638.51727673.44098651.42621921.02560338.55929

Cˆ(W,6)(0)(4)1596.2857142089.87645330.9212947.864460340.620632090.42662230.95567

Cˆ(W,6)(0)(5)1834.7142862274.25832623.957081235.42267332.664032274.76586323.98475

Cˆ(W,6)(0)(6)2089.5714292474.90751318.440911536.74429126.456482475.36061518.4626

Cˆ(W,6)(0)(7)2352.5714292693.25921614.481511852.48807521.256882693.64433314.49788

Cˆ(W,6)(0)(8)2602.4285712930.87526212.620782183.34431116.103582931.17687512.63237

Cˆ(W,6)(0)(9)2872.4285713189.4552711.036892530.03632811.919963189.65565211.04386

Cˆ(W,6)(0)(10)3189.2857143470.8488128.8284062893.3220759.2799353470.9277598.830882

Cˆ(W,6)(0)(11)3522.4285713777.0686397.2291053273.9957767.0528843777.0031697.227247

Cˆ(W,6)(0)(12)3884.4285714110.305085.8149223672.8896745.4458184110.0691025.808847

Cˆ(W,6)(0)(13)4274.8571434472.9417084.6337124090.875844.30384472.5056534.623511

Cˆ(W,6)(0)(14)4735.5714294867.5723912.7874354528.868094.3649084866.9027992.773295

Cˆ(W,6)(0)(15)5234.7142865297.0198411.1902384987.8239744.7164055296.0789081.172263

Cˆ(W,6)(0)(16)5802.8571435764.3558120.6634895468.7468755.7576865763.1008790.685115

Cˆ(W,6)(0)(17)63556272.9230631.2915335972.6882036.0159216271.3060591.316978

Cˆ(W,6)(0)(18)6915.1428576826.3592741.2839016500.7496875.9925476824.3260891.313303

Cˆ(W,6)(0)(19)7500.4285717428.6230620.9573527054.0857925.9508977426.1128590.99082

Cˆ(W,6)(0)(20)8109.4285718084.0223010.3132937633.9062385.863828080.9667470.350972

Cˆ(W,6)(0)(21)8690.5714298797.2449281.2274628241.4786465.1675868793.5673491.185146

Cˆ(W,6)(0)(22)9360.4285719573.3924832.2751518878.1313075.1525139569.006922.228299

Cˆ(W,6)(0)(23)10027.2857110418.016593.8966769545.2560924.80717910412.826763.844919

Cˆ(W,6)(0)(24)10728.1428611337.158695.67680610244.311494.50992711331.056835.619929

Cˆ(W,6)(0)(25)11578.2857112337.39326.55630310976.825795.19472312330.25886.494684

Cˆ(W,6)(0)(26)1246513425.874617.70858111744.400455.78098313417.5737.641982

Cˆ(W,6)(0)(27)13442.5714314610.38868.68745412548.713556.64945614600.769378.615896

MAPE9.88439640613.645708729.881179166

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

Cˆ(W,6)(0)(28)14510.5714315899.407789.57120413391.523517.71194915888.303089.494675

Cˆ(W,6)(0)(29)15590.1428617302.1522310.9813614274.67298.43783117289.3748410.8994

Cˆ(W,6)(0)(30)16723.2857118828.6554912.5894515200.09259.10821718813.9967512.5018

Cˆ(W,6)(0)(31)18037.1428620489.8363613.5980216169.8054810.3527320473.0637713.50503

Cˆ(W,6)(0)(32)19302.8571422297.5769115.5143917185.9318610.966922278.4316215.4152

MAPE12.450882739.3155266112.36321992
Table 11

Simulated and Forecasted results for Rajasthan on Model-II.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.0586, b = 700.4246
a = 0.0160, b = 129.94
a = −0.0585, b = 700.6304
Simulated valueSimulated valueSimulated value
Cˆ(W,6)(0)(1)355.5714286355.57142860355.57142860355.57142860

Cˆ(W,6)(0)(2)427742.826613273.96408187.916908855.99136742.978321673.99961

Cˆ(W,6)(0)(3)503.8571429787.673605856.32876313.858540937.70882787.768348656.34756

Cˆ(W,6)(0)(4)588.2857143835.228165441.97662437.805807125.57939835.25851741.98178

Cˆ(W,6)(0)(5)685885.653756929.29252559.790289518.27879885.611603329.28637

Cˆ(W,6)(0)(6)776.4285714939.123714520.9543679.8430712.43971939.000196920.93839

Cˆ(W,6)(0)(7)871.8571429995.82183714.21846797.99473838.471847995.607291514.19386

Cˆ(W,6)(0)(8)968.42857141055.943029.036748914.27539985.591861055.6269139.004107

Cˆ(W,6)(0)(9)1061.4285711119.6939255.4893331028.7146833.0820621119.2647835.448903

Cˆ(W,6)(0)(10)1156.5714291187.2936912.6563221141.3417471.3167961186.7390262.608364

Cˆ(W,6)(0)(11)1256.1428571258.9746870.2254391252.185290.3150571258.2809160.170208

Cˆ(W,6)(0)(12)1369.8571431334.983312.5458011361.2735560.6266051334.1356672.607679

Cˆ(W,6)(0)(13)1493.7142861415.5808365.2308161468.6343391.6790321414.563285.298939

Cˆ(W,6)(0)(14)1612.7142861501.044316.924351574.2949952.3822751499.8394266.999061

Cˆ(W,6)(0)(15)1727.7142861591.6675087.8743791678.2824482.8611121590.2563957.956055

Cˆ(W,6)(0)(16)1832.2857141687.7619397.8876221780.6231942.8195671686.1241017.97701

Cˆ(W,6)(0)(17)1933.2857141789.6579217.4292071881.3433082.6867421787.7711377.526801

Cˆ(W,6)(0)(18)2031.2857141897.7057146.5761311980.4684562.5017291895.5459066.682458

Cˆ(W,6)(0)(19)2111.7142862012.2767224.7088552078.0238941.5954052009.8178174.825296

Cˆ(W,6)(0)(20)21902133.7647762.5678182174.0344790.7290192130.9785452.695044

Cˆ(W,6)(0)(21)2278.5714292262.587480.701492268.5246760.4409232259.4433790.839476

Cˆ(W,6)(0)(22)2368.8571432399.1876531.2803862361.518560.3097942395.6526421.131157

Cˆ(W,6)(0)(23)2467.2857142544.0348483.1106712453.0398270.5773912540.0732042.950104

Cˆ(W,6)(0)(24)2567.4285712697.6269655.0711592543.1117960.9471262693.2000774.898734

Cˆ(W,6)(0)(25)2681.5714292860.4919656.6722272631.7574191.8576422855.5581186.488236

Cˆ(W,6)(0)(26)27953033.1896858.5219922718.9992812.7191673027.703828.325718

Cˆ(W,6)(0)(27)2920.5714293216.31375810.126182804.8596143.9619583210.2272299.917778

MAPE12.643394787.31374764612.63328453

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

Cˆ(W,6)(0)(28)30413410.49366112.15042889.3602934.9865083403.75395811.92877

Cˆ(W,6)(0)(29)3171.4285713616.39687114.030532972.522856.2718023608.94733613.79564

Cˆ(W,6)(0)(30)3305.1428573834.73116516.023163054.3684757.5873993826.51068115.77444

Cˆ(W,6)(0)(31)3437.7142864066.24704918.283453134.9180238.8080694057.18970918.01998

Cˆ(W,6)(0)(32)3570.1428574311.74034220.772213214.1920179.9702134301.77509120.49308

MAPE16.251949717.52479827916.00238203
Table 12

Simulated and Forecasted results for Gujarat on Model-II.

Simulated results
SampleIn-sampleGM(1,1) [14]
ErrorNGM(1,1,k) [23]
ErrorIOGM
Error
a = −0.0738, b = 1904.2076
a = −0.1212, b = 730.2219
a = −0.0738, b = 1905.07252622308
Simulated valueSimulated valueSimulated value
Cˆ(W,6)(0)(1)1784.7142861784.71428601784.71428601784.7142860

Cˆ(W,6)(0)(2)2013.7142862112.8058274.920834834.784594858.545032113.8010784.970258

Cˆ(W,6)(0)(3)2234.1428572274.5338531.8078971427.20254836.118562275.6813551.859259

Cˆ(W,6)(0)(4)2443.7142862448.6416040.2016321951.99884620.121642449.9588350.255535

Cˆ(W,6)(0)(5)2650.8571432636.0767050.5575722416.8921758.8260122637.5829290.500752

Cˆ(W,6)(0)(6)2862.5714292837.859320.8632842828.720171.1825472839.5757550.803322

Cˆ(W,6)(0)(7)3077.1428573055.08770.7167413193.5399843.7826363057.0377070.653371

Cˆ(W,6)(0)(8)3316.4285713288.9441660.8287353516.7173736.0392923291.1534490.762119

Cˆ(W,6)(0)(9)3569.4285713540.7015410.8048083803.005626.5438223543.1983720.734857

Cˆ(W,6)(0)(10)3841.7142863811.7300780.780494056.6154445.5938873814.5455370.707204

Cˆ(W,6)(0)(11)4125.1428574103.5049180.5245384281.2769273.7849374106.673160.447735

Cˆ(W,6)(0)(12)44294417.614120.2570764480.2943851.1581484421.1726620.176729

Cˆ(W,6)(0)(13)4751.2857144755.7673020.0943244656.5949591.992954759.7573370.178302

Cˆ(W,6)(0)(14)5104.2857145119.8049480.3040434812.7716715.7111625124.271690.391553

Cˆ(W,6)(0)(15)5467.5714295511.7084250.807254951.1215649.445695516.7014810.898572

MAPE0.89794827311.256421550.889304425

Forecasted results

Out-sampleSimulated valueErrorSimulated valueErrorSimulated valueError

Cˆ(W,6)(0)(16)5841.4285715933.6107671.5780765073.67948413.143175936.5018431.627569

Cˆ(W,6)(0)(17)6224.4285716387.8082832.6248155182.24800716.743396391.1342912.678249

Cˆ(W,6)(0)(18)66166876.7730583.9415525278.42395720.217296880.583653.999148

Cˆ(W,6)(0)(19)7011.1428577403.1664075.5914365363.62188523.498617407.516295.653478

Cˆ(W,6)(0)(20)7402.1428577969.8533587.6695435439.0948826.527974.8027747.736407

MAPE4.28108418520.024492534.338970394
Table 9 depicts results of forecasters in terms of In-Samples and Out-Samples for three grey forecasting models as described in the previous section also. Internal parameters of grey models (‘a’ and ‘b’) have also been depicted along with the errors. For constructing the time series, data from 6th April 2020 to 12th April 2020 is considered as and represents the data of mean infected cases from 2nd May 2020 to 8th May 2020. Time series is divided into two parts. First 27 samples are considered for constructing the grey model and for obtaining parameters (‘a’, ‘b’). The remaining, 5 samples are employed to evaluate the performance of constructed model. As observed from the Table 9, IOGM outperforms other opponents, when MAPE of Out-Sample is considered. It is observed that values of MAPEs are quite competitive for the proposed IOGM. The MAPE values are quite competitive for simulated (4.441512436) and forecasted data (2.316478228). Pictorial representation of all these models is depicted in Fig. 7. It is also worth mentioning here that IOGM model gives competitive results in this case, as the forecasted value increases with every sample at a higher exponential rate. In addition to that, the graphical representation of simulation and forecasting errors for Delhi Model-II is presented in Fig. 8.
Fig. 7

Forecasted Results of Proposed Grey Model-II for different states and Delhi.

Fig. 8

Error of Proposed Grey Model-II for different states and Delhi.

Simulated and Forecasted results for Delhi on Model-II. Comparative analysis of the forecasting results for the state of Maharashtra is presented in Table 10. For constructing the time series for grey models like previous case studies, 27 such indicators have been considered for simulation and the remaining 5 samples are considered as Out-Samples for evaluating the constructed grey models. The internal parameters of grey forecasting systems such as (‘a’ and ‘b’) are shown in the respective columns. The data from 5th April 2020 to 11th April 2020 is considered as and represents the data of mean infected cases from 2nd May 2020 to 8th May 2020. A careful inspection of obtained MAPEs for simulated data In-Samples and Out-Samples indicate that MAPEs are quite competitive for proposed IOGM for simulated results (9.881179166) and competitive for forecasted results (12.36321992). Graphical representation of the forecasting performance along with errors for each model are depicted in Fig. 7, Fig. 8 respectively. It is concluded that Model-II provides quite competitive results with the proposed IOGM. Simulated and Forecasted results for Maharashtra on Model-II. Forecasting results of Rajasthan state for all three grey models are depicted in Table 11 and graphical representation of the forecasting results is presented in Fig. 7. For construction of the time series, from 6th April 2020 to 12th April 2020 is considered as and data of mean infected cases from 3rd May 2020 to 9th May 2020 is considered as . Inspecting the forecasting results from Table 11 and Fig. 7, it is observed that for this particular case NGM model provides better results. Similar to previously reported results all internal parameters of the grey system are shown in Table 11. Inspecting the values of MAPEs for IOGM, it has been observed that the performance of the IOGM is acceptable and competitive as per Lewis’s criterion. It is observed that values of MAPEs are (12.63328453) for proposed IOGM simulated data (27 data points) and (16.00238203) for the remaining 5 Out-Samples. Simulated and forecasted results of these models are depicted in Fig. 7. It is observed that the IOGM model gives competitive results in this case as the increment in infected cases and forecasted values increase with a higher exponential rate. Further, the error analysis for Rajasthan (Model-II) is depicted in Fig. 8. Simulated and Forecasted results for Rajasthan on Model-II. Table 12 shows the comparative analysis of different grey models along with the proposed IOGM (Model-II). The analysis is depicted through calculated error for each sample and the same is shown along with the sample. All parameters obtained for an internal grey mechanism such as ‘a’ and ‘b’ are shown in Table 12. For constructing the time series for the state of Gujarat infected cases, 15 samples are taken for constructing the model and the remaining five samples are kept for testing the efficacy of the proposed IOGM. The data from 18th April 2020 to 24th April 2020 represent , and represent the data of mean infected cases from 2nd May 2020 to 8th May 2020. After assessment of results, it can be concluded that for this particular data IOGM’s performance is better than competitors as the MAPE values obtained for simulated data is optimal (0.889304425) and competitive for forecasted results (4.338970394). It is also worth mentioning here that the rise in infected cases in this particular state is swift comparatively, hence the NGM model provides pessimistic results. The same fact can be observed from higher MAPEs obtained from the NGM model (11.25642155) for simulated data. The MAPE is very high (20.02449253) for forecasted data. Graphical representation of the forecasted results is presented in Fig. 7. Further, the analysis depicting simulation and forecasting errors for Gujarat Model-II is shown in Fig. 8. From this analysis, it can be concluded that proposed IOGM performs satisfactorily. Simulated and Forecasted results for Gujarat on Model-II. Forecasted Results of Proposed Grey Model-II for different states and Delhi. Error of Proposed Grey Model-II for different states and Delhi.

Comparative analysis of the proposed IOGM models

To showcase the efficacy of the proposed approach, the analysis based on errors reported in simulated and forecasted data have already been discussed in the previous section. On the basis of MAPE values of forecasting models, it can be concluded that proposed models based on different overlap periods and mean infected cases in the duration of a span of 6–7 days can be a potential tool for alignment of medical facilities and policy decisions. Further, to have a clear insight, comparative analysis of these proposed grey models are depicted in Fig. 9. The following points can be concluded from this analysis:
Fig. 9

Comparative Analysis of proposed grey models on the basis of average ranks..

Fig. 9(a) and (c) show the MAPE of forecasted results of IOGM models. It can be easily concluded that the proposed IOGM (Model-II) provides competitive results as compared to the results obtained by conventional GM and NGM models. Fig. 9(b) depicts the average rank analysis. For conducting this analysis, developed models have given rank as per the performance. The evaluation of performance is based on the calculated MAPE for simulated and forecasted samples. After taking the mean of the MAPE obtained from forecasted and simulated models, it has been observed that the average rank of IOGM (Model-II) is I (1.5) as compared to other grey models i.e. 2 and 2.5. However, the results for Model-I are quite comparable with the original GM model. Hence, it is to be noted that for Model-II proposed IOGM based methodology provides very competitive results. This method is suitable for forecasting as it produces meaningful results without knowing the pattern of variables. It can generate a reliable forecast for planning combating strategies. Fig. 9(d) depicts the average MAPE analysis obtained by IOGM models (state-wise). As shown, the average MAPE obtained by models I and II are (11.76 and 7.82), (16.29, 11.37), (13.09, 9.54) and (21.95, 13.25) for Delhi, Maharashtra, Gujarat and Rajasthan respectively. It can be concluded that the proposed IOGM model-II exhibits superior performance as the average MAPE calculated for forecasting is optimal. Comparative Analysis of proposed grey models on the basis of average ranks.. From the results reported in this section, it can be concluded that the estimated results are always higher than the actual infected cases. This indicator is sufficient enough to spark an alarm to the authorities. However, the authenticity of the forecast largely depends upon the removal of potential uncertainties in the data. Another problem with the forecasting of epidemic and pandemic is that the data of confirmed cases multi-folds with time. Apart from these issues, forecasting is immensely valuable as it allows us to foresee many preventive and corrective measures in health care. Model-I and II give a sufficient amount of accuracy in the prediction of mean weekly infected cases. Higher values of MAPE can be justified with the larger population in three states and Delhi. Following recommendations can be drawn from this forecast: It can be seen that the performance of the models relies on the mean infected cases of a duration of more than five days. It is also a known fact that by taking the average of infected cases, the forecaster can easily deal with the randomness in data. This randomness is due to environment, policies, strategic decisions, sentiments and medical conditions. Considering a large population and the density of population in the major states of India, it may be concluded that based on the predictions of pandemic spread in these states, authorities can take decisions on the availability of Intensive Care Units and for severe cases, more ventilators can be procured. It is also empirical to spread awareness of this deadly disease in rural areas. Here, the authorities can plan online/offline campaigns to educate people before it hits the masses. Also, the strategies can be framed to impose lockdown in certain states and the period of lockdown can also be calculated based on this forecast. In addition to the above-cited recommendations, special care is to be taken of those patients who are already suffering from other diseases and taking regular treatment from hospitals. Based on the infected cases forecast, local hospitals, schools and some unused official buildings can be converted into COVID relief and cure centres. Based on the prediction results, the supply of required first aid treatment equipment and medicines can be foreseen. An awareness programme for the first line of medication can be developed. The knowledge of these programmes can be disseminated at different levels.

Conclusion

Novel Coronavirus poses a threat to human beings. This has significantly changed the way of thinking towards life. As the pandemic hit masses, the prediction methods offer help to medical practitioners, policymakers, and leaders of the states and countries to combat this disease effectively. The work reported in this paper has discussed difficulties in the forecasting of spread of pandemic and it has offered a probable solution in the form of the proposed grey mathematics-based optimization model. Following are the major conclusions of this work: This paper has presented theoretical aspects of GPMs. Also, it has discussed how the accuracy of these models is compromised due to the inherent nature of the models. An Internal Optimization-Based Model has been proposed for addressing these issues. This model has been validated on benchmark time series data. After the validation of this model, two sub-models have been developed with the help of different overlap periods to conduct the forecast of pandemic spread. These models are based on the mean values of the infected cases in the three major states and Delhi consisting of different overlap patterns i.e. 5 and 6 days respectively. The proposed prediction models are based on internal optimization and also on the hypothesis that performance can substantially be enhanced with the help of a careful selection of the grey model’s internal parameters. Further, both models have been tested on three major states and Delhi and forecasting of the infected cases has been done. It is observed that the values of error indices are optimal as compared with non-optimized models. The comparison of optimized models and non-optimized conventional models such as GM and NGM has been done in terms of evaluation of error indices. Further, this analysis has been extended to the evaluation of the average rank associated with these models. It has been observed that the proposed models perform satisfactorily as ranks obtained by these models are optimal in comparison to other grey models. Further, it is stated that the results of the proposed models are closely aligned with the actual data. For extending the analysis,the average MAPE of proposed IOGM models (place wise) have been evaluated. Moreover, it is observed that the proposed model-II (with a higher overlap period) yields satisfactory results. Based on prediction results, certain suggestions and recommendations have been framed. These recommendations can be further utilized for framing the policies and preventive strategies for the COVID-19 by the Government of India. The comparative analysis of performance of other grey models for forecasting Corona spread can be a future research direction. It will be interesting to develop new grey models with the application of nature-inspired optimizers in future.

CRediT authorship contribution statement

Akash Saxena: Conceptualization, Methodology, Software, Data curation, Writing – original draft, Visualization, Investigation, Supervision, Software, Validation, Writing – review & editing.

Declaration of Competing Interest

The author declares that he has no known competing financial interests or personal relationships that could have appeared to influecne the work reported in this paper.
  11 in total

1.  Prediction for the spread of COVID-19 in India and effectiveness of preventive measures.

Authors:  Anuradha Tomar; Neeraj Gupta
Journal:  Sci Total Environ       Date:  2020-04-20       Impact factor: 7.963

2.  Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China.

Authors:  Chaolin Huang; Yeming Wang; Xingwang Li; Lili Ren; Jianping Zhao; Yi Hu; Li Zhang; Guohui Fan; Jiuyang Xu; Xiaoying Gu; Zhenshun Cheng; Ting Yu; Jiaan Xia; Yuan Wei; Wenjuan Wu; Xuelei Xie; Wen Yin; Hui Li; Min Liu; Yan Xiao; Hong Gao; Li Guo; Jungang Xie; Guangfa Wang; Rongmeng Jiang; Zhancheng Gao; Qi Jin; Jianwei Wang; Bin Cao
Journal:  Lancet       Date:  2020-01-24       Impact factor: 79.321

3.  Forecasting the novel coronavirus COVID-19.

Authors:  Fotios Petropoulos; Spyros Makridakis
Journal:  PLoS One       Date:  2020-03-31       Impact factor: 3.240

4.  Prediction of the COVID-19 spread in African countries and implications for prevention and control: A case study in South Africa, Egypt, Algeria, Nigeria, Senegal and Kenya.

Authors:  Zebin Zhao; Xin Li; Feng Liu; Gaofeng Zhu; Chunfeng Ma; Liangxu Wang
Journal:  Sci Total Environ       Date:  2020-04-25       Impact factor: 7.963

5.  Why is it difficult to accurately predict the COVID-19 epidemic?

Authors:  Weston C Roda; Marie B Varughese; Donglin Han; Michael Y Li
Journal:  Infect Dis Model       Date:  2020-03-25

6.  Prediction and analysis of COVID-19 positive cases using deep learning models: A descriptive case study of India.

Authors:  Parul Arora; Himanshu Kumar; Bijaya Ketan Panigrahi
Journal:  Chaos Solitons Fractals       Date:  2020-06-17       Impact factor: 9.922

7.  Time series forecasting of COVID-19 transmission in Canada using LSTM networks.

Authors:  Vinay Kumar Reddy Chimmula; Lei Zhang
Journal:  Chaos Solitons Fractals       Date:  2020-05-08       Impact factor: 5.944

8.  Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020.

Authors:  K Roosa; Y Lee; R Luo; A Kirpich; R Rothenberg; J M Hyman; P Yan; G Chowell
Journal:  Infect Dis Model       Date:  2020-02-14

9.  WHO Declares COVID-19 a Pandemic.

Authors:  Domenico Cucinotta; Maurizio Vanelli
Journal:  Acta Biomed       Date:  2020-03-19
View more
  2 in total

1.  Forecasting CO2 Emissions Using A Novel Grey Bernoulli Model: A Case of Shaanxi Province in China.

Authors:  Huiping Wang; Zhun Zhang
Journal:  Int J Environ Res Public Health       Date:  2022-04-19       Impact factor: 4.614

2.  A data-driven hybrid ensemble AI model for COVID-19 infection forecast using multiple neural networks and reinforced learning.

Authors:  Weiqiu Jin; Shuqing Dong; Chengqing Yu; Qingquan Luo
Journal:  Comput Biol Med       Date:  2022-04-27       Impact factor: 6.698

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.