Literature DB >> 34131231

A new grey quadratic polynomial model and its application in the COVID-19 in China.

Jianbo Zhang1, Zeyou Jiang2.   

Abstract

This paper develops a new grey prediction model with quadratic polynomial term. Analytical expressions of the time response function and the restored values of the new model are derived by using grey model technique and mathematical tools. With observations of the confirmed cases, the death cases and the recovered cases from COVID-19 in China at the early stage, the proposed forecasting model is developed. The computational results demonstrate that the new model has higher precision than the other existing prediction models, which show the grey model has high accuracy in the forecasting of COVID-19.

Entities:  

Mesh:

Year:  2021        PMID: 34131231      PMCID: PMC8206087          DOI: 10.1038/s41598-021-91970-1

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

At the beginning of 2020, a new strain of coronavirus (COVID-19) was found from some patients in January 2020. This disease can lead to severe fever, and mainly acute respiratory failure syndrome[1]. It is proven that this coronavirus can be transmitted from person to person. The number of confirmed cases rose sharply since the January 2020, and governments have to promulgate various laws and policies to alleviate the spread of COVID-19. At now, the total confirmed cases has reached 137,866,311 cases all over the world. Moreover, there is no indication that the virus will disappear within a few months. Thus accurately prediction the tendency, particularly at the early stage of the disease, can give a guidance for the control and prevention of the coronavirus. It is generally known that the statistical models like autoregressive model, moving average and autoregressive integrated moving average, and the computational intelligence methods are widely applied in COVID-19 diseases. Castillo and Melin[2] described a hybrid intelligent approach for efficient and accurate prediction COVID-19 time series combining fuzzy logic and fractal theory. Publicly available datasets of 10 countries are used to establish the fuzzy model, and the results show the new model can be considered good studying the complexity of this epidemic diseases. Chimmula and Zhang[3] proposed a new state-of-the-art Deep Learning forecasting model for COVID-19 outbreak in Canada. The possible trends and stopping time of COVID-19 in Canada are evaluated, and then compared transmission rates of Canada with Italy and USA. Anastassopoulou et al.[4] used a Susceptible-Infectious-Recovered-Dead (SIDR) model to study the basic reproduction number, the per day infection mortality and the recovery rates of Hubei in China. Petropoulos and Makridakis[5] introduced an objective method to predict the spread of confirmed cases, the number of deaths and recoveries of the COVID-19 under the assumption that the original data is reliable and the process of the disease following the past pattern. Shastri et al.[6] used neural network with Stacked LSTM, Convolutional LSTM and Bi-directional LSTM to study the confirmed cases and the death cases of COVID-19 in USA and India. Wang et al.[7] developed a deep learning method with rolling mechanism to forecast the epidemic trend for Russia, Peru and Iran. Hawas[8] introduced the recurrent neural networks for forecasting the virus’s daily infection in Brazil with limited raw data. Yonar et al.[9] estimated the number of COVID-19 epidemic cases of Turkey, Germany, United Kingdom, France, Italy, Russia, Canada and Japan by Box-Jenkins (ARIMA), curve estimation models and Brown/Holt linear exponential smoothing methods. Melin et al.[10] presented a multiple ensemble neural network with fuzzy logic method for the COVID-19 cases in Mexico where the errors are significantly lower than traditional neural networks. Sun and Wang[11] examined the data from January 23 to March 25 by ordinary differential equation model, which demonstrate that strongly controlled measured can minimize total infections. Castillo and Melin[12] proposed a hybrid intelligent fuzzy fractal method for COVID-19 classification of countries. Additionally, Luo et al.[13], Sahin and Sahin[14], Zhao et al.[15] used grey models to study the number of patients infected with COVID-19. The Chaos, Solitons and Fractals launched an open focus issue for understanding and mitigating the effects of the current pandemic[16]. For more details about this topic, the interested readers can refer to[17-23]. Moreover, the details of these work are summarized in Table 1.
Table 1

Studies on COVID-19 analysis and forecasting.

RefDescriptionForecasting methodfocusData amount
[2]Computational intelligence methodA hybrid intelligent approach based on fractal theory and fuzzy logicBelgium, China, France, Germany, Iran Italy, Spain, Turkey, UK, US01/22/2020–03/31/2020
[10]Computational intelligence methodMultiple ensemble neural network model with fuzzy response aggregationMexicoto 05/18/2020
[12]Computational intelligence methodHybrid intelligent fuzzy fractal approachAustria, Bolivia, Brazil, Ecuador, Finland, Greece, India, Morocco, New Zealand, Norway, Poland, Russia, Singapore, Sweden, Switzerland04/01/2020–07/12/2020
[21]Computational intelligence methodRNN based Long Short Term Memory (LSTM) modelChina, Australia, USA, others01/12/2020–05/11/2020
[6]Deep learning methodRNN based variants of long short term memory (LSTM)India, USA02/07/2020–07/07/2020
[7]Deep learning methodRecurrent neural networks (RNNs)Russia, Peru, Iran02/22/2020–07/07/2020
[3]State-of-the-art Deep Learning modelsthe Long short-term memory networksCanada20/22/2020–03/31/2020
[4]Nonlinear dynamics systemSusceptible Infectious-Recovered-Dead (SIDR) modelChina01/11/2020–02/10/2020
[11]Nonlinear dynamics systemOrdinary differential equation modelChina01/23/2020–03/25/2020
[18]Nonlinear dynamics systemSEIDIUQHRD deterministic compartmental modelRussia, Brazil, India, Bangladesh02/01/2020–05/08/2020
[23]Nonlinear dynamics systemSusceptible-Exposed-Infected-Recovered (SEIR) modelChina, Italy01/22/2020–03/30/2020
[9]Parameter modelBox-Jenkins (ARIMA) and Brown/Holt linear exponential smoothing methodsTurkey, Germany, UK, France, Italy, Russia, Canada, Japan01/22/2020–03/22/2020
[17]Parameter modelAutoregressive Integrated Moving Average (ARIMA) modelUSA, UK, Italy, Spain, France, China, India07/12/2020–09/11/2020
[22]Parameter modelExponential decay model (EDM)China02/27/2020–04/07/2020
[5]Statistical methodNullSouth Korea, Iran, and Europe01/22/2020–03/11/2020
[13]Grey prediction modelGRM(1,1) modelChina, Italy, United Kingdom, Russian01/23–02/06 03/10–03/21 04/11–04/25 06/01–08/12
[14]Grey prediction modelFractional nonlinear grey Bernoulli modelItaly, UK, USA04/22/2020–05/22/2020
[15]Grey Prediction modelRolling grey Verhulst modelChina01/21/2020–02/20/2020
Studies on COVID-19 analysis and forecasting. It can be seen that the neural network models and statistical prediction models are widely used to study the COVID-19, and the grey prediction model is relatively few. As we know, the statistical models often require a large amount of historical data, at least thirty or more datasets, which obey a certain distribution. The neural network method needs a substantial amount of datasets for training to obtain system optimized parameters. However, the transmission mechanism of COVID-19 is not very clear, especially in the early stage owing to the limited information available. Thus it is very important to select a favorable technique for prediction the trend of the COVID-19 with limited information. The grey prediction method, proposed by Deng Julong[24], is an efficient and accuracy method for solving uncertain problems with limited information. In the classical grey model GM(1,1), the grey action quantity is a constant number, which is essentially a homogenous exponent model. When the raw of data is not a homogeneous exponent sequence, the model accuracy maybe low. So Cui et al.[25], Xie et al.[26] put forward a non-homogeneous grey model with grey action quantity is bt. Chen and Yu[27] based on the work of[25,26] proposed a non-homogenous grey prediction model termed as NGM(1,1,k,c) in their work where the grey action quantity is bt + c. The whitening equation, the time response function and the restored values of the model are all derived with the grey techniques and mathematical tools. This model can simulate a homogeneous exponential sequence, a non-homogeneous exponential sequence and a linear sequence. However, we discover this non-homogeneous grey prediction model sometimes has large error with some sequences. To further improve the effectiveness and applicableness of grey models, we generalized the non-homogeneous grey forecasting model to a grey prediction model with quadratic polynomial term in this work. At the early stage, the spreading mechanism of the COVID-19 is not clear, and there is limited available data to collect for us. Thus it is important for us to select an appropriate method to deal with the COVID-19, and obtain acceptable results. Under this situation, the grey forecasting model is chose to study the confirmed cases, the death cases and the recovered cases of COVID-19 in China at the early stage. With the grey theory and mathematical analysis, the grey quadratic polynomial model GMQP(1,1) is systematically studied. The grey basic form, the system parameters, the time response function and the restored values are all derived. Based on these expressions, some special cases are all considered. Further, the new model is applied to study the confirmed cases, the death cases and the recovered cases from COVID-19 in China at the early stage. The computational results are compared with the classical grey model GM(1,1)[24], the discrete grey model DGM(1,1)[28,29], the non-homogeneous grey model NGM(1,1,k,c)[27,30], the grey Verhulst model GVM(1,1)[31-34] and the polynomial regression PR(2) in the application section. It is found that the new model outperforms the other prediction models and can obtain competitive results. In summary, the main contributions and originalities of this work are provided here. (1) The grey forecasting model with quadratic polynomial term is develop, which can solve quasi homogeneous and non-homogeneous exponential series, or even some fluctuating series. (2) The analytical solution of time response function and the matrix expression of system parameters are also determined by grey technique. (3) The proposed newly model is a general grey forecasting model, and the GM(1,1) model, the NGM(1,1,k) model and the NGM(1,1,k,c) model are all special cases of the proposed model. Moreover, the feasibility of the new model is verified through two examples. (4) The new model is used to study the confirmed cases, the death cases and the recovered cases of COVID-19 in China at the early stage, and results illustrate that the new model has higher precision than other forecasting models. The rest of this paper is arranged as follows. Section 2 discusses the existing grey forecasting models. The details of the grey prediction model with quadratic polynomial term is given in Sect. 3. Section 4 provides some numerical examples. Applications are studied in the Sect. 5. Conclusions are placed in the last section.

Some existing grey forecasting models

This section provides a brief overview of some grey forecasting models which will used in the application section. They are the classical grey model GM(1,1), the discrete grey model DGM(1,1), the non-homogeneous grey model NGM(1,1,k,c) and the grey Verhulst model GVM(1,1). For concise, we only provide the whitening equation, the time response function and the restored values of them. The GM(1,1) model The classical grey model GM(1,1) is the core of the grey forecasting theory. Since been putted forward, it has been widely applied in various fields including energy, economy and education. The whitening equation of GM(1,1) model is given by The time response function and the restored values are The DGM(1,1) model The discrete grey forecasting model DGM(1,1) is initially provided by Xie and Liu[28,29], the mathematical expression is and the recursive function is given by The NGM(1,1,k,c) model The whitening equation of the NGM(1,1,k,c) is The time response function and the restored values are The GVM(1,1) model. This nonlinear grey model is first appeared in the book of Deng[34], which is able to simulate and predict original observations with an inverted U shape or a signal peak feature. The whitening equation of GVM(1,1) model is Further, the time response function and the restored values are

The grey model with quadratic polynomial term

This section discusses the grey model with quadratic polynomial term which is abbreviated as GMQP(1,1) model in the present paper. We first provide the definition of the accumulated and inverse accumulated generation operators, and then discuss the new model GMQP(1,1) along with some properties.

Accumulated and inverse accumulated generation operator

Definition 1

(Accumulated generation operator) First, we assume the original non-negative sequence is , and A is a sequence operator such that , where the relationship is given by . The operator A is named as the first-order accumulated generation operator (1-AGO) of original sequence . It follows from definition 1 that , where .

Definition 2

(Inverse accumulated generation operator). The inverse accumulated generation operator is defined as , where and . It follows from the definition 1 and definition 2 that the inverse accumulated generation operator is the inverse operation of the accumulated generation operator.

The grey quadratic polynomial model

Definition 3

Assume and are stated in definition 1, then the whitening differential equation of the grey model with quadratic polynomial term is defined as. where a is the development coefficient, and is the grey action quantity. Obviously, when system parameter b = 0 in Eq. (12), the GMQP(1,1) model degenerates to the NGM(1,1,k,c) model. When the parameters b = 0 and c = 0 in Eq. (12), the GMQP(1,1) model reduces to the classical GM(1,1) model.

Theorem 1

The basic form of the GMQP(1,1) model is represented by. where is called the mean sequence or background values.

Proof

The whitening equation is integral on interval [k-1, k], It yields that With the trapezoid formula , and some mathematical calculations, we have this completes the proof.

Theorem 2

Let raw data sequence be the non-negative sequence, is the 1-AGO sequence of , and the background value is . The column parameter of the GMQP(1,1) model is presented by the following relationship. where Employing the mathematical induction considering k = 2,3,…,n into Theorem 1, we obtain that. . Converting the above equation system into the matrix form, we can get It is easily known that .

Theorem 3

The analytical expression of the time response sequence of the GMQP(1,1) model is given by. and the restored values can be derived by utilizing the 1-IAGO, that is It follows from the theory of the ordinary differential equation that the general solution of the whitening equation is Noting that , and , we can obtain Finally, we can discrete the expression of to get the time response function, and the restored values of the GMQP(1,1) model.

Error checking method

The performance of model should include two aspects: the simulation performance and the fitting performance. Assume a raw sequence where a subsequence composed of the first m entries of raw sequence is applied to develop the newly proposed model, and simulation sequence is . We utilize the grey forecasting model to forecast the left n-m steps data, and the prediction sequence is . The error sequence of the simulation sequence and the prediction sequence are, respectively, and , which are given as follows where and . Here the absolute percentage error (APE), the absolute error (MAE), the mean squares error (MSE), the mean absolute percentage error (MAPE), the root mean square percentage error (RMSPE), the index of agreement (IA) and the correlation coefficient (R) are provided below. The absolute percentage error The absolute error (MAE) where is the mean absolute simulation percentage error MAEsim, is the mean absolute fitting percentage error MAEfit, is the total mean absolute percentage error MAEall. The mean squares error (MSE) The mean absolute percentage error The root mean square percentage error The index of agreement (IA) where is the mean value of original sequence. The correlation coefficient (R)Moreover, the flowchart of the GMQP(1,1) model is listed in the following Fig. 1.
Figure 1

The flowchart of the GMQP(1,1) model.

The flowchart of the GMQP(1,1) model.

Validation of the GMQP(1,1) model

To validation of the feasibility of the new model, this section gives two numerical example where datasets are collected from published papers.

Example 1

In this example, data are all collected from Table 2 in Ref[35]. where the total energy consumption in China (unit: 10000tce). These data are used to build the GM(1,1) model, the DGM(1,1) model, the NGM(1,1,k,c) model, the GVM(1,1) model and the GMQP(1,1) model. The numerical results of these grey forecasting models are displayed in the following Tables 2, 3 and 4.
Table 2

The numerical results of the energy consumption of China (unit: 10,000 tce).

yearvaluesGM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)GMQP(1,1)
1999140,568.82140,568.8200140,568.8200140,568.8200140,568.8200140,568.8200
2000145,530.86154,097.5428154,173.2782121,587.393441,622.9108131,646.4473
2001150,405.8166,155.7594166,243.3504142,213.090153,370.7429152,246.2752
2002159,430.99179,157.5379179,258.3764162,614.716868,053.9908172,683.2660
2003183,791.82193,176.7127193,292.3359182,794.708086,162.8285192,937.8565
2004213,455.99208,292.8956208,425.0000202,755.4712108,114.8464212,988.1330
2005235,996.65224,591.9280224,742.3853222,499.3882134,140.9801232,809.5492
2006258,676.3242,166.3685242,337.2425242,028.8146164,126.8251252,374.6097
2007280,507.94261,116.0186261,309.5834261,346.0807197,422.6059271,652.5157
2008291,448.29281,548.4891281,767.2500280,453.4912232,662.9912290,608.7684
2009306,647.15303,579.8116303,826.5269299,353.3259267,672.3260309,204.7243
2010324,939.15327,335.0970327,612.8026318,047.8398299,553.1267327,397.0971
2011348,001.66352,949.2464353,261.2821336,539.2636325,035.1990345,137.4001
2012361,732.01380,567.7169380,917.7554354,829.8034341,075.7973362,371.3216
Table 3

The APEs of these forecasting models in the energy consumption of China.

yearGM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)GMQP(1,1)
19990.00000.00000.00000.00000.0000
20005.88655.938516.452571.39939.5405
200110.471610.52995.447164.51551.2237
200212.373112.43631.996957.31458.3122
20035.10635.16920.542553.11934.9763
20042.41882.35695.013049.35030.2192
20054.83264.76885.719343.15981.3505
20066.38256.31646.435636.55132.4361
20076.91316.84416.831129.61963.1569
20083.39683.32173.772520.17010.2881
20091.00030.91982.378612.71000.8340
20100.73740.82282.12087.81250.7564
20111.42171.51143.29386.59950.8231
20125.20715.30391.90815.71040.1767
Table 4

The evaluation measures of these models in the energy consumption of China.

GM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)GMQP(1,1)
MAE11,157.240911,173.171010,759.129072,426.87995099.5298
MSE161,263,808.4429161,615,292.1515153,318,142.59986,306,252,770.160446,603,353.8470
MAPE5.08835.09544.762435.23322.6226
RMSPE6.09816.11146.135941.67993.9827
IA0.99220.99220.99270.81820.9978
R0.98690.98690.99550.94790.9962
The numerical results of the energy consumption of China (unit: 10,000 tce). The APEs of these forecasting models in the energy consumption of China. The evaluation measures of these models in the energy consumption of China. We can from Tables 2, 3, and 4 that the new model has better performance measures than other grey forecasting models in the energy consumption of China, which show that the new structure of GMQP(1,1) model can improve the precision of grey model.

Example 2

In this example, the raw data of the electricity consumption of China are collected from Table 2 in Ref.[36], where the twelve data are all applied to build different kinds of grey models. Similarly, the computational results and evaluation measures are listed in the following Tables 5, 6, and 7.
Table 5

The results of the electricity of China by different grey forecasting models.

yearvaluesGM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)GMQP(1,1)
200216541654.00001654.00001654.00001654.00001654.0000
20031910.52113.27552114.96881785.3338574.26171908.7598
20042203.32325.48962327.52322084.6507764.27212212.6902
20052500.22559.01422561.43932392.25181009.78242524.3652
20062865.72815.98932818.86402708.36631321.41592844.1501
20073281.53098.76963102.15983033.22991707.67373172.4276
20083495.73409.94663413.92693367.08482171.42433509.5984
20093714.63752.37193757.02663710.17972705.01993856.0821
20104192.34129.18344134.60784062.77043284.87964212.3181
20114692.84543.83414550.13594425.11983867.70674578.7665
20124959.15000.12385007.42464797.49794391.72554955.9092
20135322.35502.23395510.67085180.18234786.00895344.2508
Table 6

The APEs of these grey models in the electricity consumption in China.

yearGM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)GMQP(1,1)
20020.00000.00000.00000.00000.0000
200310.613710.70246.551569.94180.0911
20045.54585.63815.385165.31240.4262
20052.35242.44944.317659.61190.9665
20061.73471.63445.490253.88850.7520
20075.56855.46527.565747.96063.3239
20082.45312.33923.679237.88300.3976
20091.01691.14220.119027.17873.8088
20101.50551.37613.089721.64490.4775
20113.17443.04015.704117.58212.4300
20120.82720.97453.258711.44110.0643
20133.38083.53932.670210.07630.4124
Table 7

The evaluation measures of these models in the electricity consumption of China.

GM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)GMQP(1,1)
MAE106.6169107.0326144.66661141.257243.6811
MSE14,943.904815,030.252925,372.15411,438,735.61444285.7338
MAPE3.47033.48194.348338.41101.1955
RMSPE4.41554.43394.780043.79621.7506
IA0.99690.99680.99460.79800.9991
R0.99500.99500.99820.94440.9986
The results of the electricity of China by different grey forecasting models. The APEs of these grey models in the electricity consumption in China. The evaluation measures of these models in the electricity consumption of China. It is shown that the GM(1,1) model, the DGM(1,1) model, the NGM(1,1,k,c) model and the GMQP(1,1) model successfully catch the trend of the electricity consumption of China. Moreover, the new model has the best performance measures, while the GVM(1,1) model has the worst performance measures. It follows from example 1 and example 2 that the new grey model has best performance measures, which shows the new grey models with a more flexible structure can be a good way of improving the accuracy of model.

Applications in the COVID-19 of China

In this section, we will use different grey forecasting models and the polynomial regression to study the confirmed cases, the death cases and the recovered cases from COVID-19 in China, which are the classical continuous grey model GM(1,1), the discrete grey model DGM(1,1), the non-homogeneous grey model NGM(1,1,k,c), the nonlinear grey Verhulst model GVM(1,1), the polynomial regression (PR) and the grey model with quadratic polynomial term GMQP(1,1). Moreover, the structure of the applications in the COVID-19 of China is shown in Fig. 2.
Figure 2

The structure of the application in the COVID-19 of China.

The structure of the application in the COVID-19 of China.

The confirmed cases from COVID-19 of China

In this subsection, we apply forecasting models to study the confirmed cases from COVID-19 of China. The raw data, starting 2020-01-21 to 2020-02-06, are collected from the website: http://www.nhc.gov.cn, and displayed in the following Table 8 and Fig. 3.
Table 8

The number of the confirmed cases from COVID-19 of China.

date01/21/202001/22/202001/23/202001/24/202001/25/202001/26/2020
raw data29144057183012871975
date01/27/202001/28/202001/29/202001/30/202001/31/202002/01/2020
raw data2744451559747711969211,791
date02/02/202002/03/202002/04/202002/05/202002/06/2020
raw data14,38017,20520,43824,32428,018
Figure 3

The plots of the confirmed cases from COVID-19 of China.

The plots of the confirmed cases from COVID-19 of China. The number of the confirmed cases from COVID-19 of China. With these raw data, we can deduce the mathematical expressions of different grey model. Here we take the GMQP(1,1) model as an example to details show the modelling procedures. Step 1 pre-process the raw data. It follows from Table 8 that the original sequence is X(0) = (291, 440, 571, 830, 1287, 1975, 2744, 4515, 5974, 7711, 9692, 11,791, 14,380, 17,205, 20,438, 24,324, 28,018). The first 14 data are used to develop the GMQP(1,1) model of the confirmed cases of COVID-19, and the remaining three data are used to test. From the definition 1, the first-order accumulating generated sequence is X(1) = (291, 731, 1302, 2132, 3419, 5394, 8138, 12,653, 18,627, 26,338, 36,030, 47,821, 62,201, 79,406, 99,844, 124,168, 52,186). Step 2 System parameter estimation. From theorem 2, and the values of X(0) and X(1), we calculate the matrix B and the matrix Y which are given by With the help of the Eq. (17), we can obtain the values of system parameters as Step 3 Model construction. Substituting the system parameters a, b, c and d into Eq. (12), we obtain that. . And then we can obtain the expressions of Eq. (20) and Eq. (21), respectively. Therefore, we can compute the simulation and prediction values of the confirmed cases of COVID-19 of China. By a similar argument to the other grey forecasting models which are provided below. The GM(1,1) model. We can obtain system parameters a = -0.2441, b = 1116.9454 of the GM(1,1) model by the least squares method. And then the mathematical expression is given by. . The DGM(1,1) model. We directly deduce system parameters a = 0.2441, b = 1116.9454 of the DGM(1,1) model. And the mathematical formula is given by. . The NGM(1,1,k,c) model. We can derive system parameters a = -0.1719, b = 463.7776 and c = -1124.6229 of the NGM(1,1,k,c) model. The whitening equation is built, there is. . The GVM(1,1) model. We deduce system parameters a = -0.3820 and b = -2.0528E-6 of the GVM(1,1) model with the least squares estimation method. Further, the whitening equation is put forward, there is. . The polynomial regression model. We compute the values of parameters of the polynomial regression model where a = 120.9911, b = − 535.4727 and c = 916.0495, respectively. And then the mathematical expression is. . Once the specific grey forecasting models are established, the computational results and error metrics can be easily obtained which are displayed in the following Tables 9, 10, 11 and Fig. 4. The MAEsim, MAEfit, and MAEall of the GMQP(1,1) model are 93.9043%, 871.5592% and 239.7146%, the MSEsim, MSEfit, and MSEall of the GMQP(1,1) model are 14,610.4784%, 924,128.4138% and 185,145.0913%, the MAPEsim, MAPEfit, and MAPEall of the GMQP(1,1) model are 4.8534%, 3.4346%, 4.5873%, the RMSEsim, RMSEfit, and RMSEall of the GMQP(1,1) model are 7.1669%, 3.6842%, 6.6542%, the IAsim, IAfit, and IAall of the GMQP(1,1) model are 0.9999%, 0.9990% and 0.9994%, the Rsim, Rfit, and Rall of the GMQP(1,1) model are 0.9998%, 0.9994% and 0.9996%, respectively.
Table 9

The computational results of the confirmed cases of COVID-19 of China.

datedataGM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)PR(2)GMQP(1,1)
01/21/2020291291.0000291.0000291.0000291.0000501.5679291.0000
01/22/20204401345.51551353.1171-420.6949135.0533329.0684506.8141
01/23/20205711717.49611729.31266.4145197.5213398.5511495.6951
01/24/20208302192.31432210.0987513.6543288.6436710.0159748.7279
01/25/202012872798.40062824.55351116.0578421.29171263.46291262.8613
01/26/202019753572.04523609.83991831.4788613.81162058.89202035.0789
01/27/202027444559.57124613.45282681.1206892.00523096.30333062.3995
01/28/202045155820.10835896.09173690.16451291.43744375.69674341.8762
01/29/202059747429.13297535.33164888.51601859.63035897.07235870.5960
01/30/202077119482.98789630.31546311.69152657.02777660.42997645.6799
01/31/2020969212,104.650412,307.74968001.87023754.38569665.76989664.2818
02/01/202011,79115,451.096815,729.567910,009.14495222.483611,913.091811,923.5887
02/02/202014,38019,722.700320,102.725212,393.00637108.510814,402.395914,420.8200
02/03/202017,20525,175.229425,691.713915,224.10629394.752517,133.682117,153.2273
02/04/202020,43832,135.162432,834.561418,586.351411,944.308420,106.950520,118.0938
02/05/202024,32441,019.235441,963.273622,579.390614,459.334923,322.201123,312.7343
02/06/202028,01852,359.395353,629.963627,321.567616,500.409126,779.433826,734.4943
Table 10

The APEs of different model in the confirmed cases of COVID-19 of China, (%).

dateGM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)PR(2)GMQP(1,1)
01/21/20200.00000.00000.00000.000072.36010.0000
01/22/2020205.7990207.5266195.612569.306125.211715.1850
01/23/2020200.7874202.856998.876665.407830.201213.1883
01/24/2020164.1343166.276938.113965.223714.45599.7918
01/25/2020117.4359119.468013.282267.26561.82881.8756
01/26/202080.863082.77677.266968.92094.24773.0420
01/27/202066.165168.12872.291567.492512.839011.6035
01/28/202028.906130.589018.268871.39673.08533.8344
01/29/202024.357826.135418.170168.87131.28771.7309
01/30/202022.980024.890618.146965.54240.65580.8471
01/31/202024.893226.988717.438461.26300.27060.2860
02/01/202031.041433.403215.112055.70791.03551.1245
02/02/202037.153739.796413.817850.56670.15570.2839
02/03/202046.325149.327011.513545.39520.41450.3009
02/04/202057.232460.65459.059841.55831.61981.5653
02/05/202068.636972.51807.172440.55534.11864.1575
02/06/202086.877791.41252.485741.10784.42064.5810
Table 11

The evaluation measures of different forecasting models in the confirmed cases.

GM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)PR(2)GMQP(1,1)
MAEsim2481.24992624.5284989.88233482.9573105.533493.9043
MAEfit17,577.931018,549.26621430.89689958.6492857.1382871.5592
MAEall5311.87765610.41681072.57254697.1496246.4593239.7146
MSEsim10,028,334.630911,366,851.60251,456,440.322019,361,149.202118,247.150014,610.4784
MSEfit336,019,338.9233373,597,128.88262,319,094.1103100,703,105.1213882,413.6768924,128.4138
MSEall71,151,647.935779,285,028.59261,618,187.907334,612,765.9370180,278.3738185,145.0913
MAPEsim80.834082.935735.993263.25847.36074.8534
MAPEfit70.915774.86176.239341.07383.38633.4346
MAPEall78.974381.421830.414359.09886.61554.5873
RMSPEsim104.2863105.877962.987963.716012.26597.1669
RMSPEfit71.959075.92566.824041.07593.61153.6842
RMSPEEall99.0321100.941156.853360.123911.16646.6542
IAsim0.94140.93480.99030.87940.99990.9999
IAfit0.86290.85360.99740.77690.99900.9990
IAall0.88120.87210.99440.83910.99940.9994
Rsim0.99320.99300.99820.98110.99970.9998
Rfit0.99640.99640.99790.99900.99940.9994
Rall0.98580.98550.99790.99030.99950.9996
Figure 4

The plots of the confirmed cases of COVID-19 of China.

The plots of the confirmed cases of COVID-19 of China. The computational results of the confirmed cases of COVID-19 of China. The APEs of different model in the confirmed cases of COVID-19 of China, (%). The evaluation measures of different forecasting models in the confirmed cases. It follows from these results that the GM(1,1) model and the DGM(1,1) model has worst performance measures, the NGM(1,1,k,c) model and the GVM(1,1) model have worse performance measures, and the new model GMQP(1,1) have good performance measures. This also demonstrates that the grey model with quadratic polynomial term is more powerful to deal with the data of the confirmed cases of COVID-19 of China.

The death cases from COVID-19 of China

This subsection discusses the death cases from COVID-19 of China by employing grey models. The raw data are collected from the website: http://www.nhc.gov.cn, and displayed in the following Tables 12, 13, 14 and Fig. 5. The first 14 observations are used to build models, and the left three observation is used to test. Similar argument is applied to derive system parameters of each model, and then the mathematical expressions are given below.
Table 12

The computational results of the death cases of COVID-19 of China.

datedataGM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)PR(2)GMQP(1,1)
01/22/202099.00009.00009.00009.000012.33939.0000
01/23/20201743.350543.52373.56623.593816.204716.6241
01/24/20202553.341353.594617.03035.024024.667626.1427
01/25/20204165.634865.995832.49837.018537.728039.5647
01/26/20205680.761581.266450.26859.795855.386057.0369
01/27/20208099.3744100.070670.683513.654477.641578.7117
01/28/2020106122.2770123.225894.136918.9982104.4945104.7470
01/29/2020132150.4579151.7389121.080926.3669135.9451135.3069
01/30/2020170185.1336186.8495152.035136.4658171.9931170.5615
01/31/2020213227.8010230.0844187.596250.1892212.6387210.6874
02/01/2020259280.3017283.3233228.450068.6193257.8819255.8677
02/02/2020304344.9022348.8812275.384292.9689307.7225306.2925
02/03/2020361424.3911429.6083329.3037124.4206362.1607362.1591
02/04/2020425522.1995529.0149391.2482163.8024421.1964423.6721
02/05/2020490642.5497651.4229462.4120211.0520484.8297491.0438
02/06/2020563790.6366802.1549544.1674264.5072553.0604564.4946
02/07/2020636972.8529987.7645638.0906320.2373625.8887644.2531
Table 13

The Errors of different model in the death cases of COVID-19 of China, (%).

dateGM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)PR(2)GMQP(1,1)
01/22/20200.00000.00000.00000.000037.10320.0000
01/23/2020155.0027156.021779.022678.85994.67842.2112
01/24/2020113.3654114.378431.878879.90421.32974.5707
01/25/202060.084960.965320.735782.88167.98043.5008
01/26/202044.217045.118710.234882.50751.09641.8516
01/27/202024.218025.088211.645682.93212.94811.6104
01/28/202015.355716.250811.191682.07711.42031.1821
01/29/202013.983314.95378.272080.02512.98872.5052
01/30/20208.90219.911510.567678.54961.17240.3303
01/31/20206.94888.020811.926776.43700.16961.0857
02/01/20208.22469.391211.795473.50600.43171.2094
02/02/202013.454714.76359.413169.41811.22450.7541
02/03/202017.559919.00518.780165.53440.32150.3211
02/04/202022.870524.47417.941661.45830.89500.3124
02/05/202031.132632.94355.630256.92821.05520.2130
02/06/202040.432842.47873.345153.01831.76550.2655
02/07/202052.964355.30890.328749.64821.58981.2977
Table 14

The evaluation measures of different forecasting models in the death cases.

GM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)PR(2)GMQP(1,1)
MAEsim31.609733.705918.1322120.62171.92171.5865
MAEfit239.0131250.780716.1704297.73458.40713.5972
MAEall70.497974.407517.7643153.83043.13771.9635
MSEsim1518.43191737.0336427.689921,254.56805.40333.2758
MSEfit62,853.232568,996.8802373.378488,872.023375.921623.8122
MSEall13,018.707114,348.2548417.506533,932.840818.62557.1264
MAPEsim38.783639.872517.954376.46852.05051.6496
MAPEfit41.509943.57703.101353.19821.47020.5921
MAPEall39.294840.567115.169372.10531.94171.4513
RMSPEsim58.617659.306025.938876.76412.93752.0616
RMSPEfit42.462844.53013.785853.28131.50090.7745
RMSPEEall55.945156.828923.438372.93922.72651.8883
IAsim0.98190.97940.99470.79250.99991.0000
IAfit0.92710.92200.99920.42520.99980.9999
IAall0.94330.93880.99720.69760.99991.0000
Rsim0.99480.99470.99960.97290.99980.9999
Rfit0.99820.99820.99920.99990.99980.9997
Rall0.98700.98680.99860.97560.99990.9999
Figure 5

The plots of the death cases of COVID-19 of China.

The plots of the death cases of COVID-19 of China. The computational results of the death cases of COVID-19 of China. The Errors of different model in the death cases of COVID-19 of China, (%). The evaluation measures of different forecasting models in the death cases. The GM(1,1) model. The DGM(1,1) model. The NGM(1,1,k,c) model. The GVM(1,1) model. The polynomial regression model. The GMQP(1,1) model. When the specific mathematical expression of each model is derived, the computational results and error metrics are straightforward obtained, which are provided in the following Tables 12, 13, 14 and Fig. 5. The MAEsim, MAEfit, and MAEall of the GMQP(1,1) model are 1.5865%, 3.5972% and 1.9635%, the MSEsim, MSEfit, and MSEall of the GMQP(1,1) model are 3.2758%, 23.8122% and 7.1264%, the MAPEsim, MAPEfit, and MAPEall of the GMQP(1,1) model are 1.6496%, 0.5921%, 1.4513%, the RMSEsim, RMSEfit, and RMSEall of the GMQP(1,1) model are 2.0616%, 0.7745%, 1.8883%, the IAsim, IAfit, and IAall of the GMQP(1,1) model are 1.0000%, 0.9999% and 1.0000%, the Rsim, Rfit, and Rall of the GMQP(1,1) model are 0.9999%, 0.9997% and 0.9999%, respectively. Similarly, the GM(1,1) model, the DGM(1,1) model and the GVM(1,1) model have the worst computational results, the NGM(1,1,k,c) model has the worse computational results, and the GMQP(1,1) has the most computational results. It indicates that the new model has higher precision than the other forecasting models in the death cases from COVID-19 of China.

The recovered cases from COVID-19 in China

This subsection discusses the recovered cases from COVID-19 of China by employing grey models. The raw data are collected from the website: http://www.nhc.gov.cn, and displayed in the following Tables 15, 16, 17 and Fig. 6. The first 14 observations are used to build models, and the left three observation is used to test. Similar argument is applied to derive system parameters of each model, and then the mathematical expressions are given below.
Table 15

The computational results of the recovered cases of COVID-19 of China.

datedataGM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)PR(2)GMQP(1,1)
01/23/20203434.000034.000034.000034.0000103.046434.0000
01/24/20203826.296726.543923.043813.560749.704144.3017
01/25/20204935.820136.247132.231718.961417.692944.9811
01/26/20205148.792349.497444.717726.50537.012651.2829
01/27/20206066.462467.591361.685437.035717.663564.8046
01/28/202010390.531692.299684.743751.720849.645387.5974
01/29/2020124123.3176126.0400116.078872.1723102.9582122.2954
01/30/2020171167.9769172.1143158.6615100.6009177.6022172.2811
01/31/2020243228.8096235.0313216.5292140.0143273.5772241.8978
02/01/2020328311.6728320.9477295.1684194.4569390.8832336.7231
02/02/2020475424.5448438.2711402.0350269.2765529.5203463.9190
02/03/2020632578.2933598.4826547.2611371.3702689.4885632.6827
02/04/2020892787.7216817.2598744.6160509.3095870.7876854.8239
02/05/202011531072.99421116.01171012.8110693.14141073.41791145.5082
02/06/202015401461.57801523.97341377.2742933.53351297.37911524.2097
02/07/202020501990.88712081.06671872.56081239.78181542.67142015.9351
02/08/202126492711.88482841.80732545.62971616.17191809.29482652.7965
Table 16

The Errors of different model in the recovered cases of COVID-19 of China, (%).

dateGM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)PR(2)GMQP(1,1)
01/23/20200.00000.00000.00000.0000203.07770.0000
01/24/202030.798030.147639.358564.313930.800316.5833
01/25/202026.897826.026234.221061.303363.89218.2019
01/26/20204.32882.946312.318348.028986.24970.5547
01/27/202010.770612.65222.809138.273870.56098.0076
01/28/202012.105210.388817.724549.785651.800714.9540
01/29/20200.55031.64526.388141.796516.96921.3747
01/30/20201.76790.65167.215541.16913.86090.7492
01/31/20205.83973.279310.893342.381012.58320.4536
02/01/20204.97782.150110.009640.714319.17172.6595
02/02/202010.62217.732415.361143.310211.47802.3328
02/03/20208.49795.303413.408041.23899.09630.1080
02/04/202011.69048.378916.522942.90252.37814.1677
02/05/20206.93893.208012.158639.88376.90220.6498
02/06/20205.09231.040710.566639.380915.75461.0253
02/07/20202.88361.51548.655639.522824.74771.6617
02/08/20202.37397.27853.902238.989431.69900.1433
Table 17

The evaluation measures of different forecasting models in the recovered cases.

GM(1,1)DGM(1,1)NGM(1,1,k,c)GVM(1,1)PR(2)GMQP(1,1)
MAEsim28.360818.781044.8298140.067239.73837.6964
MAEfit66.806679.9669147.8451816.5043529.884917.8839
MAEall35.569430.253464.1452266.8992131.64089.6065
MSEsim1822.7206770.27444365.663339,193.11392009.7999150.7354
MSEfit4532.950112,798.882622,883.2485696,996.3883340,450.6755474.7209
MSEall2330.88863025.63847837.7105162,531.227865,467.4640211.4826
MAPEsim10.44508.808515.260645.777129.67264.6767
MAPEfit3.44993.27827.708139.297724.06710.9435
MAPEall9.13357.771513.844644.562228.62153.9767
RMSPEsim13.546112.557118.276246.449340.41007.1431
RMSPEfit3.64614.33428.201639.298424.93651.1304
RMSPEEall12.311911.473316.852445.194837.99186.4573
IAsim0.99770.99900.99460.95300.99750.9998
IAfit0.99950.99870.99730.85870.94440.9999
IAall0.99900.99880.99650.89860.96390.9999
Rsim0.99930.99930.99930.99920.99050.9996
Rfit0.99910.99910.99910.99990.99980.9996
Rall0.99870.99850.99880.99980.98320.9999
Figure 6

The plots of the recovered cases of COVID-19 of China.

The plots of the recovered cases of COVID-19 of China. The computational results of the recovered cases of COVID-19 of China. The Errors of different model in the recovered cases of COVID-19 of China, (%). The evaluation measures of different forecasting models in the recovered cases. The GM(1,1) model. The DGM(1,1) model. The NGM(1,1,k,c) model. The GVM(1,1) model. The polynomial regression model. The GMQP(1,1) model. When the specific mathematical expression of each model is derived, the computational results and error metrics are straightforward obtained, which are provided in the following Tables 15, 16, 17 and Fig. 6, respectively. The MAEsim, MAEfit, and MAEall of the GMQP(1,1) model are 7.6964%, 17.8839% and 9.6065%, the MSEsim, MSEfit, and MSEall of the GMQP(1,1) model are 150.7354%, 474.7209% and 211.4826%, the MAPEsim, MAPEfit, and MAPEall of the GMQP(1,1) model are 4.6767%, 0.9435%, 3.9767%, the RMSEsim, RMSEfit, and RMSEall of the GMQP(1,1) model are 7.1431%, 1.1304%, 6.4573%, the IAsim, IAfit, and IAall of the GMQP(1,1) model are 0.9998%, 0.9999% and 0.9999%, the Rsim, Rfit, and Rall of the GMQP(1,1) model are 0.9996%, 0.9996% and 0.9999%, respectively. Similarly, the GVM(1,1) model has the worst computational results, the GM(1,1) model, the DGM(1,1) model and the NGM(1,1,k,c) model have the better computational results, and the GMQP(1,1) has the most best computational results. It indicates that the new model has higher precision than the other forecasting models in the recovered cases from COVID-19 of China.

Conclusion

This paper studied the grey forecasting model with quadratic polynomial term, and applied it to the confirmed cases, the death cases and the recovered cases from COVID-19 of China at the early stage. By using the grey technique and some mathematical derivations, the grey basic form, the time response function and the restored values are all systematically analyzed. With raw datasets of COVID-19 in China, we compute the simulation and fitting values by different forecasting models. It follows from the computational results, we can observed the new model has higher precision than other models. This also implied that our generalized model has applicable value in the COVID-19. In this work, the GMQP(1,1) model is an univariate grey forecasting model and some factors such as social isolation and lockdown, vaccines, active treatment cannot be considered. In addition, the integer order accumulating generated operation is used to preprocess the raw data. It is generally known that the fractional order accumulating generated operation or the new information priority to preprocess raw data can get more accurate results. Thus in the future, we will continuous consider such a model with other accumulating generated operator including new information priority, fractional accumulating generated operator. Further, other multivariate grey forecasting models can be constructed to study the COVID-19.
  22 in total

1.  Early forecasting of the potential risk zones of COVID-19 in China's megacities.

Authors:  Hongyan Ren; Lu Zhao; An Zhang; Liuyi Song; Yilan Liao; Weili Lu; Cheng Cui
Journal:  Sci Total Environ       Date:  2020-04-26       Impact factor: 7.963

2.  Time series prediction of COVID-19 by mutation rate analysis using recurrent neural network-based LSTM model.

Authors:  Refat Khan Pathan; Munmun Biswas; Mayeen Uddin Khandaker
Journal:  Chaos Solitons Fractals       Date:  2020-06-13       Impact factor: 5.944

3.  A novel grey model based on traditional Richards model and its application in COVID-19.

Authors:  Xilin Luo; Huiming Duan; Kai Xu
Journal:  Chaos Solitons Fractals       Date:  2020-11-17       Impact factor: 5.944

4.  Short-term forecast in the early stage of the COVID-19 outbreak in Italy. Application of a weighted and cumulative average daily growth rate to an exponential decay model.

Authors:  Nicola Bartolomeo; Paolo Trerotoli; Gabriella Serio
Journal:  Infect Dis Model       Date:  2020-12-30

5.  On the uncertainty of real-time predictions of epidemic growths: A COVID-19 case study for China and Italy.

Authors:  Tommaso Alberti; Davide Faranda
Journal:  Commun Nonlinear Sci Numer Simul       Date:  2020-06-01       Impact factor: 4.260

6.  Data-based analysis, modelling and forecasting of the COVID-19 outbreak.

Authors:  Cleo Anastassopoulou; Lucia Russo; Athanasios Tsakris; Constantinos Siettos
Journal:  PLoS One       Date:  2020-03-31       Impact factor: 3.240

7.  Generated time-series prediction data of COVID-19's daily infections in Brazil by using recurrent neural networks.

Authors:  Mohamed Hawas
Journal:  Data Brief       Date:  2020-08-19
View more
  4 in total

1.  Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models.

Authors:  Yanding Wang; Zehui Yan; Ding Wang; Meitao Yang; Zhiqiang Li; Xinran Gong; Di Wu; Lingling Zhai; Wenyi Zhang; Yong Wang
Journal:  BMC Infect Dis       Date:  2022-05-25       Impact factor: 3.667

2.  Forecasting the transmission trends of respiratory infectious diseases with an exposure-risk-based model at the microscopic level.

Authors:  Ziwei Cui; Ming Cai; Yao Xiao; Zheng Zhu; Mofeng Yang; Gongbo Chen
Journal:  Environ Res       Date:  2022-05-12       Impact factor: 8.431

3.  A novel grey model based on Susceptible Infected Recovered Model: A case study of COVD-19.

Authors:  Huiming Duan; Weige Nie
Journal:  Physica A       Date:  2022-05-30       Impact factor: 3.778

4.  Deathdaily: A Python Package Index for predicting the number of daily COVID-19 deaths.

Authors:  Yoshiyasu Takefuji
Journal:  Netw Model Anal Health Inform Bioinform       Date:  2022-03-20
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.