Literature DB >> 36105539

An Overview of Discrete Distributions in Modelling COVID-19 Data Sets.

Ehab M Almetwally1,2, Sanku Dey3, Saralees Nadarajah4.   

Abstract

The mathematical modeling of the coronavirus disease-19 (COVID-19) pandemic has been attempted by a large number of researchers from the very beginning of cases worldwide. The purpose of this research work is to find and classify the modelling of COVID-19 data by determining the optimal statistical modelling to evaluate the regular count of new COVID-19 fatalities, thus requiring discrete distributions. Some discrete models are checked and reviewed, such as Binomial, Poisson, Hypergeometric, discrete negative binomial, beta-binomial, Skellam, beta negative binomial, Burr, discrete Lindley, discrete alpha power inverse Lomax, discrete generalized exponential, discrete Marshall-Olkin Generalized exponential, discrete Gompertz-G-exponential, discrete Weibull, discrete inverse Weibull, exponentiated discrete Weibull, discrete Rayleigh, and new discrete Lindley. The probability mass function and the hazard rate function are addressed. Discrete models are discussed based on the maximum likelihood estimates for the parameters. A numerical analysis uses the regular count of new casualties in the countries of Angola,Ethiopia, French Guiana, El Salvador, Estonia, and Greece. The empirical findings are interpreted in-depth. © Indian Statistical Institute 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Entities:  

Keywords:  COVID-19; discrete distributions; hazard rate; maximum likelihood estimation.; survival discretization

Year:  2022        PMID: 36105539      PMCID: PMC9461386          DOI: 10.1007/s13171-022-00291-6

Source DB:  PubMed          Journal:  Sankhya Ser A        ISSN: 0976-836X


Introduction

Corona-Virus “COVID-19” was first reported in early December 2019 in Wuhan, China, and within three months spread like a pandemic around the whole globe. The World Health Organization (WHO) described COVID-19 as a pandemic on March 11, 2020. Refer to Figs. 1 and 2. Despite the drastic, large-scale containment measures implemented in most countries, these numbers rapidly increased every day—posing an unprecedented threat to the global health and economy of interconnected human societies. Countries around the world have therefore increased their efforts to decrease the COVID-19 spread rate.
Figure 1

The situation for the daily new cases over the world by the WHO Region has been shown in Figure 1

Figure 2

The situation for the daily new deaths over the world by the WHO Region has been shown in Figure 2

The situation for the daily new cases over the world by the WHO Region has been shown in Figure 1 The situation for the daily new deaths over the world by the WHO Region has been shown in Figure 2 To model daily cases and deaths in the world, there are some mathematical/statistical models in the literature which are used to describe the dynamics of the evolution of COVID-19. The comparison of the COVID-19 epidemic dynamics among different countries is of great concern. In this regard, the researchers are making their best efforts to provide medical solutions for drugs and vaccines in reducing the risk of virus spread. The study of this aspect of science requires discrete distributions. For any researcher, the first question comes to mind- Why do we need discrete distributions? We are aware that most of the current continuous distributions do not fit adequately for modeling the cases of COVID-19 in count data analysis. In the current situation, it is of great interest to study more about COVID-19 and compare different countries as many as possible. Therefore, in this article, an effort has been made to compare the COVID-19 pandemic outbreak in several countries around the world. Recently, many authors introduced different discrete distributions such as natural discrete Lindley distribution has been implemented by Al-Babtain et al. (2020a, ??) to model everyday cases and deaths in the world. Almetwally et al. (2020) introduced a discrete Marshall-Olkin generalized exponential distribution to discuss the recent Egyptian cases regularly. Elbatal et al. (2022) obtained discrete odd Perks-G class of distributions. Almetwally et al. (2022) introduced discrete Marshall-Olkin inverse Toppe-Leone distribution with application to COVID-19 data. Nagy et al. (2021) discussed discrete extended odd Weibull exponential with different applications. Gillariose et al. (2021) proposed discrete generalization of the exponential model. A new discrete distribution, called discrete generalized Lindley, was analyzed by El-Morshedy et al. (2020) to examine the counts of daily coronavirus cases in Hong Kong and new daily fatalities in Iran. Maleki et al. (2020) have used an autoregressive time series model based on normal distribution of the two-piece scale mixture to estimate the recovered and reported cases of COVID-19. The study carried out by Hasab et al. (2020) where they used the susceptible infected recovered (SIR) epidemic dynamics of the COVID-19 pandemic for modelling the novel Coronavirus epidemic in Egypt. Nesteruk (2020) and Batista (2020b) have predicted regular new COVID-19 cases in China by using the mathematical model, named SIR. The logistic growth regression model used by Batista (2020a) for estimating the final size of the coronavirus outbreak and its peak time. This research work aims to model the daily new fatalities of COVID-19 using a review of statistical models to determine the best model fitting of COVID-19 data for different countries as Angola, Ethiopia, French Guiana, El Salvador, Estonia, and Greece and aware of the risks resulting from the spread of Corona-Virus in the world. To accomplish this goal: First, we study separate models such as Poisson, geometric, negative binomial, discrete Burr, discrete Lindley, discrete alpha power inverse Lomax, discrete generalized exponential, discrete Marshall-Olkin Generalized exponential, discrete Gompertz-G-exponential, discrete Weibull, discrete inverse Weibull, exponentiated discrete Weibull, discrete Rayleigh, and new discrete Lindley. Second, in some countries such as Angola, Ethiopia, French Guiana, El Salvador, Estonia, and Greece, we define the best discrete models that match different regular Coronavirus death datasets. The remainder of the paper is structured as follows. Discrete models are analyzed in Section 2. In Section 3, review for discrete models has been done based on survival discretization method. We discuss the parameter estimation of the discrete models in Section 4. Section 5 presents the new regular death of COVID-19 in the case of Angola, El Salvador, Estonia, and Greece to validate the use of models in suitable lifetime count results. Lastly, in Section 6, conclusions are made.

Review for Classical Discrete Models

In this Section, survival discretization method and some discrete distributions have been reviewed, such as Binomial, Poisson, Hypergeometric, discrete Burr, discrete Lindley, discrete alpha power inverse Lomax,discrete generalized exponential, discrete Marshall-Olkin Generalized exponential, discrete Gompertz-G-exponential, discrete Weibull, discrete inverse Weibull, exponentiated discrete Weibull, discrete Rayleigh, new discrete Lindley, negative binomial, beta-binomial, Skellam, beta negative binomial and Conway–Maxwell–Poisson distribution. We don’t use Logarithmic, Borel, discrete compound Poisson, Boltzmann, Benford’s law, Yule–Simon, Zipf’s law, and Zeta distribution because the range of x doesn’t support 0.

Binomial Distribution

The binomial distribution (bionm) can be defined, using the binomial expansion as the distribution of a random variable X for which where p + q = 1, 0 < p < 1, and n is a positive integer. Ifn = 1, the distribution is called the distribution of Bernoulli.

Poisson Distribution

If a random variable X has a Poisson (Pois) distribution with parameter 𝜃,then its PMF is given by where 𝜃 > 0. For more information, see Johnson et al. (2005) chapter 4.

Hypergeometric Distribution

In a sample of n balls drawn without substitution from a population of (N) balls, (N𝜃) of which are white and (N − N𝜃) are black. The PMF of hypergeometric distribution is given by For more information, see Johnson et al. (2005) chapter 6.

Waring Distribution

The distribution of Waring is a generalization of the distribution of Yule, see Johnson et al. (2005). Taking Pr[X = x] , Waring expansion proportional to the (x + 1) term in the sequence. where α,𝜃 > 0. If α = 1 then, Yule distribution is the special case of Waring distribution.

Yule–Simon Distribution

The Yule–Simon distribution is a discrete probability distribution named after Udny Yule and Herbert A. Simon in probability and statistics. Originally, Simon named it the distribution of Yule by Simon (1955). The Yule-Simon (𝜃) PMF is where 𝜃 > 0. The CDF of Yule-Simon distribution is The hr function of the Yule-Simon distribution is given by

Discrete Rectangular Distribution

In its most general form, the discrete rectangular distribution (sometimes called the discrete uniform distribution) is defined by

Distribution of Leads

The distribution of leads in coin tossing (Johnson et al. 2005) has the PMF

Beta-binomial Distribution

The beta-binomial (Bbinom) distribution in probability theory and statistics is a family of discrete probability distributions on finite support of non-negative integers occurring when the probability of success is either unknown or random in each of a fixed or known number of Bernoulli trials, is discussed by Griffiths (1973). The PMF of Bbinom is given by the CDF of Bbinom is given by where FGH (a,b,x) is the generalized hypergeometric function.

Negative Binomial Distribution

The negative binomial(Nbionm) distribution in probability theory and statistics is a discrete probability distribution that models the number of successes before a given non-random number of failures (denoted by r) occur in a series of independent and identically distributed Bernoulli trials. The PMF of Nbinom is given by where r is a number of failures until the experiment is stopped (integer, but the definition can also be extended to real numbers) and p is success probability in each experiment. For more information, see Johnson et al. (2005) chapter 5.

Geometric Distribution

The geometric distribution in probability theory and statistics is the probability distribution of the number X − 1 of failures before the first success, supported by the set{0,1,2,... }. The PMF is given as where0 < p < 1.

Beta Negative Binomial Distribution

A beta negative binomial (BNbinom) distribution in probability theory is the probability distribution of a discrete random variable X equal to the number of failures needed to achieve achievements in a series of independent Bernoulli trials in which the probability p of success in each trial, though constant in any given experiment, is itself a random variable following a beta distribution. Wang (2011) discussed this distribution as a compound probability distribution. The PMF of BNbinom is given by where x = 0, 1, 2,… and r,α,β > 0.

Logarithmic Distribution

A random variable X is said to have a logarithmic distribution with parameter 𝜃 if its PMF is in the form For more information of logarithmic distribution, see Johnson et al. (2005) chapter 7.

Skellam Distribution

Let μ1,μ2 > 0, Skellam (1946) introduced the Skellam distribution (distribution of the difference between two independent Poisson random variables) and is denoted by Skellam (μ1,μ2) with PMF is given by wherex = …,− 2,− 1, 0, 1, 2, … Ik (2μ1 μ2) is the modified Bessel function of the first kind.

Conway–Maxwell–Poisson Distribution

Shmueli et al. (2005) discussed the Conway–Maxwell–Poisson (CMP) distribution with PMF as where Z (λ,𝜃) = ∑ j= 0∞ λ (j!) is normalization constant.

Review for Discrete Models Based on Survival Discretization Method

In the statistics literature, sundry methods are available to obtain a discrete distribution from a continuous one. The most commonly used technique to generate discrete distribution is called a survival discretization method, it requires the existence of cumulative distribution function (CDF), survival function should be continuous and non-negative and times are divided into unit intervals. The PMF of discrete distribution is defined in Roy (2003) as wherex = 0,1,2,…,S (x) = P (X ≥ x) = F (x; Θ), F (x; Θ) is a CDF of continuous distribution, and Θ is a vector of parameters. The random variable X is said to have the discrete distribution if its CDF is given by The hazard rate is given by hr (x) = P (X=x) S(x). The reversed failure rate of discrete distribution is given as

Discrete Burr Distribution

The PMF of the discrete Burr (DB) distribution has been defined by Krishna and Pundir (2009) is given by where x = 0,1,2,…, α > 0,0 < 𝜃 < 1, the CDF of the DBu distribution is The hazard rate (hr) of the discrete Burr distribution is

Discrete Lindley Distribution

The PMF of the discrete Lindley (DLi) distribution has been defined by Gómez-Déniz and Calderín-Ojeda (2011) is given as follows where x = 0,1,2,…, 0 < 𝜃 < 1. The CDF of the DLi distribution is The hazard rate of the DLi distribution is

Discrete Alpha Power Inverse Lomax

The discrete alpha power inverse Lomax (DAPIL) distribution is introduced by Almetwally and Ibrahim (2020). The PMF and the CDF of the DAPIL distribution are respectively given by The hr function of the DAPIL distribution is given by

Discrete Generalized Exponential Distribution

The PMF of the discrete generalized exponential (DGE) distribution has been defined by Nekoukhou et al. (2013) is given as follows where x = 0,1,2,…,α > 0, 0 < 𝜃 < 1, when 𝜃 = e−;λ > 0, the CDF of the DGEx distribution is The hazard rate of the DGEx distribution is

The DMOGEx Distribution

The discrete Marshall-Olkin Generalized exponential (DMOGEx) distribution is introduced by Almetwally et al. (2020). The PMF and the CDF of the DMOGEx distribution are respectively given by and where 0 < ρ < 1, λ,𝜃 > 0. The hr function of the DMOGEx distribution is given by

Discrete Gompertz-G Exponential

The discrete Gompertz-G- exponential (DGzEx) distribution was introduced by Eliwa et al. (2020). The PMF and the CDF of the DGzEx distribution are respectively given by and The hr function of the DGzEx distribution is given by

Discrete Weibull

A discrete Weibull (DW) distribution was introduced by ? Nakagawa-and-Osaki:1975 (), and is defined by the cumulative distribution function (CDF)as: The DW distribution has PMF: and the hazard rate of DW is

Discrete Inverse Weibull

A discrete inverse Weibull (DIW) distribution was introduced by Jazi et al. (2010), and is defined by the CDF: The DIW distribution has PMF: and the hazard rate of DIW is

Exponentiated Discrete Weibull

The exponentiated discrete Weibull (EDW) distribution was introduced by Nekoukhou and Bidram (2015), and is defined by the CDF: The DIW distribution has PMF: and the hazard rate of DIW is The discrete Rayleigh (DR) distribution was introduced by Roy (2004), and can be defined when α = 2 and β = 1as follows: The DR distribution has PMF: and the hazard rate of DR is

New Discrete Lindley

The new discrete Lindley (NDL) distribution was introduced by Al-Babtain et al. (2020a, ??), and is defined by the CDF: The NDL distribution has PMF: and the hazard rate of NDL is

Parameter Estimation of Discrete Model

In this section, we estimate the parameters of the models using a maximum likelihood method. It is noted that the maximum likelihood method is also used to estimate unknown parameters of a statistical model because maximum likelihood estimates (MLEs) have several desirable properties; For example, they are asymptotically unbiased, symmetrical, consistent, asymptomatically normally distributed, etc. Let x1, x2,…, x be a random sample of size n from the discrete distribution, and then the log-likelihood function is given by where Θ = (Θ1,…,Θ), k is a length of Θ. The MLEs can be obtained by partially first derivatives of the log-likelihood function and equal to zero provide the MLEs of Θ, say Θ̂ = (Θ̂1,…,Θ̂), then using a computational process such as the k variable Newton-Raphson Algorithm are given by the solutions of the equations. For interval estimation and hypothesis tests on the model parameters, we require the information matrix. The k × k observed information matrix is One can use the normal distribution of Θ̂ to construct approximate confidence interval regions for some parameters. Indeed, an asymptotic 100(1 − ξ) confidence interval for each parameterΘ;j = 1,…,k, is given by where ℶ ^ denotes the (i,i) diagonal element of I− 1 (Θ̂) and z is the (1 − ξ 2) thquantile of the standard normal distribution.

Applications of Real Data

In this section, we illustrate the empirical importance of the discrete distributions, such as DB, DLi, Binom, Pois, DR, DGE, Geometric, DW, DIW, DE, NDL, DGzEx, DMOGE, DAPLo, and EDW distributions using four applications to real data sets. The fitted models are compared using some criteria; namely, Akaike information criterion (AIC), corrected AIC (CAIC), Hannan-Quinn information criterion (HQIC), Chi-square (X2) with a degree of freedom and its p-value. and

African Continent

Angola

This data represents the daily new deaths of 51 days from 10 October to 29 November 2020 belong to Angola country (see World Health Organization). The MLEs and the goodness of fit statistics are reported in Tables 1, 2 and 3.
Table 1

The goodness of fit and estimation models with one parameter for Dataset of Angola

xFreq.DLbinompoisDRGeomDENDLNbinom
0510.38513.71963.47483.640913.835313.837711.17283.7451
11310.38329.49299.33449.436110.08210.083210.21929.5336
288.661212.351112.537411.73717.3477.34738.308412.3726
376.620110.919411.226310.5945.35395.35386.332710.9106
484.80527.37687.53927.58623.90153.90124.63387.3521
563.37074.06064.05054.46112.84312.84273.29654.0368
632.30781.89661.81352.19122.07182.07142.29721.8806
711.55160.77280.69590.90771.50981.50931.57590.7644
𝜃 0.59280.952.68630.92860.27130.72870.39020.9501
χ2 7.47316.30937.16925.075113.898613.90358.56156.3249
P-Value0.38130.50410.41150.65080.0530.05290.28570.5024
AIC212.2015205.5942206.245204.4382221.7812221.7812214.0158207.5957
CAIC212.2832205.6759206.3267204.5199221.8628221.8628214.0975207.8457
BIC214.1334207.526208.1769206.37223.713223.713215.9477211.4593
HQIC212.9397206.3324206.9832205.1764222.5194222.5194214.7541209.0721
Table 2

The goodness of fit and estimation models with two parameters for Dataset of Angola

xFreq.DBDGEDWDIWBbinomBNbinomskellam
056.78084.70334.96713.38969.69025.45473.5321
11318.900411.920610.218116.93357.15346.80339.2664
288.339911.637911.220410.95315.74996.669412.4241
374.23618.599.52656.04584.73855.961811.1673
482.54745.63256.77483.58613.94745.08277.5452
561.69923.4844.1722.29383.30554.21434.0829
631.21392.08952.26481.5592.77453.43331.8423
710.91041.2321.09591.11132.33042.76480.7128
α 4.85970.57660.90260.06650.845251.13072.7117
𝜃 0.8142.77311.78641.55918.38950.95010.0251
χ2 24.86124.58213.556313.326810.05516.32496.9532
P-Value0.00080.71080.82920.06450.18550.50240.4337
AIC233.0403208.2782205.5087222.5174219.2958207.5957208.2394
CAIC233.2903208.5282205.7587222.7674216.9786207.8457208.4894
BIC236.9039212.1418209.3723226.381220.5679211.4593212.1031
HQIC234.5167209.7546206.9851223.9938219.5569209.0721209.7159
Table 3

The goodness of fit and estimation models with three parameters for Dataset of Angola

xFreq.DGzExDMOGEDAPLEDW
057.02375.39434.41786.0749
1138.39229.718411.18269.0453
289.291511.036711.80779.9774
379.23099.60269.10199.4497
487.86526.72415.98947.6045
565.39414.03693.61995.0134
632.72622.20482.09322.5727
710.8991.14241.18710.9666
α 0.89451.656703.0214
β 1.10540.49039.83330.4415
𝜃 0.3234.11680.09980.9919
χ2 3.92664.06034.54843.2125
P-Value0.78820.77280.71490.8647
AIC206.4348208.7333210.1451206.0902
CAIC206.9454209.2439210.6558206.6009
BIC212.2303214.5287215.9406211.8857
HQIC208.6494210.9479212.3598208.3048
The goodness of fit and estimation models with one parameter for Dataset of Angola The goodness of fit and estimation models with two parameters for Dataset of Angola The goodness of fit and estimation models with three parameters for Dataset of Angola From Tables 1, 2 and 3, it is evident that all distributions are fitted and work quite well for analyzing these data except for the DB distribution. However, we always search for the best model to get the best evaluation of the data, and therefore, using AIC, BIC, CAIC, HQIC, χ2 and p-values, we can say that the DMKEx model provides the best fit among all the tested models because it has the largest p-value and the smallest values of AIC, CAIC, BIC, HQIC and χ2 statistics. Figure 1 supports the results of Tables 1, 2 and 3. The fitted PMFs for Dataset of Angola

Ethiopia

This data represents the daily new deaths of 68 days from 1 April to 7 June 2020 belong to Ethiopia country (see World Health Organization). The MLEs and the goodness of fit statistics are reported in Tables 4, 5, and 6.
Table 4

The goodness of fit and estimation models with one parameter for Dataset of Ethiopia

ValueCountDLbinomPoisDRGeomDENDLNbinom
05351.901550.662950.672842.435752.545452.548652.145348.4267
11212.890714.878514.903824.20611.942211.940412.521213.9393
212.62042.21692.19171.34812.71412.71322.67254.0123
320.48510.22340.21490.01020.61690.61650.53481.1549
𝜃 0.14250.99570.29410.37590.77270.22720.83990.7122
χ2 5.808115.457216.1469397.42954.17654.17895.08733.5538
P-Value0.12130.00150.001100.2430.24280.16550.3138
AIC96.832899.387299.5043119.835796.328996.328996.604998.2034
CAIC96.893499.447899.565119.896396.389596.389596.665598.3881
BIC99.0523101.6067101.7239122.055298.548498.548498.8244102.6425
HQIC97.7122100.2666100.3838120.715197.208497.208497.484399.9623
Table 5

The goodness of fit and estimation models with two parameters for Dataset of Ethiopia

ValueCountDBDGEDWDIWBbinomBNbinomskellam
05353.035553.056353.107152.990653.20378.54150.6655
11211.692411.175311.14811.99847.28613.695214.9089
212.21842.80522.75721.88513.176713.78352.1935
320.61540.71620.71920.56891.695411.16080.2152
α 1.59060.25660.2190.77930.1855100.07410.2943
𝜃 0.11260.83670.93292.461225.08614.09650.0001
χ2 3.76573.513.45113.9784.316167.784916.12652
P-Value0.28790.31950.32720.26390.229300.0011
AIC98.265398.232698.207698.374598.6101109.5831101.5044
CAIC98.4598.417298.392398.559198.821108.5832101.689
BIC102.7044102.6716102.6467102.813597.1454110.8358105.9434
HQIC100.024299.991599.9665100.1333100.1186105.8331103.2632
Table 6

The goodness of fit and estimation models with three parameters for Data set of Ethiopia

ValueCountDGzExDMOGEDAPLEDW
05353.051953.020752.958853.0357
11211.301911.703711.878111.4966
212.66632.24822.11592.4583
320.69170.66640.57470.6694
α 0.38513.17470.00010.6074
β 1.64480.37391.51394.551
𝜃 0.04330.08260.13810.0531
χ2 3.54293.34974.09243.5125
P-Value0.31520.34080.25170.3191
AIC100.212299.9264100.4083100.1279
CAIC100.5872100.3014100.7833100.5029
BIC106.8707106.5849107.0668106.7864
HQIC102.8505102.5647103.0466102.7662
The goodness of fit and estimation models with one parameter for Dataset of Ethiopia The goodness of fit and estimation models with two parameters for Dataset of Ethiopia The goodness of fit and estimation models with three parameters for Data set of Ethiopia From Tables 4, 5, and 6, it is evident that all distributions are fitted and works quite well for analyzing these data except for the DR, binomial, Poisson, Skellam, and BNbinom distributions. However, we always search for the best model to get the best evaluation by using AIC, BIC, CAIC, HQIC, χ2, and p-values. Figure 2 supports the results of Tables 4, 5 and 6. The fitted PMFs for Dataset of Ethiopia

El Salvador

This data represents the daily new deaths of 81 days from 1 April to 20 June 2020 belong to El Salvador country (see World Health Organization). The MLEs and the goodness of fit statistics are reported in Tables 7, 8 and 9.
Table 7

The goodness of fit and estimation models with one parameter for Dataset of El Salvador

xFreq.DLbinompoisDRGeomDENDLNbinom
03435.894428.20828.014618.500839.287439.291336.89828.2086
12522.758529.562129.743833.788620.231820.23221.888129.5622
21111.963615.681915.789820.858410.418810.417911.541515.6815
365.7515.61355.58816.57365.36545.36445.70545.6133
442.62291.52521.48331.15462.7632.76222.70761.5251
511.15550.33550.3150.11681.42291.42231.24920.3354
𝜃 0.37190.98711.06170.77160.4850.51490.60450.9871
χ2 1.13198.64169.271433.66862.54522.54711.34758.642
P-Value0.95120.12420.098700.76970.76940.930.1242
AIC230.5771235.0331235.4473253.2005233.3614233.3614231.1279237.0331
CAIC230.6278235.0837235.4979253.2511233.4121233.4121231.1785237.1869
BIC232.9716237.4275237.8417255.595235.7559235.7559233.5223241.822
HQIC231.5378235.9938236.408254.1612234.3221234.3221232.0885238.9545
Table 8

The goodness of fit and estimation models with two parameters for Dataset of El Salvador

xFreq.DBDGEDWDIWBbinomBNbinomskellam
03434.403633.838833.709533.079540.05020.934928.0132
12528.005225.229724.660329.501616.45093.026929.7437
2119.33912.253912.83158.93159.16635.626215.7905
363.79075.48735.85213.68575.53867.88295.5886
441.86682.38972.44651.86773.46959.25241.4835
511.04961.02930.95711.07862.21569.6080.315
α 2.41360.42740.58380.40840.580488.71661.0618
𝜃 0.45041.56561.24461.795534.45667.11570.00001
χ2 4.13371.24571.24874.80495.9714578.71719.2705
P-Value0.53030.94040.94010.44010.3090.00020.0988
AIC238.4529232.4455232.0191239.7789242.0565246.0948237.4473
CAIC238.6068232.5994232.1729239.9327241.9607246.466237.6011
BIC243.2418237.2344236.808244.5678248.2354251.5662242.2362
HQIC240.3743234.3669233.9405241.7002249.4564253.6535239.3687
Table 9

The goodness of fit and estimation models with three parameters for Dataset of El Salvador

xFreq.DGzExDMOGEDAPLEDW
03434.454534.042234.145534.3273
12522.817724.166925.247222.9832
21113.271412.956912.029813.3583
366.59655.83055.27896.4936
442.71122.41422.30422.6292
510.88560.96491.03010.886
α 0.6610.939901.7099
β 1.21160.390312.30950.542
𝜃 0.16152.32930.24180.7948
χ2 1.27981.35691.411.3566
P-Value0.9370.9290.92320.929
AIC233.5703234.2826234.9581233.7504
CAIC233.882234.5943235.2698234.0621
BIC240.7537241.4659242.1415240.9337
HQIC236.4524237.1646237.8402236.6324
The goodness of fit and estimation models with one parameter for Dataset of El Salvador The goodness of fit and estimation models with two parameters for Dataset of El Salvador The goodness of fit and estimation models with three parameters for Dataset of El Salvador From Tables 7, 8 and 9, it is evident that all distributions are Fitted and work immensely well for analyzing these data except for the BNbionm, and DR distribution. However, we always search for the best model to get the best evaluation byusing AIC, BIC, CAIC, HQIC, χ2, and p-values. Figure 3 supports the results of Tables 7, 8 and 9.
Figure 3

The fitted PMFs for Dataset of Angola

French Guiana

This data represents the daily new deaths of 153 days from 1 June to 31 October 2020 belong to French Guiana country (see World Health Organization). The MLEs and the goodness of fit statistics are reported in Tables 10, 11 and 12.
Table 10

The goodness of fit and estimation models with one parameter for Dataset of French Guiana

ValueCountDLDbinomDPDRGeomDENDLNbinom
0102102.892595.863797.461476.8051105.446105.4415103.772495.8073
13936.075644.748943.953366.784132.773732.775434.853339.9856
2710.401210.51269.9119.122610.186410.187910.405312.5161
342.73521.65711.48990.28613.1663.16682.91233.4824
410.68160.19720.1680.00220.9840.98440.78250.9084
𝜃 0.20280.99690.4510.4980.68920.31080.77610.7913
χ2 2.08748.88499.973564.01512.50362.50292.10052.9355
P-Value0.71970.0640.040900.6440.64410.71730.5687
AIC276.2944280.1661280.288319.2933277.1681277.1681276.4623278.2455
CAIC276.3208280.2907280.3145319.3198277.1945277.1945276.4888278.3255
BIC279.3248283.2946283.3184322.3237280.1985280.1985279.4927284.3064
HQIC277.5254281.4952281.519320.5243278.3991278.3991277.6933280.7075
Table 11

The goodness of fit and estimation models with two parameters for Dataset of French Guiana

ValueCountDBDGEDWDIWBbinomBNbinomskellam
0102102.0825102.176102.491101.8402106.249390.780697.4627
13939.205437.343436.6340.16421.408137.398243.9525
277.860310.010410.35756.82510.065214.96499.9106
342.20692.5832.67542.09415.62515.94481.4897
410.81240.66070.65120.87063.38952.35750.1679
α 2.01720.25510.33010.66560.2721240.0330.4509
𝜃 0.20451.37121.11472.448354.63571.06970.0002
χ2 1.58231.92742.08381.768616.7327.02439.9743
P-Value0.8120.74910.72040.77820.00220.13460.0409
AIC278.7042278.082278.2295279.5632288.2087297.1347282.288
CAIC278.7842278.162278.3095279.6432288.2088297.1348282.368
BIC284.7651284.1429284.2904285.6241288.2147307.1408288.3489
HQIC281.1662280.544280.6915282.0252285.2111289.1372284.75
Table 12

The goodness of fit and estimation models with three parameters for Dataset of French Guiana

ValueCountDGzExDMOGEDAPLEDW
0102103.1603102.0371102.0898102.089
13935.372338.572738.516738.1064
2710.76928.68168.92119.2162
342.87652.51612.30312.4825
410.66530.80.70490.7394
α 0.51333.061100.7349
β 1.6010.3334.28593.599
𝜃 0.06110.20350.20420.1063
χ2 2.3091.25141.78681.5687
P-Value0.67910.86960.77490.8144
AIC280.4584279.6547280.2424279.9279
CAIC280.6195279.8158280.4035280.089
BIC289.5497288.7461289.3337289.0192
HQIC284.1514283.3478283.9355283.621
The goodness of fit and estimation models with one parameter for Dataset of French Guiana The goodness of fit and estimation models with two parameters for Dataset of French Guiana The goodness of fit and estimation models with three parameters for Dataset of French Guiana From Tables 10, 11 and 12, it is evident that all distributions are fitted and work immensely well for analyzing these data except for the DR and skellam distributions. However, we always search for the best model to get the best evaluation of the data, and therefore, using AIC, BIC, CAIC, HQIC, X2, and p-values, we can say that the DL in Table 7, DGE in Table 8, and DMOGE in Table 9 model provides the best fit among all the tested models because it has the largest p-value and the smallest values of AIC, CAIC, BIC, HQIC and χ2 statistics. Figure 4 supports the results of Tables 10, 11 and 12.
Figure 4

The fitted PMFs for Dataset of Ethiopia

Europe Continent

Estonia

This data represents the daily new deaths of 81 days from 1 April to 20 May 2020 belong to Estonia country (see World Health Organization). The MLEs and the goodness of fit statistics are reported in Tables 13, 14 and 15.
Table 13

The goodness of fit and estimation models with one parameter for Dataset of Estonia

xFreq.DLBinompoisDRGeomDENDLNbinom
02020.41315.26815.0589.51622.72722.72521.0915.278
11513.9117.89918.07118.99412.39712.39613.39317.901
267.86610.70110.84414.0116.7626.7627.5610.697
354.0694.3494.3385.7723.6883.6894.0014.345
431.9981.3511.3011.4512.0122.0122.0331.349
500.9470.3420.3120.231.0971.0981.0040.341
610.4380.0740.0620.0230.5990.5990.4860.073
𝜃 0.40.9771.20.810.4550.5450.5770.977
χ2 2.89618.12321.00159.6923.2213.2212.80118.156
P-Value0.8220.0060.00200.7810.7810.8330.006
AIC152.29157.901158.584170.826153.582153.582152.457159.902
CAIC152.373157.985158.667170.91153.665153.665152.541160.157
BIC154.202159.814160.496172.738155.494155.494154.369163.726
HQIC153.018158.63159.312171.554154.31154.31153.185161.358
Table 14

The goodness of fit and estimation models with two parameters for Dataset of Estonia

xFreq.DBDGEDWDIWBbinomBNbinomskellam
02020.313719.94619.890819.377826.44321.180315.0634
11516.758914.683614.364317.748810.125510.646118.0724
266.05147.80548.04255.85755.39826.324410.8412
352.61333.8974.10762.55323.1323.96734.3356
431.34761.90121.97811.34451.88362.56261.3004
500.78560.91830.91170.79951.1531.68780.312
610.49920.44160.40570.51690.71211.1280.0624
α 2.33350.47890.60220.38760.554923.00721.1998
𝜃 0.47141.40981.18791.670923.45530.74140
χ2 5.4452.9693.0325.7996.843.58821.021
P-Value0.4880.8130.8050.4460.3360.7320.002
AIC158.573154.399154.216159.305160.349153.155160.584
CAIC158.828154.655154.472159.56160.559154.059160.839
BIC162.397158.223158.04163.129165.287157.388164.408
HQIC160.029155.855155.673160.761162.71155.095162.04
Table 15

The goodness of fit and estimation models with three parameters for Dataset of Estonia

xFreq.DGzExDMOGEDAPLEDW
02020.373420.049519.587920.1969
11513.366714.147915.653213.6021
268.08648.08527.75028.1771
354.45844.10153.5744.3901
432.21061.95581.66772.1237
500.97070.90510.80910.932
610.3710.41310.41150.3734
α 0.67581.023101.4865
β 1.24620.45118.57690.6436
𝜃 0.10961.76320.22050.7555
χ2 3.11863.05913.66853.1415
P-Value0.79380.80140.72140.7909
AIC156.043156.336157.083156.116
CAIC156.565156.858157.605156.638
BIC161.779162.072162.819161.852
HQIC158.228158.521159.267158.301
The goodness of fit and estimation models with one parameter for Dataset of Estonia The goodness of fit and estimation models with two parameters for Dataset of Estonia The goodness of fit and estimation models with three parameters for Dataset of Estonia From Tables 13, 14, and 15, it is evident that all distributions are Fitted and work quite well for analyzing these data except for the Binom, Pois, and DR distributions. However, we always search for the best model to get the best evaluation of the data, and therefore, using AIC, BIC, CAIC, HQIC, X2, and p-values, we can say that the DL model provides the best fit among all the tested models because it has the smallest values of AIC, CAIC, BIC, HQIC, and χ2 statistics, as well as having the highest p-value. Figure 5 supports the results of Tables 13, 14 and 15.
Figure 5

The fitted PMFs for Dataset of El Salvador

The fitted PMFs for Dataset of El Salvador

Greece

This data represents the daily new deaths of 111 days from 12 March to 30 June 2020 belong to Greece country (see World Health Organization). The MLEs and the goodness of fit statistics are reported in Tables 16, 17 and 18.
Table 16

The goodness of fit and estimation models with one parameter for Dataset of Greece

xFreq.DLBinompoisDRGeomeDENDLNbinom
03934.353520.12119.860112.202340.798140.796836.078620.12694
12628.329834.09934.175629.132125.802725.802427.464934.10319
21719.439929.15429.40530.748816.318916.31918.584629.15261
3912.212716.765816.866921.69410.320910.321111.789616.76212
467.28337.29527.256211.18456.52756.52777.17997.292363
574.19682.56172.49734.36124.12834.12854.25112.560298
662.360.75620.71621.30812.61092.61112.46570.755599
701.30310.1930.17610.30471.65131.65141.40770.192785
800.70940.04350.03790.05551.04441.04450.79380.043407
910.38190.00880.00720.00790.66050.66060.44310.008761
𝜃 0.48680.98471.72080.89010.36760.63250.49250.984735
χ2 12.6465184.8257212.9279218.33499.47799.476810.9923184.9933
P-Value0.17930000.39440.39450.27624.59E-35
AIC399.5498440.2502442.207462.0504399.2138399.2138398.6729442.2502
CAIC399.5865440.2869442.2437462.0871399.2505399.2505398.7096442.3614
BIC402.2594442.9597444.9165464.7599401.9233401.9233401.3824447.6693
HQIC400.649441.3494443.3061463.1496400.3129400.3129399.7721444.4486
Table 17

The goodness of fit and estimation models with two parameters for Dataset of Greece

xFreq.DBDGEDWDIWBbinomBNbinomskellam
03939.972838.433837.669936.667144.7225019.6302
12633.012327.135227.173935.506723.43250.000234.0085
21714.343217.284317.685314.477814.4720.001429.4593
397.181710.79111.14647.25019.37160.005717.0124
464.13656.68066.89134.21266.20010.0187.3683
572.63074.11834.20282.70154.14920.04692.5531
661.79542.53282.53621.85612.79510.10590.7372
701.2911.55561.51731.34171.89010.21230.1824
800.9660.95470.90111.00811.28070.38560.0395
910.7460.58560.53180.7810.86870.64390.0076
α 2.0970.6130.66060.33030.7334131.2061.7325
𝜃 0.52511.11731.08181.363645.361827.20010.00003
χ2 21.52479.901910.003521.504410.1022227.2067205.3209
P-Value0.01050.35850.35020.01060.342300
AIC416.9434400.8569400.5058418.3186401.0155438.4688444.2158
CAIC417.0545400.968400.6169418.4298401.027438.4569444.3269
BIC422.3625406.276405.9249423.7377406.0137432.5658449.6349
HQIC419.1418403.0553402.7042420.517402.9877439.1251446.4142
Table 18

The goodness of fit and estimation models with three parameters for Dataset of Greece

xFreq.DGzExDMOGEDAPLEDW
03937.141839.100438.256139.6469
12626.213524.51828.749823.2576
21717.91817.657317.39416.952
3911.833311.776510.234412.0579
467.53087.38526.05698.1187
574.60564.44513.64845.1014
662.69852.60832.24692.963
701.511.50741.41671.5797
800.80410.86360.91450.7693
910.40590.49220.60390.3412
α 0.75640.562101.893
β 1.4060.566116.77260.3775
𝜃 0.05213.07010.29290.9346
χ2 9.56839.741912.19029.085
P-Value0.38660.37180.20280.4295
AIC401.4063402.2136405.1989400.9745
CAIC401.6306402.4379405.4232401.1988
BIC409.5349410.3422413.3275409.1031
HQIC404.7038405.5111408.4964404.272
The goodness of fit and estimation models with one parameter for Dataset of Greece The goodness of fit and estimation models with two parameters for Dataset of Greece The goodness of fit and estimation models with three parameters for Dataset of Greece From Tables 16, 17 and 18, it is evident that all distributions are Fitted and work quite well for analyzing these data except for the DB, Binom, Pois, DIW, and DR distribution. However, we always search for the best model to get the best evaluation of the data, and therefore, concerning the AIC, BIC, CAIC, HQIC, χ2and p-values, we can say that the DE model provides the best fit among all the tested models because it has the smallest values of AIC, CAIC, BIC, HQIC and χ2statistics, as well as having the highest p-value. Figures 6, 7 and 8 support the results of Tables 16, 17 and 18.
Figure 6

The fitted PMFs for Dataset of Guiana

Figure 7

The fitted PMFs for Dataset of Estonia

Figure 8

The fitted PMFs for Dataset of Greece

The fitted PMFs for Dataset of Guiana The fitted PMFs for Dataset of Estonia The fitted PMFs for Dataset of Greece

Concluding Remarks

In this article, we use 8 discrete distributions with one parameter, 7 discrete distributions with two parameters, and 4 discrete distributions with three parameters to fit and determine the best model of daily Coronavirus deaths in some countries, such as Angola,Ethiopia, French Guiana, El Salvador, Estonia, and Greece. In the case of discrete distributions with one parameter, we discussed DL, binomial, Poisson, DR, Geometric, DE, Nbinom, and NDL distributions. In the case of discrete distributions with two parameters, we discussed DB, DGE, DW, DIW, Bbinom, BNbinom, and skellam distributions. In the case of discrete distributions with three parameters, we discussed DGzEx, DMOGE, DAPL, and EDW distributions. A review of some important discrete distributions has been provided as DB, DL, DMOGE, DGE, DAPL, DR, DE, Geometric, Binomial, NDL, DGzEx, and EDW distribution. The maximum likelihood estimation method is discussed to estimate the parameters of the discrete distributions. We prove empirically that the discrete models fit different datasets of daily Coronavirus deaths in some countries as Angola, Ethiopia, French Guiana, El Salvador, Estonia, and Greece. DW and DB reveal its superiority over other competitive models for the analysis of daily deaths of the COVID-19 in the case of Angola, Ethiopia, French Guiana, El Salvador, Estonia, and Greece.
  6 in total

1.  The frequency distribution of the difference between two Poisson variates belonging to different populations.

Authors:  J G SKELLAM
Journal:  J R Stat Soc Ser A       Date:  1946

2.  Maximum likelihood estimation for the beta-binomial distribution and an application to the household distribution of the total number of cases of a disease.

Authors:  D A Griffiths
Journal:  Biometrics       Date:  1973-12       Impact factor: 2.571

3.  A new statistical approach to model the counts of novel coronavirus cases.

Authors:  M El-Morshedy; Emrah Altun; M S Eliwa
Journal:  Math Sci (Karaj)       Date:  2021-03-16

4.  Time series modelling to forecast the confirmed and recovered cases of COVID-19.

Authors:  Mohsen Maleki; Mohammad Reza Mahmoudi; Darren Wraith; Kim-Hung Pho
Journal:  Travel Med Infect Dis       Date:  2020-05-13       Impact factor: 6.211

5.  A New Discrete Analog of the Continuous Lindley Distribution, with Reliability Applications.

Authors:  Abdulhakim A Al-Babtain; Abdul Hadi N Ahmed; Ahmed Z Afify
Journal:  Entropy (Basel)       Date:  2020-05-28       Impact factor: 2.524

6.  The new discrete distribution with application to COVID-19 Data.

Authors:  Ehab M Almetwally; Doaa A Abdo; E H Hafez; Taghreed M Jawa; Neveen Sayed-Ahmed; Hisham M Almongy
Journal:  Results Phys       Date:  2021-12-05       Impact factor: 4.476

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.