Literature DB >> 34900522

The new discrete distribution with application to COVID-19 Data.

Ehab M Almetwally1,2, Doaa A Abdo3, E H Hafez4, Taghreed M Jawa5, Neveen Sayed-Ahmed5, Hisham M Almongy3.   

Abstract

This research aims to model the COVID-19 in different countries, including Italy, Puerto Rico, and Singapore. Due to the great applicability of the discrete distributions in analyzing count data, we model a new novel discrete distribution by using the survival discretization method. Because of importance Marshall-Olkin family and the inverse Toppe-Leone distribution, both of them were used to introduce a new discrete distribution called Marshall-Olkin inverse Toppe-Leone distribution, this new distribution namely the new discrete distribution called discrete Marshall-Olkin Inverse Toppe-Leone (DMOITL). This new model possesses only two parameters, also many properties have been obtained such as reliability measures and moment functions. The classical method as likelihood method and Bayesian estimation methods are applied to estimate the unknown parameters of DMOITL distributions. The Monte-Carlo simulation procedure is carried out to compare the maximum likelihood and Bayesian estimation methods. The highest posterior density (HPD) confidence intervals are used to discuss credible confidence intervals of parameters of new discrete distribution for the results of the Markov Chain Monte Carlo technique (MCMC).
© 2021 The Authors.

Entities:  

Keywords:  60E05; 62F10; Bayesian estimation; Inverse Toppe–Leone; Marshall–Olkin family; Maximum likelihood estimation; Survival discretization

Year:  2021        PMID: 34900522      PMCID: PMC8645255          DOI: 10.1016/j.rinp.2021.104987

Source DB:  PubMed          Journal:  Results Phys        ISSN: 2211-3797            Impact factor:   4.476


Introduction

Corona viruses are a huge family of viruses that can cause a variety of diseases varying from the common cold to much more serious conditions such as Middle East Respiratory Syndrome (MERS) and Severe Acute Respiratory Syndrome (SARS). In Wuhan, China, a new Coronavirus (COVID-19) was discovered in 2019. This is an extremely new coronavirus that has not been found in people before. The coronavirus disease 2019 (COVID-19) has been declared a pandemic by the World Health Organization (WHO). To stop the virus from spreading further, a concerted global effort is required. A pandemic affected a wide geographic area and affecting an exceptionally high proportion of the population”. The H1N1 flu pandemic in 2009 is the last pandemic reported in the world. There are numerous scientists that examined the pandemic Covid-19 and created models to match the data and offer predictions about the projected number of cases to aid the nations to make choices about prevention strategies. For example, see El-Morshedy et al. [1] he presented a new discrete distribution, a discrete generalized Lindley, for analyzing everyday coronavirus infections in Hong Kong and daily new fatalities in Iran. Maleki et al. [2] he predicted recovered and verified COVID19 cases using an autoregressive time series model based on the two-piece scale mixture normal distribution. Nesteruk [3] and Batista [4] they studied the daily new COVID-19 cases in China were anticipated using a mathematical model dubbed susceptible, infected, and recovered (SIR). Almongy et al. [5] introduced a new modeling of the COVID-19 mortality rates in Italy, Mexico, and the Netherlands. Liu et al. [6] discussed new modeling of the survival times for the COVID-19 patients in China. By using the inverse transformation to random variables, we proposed the inverse distributions. These distributions display different features in the behavior of the density and hazard rate shapes. Many authors discussed the inverted distributions and their applications. Some of the well-known inverted models are inverse Weibull distribution (Calabria and Pulcini [7], Muhammed and Almetwally [8], [9]), inverted Topp–Leone (ITL) (Hassan et al. [10], Almetwally et al. [11], Hassan et al. [12] and Almetwally [13]) among others. Hassan et al. [10] proposed the ITL with CDF given by where is the shape parameter. The probability mass function (PMF) related to Eq. (2) is given by We utilize discrete distributions in countable data analysis since most existing continuous distributions do not produce appropriate results for modeling COVID-19 cases, and counts of deaths or daily new cases exhibit significant dispersion. The survival discretization method is the most often used method for generating discrete distributions, and it necessitates the presence of a cumulative distribution function (CDF). Time is divided into unit intervals, and the survival function should be continuous and non-negative. Roy [14] defines the discrete distribution PMF as follows: Where , where is a continuous distribution CDF and is a parameter vector. If the CDF of the random variable has , it is considered to have a discrete distribution. The hazard rate is given by . The discrete distribution’s reversed failure rate is given as . Discrete Burr type XII and discrete Lomax distributions were proposed by Para, and Jan [15]. Discrete data with heavy tails can be modeled using Discrete Lomax(DL) distribution. Nakagawa and Osaki [16] proposed the discrete Weibull (DW) model, Krishna, and Pundir [17] introduced the discrete Buur (DB) model, Gómez-Déniz and Calderín-Ojeda [18] introduced discrete Lindley (DL), Nekoukhou et al. [19] suggested discrete generalized exponential (DGEx), Al-Babtain et al. [20] introduced the natural discrete Lindley (NDL), and Eliwa et al. [21] introduced the discrete Gompertz Exponential (DGzEx). Gillariose et al. [22] introduced a discrete Weibull Marshall–Olkin exponential distribution. Almetwally et al. [23] introduced Discrete Marshall–Olkin generalized exponential distribution. Marshal and Olkin [24] introduced a novel technique for adding a new parameter to an existing distribution, resulting in a new distribution known as the Marshall–Olkin(MO) extended distribution. This new distribution includes the original distribution as a unique feature and gives the model more flexibility. Sankaran and Jayakumar [25] have presented a detailed analysis on the physical interpretation of the MO family. Let denote the survivor function of a continuous random variable X. The MO extended distribution has a survival function if is the density function connected to the cumulative distribution function (CDF) F(x). where is survival function, , indicate the survival function (S) of a baseline model. Because , , is a special case of . The probability mass function (PMF) for Eq. (4) has the following shape: Our aim is to introduce discrete Marshal Olkin inverted Topp–Leone (DMOITL) and use this distribution to model the Covid-19 data from different countries. We made point estimation of the unknown parameters by using the maximum likelihood estimation method and Bayesian estimation. The HPD Intervals are used to discuss credible confidence intervals of parameters of new discrete distribution for the results of the MCMC. We computed the confidence intervals (CI) for the DMOITL distribution’s unknown parameters using asymptotic confidence intervals (ACI) as well. The rest of this study is organized as follows. In Section ‘DMOITL distribution’. We define DMOITL distribution. In Section ‘Statistical Properties’, we introduce the statistical properties of DMOITL distribution. The Two parameters of the distribution were estimated by two classical and Bayesian point estimation methods in Section ‘Parameter estimation’. While Section ‘Confidence intervals’ is concerned with the interval estimation methods. In Section ‘Simulation analysis’ we made a simulation study to compare the performance of the estimating approaches. Three real data sets from COVID-19 in different countries, including Italy, Puerto Rico, and Singapore, are used in Section ‘Data analysis’ to prove the efficiency of the DMOITL distribution with respect to other distributions. Finally, conclusions and major findings are given in Section ‘Conclusion’.

DMOITL distribution

In this part, we introduced the Marshall–Olkin inverted Topp–Leone (MOITL) distribution and converted this new continuous distribution to discrete distribution as discrete MOITL (DMOITL) distribution. By using Eqs. (4), and Eq. (1), the survival function of MOITL distribution can obtained and written as follows: where is defined as a vector parameters of MOITL distribution , and . The DMOITL distribution is obtained based on survival discretization method. Eq. (6) is used as the survival function of a baseline MOITL model using the parameter vector . As a result, the CDF of the DMOITL distribution is: The corresponding PMF of Eq. (7) is defined by where is positive vector parameters.  DMOITL() indicates the random variable with PMF (8). Fig. 1 is a graphical representation for various shapes of the PMF of the DMOITL distribution. These figures show that the PMF of the DMOITL distribution can be right-skewed, symmetric, or decreasing curves. The DMOITL distribution, as seen in the application section, has a lot of versatility and can be used to simulate skewed data. Therefore it is extensively utilized in fields like biomedical studies, biology, dependability, physical engineering, and survival analysis.
Fig. 1

PMF of DMOITL distribution.

PMF of DMOITL distribution. Sub-models of the DMOITL model for selected values of the parameters are presented as: If , the DITL distribution with the PMF, and the CDF of the DITL distribution is given by:

Statistical properties

The DMOITL distribution’s reliability measures, moments, and moment generating function (MGF) are shown here.

Reliability measures

The hazard rate function (HR) of the DMOITL distribution are given by The survival functions of DMOITL is given as There are some important shapes of the HR of the DMOITL distribution in Figs. 2. The HR of the DMOITL distribution has some important shapes, containing decreasing, and upside down curve, which are appealing features for various count models.
Fig. 2

HRF of DMOITL distribution.

HRF of DMOITL distribution. The reverse hazard function of DMOITL is given as The second rate of failure (srf) of DMOITL is

th-moment function

The non-central th-moment of DMOITL distribution can be derived using Eq. (8) as follows: In particular, the mean of DMOITL distribution is The variance of DMOITL distribution is given as The dispersion index (DI) may be determined with the help of the following expression: The skewness value (SKV) for DMOITL distribution, can be positive, zero, negative, or undefined. It can be expressed in terms of the third raw moment: The kurtosis value (KTV) for DMOITL distribution can be expressed in terms of the four raw moment: From Table 1, it is apparent that the mean, , variance, DI, , , SKV, and KTV of the DMOITL distribution with different parameters and .
Table 1

Different measures by moment function of DMOITL distribution.

αϑMeanμ2VarDIμ3μ4SKVKTV
0.50.639.9003.94E+043.77E+04965.6295.26E+077.22E+106.54444.993
0.97.120555.720505.02672.37877002.9601.16E+075.80337.695
1.52.14021.02016.4407.839408.8209692.3004.40224.816
30.8402.0001.2941.5727.32034.6402.35310.157
50.4600.6600.4480.9951.1802.5801.5455.533

1.50.6233.4601.46E+061.40E+066160.9341.20E+101.01E+146.57045.251
0.922.8006193.0405673.2253.9032.90E+061.47E+095.85138.156
1.54.78096.66073.81115.7573905.260194472.1804.31724.135
31.5205.0402.7291.83226.720184.5602.38610.194
50.8801.4800.7050.8183.2809.1601.2415.407

30.6728.2601.46E+071.40E+071.97E+043.78E+111.00E+166.57745.315
0.948.3002.87E+042.64E+04557.7642.90E+073.18E+105.85338.174
1.57.760249.360189.1424.87116071.5601278495.8404.30624.007
32.1609.2404.5742.16163.480576.8402.42910.596
51.1602.2800.9340.8226.08020.7601.4036.328
Different measures by moment function of DMOITL distribution.

Parameter estimation

Point estimation is a very important and critical estimation method, in this section, we will apply both classical and non-classical methods of estimation. First, we will apply the maximum likelihood estimation (MLE), and then we will apply the Bayesian estimation method.

Maximum likelihood method

Now we are talking about the first classical method which is the MLE. Let be a random sample of size from the DMOITL distribution. The log-likelihood equation of the vector are given by By differentiating Eq. (17), we can acquire the non-linear likelihood equations with respect to the parameters , and , respectively: and where and . We use a nonlinear optimization algorithm like the Newton Raphson method because these equations are cannot be solved explicitly.

Bayesian estimation

Bayesian estimation is one of the most important and accurate methods of estimation. In Bayesian estimation the parameters is considered as a random variable that is distributed with a certain distribution. We assign a prior believe about the parameter by using a prior distribution for the two parameters. The capacity to integrate previous information into study helps make the Bayesian technique very valuable for reliability assessment, since one of the primary challenges involved with reliability analysis is data scarcity. For the and parameters of DMOITL distribution are distributed with gamma prior distributions, where and are non-negative values. The and parameters as independent joint prior density functions can be expressed as follows: The joint posterior density function of is derived from likelihood function of DMOITL distribution and joint prior density (20). Under the symmetric loss functions, most of the Bayesian inference procedures have been developed squared-error loss function is commonly symmetric loss function. The Bayes estimators of , say based on squared error loss function is given by and It is noticed that the integrals are given by (22), (23) are not possible to derive explicitly. As a consequence, we estimate the value of integrals in (22), (23) using the Markov Chain Monte Carlo (MCMC) approach. Many studies used MCMC techniques such as Almetwally et al. [26], [27], Basheer et al. [28], Almongy et al. [5], [29], and Bantan et al. [30]. For more reading about Covid papers see [31]. Gibbs sampling and the more generic Metropolis within Gibbs samplers are significant sub classes of Markov chain Monte Carlo (MCMC) techniques. The Metropolis–Hastings (MH) and Gibbs sampling techniques are the two most often used instances of the MCMC method. The MH method, like acceptance–rejection sampling, thinks that a candidate value from a proposal distribution can be produced for each iteration of the algorithm. The MH algorithm, similar to acceptance–rejection sampling, believes that for each iteration of the algorithm, a candidate value from a proposal distribution can be produced. To generate random samples of conditional posterior densities from the DMOITL distribution, we employ the MH within the Gibbs sampling steps: and

Confidence intervals

In this section, we introduce the construction of confidence intervals with two different methods to estimate the unknown parameters of the DMOITL distribution, which are asymptotic confidence interval (ACI) in MLE and credible confidence interval in MCMC of , and .

Asymptotic confidence intervals

Using the asymptotic normal distribution of the MLE is the most popular method to set confidence bounds for the parameters. Fisher information matrix is constructed of the negative second derivatives of the natural logarithm of the likelihood function evaluated at in connection to the asymptotic variance–covariance matrix of the MLE of the parameters. Suppose the asymptotic variance–covariance matrix of the parameter vector is where

Highest posterior density

This method is similar to the ACI for more information see Chen and Shao [32] The HPD intervals: Chen and Shao [32] discussed this technique to generate the HPD intervals of unknown parameters of the benefit distribution. In this study, samples drawn with the proposed MH algorithm should be used to generate time-lapse estimates. For example, using the MCMC sampling outputs and the percentile tail points, a HPD interval with two points for 2th parameters of the DMOITL distribution can be generated. According to [32], the BCIs of the parameters of DMOITL distribution can be obtained through the following steps: Arrange , and as and , where denotes the length of the generated of MH algorithm. The symmetric credible intervals of are obtained as: and .

Simulation analysis

In this part of the paper, we made a simulation study to assess the performance of the distribution by varying the values of the actual values for both parameters and observing the effect of this change on the accuracy of estimation for both methods. The Monte-Carlo simulation process is used in this section to compare the conventional estimation methods: MLE, and Bayesian estimation methods under the square error loss function based on MCMC, for the estimation of DMOITL distribution parameters by R software. Monte-Carlo experiments are carried out on the basis of 10000 randomly generated DMOITL distribution samples, where represents the DMOITL for various parameter actual values such as: Case 1: with different . Case 2: with different . Case 3: with different , and different sample sizes , and ).

Concluding remarks on the simulation results

The Table 2, Table 3, Table 4 summarize the simulation findings for the methodologies provided in this work for estimating parameters of the DMOITL distribution using point estimate and interval estimation. . It is necessary to compare the different point estimating approaches by calculating the Bias, MSE, and lower and higher confidence intervals. These tables let the following conclusions to be drawn:
Table 2

This table contains the simulation results when .

α=0.5
MLE
Bayesian
ϑnBiasMSELowerUpperCPBiasMSELowerUpper
0.320α0.86202.40170.01783.902097.53%0.29850.22430.28591.3868
ϑ
0.1935
0.1140
0.0522
1.0392
97.53%
0.0815
0.0207
0.1684
0.5787
50α0.40200.36880.00711.796895.35%0.23700.13580.42261.3368
ϑ
0.1023
0.0263
0.1546
0.6500
94.77%
0.0508
0.0084
0.2506
0.5054
100α0.29200.15760.25811.325897.56%0.14130.03920.49130.9333
ϑ0.08130.01510.19800.564797.56%0.03170.00300.26830.4231

0.620α1.38194.98140.05555.318894.40%0.39390.32260.31111.7934
ϑ
0.4590
0.5431
0.0717
2.1897
94.50%
0.1344
0.0733
0.3954
1.2206
50α0.79271.25550.12602.845695.50%0.42120.31860.43891.6293
ϑ
0.2909
0.1878
0.2608
1.5209
95.50%
0.1298
0.0488
0.4668
1.0960
100α0.64140.61380.25922.023795.70%0.37380.20570.48631.4197
ϑ0.25050.10730.43681.264295.60%0.11560.02890.51920.9901

1.520α1.81445.15850.03836.146294.82%0.54400.51690.35431.9763
ϑ
0.9485
3.0928
0.2920
5.6788
95.30%
0.2369
0.3062
0.9455
2.7790
50α1.02544.22010.07226.229495.00%0.72970.74110.51892.1631
ϑ
0.8153
2.0490
0.9912
4.3157
96.00%
0.3279
0.2524
1.2299
2.6062
100α0.86174.47980.38704.336395.30%0.82380.83360.70962.1376
ϑ0.70351.39411.41893.650394.40%0.38860.24021.34362.4744
Table 3

This table contains the simulation results when .

α=1.5
MLE
Bayesian
ϑnBiasMSELowerUpperCPBiasMSELowerUpper
0.320α0.33430.87970.11133.557297.58%0.28020.62641.11334.5505
ϑ
0.0330
0.0142
0.1083
0.5578
96.36%
0.0413
0.0106
0.2401
0.5078
50α0.31670.49930.56323.070297.56%0.27710.36960.83753.0974
ϑ
0.0152
0.0059
0.1653
0.4650
92.68%
0.0068
0.0052
0.2494
0.4362
100α0.20470.20580.90102.508397.50%0.13600.10840.98152.4394
ϑ0.01880.00250.22660.411092.50%0.01330.00230.26890.4068

0.620α1.39344.60940.13106.096496.50%0.86001.83001.06744.5782
ϑ
0.1828
0.1063
0.2533
1.3123
96.60%
0.0966
0.0353
0.4543
1.0085
50α0.84131.59680.49234.190395.60%0.66601.01221.21463.5180
ϑ
0.1052
0.0363
0.3940
1.0164
94.20%
0.0632
0.0128
0.5188
0.8513
100α0.68790.91460.88523.490794.70%0.48630.65771.31202.7546
ϑ0.08640.02000.46690.906094.90%0.04460.00600.53170.7638

1.520α2.46256.45640.174410.668895.60%1.30493.15131.11465.2440
ϑ
0.6681
0.9558
0.7683
3.5678
95.90%
0.2077
0.1706
1.1420
2.4016
50α1.79954.17350.21618.382994.50%1.46203.29651.45945.2826
ϑ
0.5615
0.5397
1.1325
2.9904
94.00%
0.2388
0.1269
1.3017
2.2931
100α1.15583.84341.36216.754195.80%1.41372.61331.76034.5595
ϑ0.53390.39261.39072.677095.50%0.23820.09651.40132.1529
Table 4

This table contains the simulation results when .

α=3
MLE
Bayesian
ϑnBiasMSELowerUpperCPBiasMSELowerUpper
0.320α0.18020.69611.57804.782593.06%0.14630.59372.01434.4308
ϑ
0.0134
0.0051
0.1750
0.4519
96.53%
0.0131
0.0042
0.2641
0.4025
50α0.01230.35281.82784.196893.10%0.01430.30972.03414.0315
ϑ
0.0117
0.0017
0.2324
0.3910
93.10%
−0.0006
0.0014
0.2802
0.3994
100α0.13600.11262.52473.747393.55%0.09350.10342.59113.7802
ϑ0.00320.00060.25550.351096.77%0.00170.00050.28500.3773

0.620α0.88313.53140.63037.135996.60%0.73102.79252.05056.8045
ϑ
0.0556
0.0296
0.3363
0.9750
96.50%
0.0550
0.0158
0.4897
0.8786
50α0.36960.75661.82554.913693.80%0.27950.58262.38025.7343
ϑ
0.0227
0.0084
0.4481
0.7973
95.00%
0.0313
0.0049
0.5146
0.7578
100α0.50620.76762.10394.908595.30%0.42800.44372.59584.8926
ϑ0.03210.00550.50050.763694.70%0.02540.00480.55560.7244

1.520α1.55414.03220.87038.237896.70%1.01662.74272.08727.7519
ϑ
0.3843
0.3551
0.9914
2.7773
95.80%
0.1754
0.1075
1.2135
2.2337
50α1.04853.80092.18356.786297.50%0.99162.04322.76036.0531
ϑ
0.2765
0.1541
1.2300
2.3230
94.80%
0.1581
0.0625
1.3345
2.0548
100α0.97401.55382.93665.544194.90%0.81260.96372.92864.0197
ϑ0.29350.13051.38042.206694.50%0.14270.03901.42261.9322
As rises, the Bias and MSE of the DMOITL distribution drop. Bias and MSE for and parameters grow as increases. As the value of grows, the Bias and MSE values for the and parameters decrease. Bayesian estimation is the best approach for estimating the parameters as it provides the smallest MSE and Bias and also has the shorties confidence interval Using Bayesian estimation, the MLE ACI confidence interval for parameters of the DMOITL distribution has the smallest confidence interval. This table contains the simulation results when . This table contains the simulation results when . This table contains the simulation results when .

Data analysis

In this part of the paper, we used two real data sets as an application on the superiority of the distribution The DMOITL distribution is fitted to more notable fields of Covid-19 with diverse countries such as Italy, Puerto Rico, and Singapore in this part. We compare the fits of the discrete Buur (DB) [Krishna and Pundir [17]] model, discrete Weibull (DW) [Nakagawa and Osaki [16]], discrete inverse Weibull (DIW) [Jazi et al. [33]], Poisson, negative binomial (NB), discrete alpha power inverse Lomax (DAPIL) [Almetwally and Ibrahim [34]], discrete Lindley (DLi) [Gómez-Déniz and Calderín-Ojeda [18]], and DITL models in Table 5, Table 6, Table 7.
Table 5

MLE, CvM, AD, KS and AIC for different alternative models of DMOITL distribution: Puerto Rico.

αϑλCvMADKSAIC
DMOITL116654.69682.51260.07110.40190.0758487.0636
DBuur16.52480.98860.12370.68210.4977607.7753
DW0.99991.587612.575976.09740.9937487.1050
DIW0.00000.83120.17450.96841.0000510.3790
NB0.13390.19510.59690.3400884.7561
Poisson245.84740.36110.94370.47134067.0956
DAPL1.38741.48943.145E−250.17710.98390.2582512.5820
DITL0.21750.12700.70010.4852596.0123
Table 6

MLE, CvM, AD, KS and AIC for different alternative models of DMOITL distribution: Italy.

αϑλCvMADKSAIC
DMOITL3065.82853.33960.06810.36800.0715475.1201
DBuur16.20050.97950.18211.04240.4526623.6174
DW0.99831.949719.9308121.83550.9900476.0156
DIW0.00001.45580.29911.73671.0000495.8519
NB0.72950.08460.38100.2351608.7627
Poisson22.623022.62300.09180.45350.2674700.5536
DAPL0.00201.45891.08E−130.21881.26470.1265486.1749
Dli0.92020.09400.42540.1464481.7783
DITL0.42030.15920.90480.4320604.7508
Table 7

MLE, CvM, AD, KS and AIC for different alternative models of DMOITL distribution: Singapore.

αϑλCvMADKSAIC
DMOITL1013.68343.04760.16070.90560.07921853.6278
DBuur93.74540.99630.30482.08310.43342376.1589
DW0.99581.731380.6484483.53140.9927441860.2878
DIW3.52E−181.459440.6313324.14162211921.321
NB0.92223660.326251.9481250.314062790.6446
Poisson20.405450.3565312.1580160.3231332921.4866
DAPL0.00372.48792.93E−070.38472.61990.08471879.7269
Dli0.91240.21740.99640.13241870.9463
DITL0.44210.22661.55890.41682311.8193
Table 5, Table 6, Table 7 provide values of Cramér–von Mises (CvM), Anderson–Darling (AD), Kolmogorov–Smirnov (KS) and Akaike information criterion (AIC) statistics for the all models fitted based on three real data sets. These tables also include the MLE of the parameters for the models under consideration. Fig. 3, Fig. 5, Fig. 7 show the fitted DMOITL, PMF, CDF, PP-plot, and QQ-plot of the three data sets, respectively. These statistics show that among all fitted models, the DMOITL distribution has the lowest CvM, AD, KS, and AIC values. Using alternative data, Table 8 presented MLE and Bayesian estimation methods for parameters of the DMOITL distribution. Figs. 4, 6, 8 show convergence plots of MCMC for parameter estimates of DMOITL distribution for different data set.
Fig. 3

Plots of estimated pmfs of distributions for Data set of Puerto Rico.

Fig. 5

Plots of estimated pmfs of distributions for Data set of Italy.

Fig. 7

Plots of estimated pmfs of distributions for Data set of Singapore.

Table 8

MLE and Bayesian estimation method for parameters of DMOITL distribution using different data.

MSE
Bayesian
estimateSEestimateSE
Puerto Ricoα168573.64940.0079168573.64900.0044
ϑ2.63210.06362.30890.0492

Italyα3065.82850.00113065.82810.0010
ϑ3.33960.08873.33420.0116

Singaporeα1013.68340.01881013.68170.0159
ϑ3.04760.04733.04550.0163
Fig. 4

Convergence plots of MCMC for parameter estimates of DMOITL distribution for data set of Puerto Rico.

Fig. 6

Convergence plots of MCMC for parameter estimates of DMOITL distribution for data set of Italy.

Fig. 8

Convergence plots of MCMC for parameter estimates of DMOITL distribution for data set of Singapore.

Firstly: This is a COVID-19 data set from Puerto Rico that spans 38 days, from February 26 to April 4, 2021. This data set is comprised of newly reported instances on a daily basis. The data are as follows: 100, 311, 114, 253, 287, 151, 30, 102, 199, 261, 305, 185, 120, 68, 46, 356, 160, 235, 193, 216, 67, 69, 332, 212, 330, 295, 227, 145, 78, 260, 399, 268, 595, 447, 170, 365, 510, 881. Secondly: This is a 61-day COVID-19 data set from Italy, recorded between 13 June and 12 August 2021. This data set is comprised of newly reported instances on a daily basis. The data are as follows: 52, 26, 36, 63, 52, 37, 35, 28, 17, 21, 31, 30, 10, 56, 40, 14, 28, 42, 24, 21, 28, 22, 12, 31, 24, 14, 13, 25, 12, 7, 13, 20, 23, 9, 11, 13, 3, 7, 10, 21, 15, 17, 5, 7, 22, 24, 15, 19, 18, 16, 5, 20, 27, 21, 27, 24, 22, 11, 22, 31, 31. MLE, CvM, AD, KS and AIC for different alternative models of DMOITL distribution: Puerto Rico. Plots of estimated pmfs of distributions for Data set of Puerto Rico. Convergence plots of MCMC for parameter estimates of DMOITL distribution for data set of Puerto Rico. Thirdly: This is a 242-day COVID-19 data set from Singapore, recorded between 20 November 2020 and 19 July 2021. This data set is comprised of newly reported instances on a daily basis. The data are as follows: 4, 4, 5, 12, 5, 18, 7, 5, 4, 6, 8, 5, 10, 2, 9, 3, 13, 5, 13, 12, 6, 6, 8, 8, 7, 5, 16, 12, 24, 9, 17, 19, 10, 29, 21, 13, 14, 10, 5, 5, 13, 27, 30, 30, 33, 35, 24, 28, 31, 33, 23, 29, 42, 22, 17, 38, 45, 30, 24, 30, 14, 30, 40, 38, 15, 10, 48, 44, 14, 25, 34, 24, 58, 29, 29, 19, 18, 22, 25, 26, 24, 22, 11, 15, 12, 18, 9, 14, 9, 1, 11, 11, 14, 12, 11, 10, 4, 7, 10, 13, 12, 11, 12, 8, 23, 19, 9, 13, 13, 13, 6, 10, 8, 10, 8, 17, 12, 11, 9, 15, 15, 17, 12, 12, 13, 15, 17, 12, 23, 12, 21, 26, 34, 26, 43, 18, 10, 17, 24, 35, 21, 26, 32, 20, 25, 14, 27, 16, 34, 39, 23, 20, 14, 15, 24, 39, 23, 40, 45, 12, 23, 35, 24, 34, 39, 17, 17, 16, 18, 25, 20, 28, 19, 25, 16, 34, 52, 31, 49, 28, 38, 38, 41, 40, 29, 25, 36, 30, 26, 24, 30, 33, 25, 23, 18, 31, 45, 13, 18, 20, 14, 9, 4, 13, 9, 18, 13, 25, 14, 24, 27, 16, 21, 11, 16, 18, 22, 23, 20, 17, 14, 9, 10, 16, 10, 10, 7, 11, 13, 10, 12, 16, 10, 6, 8, 26, 26, 60, 48, 61, 68, 92. MLE, CvM, AD, KS and AIC for different alternative models of DMOITL distribution: Italy. Plots of estimated pmfs of distributions for Data set of Italy. Convergence plots of MCMC for parameter estimates of DMOITL distribution for data set of Italy. MLE, CvM, AD, KS and AIC for different alternative models of DMOITL distribution: Singapore. Plots of estimated pmfs of distributions for Data set of Singapore. MLE and Bayesian estimation method for parameters of DMOITL distribution using different data. Convergence plots of MCMC for parameter estimates of DMOITL distribution for data set of Singapore.

Concluding remarks on the data analysis

After applying the three data sets on the proposed distribution and observing the results in Table 5, Table 6, Table 7 that provide values of Cramér–von Mises (CvM), Anderson–Darling (AD), Kolmogorov- Smirnov (KS) and Akaike information criterion (AIC) statistics for the all models fitted based on three real data sets we found that our proposed distribution is the best model as it has the lowest value of AIC and KS values. By referring to these values, we can make sure that our proposed distribution is superior among all its competitors

Existence and uniqueness for the log-likelihood

We sketched the log-likelihood for each parameter as shown in Fig. 9, Fig. 10, Fig. 11 by fixing one parameter and varying the other. The figures show that the three data sets behaves very well, as we can see that the two roots of the parameters are global maximum, and also by differentiating the log-likelihood with respect to each parameters, we found that the function is a decreasing function and it intersects the -axis in a single point which is the root of the parameter, and that assures that the roots are unique
Fig. 9

Existence and uniqueness for the log-likelihood for data set of Puerto Rico.

Fig. 10

Existence and uniqueness for the log-likelihood for data set of Italy.

Fig. 11

Existence and uniqueness for the log-likelihood for data set of Singapore.

Existence and uniqueness for the log-likelihood for data set of Puerto Rico. Existence and uniqueness for the log-likelihood for data set of Italy. Existence and uniqueness for the log-likelihood for data set of Singapore.

Conclusion

In this paper, we introduce Discrete Marshall–Olkin Inverted Topp- Leone distribution which is called DMOITL. We derived its statistical properties. We made the point and interval estimation by classical and Bayesian estimation methods for the DMOITL unknown parameters and . We conducted simulation analysis using the R package to differentiate the performance of different estimation methods. We deduced that the Bayesian method is very efficient than the classical method as it gets more efficient results through the values of the MSE and the length of the confidence interval as it is always shorter and the MSE is always smaller. In order to prove the superiority and applicability of the proposed distribution, we made a data analysis through the COVID-19 data. We used three data sets in three different countries thought different intervals of time, and by referring to the results in Table 5, Table 6, Table 7 that provide values of Cramér–von Mises (CvM), Anderson–Darling (AD), Kolmogorov- Smirnov (KS) and Akaike information criterion (AIC) statistics for the all models fitted based on three real data sets we found that our proposed distribution is the best model as it has the lowest value of AIC and KS values.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  4 in total

1.  Time series modelling to forecast the confirmed and recovered cases of COVID-19.

Authors:  Mohsen Maleki; Mohammad Reza Mahmoudi; Darren Wraith; Kim-Hung Pho
Journal:  Travel Med Infect Dis       Date:  2020-05-13       Impact factor: 6.211

2.  A New Discrete Analog of the Continuous Lindley Distribution, with Reliability Applications.

Authors:  Abdulhakim A Al-Babtain; Abdul Hadi N Ahmed; Ahmed Z Afify
Journal:  Entropy (Basel)       Date:  2020-05-28       Impact factor: 2.524

3.  A new extended rayleigh distribution with applications of COVID-19 data.

Authors:  Hisham M Almongy; Ehab M Almetwally; Hassan M Aljohani; Abdulaziz S Alghamdi; E H Hafez
Journal:  Results Phys       Date:  2021-03-12       Impact factor: 4.476

4.  Modeling the survival times of the COVID-19 patients with a new statistical model: A case study from China.

Authors:  Xiaofeng Liu; Zubair Ahmad; Ahmed M Gemeay; Alanazi Talal Abdulrahman; E H Hafez; N Khalil
Journal:  PLoS One       Date:  2021-07-26       Impact factor: 3.240

  4 in total
  2 in total

1.  Sampling Inspection Plan to Test Daily COVID-19 Cases Using Gamma Distribution under Indeterminacy Based on Multiple Dependent Scheme.

Authors:  Muhammad Aslam; Gadde Srinivasa Rao; Mohammed Albassam
Journal:  Int J Environ Res Public Health       Date:  2022-04-27       Impact factor: 4.614

2.  An Overview of Discrete Distributions in Modelling COVID-19 Data Sets.

Authors:  Ehab M Almetwally; Sanku Dey; Saralees Nadarajah
Journal:  Sankhya Ser A       Date:  2022-09-09
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.