Literature DB >> 32530965

A Gull Alpha Power Weibull distribution with applications to real and simulated data.

Muhammad Ijaz1, Syed Muhammad Asim1, Muhammad Farooq1, Sajjad Ahmad Khan2, Sadaf Manzoor2.   

Abstract

In this paper, we produced a new family of distribution called Gull Alpha Power Family of distributions (GAPF). A Special case of GAPF is derived by considering the Weibull distribution as a baseline distribution called Gull Alpha Power Weibull distribution (GAPW). The suitability of the proposed distribution derives from its ability to model both the monotonic and non-monotonic hazard rate functions which are a common practice in survival analysis and reliability engineering. Various statistical properties were derived in addition to their special cases. The unknown parameters of the model are estimated using the maximum likelihood method. Moreover, the usefulness of the proposed distribution is supported by using two real lifetime data sets as well as simulated data.

Entities:  

Mesh:

Year:  2020        PMID: 32530965      PMCID: PMC7292407          DOI: 10.1371/journal.pone.0233080

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

From last few years, researchers made a contribution to the theory of probability so as to remove some of the limitations of the existing probability distributions. For example, the Exponential distribution fails to model the monotonic and non-monotonic hazard rate functions, it can only model the constant hazard rate of an object; Gamma distribution can only model the data with monotonically increasing failure rate. But in practice, various data exists which follows a non-monotonic hazard rate function, for example, the lifetime of an electronic device or the accident rate follows a non-monotonic hazard rate function. It is usual practice to modify the existing probability models so as to model both the monotonic and non-monotonic hazard rate function and also to provide a suitable fit. One such modification is to produce a generator and then applied to the existing models so as to derive a new probability model. For example, Al-Aqtash et.al [1] produced a new family of distributions using the logit function and derived the special case named as Gumbel-Weibull distribution. Alzaatreh et.al [2] investigated the gamma-X family of distribution and explored the special case by employing the normal distribution as a baseline distribution. Abid, & Abdulrazak [3] Presented a truncated Frechet-G family of distribution. Korkmaz & Genç [4] defined a generalized two-sided class of probability distribution. Alzaghal et.al [5] worked on the T-X family of distributions. Aldeni et.al [6] explained a new family using the quantile functions of the generalized lambda distribution. For other generators, we refer to see [7-10]. The more recent modified Weibull distributions are [3], [4], [6-7], [10]. The scope of developing a new family of distribution is to produce a new probability distribution so as to remove some of the difficulties found in the existing probability distributions. The proposed distribution will not only model the monotonic and non-monotonic hazard rate function, but also increase flexibility and provided a better fit as compared to other probability distribution distributions provided in the literature. In this paper, a new family of distribution is proposed called Gull Alpha Power Family of distribution. The special case of this family is derived by employing the Weibull distribution called Gull Alpha Power Weibull distribution (GAPW). GAPW is a modified form of the Weibull distribution which can model the non-monotonic hazard rate shapes. Various statistical properties have been derived such as hazard rate function, survival function, and moments. Two real data sets and simulated data are used to check the versatility of the proposed model. The paper detailed discussion is as follows.

Gull Alpha Power Family (GAPF) of distributions

This section illustrates a new family of probability distributions called Gull Alpha Power Family of distributions or in short GAPF. Let Y is a continuous random variable then the cumulative distribution function (CDF) of Gull Alpha Power Family of distribution is defined as Where α is being the shape parameter and not be zero. The probability density function related to the above CDF takes the form

Gull Alpha Power Weibull distribution (GAPW)

This section illustrates the special form of GAPF by employing the CDF of the Weibull distribution. The CDF of the Weibull distribution [11] is given by where β is the scale parameter and γ is the shape parameter. The Weibull distribution [11] is one of the most important and has been widely used in many real-world problems. For example, in reliability applications, Keshavan et.al [12] used the Weibull distribution to analyze the fracture strength of glass data, Fok et.al [13] utilized to model the failure of brittle materials data, Li et.al [14] by working the failure probability of concrete components. In Geophysics, Al-Hasan & Nigmatullin [15] modeled the wind speed data, to model the data related to earthquake [16] considered the Weibull distribution, and the data concerned with environmental radioactivity was analyzed by Dahm et.al [17]. Tsumoto & Okiai [18] applied the Weibull distribution to the impulse breakdown of oil-filled cable data. The Weibull distribution can model only increasing, decreasing or a constant failure rate. This distribution fails to model a non-monotonic hazard rate function, for example, a failure rate of the electronic device, accident rate, and infant mortality rate. To achieve this goal, researchers have been constructing different modified versions of the Weibull distribution, for example, Almalki & Yuan [19] presented the New modified Weibull distribution, A new extension of Weibull distribution is presented by [20], a four parameter Weibull distribution is studied by Lemonte et.al [21], Gumbel-Weibull distribution was explored by Al-Aqtash et.al [22], Almheidat et.al [23] investigated the generalized form of the Weibull distribution, Almalki & Nadarajah [24] by working with the discrete form of the Weibull distribution, and a flexible Weibull extension was introduced by Bebbington et.al [25]. Let Y is a continuous random variable which follows GAPW distribution then the CDF and PDF are respectively given by The proposed distribution contains three parameters that is β>0 is the scale and α>0, γ>0 being the shape parameters. The graphical representations of the probability density function and cumulative distribution function with different values of parameters are given in Fig 1.
Fig 1

The Pdf and Cdf of GAPW.

The survival and hazard rate function

Let a random variable Y follows GAPW (α,β,γ). Then the survival function of GAPW is defined as Similarly, the failure or hazard rate function of GAPW is defined as , transforming Eqs (4) and (5), we obtain the result given below Fig 2 shows the behavior of the hazard rate function with different parameter values.
Fig 2

The hazard rate function of GAPW.

The quantile function and median

The quantile function is used to conduct a simulation study as well as to measure the median, quartile, octile, decile, and percentile. The quantile function is the real solution of a random variable Y to the expression given as Substituting (4), we obtain the result as The simplified form is then given by Using Mathematica software, we get the following result where the product Log function W (Z) is defined as For the median, put u = 0.5 in Eq (9).

The rth moments

Let a random variable Y has GAPW distribution with parameters α,β, and γ then the rth moments (about the origin), say is defined as Recalling (5) we get Solving the first part of the above integral form The above integral takes the following form Finally, we obtained the result as Now to solve the second part Using the same transformation as earlier, the integral we may write as We obtained the following result By combining (12) and (13), it’s observed that the rth moment does not exist in general.

Order statistics

Let Y1,Y2,Y3,…Y be ordered random variables from GAPW, then the PDF of the i order statistic is given by Using (4) and (5), the minimum and maximum order statistic of the Gull Alpha Power distribution are given by

Parameter estimation

Since the parameters of the probability model are unknown and it is to be estimated using information obtained from a sample. For a detailed discussion on maximum likelihood estimation, we refer to see [26-28]. In this section, the usual method maximum likelihood estimates are used to find out the estimate of the parameters. Let suppose an independent random sample of size n that is Y1,2Y,Y3…Y is selected from GAPW (α,β,γ). The Likelihood function is defined as Substituting (5) in the above expression, we get To derive the estimate of the parameters, we have to take the partial derivatives with respect to α,β,γ and then equate the result to zero Eqs (16)–(18) is not in closed form. Hence it is difficult to calculate the values of the parameters. However, one can use the iteration procedure used in mathematics that is the Bisection and Newton Raphson method to get the MLE.

Renyi entropy

By definition, the Renyi entropy of the random variable Y belong to GAPW (α,β,γ) is given by Using the expression given below Hence (20) takes the following form by solving the above expression, we get Replace (22) in (19), we obtained the following result for the Renyi entropy

Mode

The mode of the random variable Y is defined by the following equation. The more simplified form of the above expression may be written as Using Mathematica software, it has been observed that the density function will be maximum only at y = 0

Skewness and kurtosis

The mathematical form of the Galton Skewness and Moors kurtosis of the GAPW distribution with three parameters are defined by the following relationship Where Q describe different quartiles values. Table 1 illustrates the numerical description of the Skewness and Kurtosis for different values of parameters.
Table 1

Numerical values of skewness and kurtosis.

αβγSkewnessKurtosis
0.10.10.10.87436872.258266
0.10.20.30.26258820.9188042
0.10.40.50.0086976890.9147886
0.10.60.6-0.0597490.9656408
0.20.30.10.96967363.511172
0.30.30.20.9166722.373731
0.40.30.20.78121349.273318
0.80.50.6-0.21280081.200246
0.90.61-0.22716961.220157
111-0.26185951.30627

Mean Residual Life (MRL)

The mean residual life of the gull alpha power distribution with parameters α,β,γ is define as Plugging in the above two expressions in (27), we get By solving (28) finally, we obtained the following result

Special cases

This section illustrates two special cases of the Gull Alpha Power Weibull distribution. Case. γ = 1 When we put γ = 1 in (4) and (5), then it shall be referring to the CDF and PDF of the Gull Alpha Power Exponential distribution (GAPE). The mathematical forms are described as Case. γ = 2 If we replace γ = 2 in the expressions (4) and (5), the derived probability function will stand for the Gull Alpha Power Rayleigh distribution (GAPR). The CDF and PDF of GAPR are respectively given by

Applications

In this section, we provide two real life applications of the proposed model in which one data set follows a non-monotonic hazard rate shape and the second data follows a monotonic hazard rate shape so as to achieve the objectives and delineate efficiency of the proposed model. The performance of the model is judged by commonly used goodness of fit measures including Cramer-von mises (W), Anderson darling (A), Akaike information criteria (AIC), Consistent Akaike information criteria (CAIC), Hannan and quin information criteria (HQIC), and Bayesian information criteria (BIC). The mathematical form of these criteria is defined by where, is the maximized likelihood function and y is the given random sample, is the maximum likelihood estimator and p is the number of parameters in the model. As a general rule, a probability model with fewer values of these criteria should be considered the best fitted model among other probability distributions.

Data set 1: Remission time of Bladders cancer patients

The data set consists of the remission time of 128 bladder cancer patients. The data set is taken from Aldeni and Famoye [29] with the values are as follows 0.080, 0.200, 0.400, 0.500, 0.510, 0.810, 0.900, 1.050, 1.190, 1.260, 1.350, 1.400, 1.460, 1.760, 2.020, 2.020, 2.070, 2.090, 2.230, 2.260, 2.460, 2.540, 2.620, 2.640, 2.690, 2.690, 2.750, 2.830, 2.870, 3.020, 3.250, 3.310, 3.360, 3.360, 3.480, 3.520, 3.570, 3.640, 3.700, 3.820, 3.880, 4.180, 4.230, 4.260, 4.330, 4.340, 4.400, 4.500, 4.510, 4.870, 4.980, 5.060, 5.090, 5.170, 5.320, 5.320, 5.340, 5.410, 5.410, 5.490, 5.620, 5.710, 5.850, 6.250, 6.540, 6.760, 6.930, 6.940, 6.970, 7.090, 7.260, 7.280, 7.320, 7.390, 7.590, 7.620, 7.630, 7.660, 7.870, 7.930, 8.260, 8.370, 8.530, 8.650, 8.660, 9.020, 9.220, 9.470, 9.740, 10.06, 10.34, 10.66, 10.75, 11.25, 11.64, 11.79, 11.98, 12.02, 12.03, 12.07, 12.63, 13.11, 13.29, 13.80, 14.24, 14.76, 14.77, 14.83, 15.96, 16.62, 17.12, 17.14, 17.36, 18.10, 19.13, 20.28, 21.73, 22.69, 23.63, 25.74, 25.82, 26.31, 32.15, 34.26, 36.66, 43.01, 46.12, 79.05. Fig 3 shows the theoretical and empirical pdf and cdf of the GAPW distribution using the bladder cancer patient’s data and it is observed that the GAPW is the best-fitted line as compared to others. Fig 4 demonstrates the Q-Q and P-P plot of the bladder cancer patient data. The TTT plot in Fig 5 clearly shows that this data follows a non-monotonic hazard rate shapes. Table 2 reflects the maximum likelihood estimates, standard errors, and the log-likelihood values. Table 3 defines the goodness of fit measures for the bladder cancer data. It has been observed that the goodness of fit measures has fewer values for GAPW while analyzing the bladder cancer data. Hence the proposed distribution provides a better fit as compared Weibull exponential (W.E), exponential (E), Weibull (W), Rayleigh (R) and Algoharai inverse flexible Weibull (AIFW) distribution.
Fig 3

Theoretical and empirical Pdf and Cdf of GAPW.

Fig 4

Theoretical and empirical Pdf and Cdf with Q-Q plot and P-P plot for GAPW.

Fig 5

TTT plot of the bladder cancer patient data.

Table 2

Maximum likelihood estimates and their standard errors.

ModelMleStandard error-log(likelihood)
GAPW0.00590119 0.79751413 0.533557960.005280265 0.121546525 0.046158047
409.9908
W.E3.95810505 0.01796843 0.858191931.214089581 0.004666546 0.059280045
419.8998
W0.09438292 1.045764660.01912624 0.06742473
414.0874
Exp0.10676950.009436355414.3419
Rayleigh0.0050797730.0004307331491.2659
AIFW0.1677404 0.12319480.02508775 0.01045528451.0704
Table 3

Goodness of fit measures of the GAPW for bladder cancer data.

ModelsWAAICCAICBICHQIC
GAPW0.025334310.1608187825.9815826.1751834.5376829.4579
W.E0.21452761.282891845.7996845.9931854.3557849.276
W0.13081770.7832353832.1747832.2707837.8788834.4923
Exp0.11928930.7159703830.6838830.7155833.5358831.8426
Rayleigh0.46690782.732901984.5318984.5635987.3838985.6906
AIFW0.57357983.457475906.1409906.2369911.8449908.4585

Data set 2: Bank customers Data

The data set waiting time of 100 bank customers is taken from Ghitany et al. [30]. The data set values are given below 0.8,0.8,1.3,1.5,1.8,1.9,1.9,2.1,2.6,2.7,2.9,3.1,3.2,3.3,3.5,3.6,4,4.1,4.2,4.2,4.3,4.3,4.4,4.4,4.6,4.7,4.7,4.8,4.9,4.9,5.0,5.3,5.5,5.7,5.7,6.1,6.2,6.2,6.2,6.3,6.7,6.9,7.1,7.1,7.1,7.1,7.4,7.6,7.7,8,8.2,8.6,8.6,8.6,8.8,8.8,8.9,8.9,9.5,9.6,9.7,9.8,10.7,10.9,11.0,11.0,11.1,11.2,11.2,11.5,11.9,12.4,12.5,12.9,13.0,13.1,13.3,13.6,13.7,13.9,14.1,15.4,15.4,17.3,17.3,18.1,18.2,18.4,18.9,19.0,19.9,20.6,21.3,21.4,21.9,23,27,31.6,33.1,38.5. Fig 6 illustrates the theoretical and empirical Pdf and Cdf of the GAPW distribution using the 100 bank customer’s data. The graph clearly shows that the red line is the best-fitted line to theoretical data. Fig 7 displays the Q-Q and P-P plot of the bank customer’s data. The TTT plot in Fig 8 clearly shows that this data follows a monotonic hazard rate shapes. Table 4 gives the maximum likelihood estimates of the unknown parameters of GAPW including the standard errors, and the log-likelihood values. The values given in Table 5 declared that GAPW leads to a better fit than other versions of the Weibull distribution like Weibull exponential (W.E), exponential (E), Weibull (W), Rayleigh (R) and Algoharai inverse flexible Weibull (AIFW) distribution.
Fig 6

Theoretical and empirical Pdf and Cdf of GAPW.

Fig 7

Theoretical and empirical Pdf and Cdf with Q-Q plot and P-P plot for GAPW.

Fig 8

TTT plot of bank customers data.

Table 4

Maximum likelihood estimates and their standard errors.

ModelMleStandard error-log(likelihood)
GAPW0.004584114 0.540116500 0.6796962850.007821438 0.164012222 0.088528859317.4891
W.E3.97641055 0.02509669 1.243491144.16205665 0.01545783 0.15095850320.9662
W0.02971531 1.461442500.008598382 0.102215074318.745
Exp0.10124240.01012326329.0209
Rayleigh0.0066761240.0006522215329.2404
AIFW1.6423171 0.11531540.17122546 0.01167334330.7856
Table 5

Goodness of fit measures of the GAPW for bank customers data.

ModelWAAICCAICBICHQIC
GAPW0.019399830.1355293640.9783641.2283648.7938644.1413
W.E0.1124680.7070021647.9323648.1823655.7479651.0954
W0.062659890.3945532641.4899641.6136646.7003643.5986
Exp0.027038350.1790246660.0418660.0826662.6469661.0961
Rayleigh0.12658040.7863305660.4807660.5216663.0859661.5351
AIFW0.17034071.219407665.5711665.6948670.7815667.6798

Simulations

To perform simulations an expression (9) was used to generate artificial data from the GAPW distribution. The simulations are performed 100 times with a different set of parameters with different sample of size n. The maximum likelihood estimates and their standard errors are given in Table 6. The tabulated values clearly show as we increase the sample size, both the ml estimates and the standard errors are decreases. The general formula for computing the mean square error and bias are as follows
Table 6

Maximum likelihood Estimates and their standard errors.

Actual valuesML EstimateStandard deviations
α β γnα        β        γα        β        γ
3052.35656923.944706-1.4259069.22775957.27765540.1274975
5050.9512924.95536-1.439426.97044136.07082450.1017499
0.009, 0.007, 2.56050.66497526.246471-1.4665746.310806015.720636980.09179273
0.002, 0.03, 2.43048.05378130.3749842.5575618.507452811.24572620.2579958
4054.45522238.392647-2.6340478.322018210.06989060.1794368
5050.9512924.95536-1.439426.97044136.07082450.1017499
0.004, 0.04, 2.41543.06107627.780158-2.69099910.492196012.53210450.3456288
3044.81568732.610212-2.7160427.871181411.27853700.2546263
6058.71682326.033329-2.6514587.42684846.06261250.1738104
0.009, 0.05 3.53074.15470038.831954-4.72752213.562873814.87038350.4515338
4577.26235635.034728-4.70354911.569108910.72488960.3635661
6063.77173037.754394-4.7280078.132233110.19545030.3217287

Conclusion

In this paper, we produced a new generator called Gull Alpha Power Family of distributions or in short GAPF. A special case of this family was derived by employing the CDF of the Weibull distribution as a baseline distribution. The special case is known as Gull Alpha Power Weibull distribution (GAPW). Various statistical properties have been discussed in addition to the parameter estimation using the maximum likelihood method. In future, a researcher may conduct a study on estimation of the parameters of the proposed model under Bayesian paradigm by using informative and non-informative priors. For a detailed discussion on Bayesian estimation we refer to see [31-34]. Furthermore, we have explored the special cases of GAPW. The significance of the proposed model is justified by using two real data sets as well as the simulated data. The TTT plot of the bladder cancer patient’s data clearly demonstrates that this data follows a non-monotonic hazard rate shape. While the TTT plot of the bank customers data follows a monotonic hazard rate shape. It has been observed that the proposed model performs well in both the non-monotonic and a monotonic hazard rate shapes as compared to the Weibull (W), Weibull Exponential (W.E), Exponential (E), Rayleigh (R), and Algoharai Inverse Flexible Weibull (AIFW) distribution.

Numerical values of skewness and kurtosis.

(DOCX) Click here for additional data file.

Maximum likelihood estimates and their standard errors.

(DOCX) Click here for additional data file.

Goodness of fit measures of the GAPW for bladder cancer data.

(DOCX) Click here for additional data file. (DOCX) Click here for additional data file. (DOCX) Click here for additional data file.

Maximum likelihood Estimates and their standard errors.

(DOCX) Click here for additional data file.

Remission time of Bladders cancer patients [26].

(DOCX) Click here for additional data file.

Bank customers data [27].

(DOCX) Click here for additional data file.
  4 in total

1.  Application of the Weibull distribution to describe the vertical distribution of cesium-137 on a slope under permanent pasture in Luxembourg.

Authors:  H Dahm; J Niemeyer; D Schröder
Journal:  J Environ Radioact       Date:  2002       Impact factor: 2.674

2.  Analysis of safety factors for urban expressways considering the effect of congestion in Shanghai, China.

Authors:  Jian Sun; Tienan Li; Feng Li; Feng Chen
Journal:  Accid Anal Prev       Date:  2015-12-23

3.  Injury severities of truck drivers in single- and multi-vehicle accidents on rural highways.

Authors:  Feng Chen; Suren Chen
Journal:  Accid Anal Prev       Date:  2011-04-22

4.  Analysis of hourly crash likelihood using unbalanced panel data mixed logit model and real-time driving environmental big data.

Authors:  Feng Chen; Suren Chen; Xiaoxiang Ma
Journal:  J Safety Res       Date:  2018-04-25
  4 in total
  2 in total

1.  A new versatile modification of the Rayleigh distribution for modeling COVID-19 mortality rates.

Authors:  Mutua Kilai; Gichuhi A Waititu; Wanjoya A Kibira; M M Abd El-Raouf; Tahani A Abushal
Journal:  Results Phys       Date:  2022-02-22       Impact factor: 4.476

2.  Statistical Analysis of the COVID-19 Mortality Rates with Probability Distributions: The Case of Pakistan and Afghanistan.

Authors:  Javid Gani Dar; Muhammad Ijaz; Ibrahim M Almanjahie; Muhammad Farooq; Mahmoud El-Morshedy
Journal:  Comput Math Methods Med       Date:  2022-07-25       Impact factor: 2.809

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.