Literature DB >> 33270673

Kumaraswamy inverse Gompertz distribution: Properties and engineering applications to complete, type-II right censored and upper record data.

M El-Morshedy1,2, Adel A El-Faheem3, M El-Dawoody1,3.   

Abstract

This article proposes and studies a new three-parameter generalized model of the inverse Gompertz distribution, in the so-called Kumaraswamy inverse Gompertz distribution. The main advantage of the new model is that it has "an upside down bathtub-shaped curve hazard rate function" depending upon the shape parameters. Several of its statistical and mathematical properties including quantiles, median, mode, moments, probability weighted moment, entropy function, skewness and kurtosis are derived. Moreover, the reliability and hazard rate functions, mean time to failure, mean residual and inactive lifetimes are also concluded. The maximum likelihood approach is done here to estimate the new model parameters. A simulation study is conducted to examine the performance of the estimators of this model. Finally, the usefulness of the proposed distribution is illustrated with different engineering applications to complete, type-II right censored, and upper record data and it is found that this model is more flexible when it is compared to well-known models in the statistical literature.

Entities:  

Year:  2020        PMID: 33270673      PMCID: PMC7714165          DOI: 10.1371/journal.pone.0241970

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


1. Introduction

The two-parameter Gompertz (G) distribution was offered by [1] and it can be displayed as an extension of the exponential distribution. It has an influential role in survival analysis for forming adequate actuarial and human mortality tables. Also, it is a beneficial model for survival distributions characterized by increasing hazard rate and also to describe the distribution of adult life spans by demographers and actuaries; see [2]. Several authors have contributed to the studies that accentuate the statistical characterization and methodology of the G distribution; like [3-10]. The inverse distributions were introduced in the modeling literature in demography, biological and actuarial surveys; look in [10-15]. The inverse Gompertz (IG) distribution was proposed by [16] and it was introduced as a lifetime model. Suppose that X is a random variable that has an IG distribution whose shape and scale parameters are λ > 0 and β > 0, respectively. Then, the cumulative distribution function (CDF) of X takes the formula Also, the probability density function (PDF) of X takes the formula [17] presented a new lifetime model having only one parameter and it is called A distribution which is characterized by an upside-down bathtub shaped hazard function. The random variable Z has an A distribution with scale parameter β > 0, if the CDF of Z takes the form The PDF of Z takes the form Setting λ = 1 in Eqs (1) and (2), we deduce the CDF and PDF of A distribution with the parameter β. That is, the A distribution is a particular case of IG distribution. In many workable circumstances, the classical distributions don’t give a sufficient fit to actual data. Therefore, various generators are proposed to produce a new models; see [18-28]. The Kumaraswamy-G (K-G) family is one of the essential generators that have an increased interest after the persuasive debate on the pitfalls of the beta-G family suggested by [29]. Cordeiro and de Castro (2011) clarified the CDF of the two-parameter K-G that takes the formula The corresponding PDF to Eq (5) will be where, g(x) and G(x) are the PDF and CDF of a baseline random variable X. Also, θ > 0 and γ > 0 are two extra shape parameters. The principal suggest in our paper is a generalization of the IG distribution called the Kumaraswamy inverse Gompertz distribution, abbreviated KuIG, depending on the Eqs (5) and (6). The failure rate function of the new model takes the form of "an upside down bathtub-shaped". Another important characteristic of the KuIG is that it suitable for testing the goodness of fit of some special sub-models, such as the IG and A distributions. The article is distributed as follows: Section 2 introduces the CDF and the corresponding PDF of the KuIG. Section 3 presents several fundamental statistical properties. Some essential functions used in reliability analysis are introduced in Section 4. The maximum likelihood approach is mentioned in Section 5 to appreciate the parameters of the KuIG. In Sections 6 and 7, we will obtain the maximum likelihood estimators for type-II right censored and upper record data, respectively. The performance of the KuIG estimators is appreciated in Section 8 using a simulation study. We will analyze five actual data sets are three complete, one type-II right censored, and one upper record data in Section 9 and the results are compared with different known distributions. Finally, in Section 10, a conclusion for the obtained results is presented.

2. Kumaraswamy inverse Gompertz distribution

2.1. Specifications of KuIG

The non-negative random variable X is said to have the KuIG with the vector of parameters Ω = (α, β, γ), say X ∼ KuIG(Ω), if its CDF is given by the formula where, α = λθ. We can easily see that, if we substitute the CDF of A distribution, instead of the CDF of IG in Eq (5), we will obtain the same result. So, the proposed model can be named KuIG or KuA. The PDF of the KuIG will be The two parameters α and γ are the shape parameters and β is the scale parameter. Setting γ = 1 in the Eqs (7) and (8), we will get the CDF and PDF of the IG with the two parameters α and β, respectively. Moreover, if we put α = γ = 1 in the above two equations, we will obtain the CDF and PDF of A distribution with the parameter β. This confirms the fact that the IG and A distributions are particular cases of our proposed KuIG. Fig 1 shows the graphical behavior of the PDF for KuIG with different values of α, β and γ.
Fig 1

The graphs of the PDF for KuIG.

3. Statistical characteristics of KuIG

3.1. The quantiles and median

An explicit formula for the quantile and the median of KuIG are derived in this subsection. The quantile x of the KuIG is given as follows The median of KuIG is found by putting in Eq (9) as follows

3.2. The mode

The mode of KuIG is obtained by solving the equation below with respect to x. This equation has no explicit solution in x. So, some numerical methods are used to solve it.

3.3. The r moment

If X ∼ KuIG(Ω), then the r moment of X is found using By substituting from Eq (8) in Eq (12), we get the r moment as follows

3.4. Moment generating function

The moment generating function of KuIG, say M(t), is found using Substituting from Eq (13) into Eq (14), we obtain

3.5. The probability weighted moment

The probability weighted moment (PWM) was initially offered and introduced by [30]. The PWM of X with CDF F(x), say ξ, is assigned by If X ∼ KuIG(Ω), then the PWM ξ of X is given by the formula

3.6. The entropy function and ρ–entropy

Entropy performs a pivotal role in engineering, information theory, computer science and probability theory. It can be used as a measure of dispersion for the uncertainty associated with a random variable X; see [31]. The Rényi entropy of X with PDF (x), say I(X), is expressed by If X ∼ KuIG(Ω), then I(X) is given by the formula The ρ–entropy of X, say H(X), is found by

3.7. Skewness and kurtosis

The effect of the shape parameters α and γ on skewness (S) and kurtosis (K) is investigated using the quantiles of KuIG given in Eq (9) [32] suggested skewness using quartiles called the Bowley skewness which is defined as follows Also [33], proposed kurtosis based on octiles called the Moors kurtosis which is defined as where, q(.) is the quantile function. Fig 2 gives the graphs of S and K for some values of α when γ = 0.6. This figure reveals that both measures S and K decreases when β increases for fixed α.
Fig 2

The graphs of the S (left panel) and the graphs of the K (right panel) for KuIG distribution.

4. Reliability analysis

4.1. The survival and failure rate functions

The survival (reliability) function of X ∼ KuIG(Ω) is found by the formula The failure (hazard) rate function (HRF) of X is found by the formula The graphic behavior of the HRF of KuIG with various choices of α, β and γ is offered in Fig 3.
Fig 3

The graphs of the HRF for KuIG.

If X ∼ KuIG(Ω), then the reversed failure rate function of X is

4.2. The mean time to failure

If X ∼ KuIG(Ω), then the mean time to failure (MTTF) of X is found by Substituting from Eq (13), when r = 1, we find that

4.3. The mean residual lifetime

In life testing situations and reliability theory, the mean residual lifetime (MRL), say M(t), is known as the anticipated remaining lifetime T − t, provided that the component has been survived until a time t. The M(t) is found using the formula If T ∼ KuIG(Ω), then the MRL of T is

4.4. The mean inactive lifetime

The mean waiting (inactive) lifetime (MWT), say M(t), measures the time elapsed since the component fails with lifetime T, provided that it has failed some time before t, t > 0. It is defined as If T ∼ KuIG(Ω), then the MWT of T is

5. Maximum likelihood estimators for complete data

In this section, we will discuss and study how to use the maximum likelihood approach to appreciate the unknown parameters (α, β, γ) of the KuIG. Suppose that x1, x2, …, x be a randomly selected sample with size n from the KuIG(Ω), thus the log-likelihood function L(Ω) for it is given by By deriving the first partial derivatives of L(Ω) with regard to α, β and γ and put it equal to zero, the normal equations of L(Ω) will take the forms and Eqs (30), (31) and (32) don’t have an explicit solutions to and . Therefore, we will solve the previous system of equations numerically to obtain the maximum likelihood estimators (MLEs) .

6. Maximum likelihood estimators for type-II right censored data

If a life testing experiment stopped over when a limited number of items are observed to be failed, then the remaining items are indicated to be a type-II right censored. The inference associated with this type of data is less efficient than the inference associated with the complete data because some information about the parameters under study will be missed with the type-II right censored data; see [34]. If x(1), x(2), ……, x(, k ≤ n denote the ordered values of a random sample x1, x2, ……, x (failure times) and observations terminate after the kth failure occurs, then the likelihood function take the form ([35]) If x1, x2, …, x be a random sample taken from KuIG(Ω), then the L(Ω) of x(1), x(2), ……, x(, k ≤ n is found by the formula The first partial derivatives of L(Ω) are obtained by differentiating Eq (34) for α, β and γ as and Equating Eqs (35), (36) and (37) to zero, we will get the normal equations of L which don’t have an explicit solutions to , and and must be solved numerically to find the maximum likelihood estimators (MLEs) .

7. Maximum likelihood estimators for upper record data

The study of record values has extensive applications to real world situations such as sporting events, meteorological and seismological sciences and life testing studies. The upper record value is that one which is larger than all watched values so far. Suppose that X = {X, X, …, X} is an upper record values taken from a random sample x1, x2, …, x that follow the KuIG, then the likelihood function of X can be expressed by [36] From Eqs (7) and (8), we can obtain the L(Ω) as follows Differentiating Eq (39), we will obtain the first partial derivatives of L(Ω) with regard to α, β, and γ as and By equal Eqs (40), (41) and (42) to zero, we will get the normal equations of L which don’t have an explicit solutions to , and and must be solved numerically to find the MLEs .

8. Simulation results

A simulation study will be accomplished, in this section, to appreciate the performance of the MLEs by using the bias estimates and the mean squared errors (MSEs); see [37]. This simulation is carried out by using "R" language. We examine the behavior of the MLEs for varied values of n, α, β and γ. Furthermore, the graphical comparison of these three parameters according to the bias estimates and MSEs for the KuIG distribution is displayed in Figs 4, 5 and 6 which demonstrates that both bias estimates and MSEs are decreased as n increases. That is, estimating the parameters of the KuIG by using the MLE technique implements quite well.
Fig 4

The bias estimates and MSEs of the KuIG for various values of n when (α, β, γ) = (0.1, 3.1, 3.5).

Fig 5

The bias estimates and MSEs of the KuIG for various values of n when (α, β, γ) = (0.3, 2.1, 3.5).

Fig 6

The bias estimates and MSEs of the KuIG for various values of n when (α, β, γ) = (0.7, 2.1, 3.5).

9. Data analysis

We will analyze here five real data sets, three of them are complete, one is type-II right censored and one is upper record data to clarify that the KuIG distribution is a good lifetime model, compared with many known models like inverse flexible Weibull (IFW), exponentiated inverse flexible Weibull (EIFW), inverse Weibull (IW), inverse exponential (IE), inverse Rayleigh (IR), A distribution (A), and inverse Gompertz (IG) distributions. The MLE method will be used to compare the goodness-of-fit of the KuIG with these distributions. All mentioned distributions will be fitted in each data set according to some different criteria, namely, the Kolmogorov Smirnov test statistic (KS) with its corresponding P-values. Also, the log-likelihood values (L), Akaike information criterion (AIC), correct Akaike information criterion (CAIC), Hannan-Quinn information criterion (HQIC), Bayesian information criterion (BIC), Cramér-von Mises (W*) statistic and Anderson-Darling (A*) statistic will be found; see [38-43].

9.1. Complete data set I

The first data mentioned by [44] and represents the strength of glass for aircraft window (see, A1 in S1 Appendix). The MLEs, KS and P-values are given in Table 1 for all eight studied models. It is obvious that the KuIG has the smallest KS value and the largest P-value through whole models elaborated. This emphasizes that the KuIG fits the first data better than IE, IR, IW, A, IFW, EIFW and IG models. On the other hand, for the eight mentioned models, the KuIG has the smallest values of–L, AIC, CAIC, HQIC, BIC, W* and A*. This confirms that the KuIG appears to be a very competitive model to data I better than the other seven models.
Table 1

The MLEs, KS, P-values, –L, AIC, CAIC, HQIC, BIC, A* and W* values for data I.

StatisticsModels
IFWEIFWIWIRIEAIGKuIG
α^61.1672.3764461827810.50429.215125.6621.24979.042
β^0.08590.1644.655––––––119.76218.694
γ^––81.512––––––––––26.554
KS0.1460.1360.1460.3250.4770.1620.1390.124
P-value0.4790.5670.4820.0026.15×10−70.3540.5380.681
L104.963104.141105.323118.201137.262107.950107.884103.988
AIC213.927214.282214.647238.401276.523217.901219.768213.976
CAIC214.355215.171215.075238.539276.661218.039220.196214.865
BIC216.795218.584217.515239.835277.957219.335222.636218.278
HQIC214.862215.684215.582238.869276.990218.368220.702215.379
W*0.0780.07420.0830.0750.0740.1220.1180.074
A*0.4670.3970.5030.4030.3920.8040.7780.394
In Figs 7 and 8, we show the estimated PDFs, estimated CDFs and P-P plots of all tested distributions using the estimators obtained in Table 1. From these figures, it is noticed that the KuIG fits the first data better than the other seven models. The likelihood ratio test (LRT) can be used here to test if the fit by KuIG model is statistically superior to the fit by A and IG models for data set I. Table 2 gives the values of the LRT, degree of freedom (d.f) and its P-values for data I. Based on the P-values, we will reject the null hypothesis (H0) at a level of significance = 0.05.
Fig 7

The estimated PDFs (left panel) and the estimated CDFs (right panel) of data I.

Fig 8

The P-P plots for data I.

Table 2

The LRT, degree of freedom and P-value for data I.

ModelsNull Hypothesis (H0)ΛD.FP-value
Aα = γ = 1 or x1, x2, …, xn ~ A(β)7.92520.019
IGγ = 1 or x1, x2, …, xn ~ IG(β)7.79210.005
The profile of L(Ω) for the parameters of KuIG in case of the first real set of data is given in Fig 9 which confirms that only one solution is existed for the likelihood equations. The total test time (TTT) plot which specifies some qualitative readings about the failure rate shape is also given in Fig 9.
Fig 9

The profile of L(Ω) for data set I (left panel) and the TTT plot (right panel).

9.2. Complete data set II

The second data introduced by [45] represents the tiredness lifetime of 101 6061-T6 aluminum coupons (see, A2 in S1 Appendix). Table 3 presents the MLEs, KS and the P-values for all compared distributions. Also, the values of–L, AIC, CAIC, HQIC, BIC, A* and W* are determined for these distributions. From Table 3, we find that the KuIG is the best model between all studied distributions. Figs 10 and 11 shows the estimated PDFs, estimated CDFs and P-P plots of the tested distributions using the estimators obtained in Table 3. These figures confirm that the KuIG fits data II better than all seven tested models. Fig 12 gives the profile of the L(Ω) of the parameters α, β and γ and the TTT-plot for the second real set of data. From the profile of the L(Ω), we conclude that one and only solution is existed for the likelihood equations. Table 4 presents the values of LRT, d.f with the P-values for data set II. We deduce that the null hypotheses (H0) are rejected at = 0.05.
Table 3

The MLEs, KS, P-values, –L, AIC, CAIC, HQIC, BIC, A* and W* values for data II.

StatisticsModels
IFWEIFWIWIRIEAIGKuIG
α^295.46678.7923.28×101016361.23129.933705.557.435599.595
β^0.02060.03865.051––––––501.77553.955
γ^––58.724––––––––––180.479
KS0.1390.1130.1330.4030.5060.3660.2060.067
P-value0.0390.1530.0551.21×10−140.03.77×10−120.000380.761
L476.101465.265475.186530.197595.547517.597494.448456.431
AIC956.202936.531954.3721062.3931193.0941037.194992.896918.862
CAIC956.325936.778954.4941062.4341193.1351037.234993.018919.109
BIC961.432944.376959.6021065.0091195.7091039.809998.126926.707
HQIC958.319939.707956.4891063.4521194.1531038.253995.013922.038
W*0.4370.2380.4320.1720.1211.2040.8030.056
A*2.5481.3492.4930.9750.6897.0254.7070.360
Fig 10

The estimated PDFs (left panel) and the estimated CDFs (right panel) of data II.

Fig 11

The P-P plots of data II.

Fig 12

The profile of L(Ω) for data II (left panel) and the TTT plot (right panel).

Table 4

The LRT, degree of freedom and P-value of data II.

ModelsNull Hypothesis (H0)ΛD.FP-value
Aα = γ = 1 or x1, x2, …, xn ~ A(β)122.33220
IGγ = 1 or x1, x2, …, xn ~ IG(β)76.03410

9.3. Complete data set III

The third complete data set symbolizes the simulated strengths of glass fibers presented by [46] (see, A3 in S1 Appendix). In Table 5, we present the MLEs, KS and the P-values for all tested distributions. Moreover, the values of–L, AIC, CAIC, HQIC, BIC, A* and W* are determined for these distributions. From Table 5, we find that the KuIG is the best model between all studied distributions. The estimated PDFs, estimated CDFs with the P-P plots of the tested distributions using the estimators obtained in Table 5 are presented in Figs 13 and 14. These figures confirm that the KuIG fits data set III better than all seven tested models. Fig 15 shows the profile of the L(Ω) of the parameters α, β and γ and the TTT-plot. It is appeared, from the profile of the L(Ω), that one and only solution exists for the likelihood equations. Table 6 describes the values of LRT, d.f with the P-values for data III. According to the P-values, the null hypothesis (H0) will be refused at = 0.05.
Table 5

The MLEs, KS, P-values, –L, AIC, CAIC, HQIC, BIC, A* and W* values of data III.

StatisticsModels
IFWEIFWIWIRIEAIGKuIG
α^3.7324.1696.4982.2331.5262.1110.0320.684
β^1.8691.6665.438––––––7.5834.223
γ^––0.544––––––––––3.452
KS0.0820.0840.0770.3600.4680.5210.1010.068
P-value0.7560.7390.8197.762×10−83.537×10−134.44×10−160.5080.917
L20.61820.59320.06453.38192.80563.32222.80919.719
AIC45.23747.18644.128108.762187.609128.64549.61745.439
CAIC45.43747.59344.328108.827187.675128.71049.81745.847
BIC49.52353.61548.414110.905189.753130.78853.90351.869
HQIC46.92349.71545.814109.604188.452129.48751.30347.969
W*0.0790.0810.0710.0870.1260.06290.1380.0610
A*0.6100.6160.5330.7090.9820.5140.9280.471
Fig 13

The estimated PDFs (left panel) and the estimated CDFs (right panel) for data III.

Fig 14

The P-P plots of data III.

Fig 15

The profile of L(Ω) for data set III (left panel) and the TTT plot (right panel).

Table 6

The LRT, degree of freedom and P-value for data set III.

ModelsNull Hypothesis (H0)ΛD.FP-value
Aα = γ = 1 or x1, x2, …, xn ~ A(β)87.20420
IGγ = 1 or x1, x2, …, xn ~ IG(β)6.17710.013

9.4. Data set IV (Type-II right censored data)

The censored data analyzed here was introduced by [47] which represents the fatigue life for 10 bearings of a specific type in hours and a sample of size k = 8 from this data is taken (see, A4 in S1 Appendix). Table 7 shows the MLEs, –L, and KS with the P-values for the A, IG and KuIG distributions. It is clear that, the KuIG has the lowest -L and KS value and the highest P-value and this emphasizes that KuIG fits the studied data here better than A and IG.
Table 7

The MLEs, K-S, P-values and –L values for type-II right censored data.

ModelsMLEs-LKSP-value
Aα^=1295.529.2240.3670.135
IGα^=2.211,β^=1.14424.3570.3490.175
KuIGα^=1.560,β^=1201.6,γ^=0.90922.9710.1590.961

9.5. Data set V (Upper record data)

The record data analyzed here are introduced by [48] which contains n = 11 lifetimes to breakdown of an electric isolating fluid exposed to thirty kilovolt (see, A5 in S1 Appendix). From this data, it is found that the upper record values are 2.836, 3.120, 5.169 and 5.272. Table 8 summarizes the MLEs, –L, and KS with the P-values for A, IG and KuIG distributions. It is clear that, the KuIG has the lowermost -L and KS value and the uppermost P-value and this emphasizes that the KuIG fits this type of data better than the A and IG models.
Table 8

The MLEs, –L, KS, and P-values for upper record data.

ModelsMLEs-LKSP-value
Aα^=7.1367.5920.6870.046
IGα^=2.82×10-3,β^=26.3993.250.5130.285
KuIGα^=0.204,β^=13.606,γ^=2.2352.9930.4790.317

10. Conclusions

In this article, we proposed a new model named KuIG which is considered as an extension and generalization to inverse Gompertz and A distributions. The studied model is characterized by an upside down bathtub-shaped curve hazard rate function depending upon the shape parameters. Also, the KuIG is appropriate for testing the goodness of fit of some special sub-models, such as the IG and A distributions. Some statistical and mathematical properties including the quantiles, median, mode, moments, PWMs, entropies, skewness and kurtosis of KuIG are derived and discussed. Also, some basic functions used in reliability theory are obtained such as the reliability function, HRF, reversed HRF, MTTF, MRL and MWT. The parameters of KuIG are appreciated using the MLE in case of complete, type-II right censored and upper record data. A simulation is done to determine the performance of MLEs according to biases and MSEs. Three complete data sets are analyzed using the KuIG and it is compared with seven other life time distributions. Also, we analyzed a type-II right censored and upper record data and it is compared with the IG and A distributions. It is found that the KuIG has more flexibility for fitting the various data in engineering applications than the mentioned models. Future works include (i) bivariate extension of the KuIG, (ii) KuIG-G family of distributions, (iii) different estimation methods of the KuIG and (iv) discrete case of the KuIG. (DOCX) Click here for additional data file.
  5 in total

1.  A multivariate random-parameters Tobit model for analyzing highway crash rates by injury severity.

Authors:  Qiang Zeng; Huiying Wen; Helai Huang; Xin Pei; S C Wong
Journal:  Accid Anal Prev       Date:  2016-11-30

2.  Analysis of hourly crash likelihood using unbalanced panel data mixed logit model and real-time driving environmental big data.

Authors:  Feng Chen; Suren Chen; Xiaoxiang Ma
Journal:  J Safety Res       Date:  2018-04-25

Review 3.  Deciphering death: a commentary on Gompertz (1825) 'On the nature of the function expressive of the law of human mortality, and on a new mode of determining the value of life contingencies'.

Authors:  Thomas B L Kirkwood
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2015-04-19       Impact factor: 6.237

4.  Crash Frequency Modeling Using Real-Time Environmental and Traffic Data and Unbalanced Panel Data Models.

Authors:  Feng Chen; Suren Chen; Xiaoxiang Ma
Journal:  Int J Environ Res Public Health       Date:  2016-06-18       Impact factor: 3.390

5.  Investigation on the Injury Severity of Drivers in Rear-End Collisions Between Cars Using a Random Parameters Bivariate Ordered Probit Model.

Authors:  Feng Chen; Mingtao Song; Xiaoxiang Ma
Journal:  Int J Environ Res Public Health       Date:  2019-07-23       Impact factor: 3.390

  5 in total
  1 in total

1.  On the Burr XII-moment exponential distribution.

Authors:  Fiaz Ahmad Bhatti; G G Hamedani; Mustafa Ç Korkmaz; Wenhui Sheng; Azeem Ali
Journal:  PLoS One       Date:  2021-02-22       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.