Literature DB >> 33286223

Statistical Inference of the Half-Logistic Inverse Rayleigh Distribution.

Abdullah M Almarashi1, Majdah M Badr2, Mohammed Elgarhy3, Farrukh Jamal4, Christophe Chesneau5.   

Abstract

The inverse Rayleigh distribution finds applications in many lifetime studies, but has not enough overall flexibility to model lifetime phenomena where moderately right-skewed or near symmetrical data are observed. This paper proposes a solution by introducing a new two-parameter extension of this distribution through the use of the half-logistic transformation. The first contribution is theoretical: we provide a comprehensive account of its mathematical properties, specifically stochastic ordering results, a general linear representation for the exponentiated probability density function, raw/inverted moments, incomplete moments, skewness, kurtosis, and entropy measures. Evidences show that the related model can accommodate the treatment of lifetime data with different right-skewed features, so far beyond the possibility of the former inverse Rayleigh model. We illustrate this aspect by exploring the statistical inference of the new model. Five classical different methods for the estimation of the model parameters are employed, with a simulation study comparing the numerical behavior of the different estimates. The estimation of entropy measures is also discussed numerically. Finally, two practical data sets are used as application to attest of the usefulness of the new model, with favorable goodness-of-fit results in comparison to three recent extended inverse Rayleigh models.

Entities:  

Keywords:  entropy; half-logistic transformation; inverse Rayleigh distribution; moments; real data analysis; statistical inference

Year:  2020        PMID: 33286223      PMCID: PMC7516925          DOI: 10.3390/e22040449

Source DB:  PubMed          Journal:  Entropy (Basel)        ISSN: 1099-4300            Impact factor:   2.524


1. Introduction

In the seventies, [1] introduced what will be reveal as an important distribution for lifetime and reliability studies, known as inverse Rayleigh (IR) distribution. Specially, it provides an appropriate statistical model when dealing with unimodal highly right-skewed data. As mathematical basis, the corresponding cumulative distribution function (cdf) and probability density function (pdf) are given by respectively, where is a scale parameter. As notable features, the IR distribution has tractable and simple probability functions, is unimodal and right-skewed, and possesses a hazard rate function with a singular curvature: it increases at a certain value, then decreases until attain a kind of stabilization. The pioneer studies are [2] which presents some properties of the maximum likelihood estimator of , and [3] which provides closed-form expressions for the (standard) mean, harmonic mean, geometric mean, mode and median of the IR distribution. Also, among the amount of works investigating the statistical aspects of the IR distribution, the reader can be referred to [4,5,6,7,8,9,10,11,12]. In the recent years, several extensions of the IR distribution were developed, using different mathematical techniques, often at the basis of general families of distributions. Among them, there are the beta IR (BIR) distribution by [13], transmuted IR (TIR) distribution by [14], modified IR (MIR) distribution by [15], transmuted modified IR (TMIR) distribution by [16], transmuted exponentiated IR (TEIR) distribution by [17], Kumaraswamy exponentiated IR (KEIR) distribution by [18], weighted IR (WIR) distribution by [19], odd Fréchet IR (OFIR) distribution by [20], type II Topp-Leone IR (TIITLIR) distribution by [21], type II Topp-Leone generalized IR (TIITLGIR) distribution by [22] and exponentiated IR (EIR) distribution by [23]. However, to the best of our knowledge, the use of the half-logistic transformation to extend the IR distribution remains unexplored, despite recent success in this regard. This half-logistic transformation was pioneered by [24] in the context of the half logistic generated (HL-G) family of continuous distributions. As main functions, the cdf and pdf of the HL-G family are, respectively, given by and where is a shape parameter, is a cdf of a baseline/parent continuous distribution, with corresponding pdf , and  represents the baseline parameters (under a vector form, say ). Thus, many studies used the HL-G family to introduce new flexible continuous distributions with modelling perspectives, such as [25] with the HL Lomax (HLL) distribution, [26] in which different methods of estimation for the HLL distribution are proposed, ref. [27] with the HL power Lindley (HLPL) distribution, [28] with the HL generalized Weibull (HLGW) distribution, [29] with the HL Burr X (HLBX) distribution and [30] which studied different estimation methods for the HL Topp-Leone (HLTP) distribution. That is, attracted by the success of the above extensions, we investigate the half-logistic IR (HLIR) distribution, constituting a new lifetime distribution with two parameters, and a new extension of the IR distribution as well. As expressed later, the cdf and pdf of the HLIR distribution are obtained by inserting (1) into (2) and (3), respectively. In view of these functions, the HLIR can also be viewed as a special case of the HL Fréchet (HLF) distribution by [24], i.e., with parameter , case that not received a particular attention. The aim of this paper is to provide a solid and complete study on the HLIR distribution, with an emphasis on the statistical inference of the related model. The essential mathematical properties are provided, showing the overall flexibility of the HLIR distribution via various measures (central, dispersion, as/symmetrical, entropy…). Then, five different methods of estimation are developed for the HLIR model parameters, specifically the maximum likelihood, least square, weighted least square, percentile and Cramer-von Mises methods. The estimation of the Rényi entropy and q-entropy is also discussed by using the plugging and ML methods. Then, we show that the fits provided by the HLIR model can accommodate data with various features, and can demonstrate better goodness-of-fits than the three following extended IR two-parameter models: the TIITLIR model (by [21]), TIR model (by [14]) and OFIR model (by [20]), and than the former one-parameter IR model as well. Two practical data sets are analyzed in this regard. The following sections composed the paper. Section 2 is devoted to the main probability functions of the HLIR distribution. Section 3 introduces some mathematical properties of the HLIR distribution including stochastic ordering results, a general linear representation for the exponentiated probability density function, raw/inverted moments, incomplete moments, skewness and kurtosis features, and some entropy measures. Section 4 discusses the estimation of the model parameters and entropy. In Section 5, we reveal the potential of the HLIR model compared with some other models in a concrete statistical setting. The paper encloses with some concluding remarks in Section 6.

2. The HLIR Distribution

This section introduces the main functions on the HLIR distribution, along with some analytical properties.

2.1. Probability Functions

As described in the introduction, the cdf and pdf of the HLIR distribution with the vector of parameters is obtained by inserting Equation (1) into Equation (2) and Equation (3), i.e., and respectively, where is a scale parameter and is a shape parameter. Important reliability functions of the HLIR distribution are presented below. The survival function (sf), hazard rate function (hrf), reversed hazard rate function (rhrf) and cumulative hazard rate function (chrf) of the HLIR distribution are, respectively, given by and Implications of these functions in survival analysis can be found in [31]. For modelling purposes, the pdf and hrf are informative on the ability of the HLIR model to fit data. For this reason, in the next, we put a focus on these two functions.

2.2. Functions Analysis

The limit features of the cdf, pdf and hrf of the HLIR distribution, i.e., given by Equation (4), Equation (5) and Equation (6), are studied below. In the case where , we have Thus, both the pdf and hrf tend to 0 in this case, with the same polyno-exponential decay. Also, we see that the rate of convergence mainly depends on the parameter . On the other side, when , we get Hence, in this case, both the pdf and hrf tend to 0. The rate of convergence of the pdf mainly depends on the parameter , whereas the one of the hrf is fixed and of order . The mode(s) of the HLIR distribution is(are) given by critical point(s) of the corresponding pdf. Here, after some algebraic manipulations, it(they) is(are) given as solution(s) of the following non-linear equation: Closed-form(s) for the mode(s) is(are) not available, but a mathematical software must help for a numerical evaluation. On the other side, the critical point(s) is(are) given by the solution(s) of the following non-linear equation: This(these) critical point(s) is(are) not expressible in an easy manner, but can be evaluated numerically. A more direct analysis of the shapes comes from graphical investigations. In this regard, Figure 1 and Figure 2 plot the pdf and hrf, respectively, for selected values of .
Figure 1

Plots of the pdf of the HLIR distribution for (a) ; (b) and (c) , with varying .

Figure 2

Plots of the hrf of the HLIR distribution for (a) ; (b) and (c) , with varying .

We observe that the pdf of the HLIR distribution can be unimodal, and higlhy right-skewed (see Figure 1a), moderately right-skewed (see Figure 1b), or near symmetrical (see Figure 1c), with various heaviness on the tails. Moreover, we see that the hrf of the HLIR distribution can be decreasing with a reversed J-shape (see Figure 2a), increasing-decreasing with a reversed bathtub shape (see Figure 2b), or mainly increasing (see Figure 2c). In the curvature sense, these functions are significantly more flexible in comparison to those of the former IR distribution, motivating the consideration of the HLIR model for greatest statistical perspectives.

2.3. Quantile Function

The quantile function (qf) of the HLIR distribution, say , can be obtained by inverting the corresponding cdf; it satisfies for . After some algebraic manipulations, we arrive at Thanks to its closed-form, this qf is helpful for determining the quartiles of the HLIR distribution, generating values from the HLIR distribution for simulation purposes and defining various measures of skewness and kurtosis. All these aspects will be used later.

3. Mathematical Properties

In this section, some notable mathematical properties of the HLIR distribution are derived, specifically stochastic ordering results, a general linear representation for the exponentiated probability density function, raw/inverted moments, incomplete moments, skewness and kurtosis features, and some entropy measures, namely, the Rényi entropy and q-entropy.

3.1. Some Stochastic Ordering Results

The HLIR distribution enjoys tractable stochastic ordering results involving the corresponding cdf. From a statistical point of view, such results allow a better comprehension of the roles of the parameters in the fitting ability of the HLIR model. The most notable of them are presented below. The following inequalities holds: For any For any The proof is based on monotonic arguments with respect to the parameters. After some algebraic manipulations, we get implying that is strictly decreasing with respect to . Therefore, for  any and , we have . With the same methodology, we have implying that is strictly increasing with respect to . Therefore, for  any and , we have . This ends the proof of Proposition 1. □ The following result shows a simple relation between the HLIR distribution and two other distributions, including the EIR distribution by [23]. For any where The proof follows from the definition of , involving as numerator, and the following inequalities: since for any , This ends the proof of Proposition 2. □ In fact, the cdf in Proposition 2 is the cdf of the maximum of two independent random variables: one following the EIR distribution and the other following a special case of the power Lomax (PL) distribution introduced by [32].

3.2. Linear Representation

The following result introduces a useful linear representation for the exponentiated pdf of the HLIR distribution with power parameter . Let where and The generalized binomial series formula applied two times in a row yields After a rearrangement, we get the desired result, ending the proof of Proposition 3. □ By taking in Proposition 3, we get a useful series expansion for the pdf of the HLIR distribution, “useful” in the sense that we express a sophisticated function as sums of tractable functions, i.e., . In particular, we will use it in the next to provide measures and functions which are easy to handle from the analytical and numerical point of views.

3.3. Raw/Inverted Moments

Let r be an integer; the negative values are allowed. If X denotes a random variable following the HLIR distribution, then its moment (or th inverted moment if r is negative) exists if and only if , and it is given by . Thanks to Proposition 3 applied with and the calculus of the integral via the change of variable , assuming that , we obtain the following relation where and (the standard gamma function). For instance, the mean of X, say , can be derived by taking , and the following approximation remains acceptable: where M denotes a large integer and The variance of X does not exist, as well as the standard coefficients of skewness and kurtosis. This motivates the use of other measures of skewness and kurtosis based on quantiles, as performed in Section 3.5. However, we can express all the inverted moments, i.e., , , and so on.

3.4. Incomplete Moments

Let r be an integer; the negative values are allowed. Contrary to the moments, the incomplete moments of the HLIR distribution always exist. That is, if X denotes a random variable following the HLIR distribution, for a given , the  incomplete moment of X at t is given by . Proceeding as for the raw/inverted moments, the following formula holds: where and (the upper incomplete gamma function). We rediscover the relation . Also, we can derive the normalized incomplete moment as , which is key tool to define widely used measures of inequality, such as income quintiles, Lorenz curve, Pietra ratio and Gini coefficient. For instance, the Lorenz curve is the plot of and the Gini coefficient is defined by These measures are essential to go further with the HLIR distribution in applied settings. Further details and applications of them can be found in [33,34], and the references therein.

3.5. Skewness and Kurtosis Based on the qf

We now provide skewness and kurtosis analyzes of the HLIR distribution by using some measures involving the qf given by (7). Let us set with . Then, we consider the coefficient of skewness S by [35] and the coefficient of kurtosis K by [36] defined by respectively. Here, S measures the degree of asymmetry of the HLIR distribution, whereas K measures the degree of its tail heaviness; as K increases, the tail of the HLIR distribution becomes heavier. To better handle these measures and see the effects of and on them, Figure 3 displays the two-dimensional plots for S and K with respect to and , with  .
Figure 3

Plots for (a) the skewness S and (b) the kurtosis K for .

From Figure 3, we see that the parameter has a minor effect on S and K, contrary to ; as  increases, S and K increase. Also, we see that the HLIR distribution is mainly right-skewed, confirming the prime graphical investigations on the corresponding pdf.

3.6. Measures Of Entropy

The entropy of the HLIR distribution can be measured in different ways. Here, we focus our attention on the Rényi entropy by [37] and its twin sister: the q-entropy by [38]. For discussions and applications of these two entropy measures, we refer the reader to the survey of [39], and the references therein. Let and . Then, when , the Rényi entropy of the HLIR distribution exists, and it is given by Let us now investigate a practical series expansion of the main term. Owing to Proposition 3 applied with and the calculus of the integral via the change of variable , assuming that , we get where Therefore, one can express , along with an acceptable approximation, as follows: where M denotes a large integer. Now, let and . Then, the q-entropy of the HLIR distribution is defined by Therefore, owing to (9), with a similar approach than above, we get Other kinds of entropy can be expressed in a similar manner. In this regard, the book of [40] is suggested.

4. Estimation

By considering the HLIR distribution as a statistical model, this section investigates the estimation of and via the five different methods mentioned in the introduction, also providing the interval estimation of these parameters, and the estimation of the Rényi entropy and q-entropy as well.

4.1. Estimation of The Parameters

A myriad of estimation methods can be used to estimate and . Here, we focus on the most notable of them, namely the maximum likelihood (ML), least square (LS), weighted least square (WLS), percentile (PC) and Cramer-von Mises (CV) methods. We first describe their mathematical backgrounds and perform an adequate simulation study to check their efficiency.

4.1.1. ML Method

First of all, we investigate the ML estimates (MLEs) of and . Let be n observed values from the HLIR distribution. Then, the MLEs can be calculated by maximizing the following function: with respect to and . From the analytical point of view, the MLEs of and are the solutions of two non-linear equations using the first partial derivatives of , i.e.,  and Then, equating and to zeros and solving them simultaneously with respect to and , we obtain the MLEs (the same derivative approach can be developed in the next methods, but we omit it for the sake of conciseness). Under some regularity conditions, the random versions of the MLEs are known to be consistent, asymptotically normal, efficient and equivariant. Also, the formulas for the corresponding standard errors, asymptotic confidence intervals and likelihood ratio tests involving the MLEs are well-known. In this regard, we may refer to [41]. From the practical point of view, they can be determine numerically thanks to the use of any statistical software (R, SAS, Python, MATHCAD…).

4.1.2. Ordinary and Weighted LS Methods

Let be n observed values from the HLIR distribution and be their ordered values, i.e., , …and . Then, the LS estimates (LSEs) of and can be calculated by minimizing the following function: with respect to and . Similarly, by introducing a thorough weighted sequence, the WLS estimates (WLSEs) of and can be obtained by minimizing the following function: with respect to and . For the theoretical background of this method and applications as well, we refer the reader to [42].

4.1.3. PC Method

Let be n observed values from the HLIR distribution and be their ordered values. Then, the PC estimates (PCEs) of and are derived by minimizing the following function: with respect to and . The basics of this method can be found in [43,44].

4.1.4. CV Method

Let be n observed values from the HLIR distribution and be their ordered values. The CV estimates (CVEs) is a type of minimum distance estimates which is based on the difference between the estimated and empirical cdfs (see [45,46]). That is, the CVEs of and are obtained by minimizing the following function: with respect to and . Last but not least, we refer to [47] for empirical evidence of the efficiency of the CVEs.

4.1.5. Numerical Results

Here, we conduct a simulation study to evaluate and compare the efficiency of the above estimates, with respect to their mean squared errors (MSEs). In this regard, for each , 20, 30, 50, 100 and 200, we generate 1000 random samples of size n, i.e., of the form , from the HLIR distribution (the inversion method involving the qf given by (7) is used). The four following different sets of parameters are considered: Then, the ML, LS, WLS, PC and CV estimates of and are computed, as well as their MSEs. Simulated outcomes are listed in Table 1, Table 2, Table 3 and Table 4 for , , and , respectively. The statistical software MATHCAD(14) is used.
Table 1

Estimates and MSEs with the ML, LS, WLS, PC and CV methods for the HLIR model with .

n MLEsLSEsWLSEsPCEsCVEs
EsMSEEsMSEEsMSEEsMSEEsMSE
101.7130.3021.4780.3741.4710.2871.3780.3191.7441.379
0.9910.2870.8550.4470.8410.3951.0057.8101.1091.215
201.6270.1301.4960.1421.5110.1261.4020.1421.6230.843
0.9040.0790.8280.0780.8350.0710.7980.1450.9280.130
301.5670.0651.4920.0901.5100.0821.3730.0981.5750.699
0.8560.0340.8080.0430.8180.0400.7450.0750.8660.059
501.5450.0371.5020.0541.5140.0451.4080.0571.5530.623
0.8380.0200.8140.0260.8190.0220.7520.0450.8490.032
1001.5230.0181.5030.0261.5130.0221.4240.0311.5270.555
0.8178.129 *0.8050.0120.8109.531 *0.7470.0240.8210.013
2001.5078.193 *1.4978.467 *1.5009.781 *1.4460.0171.5060.511
0.8074.213 *0.8003.667 *0.8034.774 *0.7610.0140.8085.512 *

The symbol * indicate that the value multiply .

Table 2

Estimates and MSEs with the ML, LS, WLS, PC and CV methods for the HLIR model with .

n MLEsLSEsWLSEsPCEsCVEs
EsMSEEsMSEEsMSEEsMSEEsMSE
101.8220.6131.5210.6171.5100.5421.3980.4551.8682.750
0.6150.0880.5320.0870.5340.1170.5550.2750.6610.199
201.6820.2111.5080.2351.5590.2431.4070.1961.6731.673
0.5540.0210.5080.0230.5250.0300.4980.0440.5600.035
301.5900.1041.4860.1511.5110.1211.3780.1311.5911.360
0.5360.0140.5060.0140.5140.0140.4850.0320.5390.019
501.5570.0571.4850.0741.5110.0701.3950.0741.5471.176
0.5176.076 *0.4997.249 *0.5046.665 *0.4730.0140.5178.380 *
1001.5220.0251.5040.0381.5000.0301.4200.0411.5341.109
0.5072.733 *0.5013.607 *0.5022.987 *0.4767.316 *0.5103.905 *
2001.5160.0111.5030.0211.5080.0141.4450.0201.5181.058
0.5041.175 *0.5001.744 *0.5021.370 *0.4794.204 *0.5051.811 *

The symbol * indicates that the value multiply .

Table 3

Estimates and MSEs with the ML, LS, WLS, PC and CV methods for the HLIR model with .

n MLEsLSEsWLSEsPCEsCVEs
EsMSEEsMSEEsMSEEsMSEEsMSE
101.6960.2631.4660.2701.4770.2481.3830.2721.7060.842
1.3140.7051.1281.3091.1321.3971.2335.4991.4963.430
201.6160.1021.5130.1311.5260.1141.4080.1291.6300.545
1.1370.1241.0820.6061.070.2171.0030.3451.2331.352
301.5560.0551.4850.0721.4990.0621.3880.0921.560.391
1.0860.0741.0250.0851.0330.0730.9650.2951.1070.119
501.5370.0321.4890.0451.5020.0371.4060.0541.5330.332
1.0430.0331.0020.0431.0120.0360.9290.0791.0470.052
1001.5130.0141.4900.0201.4980.0161.4290.0301.5220.295
1.0170.0140.9990.0191.0050.0160.9380.0441.0270.022
2001.5116.475 *1.5000.0101.5057.969 *1.4520.0141.5110.271
1.0095.961 *1.0008.878 *1.0057.127 *0.9470.0241.0119.286 *

The symbol * indicates that the value multiply .

Table 4

Estimates and MSEs with the ML, LS, WLS, PC and CV methods for the HLIR model with .

n MLEsLSEsWLSEsPCEsCVEs
EsMSEEsMSEEsMSEEsMSEEsMSE
100.6060.0690.5020.0690.5060.0630.4660.0510.6210.126
0.6150.0880.5380.1450.5410.1430.5570.2840.6640.213
200.5610.0230.5150.0310.5200.0270.4690.0220.5540.032
0.5540.0210.5260.0520.5250.0300.4980.0440.5530.035
300.5300.0120.4980.0160.5040.0130.4590.0150.5420.020
0.5360.0140.5100.0150.5140.0140.4850.0320.5400.017
500.5196.332 *0.4989.623 *0.5047.787 *0.4658.251 *0.5210.011
0.5176.076 *0.4997.766 *0.5046.665 *0.4730.0140.5239.412 *
1000.5072.773 *0.4974.192 *0.5003.278 *0.4734.530 *0.5114.964 *
0.5072.733 *0.4993.481 *0.5022.987 *0.4767.316 *0.5124.168 *
2000.5051.222 *0.5012.110 *0.5031.600 *0.4822.182 *0.5041.996 *
0.5041.175 *0.5001.665 *0.5021.371 *0.4794.204 *0.5041.788 *

The symbol * indicates that the value multiply .

From these tables, it is clear that the MSEs decrease as sample sizes increase, for all the estimates. Also, in almost all of the cases, the MSEs of the MLEs take the smallest values among the corresponding MSEs of the other methods. In this sense, in our setting, the ML method can be considered as the best and will be naturally privileged in the next. We complete this part with an interval estimation study. More specifically, we use the ML method to calculate lower bound (LB), upper bound (UB) and average length (AL) of the (two-sided asymptotic) confidence interval estimation of the model parameters at the levels and . The obtained numerical results are mentioned in Table 5, Table 6, Table 7 and Table 8 for , , and , respectively.
Table 5

LBs, UBs and ALs of the confidence interval estimation with the ML method for the HLIR model with .

n 90%95%
LBUBALLBUBAL
100.98122.35951.37830.84932.49151.6422
0.41551.80721.39160.28231.94041.6581
201.02371.97310.94940.93282.06401.1311
0.45701.04060.58350.40121.09640.6953
301.18991.99270.80281.11312.06960.9565
0.56371.10460.54090.51191.15640.6445
501.33851.95140.61301.27982.01010.7303
0.70901.20180.49280.66181.24900.5872
1001.26811.68780.41971.22801.72800.5000
0.62760.89170.26410.60230.91700.3147
2001.34991.64830.29831.32141.67680.3554
0.68570.87930.19360.66710.89780.2307
Table 6

LBs, UBs and ALs of the confidence interval estimation with the ML method for the HLIR model with .

n 90%95%
LBUBALLBUBAL
100.82302.44331.62030.66792.59841.9305
0.29300.99580.70280.22571.06310.8374
201.13242.41751.28511.00932.54051.5312
0.34370.74530.40160.30530.78380.4785
301.22352.23791.01441.12642.33511.2086
0.38340.71410.33070.35170.74580.3941
501.17891.89350.71461.11051.96190.8515
0.39840.64010.24160.37530.66320.2879
1001.29261.80300.51041.24371.85190.6082
0.43200.60130.16930.41580.61750.2017
2001.30991.66110.35111.27631.69470.4184
0.43850.55230.11380.42760.56320.1356
Table 7

LBs, UBs and ALs of the confidence interval estimation with the ML method for the HLIR model with .

n 90%95%
LBUBALLBUBAL
101.09882.39471.29590.97472.51871.5440
0.51742.37971.86240.33902.55802.2190
201.21172.11730.90571.12492.20401.0791
0.71641.83401.11760.60941.94101.3316
301.30162.04380.74221.23052.11490.8844
0.79161.64490.85330.70991.72651.0167
501.39571.97560.58001.34012.03120.6910
0.90091.58060.67970.83581.64570.8099
1001.36141.75920.39781.32331.79730.4740
0.85091.23810.38720.81381.27520.4614
2001.38401.66210.27801.35741.68870.3313
0.87581.13750.26180.85071.16260.3119
Table 8

LBs, UBs and ALs of the confidence interval estimation with the ML method for the HLIR model with .

n 90%95%
LBUBALLBUBAL
100.35090.96910.61820.29171.02830.7365
0.33101.18970.85870.24881.27191.0232
200.36340.76140.39800.32530.79950.4742
0.36250.79540.43290.32110.83680.5157
300.42750.76590.33840.39510.79830.4032
0.43200.82540.39350.39430.86310.4688
500.35650.58300.22640.33490.60460.2698
0.36580.58330.21740.34500.60410.2591
1000.41490.58280.16800.39880.59890.2001
0.41470.57640.16170.39930.59190.1926
2000.46380.58580.12200.45210.59740.1454
0.45910.57950.12040.44750.59110.1435
From Table 5, Table 6, Table 7 and Table 8, it is clear that the ALs decrease as sample sizes increase.

4.2. Estimation of The Entropy

We now investigate the estimation of the Rényi entropy given by (8) and the q-entropy given by (10) by the means of a simulation study. We adopt the same setting than Section 4.1.5 (we generate 1000 random samples of sizes , 20…with the same sets of parameters). Then, we determine the MLEs of and , denoted by and , and for , and 2, we estimate and by and with , respectively. We measure the precision of these estimates by the relative biases (RBs); in the case of the Rényi entropy and a given , the RB is defined by Simulated outcomes about the Rényi entropy are presented in Table 9, Table 10, Table 11 and Table 12 for , , and , respectively, and those of the q-entropy are presented in Table 13, Table 14, Table 15 and Table 16 for , , and , respectively.
Table 9

Rényi entropy estimates and RBs for the HLIR model with .

n Exact Value δ=1.2 Exact Value δ=1.5 Exact Value δ=2
EstimateRBEstimateRBEstimateRB
100.9290.8360.1000.8580.7430.1340.7870.7080.101
200.9080.0230.8290.0340.7700.022
300.9100.0200.8320.0310.7790.011
500.9170.0130.8420.0180.7826.454 *
1000.9378.576 *0.8544.477 *0.7861.323 *
2000.9290.252 *0.8580.004 *0.7880.968 *

The symbol * indicates that the value multiply .

Table 10

Rényi entropy estimates and RBs for the HLIR model with .

n Exact Value δ=1.2 Exact Value δ=1.5 Exact Value δ=2
EstimateRBEstimateRBEstimateRB
101.2791.2130.0521.1671.0770.0771.0621.0010.058
201.2230.0441.1440.0201.0360.024
301.2650.0111.1490.0151.0380.022
501.2650.0111.1606.064 *1.0556.444 *
1001.2679.558 *1.1615.065 *1.0583.544 *
2001.2735.068 *1.1642.707 *1.0592.975 *

The symbol * indicates that the value multiply .

Table 11

Rényi entropy estimates and RBs for the HLIR model with .

n Exact Value δ=1.2 Exact Value δ=1.5 Exact Value δ=2
EstimateRBEstimateRBEstimateRB
100.7890.6680.1530.730.6400.1230.6710.6100.091
200.7140.0950.7050.0350.6370.050
300.7610.0350.7120.0260.6590.017
500.7760.0160.7130.0250.6630.012
1000.7790.0120.7220.0110.6693.100 *
2000.7881.497 *0.7256.639 *0.6710.357 *

The symbol * indicates that the value multiply .

Table 12

Rényi entropy estimates and RBs for the HLIR model with .

n Exact Value δ=1.2 Exact Value δ=1.5 Exact Value δ=2
EstimateRBEstimateRBEstimateRB
100.8020.7300.0890.690.6040.1250.5850.5220.108
200.7780.0300.6610.0420.5680.029
300.7910.0140.6690.0310.5760.016
500.7967.181 *0.6790.0150.5770.014
1000.7975.891 *0.6882.132 *0.5897.532 *
2000.8002.364 *0.6890.606 *0.5850.511 *

The symbol * indicates that the value multiply .

Table 13

q-entropy estimates and RBs for the HLIR model with .

n Exact Value q=1.2 Exact Value q=1.5 Exact Value q=2
EstimateRBEstimateRBEstimateRB
101.741.5840.0901.2551.1840.0570.8370.8120.030
201.6650.0431.2400.0120.8170.024
301.7140.0151.2420.0110.8270.012
501.7220.0101.2476.763 *0.8343.921 *
1001.7334.096 *1.2625.454 *0.8380.951 *
2001.7381.241 *1.2550.119 *0.8370.192 *

The symbol * indicates that the value multiply .

Table 14

q-entropy estimates and RBs for the HLIR model with .

n Exact Value q=1.2 Exact Value q=1.5 Exact Value q=2
EstimateRBEstimateRBEstimateRB
102.2262.1410.0381.4781.4470.0210.9130.9040.010
202.1660.0271.4560.0150.9068.183 *
302.1750.0231.4714.813 *0.9130.362 *
502.2049.500 *1.4724.358 *0.9130.171 *
1002.2173.877 *1.4733.469 *0.9130.131 *
2002.2183.458 *1.4752.095 *0.9130.111 *

The symbol * indicates that the value multiply .

Table 15

q-entropy estimates and RBs for the HLIR model with .

n Exact Value q=1.2 Exact Value q=1.5 Exact Value q=2
EstimateRBEstimateRBEstimateRB
101.5231.3740.0981.1371.0240.0990.7870.7540.041
201.4650.0381.0680.0610.7610.033
301.4840.0261.1130.0210.7799.278 *
501.4990.0161.1269.813 *0.7843.558 *
1001.5127.609 *1.1297.297 *0.7881.975 *
2001.5211.275 *1.1360.757 *0.7860.939 *

The symbol * indicates that the value multiply .

Table 16

q-entropy estimates and RBs for the HLIR model with .

n Exact Value q=1.2 Exact Value q=1.5 Exact Value q=2
EstimateRBEstimateRBEstimateRB
101.5441.4720.0471.0961.0390.0530.740.7280.016
201.4760.0441.0820.0130.7300.014
301.5180.0171.0905.887 *0.7357.336 *
501.5240.0131.0923.371 *0.7365.129 *
1001.5299.527 *1.0950.878 *0.7434.069 *
2001.5365.298 *1.0960.048 *0.7423.075 *

The symbol * indicates that the value multiply .

From these tables, we see that the RBs of the Rényi entropy and q-entropy decrease as the sample sizes increase, attesting the efficiency of the proposed estimates.

5. Applications to Real Data

In this section, we prove the flexibility of the HLIR model by analyzing two practical datasets, denoted by D1 and D2, given as follows: The first data set is extracted from ([48], Table 3). For the sake of transparency, the data are: 1.312, 1.314, 1.479, 1.552, 1.700, 1.803, 1.861, 1.865, 1.944, 1.958, 1.966, 1.997, 2.006, 2.021, 2.027, 2.055, 2.063, 2.098, 2.14, 2.179, 2.224, 2.240, 2.253, 2.270, 2.272, 2.274, 2.301, 2.301, 2.359, 2.382, 2.382, 2.426, 2.434, 2.435, 2.478, 2.490, 2.511, 2.514, 2.535, 2.554, 2.566, 2.57, 2.586, 2.629, 2.633, 2.642, 2.648, 2.684, 2.697, 2.726, 2.770, 2.773, 2.800, 2.809, 2.818, 2.821, 2.848, 2.88, 2.954, 3.012, 3.067, 3.084, 3.090, 3.096, 3.128, 3.233, 3.433, 3.585, 3.585. The second data set coming from [49]. The data consists of the monthly actual taxes revenue in Egypt from January 2006 to November 2010. The data (in 1000 million Egyptian pounds) are: 5.9, 20.4, 14.9, 16.2, 17.2, 7.8, 6.1, 9.2, 10.2, 9.6, 13.3, 8.5, 21.6, 18.5, 5.1, 6.7, 17, 8.6, 9.7, 39.2, 35.7, 15.7, 9.7, 10, 4.1, 36, 8.5, 8, 9.2, 26.2, 21.9, 16.7, 21.3, 35.4, 14.3, 8.5, 10.6, 19.1, 20.5, 7.1, 7.7, 18.1, 16.5, 11.9, 7, 8.6, 12.5, 10.3, 11.2, 6.1, 8.4, 11, 11.6, 11.9, 5.2, 6.8, 8.9, 7.1, 10.8. The corresponding histogram shows that the distribution of the data is highly right-skewed, motivating the use of the HLIR model for suitable fits. The fitting behavior of the HLIR model is compared to the one of the following well-known models: OFIR, TIITLIR, TIR and IR models. As common point, they extend the IR model and possess two parameters (excepted the former IR model). With the concern to make comparisons as fair as possible, we apply the following notorious criteria: Cramér Von-Mises (CVM), Anderson-Darling (AD), Kolmogorov-Smirnov (KS) with the corresponding p-value (KS p-value), as well as those based on the log-likelihood: minus estimated log-likelihood (-), Akaike information criterion (AIC), consistent Akaike information criterion (CAIC), Bayesian information criterion (BIC) and Hannan-Quinn information criterion (HQIC). Model with the minimum values for CVM, AD, KS, (-), AIC, BIC, CAIC and HQIC, and the maximum KS p-value, is considered to provide the best fits for the proposed data. For further details on these criteria, we refer the reader to [50]. In this part, the software R is used, along with the package AdequacyModel (see [51]). The MLEs and their standard errors (SEs) for the considered models, as well as CVM, AD, KS, and KS p-value are given in Table 17 and Table 18 for D1 and D2, respectively. Also, the goodness-of-fit measures , AIC, BIC, CAIC, and HQIC are provided in Table 19 and Table 20 for D1 and D2, respectively.
Table 17

Goodness-of-fit measures, MLEs and SEs (into parentheses) for D1.

ModelCVMADKSKS p-ValueMLEs and (SEs)
HLIR0.05130.38950.05960.96683.653810.2773
(α,λ) (0.2197)(2.5587)
TIITLIR0.09080.64210.07760.79932.796610.2992
(α,θ) (0.1574)(2.8538)
TIR0.17671.20000.25400.00027.50930.8891
(θ,λ) (19.4108)(0.0182)
OFIR0.59133.70260.18010.02272.95401.3910
(θ,α) (0.1862)(0.1231)
IR0.18751.27060.35490.00002.2827-
(α) (0.1374)-
Table 18

Goodness-of-fit measures, MLEs and SEs (into parentheses) for D2.

ModelCVMADKSKS p-ValueMLEs and (SEs)
HLIR0.04760.28210.06260.97459.05401.5057
(α,λ) (0.8159)(0.2381)
TIITLIR0.04870.29040.07260.91427.17021.2912
(α,θ) (0.5758)(0.2319)
TIR0.04960.28850.06590.9598105.97620.4105
(θ,λ) (19.4108)(0.3493)
OFIR0.09640.72050.14580.162347.05800.7820
(θ,α) (5.1027)(0.0838)
IR0.04880.29340.08210.82039.3593-
(α) (0.6092)-
Table 19

Goodness-of-fit measures based on the log-likelihood for D1.

Distribution ^ AICCAICBICHQIC
HLIR50.5018105.0030105.1856109.4720106.7764
TIITLIR52.0685108.1371108.3189112.6053109.9098
TIR71.9390145.8787145.9384148.1128146.7651
OFIR71.7113147.4228147.6046151.8910149.1955
IR88.4130178.8262178.8859181.0603179.7125
Table 20

Goodness-of-fit measures based on the log-likelihood for D2.

Distribution ^ AICCAICBICHQIC
HLIR188.4997380.9994381.2137385.1545382.6213
TIITLIR188.6142381.2283381.4426385.3834382.8503
TIR189.1310382.2620382.4762386.4170383.8839
OFIR193.7239391.4479391.6621395.6029393.0698
IR190.5877383.1754382.2455385.2529382.9863
From Table 17, Table 18, Table 19 and Table 20, we confirm that the HLIR model provides the best fits among the other models for D1 and D2 since it has the lowest values of CVM, AD, KS, , AIC, CAIC, BIC and HQIC, and the greatest values for the KS p-value. Also, one can notice that the KS p-values of the HLIR model are very closed to 1, making the HLIR model difficult to beat with the KS p-value benchmark for D1 and D2. The estimated pdf (epdf), estimated cdf (ecdf), estimated sf (esf) and probability-probability (P-P) plots of the HLIR model are displayed in Figure 4 and Figure 5 for D1 and D2, respectively.
Figure 4

Plots for the epdf, ecdf, esf and P-P plots of the HLIR model for D1.

Figure 5

Plots for the epdf, ecdf, esf and P-P plots of the HLIR model for D2.

From Figure 4 and Figure 5, nice fits are observed for the HLIR models; in the four plots, the green curves fit very well those based on the corresponding empirical ones, attesting the applicability of the HLIR model for D1 and D2. In order to complete this part, let us now investigate the entropy estimation of the HLIR model for the two data sets, adopting the methodology describes in Section 3.6. For D1, at , , and , the estimated Rényi entropy is equal to , and , respectively, and, at , and , the estimated q-entropy is equal to 5, 2 and 1, respectively. Thus, we can notice that the estimated Rényi entropy decreases as increases. Also, the estimated q-entropy decreases as q increases. For D2, at , and , the estimated Rényi entropy is equal to , and , respectively, and, at , and , the estimated q-entropy is equal to , and , respectively. Hence, the estimated Rényi entropy decreases as increases. Also, the estimated q-entropy decreases as q increases.

6. Concluding Remarks

In this paper, a new two-parameter lifetime distribution based on the half-logistic transformation and the IR distribution is introduced. It is called the HLIR distribution. Some of its mathematical properties as stochastic ordering results, a general linear representation for the exponentiated probability density function, raw/inverted moments, incomplete moments, skewness and kurtosis features, and Rényi entropy and q-entropy are derived. The estimation of the model parameters is discussed through the ML, LS, WLS, PC and CV methods. Simulation study is carried out to compare the performance of the five resulting estimates. It revealed that the ML method performs better than the others, in approximately, most of situations. The estimation of the Rényi entropy and q-entropy is also conducted with success. An application to two real data sets indicates that the HLIR model can produce better fits than other champion models, also based on the IR distribution. With only two parameters and such a high degree of performance, we hope that the HLIR model will attract the attention of some practitioners for further perspectives.
  2 in total

1.  The power Lomax distribution with an application to bladder cancer data.

Authors:  El-Houssainy A Rady; W A Hassanein; T A Elhaddad
Journal:  Springerplus       Date:  2016-10-21

2.  AdequacyModel: An R package for probability distributions and general purpose optimization.

Authors:  Pedro Rafael D Marinho; Rodrigo B Silva; Marcelo Bourguignon; Gauss M Cordeiro; Saralees Nadarajah
Journal:  PLoS One       Date:  2019-08-26       Impact factor: 3.240

  2 in total
  2 in total

1.  Statistical Analysis of COVID-19 Data for Three Different Regions in the Kingdom of Saudi Arabia: Using a New Two-Parameter Statistical Model.

Authors:  Ibrahim Al-Dayel; Mohammed N Alshahrani; Ibrahim Elbatal; Naif Alotaibi; A W Shawki; Mohammed Elgarhy
Journal:  Comput Math Methods Med       Date:  2022-07-09       Impact factor: 2.809

2.  A new one-parameter lifetime distribution and its regression model with applications.

Authors:  M S Eliwa; Emrah Altun; Ziyad Ali Alhussain; Essam A Ahmed; Mukhtar M Salah; Hanan Haj Ahmed; M El-Morshedy
Journal:  PLoS One       Date:  2021-02-19       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.