Literature DB >> 34249509

Confidence intervals for the difference between the coefficients of variation of Weibull distributions for analyzing wind speed dispersion.

Manussaya La-Ongkaew1, Sa-Aat Niwitpong1, Suparat Niwitpong1.   

Abstract

Wind energy is an important renewable energy source for generating electricity that has the potential to replace fossil fuels. Herein, we propose confidence intervals for the difference between the coefficients of variation of Weibull distributions constructed using the concepts of the generalized confidence interval (GCI), Bayesian methods, the method of variance estimates recovery (MOVER) based on Hendricks and Robey's confidence interval, a percentile bootstrap method, and a bootstrap method with standard errors. To analyze their performances, their coverage probabilities and expected lengths were evaluated via Monte Carlo simulation. The simulation results indicate that the coverage probabilities of GCI were greater than or sometimes close to the nominal confidence level. However, when the Weibull shape parameter was small, the Bayesian- highest posterior density interval was preferable. All of the proposed confidence intervals were applied to wind speed data measured at 90-meter wind energy potential stations at various regions in Thailand. ©2021 La-ongkaew et al.

Entities:  

Keywords:  Bayesian methods; Coefficients of variation; Dispersion; Generalized confidence interval; Gibbs algorithm; Markov chain Monte Carlo; Random walk Metropolis; The method of variance estimates recovery; Wind speed data

Year:  2021        PMID: 34249509      PMCID: PMC8256813          DOI: 10.7717/peerj.11676

Source DB:  PubMed          Journal:  PeerJ        ISSN: 2167-8359            Impact factor:   2.984


Introduction

Nowadays, Thailand is increasingly using alternative energy sources to oil and coal owing to the government pushing forward a policy concerning alternative energy development and efficiency. The alternative energy development in Thailand has primarily relied on energy production in domestic with emphasis on solar, wind, hydro energy, biomass, biogas, municipal Solid waste (MSW), geothermal power and biofuels including ethanol and biodiesel in term of electricity, heat and biofuels (https://www.dede.go.th/ewt_news.php?nid=47340). Wind energy is an important natural energy source for use in power generation because it has many advantages: it is clean and environmentally friendly, it reduces the level of carbon dioxide emissions that contribute toward global warming, and it offers unlimited renewable energy. In recent years, power generation from wind energy in Thailand has limitation about potential of wind speeds such as wind power density, wind speed, and the height of wind turbine (Chancham et al., 2009). In addition, there are problems about where to install wind turbines because wind energy is related to the topography of the installation area and wind strength variation throughout the year. Potential areas to harvest wind energy in Thailand are coastal areas and the northeastern region of Thailand. Since the coefficient of variation can be used to analyze areas with high wind speed variability, we are interested in analyzing wind datasets from the aforementioned areas in terms of the difference between coefficients of variation of Weibull distributions. The approach in this study could be useful in finding areas that are the most suitable for placing wind turbine systems to produce electricity. In statistics, the Weibull distribution, which is named after Swedish mathematician Waloddi Weibull (Weibull, 1951), is a continuous probability distribution commonly used to analyze wind speed data. It has also been widely applied in engineering, industry, and weather forecasting. For example, Chaichana et al. (2015) studied the potential of wind energy for high-tech agricultural projects as part of the Royal Initiative at Chiang Mai province, Thailand. Moreover, Pang, Forster & Troutt (2001) estimated the parameters for a Weibull distribution when applied to wind speed data obtained from an observatory in Hong Kong. Previously, many researchers have estimated the parameters and functions of parameters for Weibull distributions. For instance, Green et al. (1994) used the Markov chain Monte Carlo (MCMC) method to estimate three parameters of a Weibull distribution. Colosimo & Ho (1999) derived confidence intervals for the Weibull mean lifetime based on censored reliability datasets. Krishnamoorthy, Lin & Xia (2009) constructed confidence intervals for the mean of Weibull distribution based on the generalized variable approach and compared this with Wald confidence intervals. In another study, Krishnamoorthy & Lin (2010) presented the confidence interval for stress–strength reliability involving two Weibull distributions. Ibrahim (2010) compared the performance of Bayesian approaches using Jeffreys’ prior and an extension of Jeffreys’ prior with maximum likelihood estimation to estimate the parameters of a Weibull distribution. Kundu & Howlader (2010) used Bayesian inference to estimate the scale parameter of an inverted Weibull distribution based on Type-II censored data. Yalcinkaya & Birgoren (2017) compared a confidence interval of the lower percentiles in a Weibull distribution of small samples based on maximum likelihood estimation and the Bayesian-Weibull method. Saraiva & Suzuki (2017) compared methods to estimate the parameters of a Weibull distribution, which are a maximum likelihood method and a Bayesian method with the Metropolis–Hastings algorithm. The coefficient of variation is used for measuring variations in data as well as comparing the degree of variation between several datasets in which the measurement units are different. It has been used in many fields. Faber & Korn (1991) used the coefficient of variation to measure variation in the mean synaptic response of nerves in the central nervous system. Banik & Kibria (2011) used the coefficient of variation to analyze data on psychiatric disorders, while Yosboonruang, Niwitpong & Niwitpong (2019) used it to examine variation in rainfall data in Thailand. In this study, we focus on the coefficient of variation. Many researchers have provided confidence intervals for estimating the single and function of coefficients of variation. For example in a normal distribution: a normal coefficient of variation (Vangel, 1996), common coefficient of variation (Tian, 2005), coefficient of variation (Mahmoudvand & Hassani, 2009) and (Donner & Zou, 2010). Moreover, Pang et al. (2005) proposed confidence intervals for estimating the coefficient of variation of a three-parameter Weibull distribution via a simulation-based Bayesian approach. Niwitpong (2013) provided confidence intervals for the coefficient of variation of a lognormal distribution with a restricted parameter space. Buntao & Niwitpong (2013) constructed confidence intervals for the difference between the coefficients of variation of lognormal distributions. Sangnawakij, Niwitpong & Niwitpong (2015) presented confidence intervals for the ratio of coefficients of variation of gamma distributions. Yosboonruang, Niwitpong & Niwitpong (2020) proposed confidence intervals for the difference between the coefficients of variation of delta-lognormal distributions. La-ongkaew, Niwitpong & Niwitpong (2009) constructed confidence interval for single coefficient of variation of Weibull distribution. It could be seen that not many researches mentioned above focused on coefficient of variation or difference between the coefficients of variation of Weibull distributions. Consequently, we extended this study to construct confidence intervals for the difference between the coefficients of variation of Weibull distributions. Herein, we propose new confidence intervals for the difference between the coefficients of variation of Weibull distributions using the generalized confidence interval (GCI) method, Bayesian methods, the method of variance estimates recovery (MOVER) method, a percentile bootstrap method, and a bootstrap interval with standard errors, the derivations of which are given in this article. As an empirical application, we use wind speed data measured at 90-meter wind energy potential stations in Thailand. Finally, we discuss the results of our study and draw conclusions on them.

Methods

Let a random variable X follow the 2-Parameter Weibull distribution. The probability density function of X is given by where the positive constant a is called the scale parameter and the positive constant k is called the shape parameter. The mean and the variance of X are and , respectively. Therefore, the coefficient of variation of X can be written as Suppose that X =(X1, X2, …, X) be a random sample of size n from Weibull distribution with scale parameter a and shape parameter k, denoted as Weibull(a, k). To estimate the parameters a and k, the maximum likelihood estimation is applied. The maximum likelihood estimators (MLEs) can be obtained from Cohen (1965), they are as follows. The MLE of k is solution of the following equation and the MLE of a is given by For solving the Eq.(3), the Newton–Raphson iterative method can be applied. We obtained a starting value for the Newton–Raphson iterative method. Let x1, x2,…, x be a sample of observations from a Weibull(a, k) distribution. Let z = ln(x), i =1 , 2, …, n. Menon (1963) showed that the estimator is asymptotically unbiased estimator, which is the starting value. The following algorithm is used to compute the MLEs: Algorithm 1 1. Generate x1, x2,…, x, let z = ln(x). 2. Compute , and . 3. Compute = . 4. For l = 1 to number of iterations. 5. Compute , , and . 6. Compute . 7. Compute . 8. Compute + . 9. If (F ≤ error tolerance) return . 10. End l loop. 11. Compute from Eq. (4). Let X1, X2, …, X and Y1, Y2, …, Y be random samples from two independent Weibull distributions with scale parameters aa and shape parameters k, k, respectively. Moreover, let λ be the coefficient of variation of X. Similarly, let λ be the coefficient of variation of Y. Thus, the difference between the coefficients of variation is defined by And the estimator of δ is given by

The Generalized Confidence Interval

The generalized confidence interval (GCI) was introduced by Weerahandi (1993). The most important concept of GCI is the generalized pivotal quantity (GPQ) defined as Definition Suppose that X = (X1, X2, …, X) be a random variables from distribution, which depends on a parameter of interest φ, and a nuisance parameter γ. Furthermore, suppose that x =(x1, x2, …, x) be the observed value of X. To obtain a GCI for φ, a GPQ R(X; x, φ, γ) is also required to satisfy the following two conditions. (i) For a fixed x, the distribution of R(X; x, φ, γ) is free of unknown parameters. (ii) The observed value of R(X; x, φ, γ) at X = x, denoted as r(X; x, φ, γ) does not depend on nuisance parameters. Let R(α∕2) and R(1 − α∕2) are the 100(α/2)-th and 100(1 − α/2)-th percentile of R(X; x), respectively. The 100 (1 − α)% two-sided confidence interval based on GCI for parameter of interest is given by [R(α∕2), R(1 − α∕2)]. Note that 1 − α is the probability that population parameter will be in the interval. The 100 (1 − α)% confidence interval will include the true value of the population parameter with probability 1 − α, i.e., if α = 0.05, that probability is about 0.95 that the 95% confidence interval will include the true population parameter. For Weibull distribution, Thoman, Bain & Antle (1969) showed that the distributions of and do not depend on a and k, where and are the MLEs of a and k, respectively. Let and be the MLEs based on a sample of size n from a Weibull(1, 1). Then and . Both distributions do not depend on any parameters. Hence, they are pivotal quantity. Krishnamoorthy, Mukherjee & Guo (2007) presented the GPQs of scale and shape parameters from Weibull distribution. Let and be the observed values of and . The GPQs of the parameters are given by and From the random variables X and Y, the GPQs for k and k are given by and Thus, the GPQ for δ is defined by Therefore, the 100 (1 − α)% two-sided confidence interval for the difference between the coefficients of variation of Weibull distributions based on GCI is where R(α∕2) denotes the 100(α/2)-th percentile of R. The following algorithm is used to construct confidence interval based on GCI for the difference between the coefficients of variation of Weibull distributions: Algorithm 2 1. Compute , , and from Algorithm 1. 2. Generate , , …, and , , …, from Weibull(1, 1). 3. Compute , , and from Algorithm 1. 4. Compute R and R from Eqs. (10) and (11). 5. Compute R from Eq. (12). 6. Repeat steps 1-5 for q times, where q is the number of generalized computation. 7. Compute 95% confidence interval based on GCI, as given in Eq. (13).

The bootstrap confidence intervals

The bootstrap method was introduced by Efron & Tibshirani (1993). This is a resampling method to determine precision measures for statistical estimation. Let and be random samples of size n and m from Weibull distributions, and let and be bootstrapped samples drawn with replacement from the original data, using the same sample sizes. After resampling B bootstrap samples, the difference between the coefficients of variation are calculated in each bootstrap sample, as follows: b =1 , 2, …, B.

The percentile bootstrap confidence interval

The percentile bootstrap confidence interval is based on the percentile of the distribution of the bootstrapped replications. The value of the bootstrap statistic, δ∗ are ordered from the smallest to the largest. Therefore, the 100 (1 − α)% two-sided confidence interval for the difference between the coefficients of variation of Weibull distributions based on percentile bootstrap is given by where δ∗(α∕2) denotes the 100(α/2)-th percentile of δ∗. The following algorithm is used to construct confidence interval based on percentile bootstrap for the difference between the coefficients of variation of Weibull distributions: Algorithm 3 1. Generate X1, X2, …, X from Weibull(a, k) and Y1, Y2, …, Y from Weibull(a, k). 2. For b = 1. 3. Resampling bootstrap samples from X1, X2, …, X and compute . 4. Resampling bootstrap samples from Y1, Y2, …, Y and compute . 5. Compute . 6. Repeat steps 3-5 for B times, where B is the number of bootstrap sample. 7. Sort δ∗ from the smallest to the largest. 8. Compute 95% confidence interval based on percentile bootstrap, as given in Eq. (14).

The bootstrap confidence interval with standard errors

From the B bootstrap statistic denoted as δ∗, b =1 , 2, …, B, we can calculate the standard error of a statistic. They can be estimated using the standard deviation of the bootstrap distribution. Therefore, the 100 (1 − α)% two-sided confidence interval for the difference between the coefficients of variation of Weibull distributions based on bootstrap confidence interval with standard errors can be written as where SE is the standard error of a statistic. The following algorithm is used to construct confidence interval based on bootstrap confidence interval with standard errors for the difference between the coefficients of variation of Weibull distributions: Algorithm 4 1. Generate X1, X2, …, X from Weibull(a, k) and Y1, Y2, …, Y from Weibull(a, k). 2. For b = 1. 3. Resampling bootstrap samples from X1, X2, …, X and compute . 4. Resampling bootstrap samples from Y1, Y2, …, Y and compute . 5. Compute . 6. Repeat steps 3-5 for B times, where B is the number of bootstrap sample. 7. Compute SE. 8. Compute 95% confidence interval based on bootstrap confidence interval with standard errors, as given in Eq. (15).

The method of variance estimates recovery

According to Donner & Zou (2010), this approach can be used to construct a confidence interval for a function of two parameters, λ − λ, defined as where the lower limit and upper limit for - are given by Hendricks & Robey (1936) presented confidence intervals for λ and λ defined as and where t( and t( denote the 100(α/2)-th percentile of t-distribution with n − 1 and m − 1 degrees of freedom, respectively. To construct a confidence interval for the difference between the coefficients of variation of Weibull distributions based on MOVER with Hendricks and Robey’s confidence interval, we substitute l, u, l and u for l, u, l and u, respectively. Therefore, the 100 (1 − α)% two-sided confidence interval for the difference between the coefficients of variation of Weibull distributions based on MOVER with Hendricks and Robey’s confidence interval becomes where The following algorithm is used to construct confidence interval based on MOVER for the difference between the coefficients of variation of Weibull distributions: Algorithm 5 1. Generate X1, X2, …, X from Weibull(a, k) and Y1, Y2, …, Y from Weibull(a, k). 2. Compute the intervals for λ and λ from Eqs. (19) and (20). 3. Compute 95% confidence interval based on MOVER, as given in Eq. (21).

The Bayesian confidence intervals

Here, we derive Bayesian estimates of the parameters of a Weibull distribution. Once again, for random variable X = (X1, X2, …, X) from a Weibull distribution, we transform scale parameter , and so we can provide its probability density function in another form as follows: It is assumed that the prior distributions for the scale and shape parameters are gamma with hyperparameters (v1, v2, z1, z2): and Hence, using Bayes’ theorem, the joint posterior density function of a′ and k is given by where L(a′, k|x) is a likelihood function. However, the posterior distributions cannot be computed, and so we used the MCMC method with Gibbs sampling to provide them for the parameters. We used the Gibbs sampling procedure to generate samples from the joint posterior distribution, as given in Eq. (27). The conditional posterior distributions of parameters are and For Eq. (29), we used the Metropolis-Hasting (MH) algorithm to update shape parameter k. Moreover, the random walk Metropolis (RWM) and Gibbs sampling methods are applied as follows (Geman & Geman, 1984): Algorithm 6 The Gibbs algorithm 1. Take the initial value of parameter (a′(0), k(0)). 2. Generate a′( ∼ gamma(v2 + n, z2 + ∑x). 3. Using Random walk Metropolis (RWM) algorithm for calculate k(. 4. Repeat step 2–3 for T times, where T is the number of replications of MCMC. 5. Burn in 1000 samples and compute the parameter of interest. Algorithm 7 The random walk Metropolis (RWM) 1. Start with (a′(, k(). 2. Generate . 3. Compute k∗ = k( + ε. 4. Compute . 5. Generate u ∼ U(0, 1). 6. If u ≤ min(1, A) set k( = k∗ and If u > min(1, A) set k( = k(.

The Bayesian-MCMC

Once again, let X and Y be random variables from Weibull distributions. First, we used Algorithms 6 and 7 to calculate the difference between the coefficients of variation denoted as δ, t =1 , 2, …, T, based on the Bayesian-MCMC. Therefore, the 100 (1 − α)% two-sided confidence interval for the difference between the coefficients of variation of Weibull distributions based on the Bayesian-MCMC is given by where denotes the 100(α/2)-th percentile of δ. The following algorithm is used to construct confidence interval based on Bayesian-MCMC for the difference between the coefficients of variation of Weibull distributions: Algorithm 8 1. Calculate the initial value of parameter (a′(0), k(0)). 2. Calculate δ from Algorithms 6 and 7. 3. Compute 95% confidence interval based on Bayesian-MCMC, as given in Eq. (30).

The Bayesian-Highest Posterior Density (HPD) Interval

Here, the assumption is that the density of every point inside the interval is greater than that of every point outside the interval, and it is also the shortest interval (Box & Tiao, 2011). In this article, the HPD interval was computed by using the R package with HDInterval. Therefore, the 100 (1 − α)% two-sided confidence interval for the difference between the coefficients of variation of Weibull distributions based on the Bayesian-HPD interval can be written as The following algorithm is used to construct confidence interval based on Bayesian-HPD for the difference between the coefficients of variation of Weibull distributions: Algorithm 9 1. Calculate the initial value of parameter (a′(0), k(0)). 2. Calculate δ from Algorithms 6 and 7. 3. Compute 95% confidence interval based on Bayesian-HPD interval, as given in Eq. (31).

Results

A simulation study was conducted to compare the performances of the six confidence intervals for the difference between the coefficients of variation of Weibull distributions: GCI, the Bayesian-MCMC method, the Bayesian-HPD interval, MOVER based on Hendricks and Robey’s confidence interval, the percentile bootstrap method, and the bootstrap confidence interval with standard errors. The R program was used to estimate the coverage probabilities and expected lengths of the proposed confidence intervals using the following parameter settings: Scale parameter a = a = 0.5 and 2. Shape parameter k = 1 and k = 0.5, 1, 2, 2.5, 4 and 9, with the differences between the coefficients of variation of −1.2360, 0, 0.4772, 0.5720, 0.7194 and 0.8671, respectively. Sample sizes (n, m) = (10,10), (10,20), (20,20), (30,30), (30,50), (50,50), (50,100) and (100,100). Hyperparameters v1 = v2 = z1 = z2 = 0.1. The number of replications for each situation was 5,000, along with 2,500 pivotal quantities for GCI. Moreover, we used 500 bootstrap samples for the bootstrap methods and we generated 20,000 realizations of MCMC using the Gibbs and RWM algorithms with a burn-in of 1000. The following performance indicators were used to determine the best-performing method: a coverage probability of greater than or equal to 0.95 (the nominal confidence level) and the shortest expected length. The following algorithm is used to estimate the coverage probability and expected length: Algorithm 10 1. Set M, q, B, T, n, m, a, k, a and k. 2. Generate X1, X2, …, X from Weibull(a, k) and Y1, Y2, …, Y from Weibull(a, k), respectively. 3. Use Algorithm 2 to construct generalized confidence interval (CI). 4. Use Algorithm 3 to construct percentile bootstrap confidence interval (CI). 5. Use Algorithm 4 to construct bootstrap confidence interval with standard errors (CI). 6. Use Algorithm 5 to construct MOVER based on Hendricks and Robey’s confidence interval (CI). 7. Use Algorithm 8 to construct Baysian-MCMC confidence interval (CI). 8. Use Algorithm 9 to construct Baysian-HPD confidence interval (CI). 9. If (L ≤ δ ≤ U), then set P = 1, else set P = 0. 10. Compute (U − L). 11. Repeat steps 2-10 for M times. 12. Compute mean of P for the coverage probability. 13. Compute mean of (U − L) for the expected length. For a = 0.5 (Table 1 and Fig. 1), the simulation results indicate that the coverage probabilities of CI were greater than or sometimes close to the nominal confidence level of 0.95 in almost all cases. CI yielded coverage probabilities greater than 0.95 when k = 1 for (n, m) = (10, 10) and (20, 20). For the Bayesian methods, the coverage probabilities of CI and CI were greater than or sometimes close to the nominal confidence level, while the expected lengths of CI for (n, m) = (10, 10) and k = 0.5, 2, 4; (n, m) = (20, 20) and k = 0.5; (n, m) = (10, 20), (30, 30), (30, 50), (50, 50), (50, 100) and k = 0.5, 1; and (n, m) = (100, 100) and k = 1 were the shortest. Moreover, CI and CI yielded coverage probabilities under 0.95 for all cases.
Table 1

Comparison results of the 95% two-sided confidence intervals for the difference between the coefficients of variation of Weibull distributions for a = 0.5.

(nm)kCoverage probability (Expected length)
CIδ(gci)CIδ(pb)CIδ(bs)CIδ(m.HR)CIδ(MCMC)CIδ(HPD)
(10,10)0.50.95680.63240.69560.84300.93920.9514
(8.1452)(1.6874)(1.7114)(2.3678)(5.4003)(4.6368)
10.94940.96140.90320.96100.93880.9706
(2.3498)(1.1568)(1.1587)(1.3535)(1.8406)(1.7986)
20.95100.90460.88600.94100.94540.9578
(1.7089)(0.9219)(0.9248)(1.0787)(1.3696)(1.3018)
2.50.94920.87100.86100.91520.93700.9400
(1.6386)(0.8863)(0.8894)(1.0359)(1.3144)(1.2340)
40.95800.86040.85260.90260.93600.9510
(1.5526)(0.8366)(0.8404)(0.9827)(1.2451) (1.1469)
90.95100.83740.83340.88540.91200.8764
(1.5173)(0.8076)(0.8105)(0.9515)(1.2218)(1.1060)
(10,20)0.50.95640.70560.79240.85980.94760.9664
(3.6785)(1.6034)(1.6216)(1.7373)(3.1102)( 2.9756)
10.95040.93780.89480.93440.9416 0.9616
(1.8251)(1.0235)(1.0258)(1.1510)(1.5018) (1.4525)
2 0.95360.86200.85360.90180.94060.9362
(1.5737)(0.8642)(0.8664)(1.0037)(1.2677)(1.1743)
2.5 0.95430.84800.84480.89520.93820.9300
(1.5418)(0.8407)(0.8432)(0.9810)(1.2400)(1.1376)
40.94880.83800.83120.88360.93860.9154
(1.5419)(0.8264)(0.8302)(0.9668)(1.2290)(1.1124)
90.94820.82620.82500.87860.92040.8794
(1.5300)(0.8106)(0.8144)(0.9544)(1.2208)(1.0958)
(20,20)0.50.95240.62920.77780.82820.9480 0.9536
(3.2987)(1.4974)(1.5220)(1.5855)(2.8839) (2.6795)
10.9530 0.95640.91700.94500.94440.9656
(1.2350) (0.8758)(0.8814)(0.9149)(1.1264)(1.1166)
2 0.95220.90240.90100.93300.94340.9486
(0.9234)(0.6836)(0.6889)(0.7236)(0.8471)(0.8260)
2.5 0.95260.89000.89180.92760.94580.9470
(0.8915)(0.6566)(0.6615)(0.6969)(0.8178)(0.7917)
40.95340.87300.88700.91660.94300.9344
(0.8589)(0.6348)(0.6403)(0.6666)(0.7866)(0.7532)
9 0.95220.86520.87200.91140.91940.8866
(0.8395)(0.6149)(0.6225)(0.6479)(0.7729)(0.7340)
(30,30)0.50.95060.64280.81240.82280.94480.9501
(2.3298)(1.3805)(1.4109)(1.2717)(2.1510)(2.0452)
10.95220.94260.92440.93820.94800.9618
(0.9229)(0.7415)(0.7481)(0.7346)(0.8728) (0.8678)
20.95100.89720.90820.92980.94620.9482
(0.6990)(0.5770)(0.5819)(0.5820)(0.6621)(0.6506)
2.50.94820.89180.90640.92800.94520.9476
(0.6787)(0.5581)(0.5629)(0.5625)(0.6430)(0.6287)
40.95320.87760.90360.92120.94760.9394
(0.6526)(0.5364)(0.5429)(0.5370)(0.6171)(0.5991)
90.94460.86020.88140.90360.90800.8784
(0.6365)(0.5187)(0.5246)(0.5207)(0.6040)(0.5835)
(30,50)0.50.95740.68560.87600.84160.9530 0.9634
(1.7122)(1.2865)(1.3262)(1.0347)(1.6309) (1.5927)
10.94000.93560.92100.92400.9360 0.9500
(0.8011)(0.6768)(0.6822)(0.6526)(0.7652) (0.7591)
20.94940.88480.89960.92260.94520.9448
(0.6690)(0.5518)(0.5568)(0.5556)(0.6354)(0.6203)
2.50.94620.88120.89480.91600.94520.9418
(0.6581)(0.5399)(0.5455)(0.5440)(0.6246)(0.6074)
40.94500.86960.88800.90980.94140.9382
(0.6447)(0.5297)(0.5359)(0.5289)(0.6095)(0.5897)
90.94980.86080.88780.90560.92140.8936
(0.6306)(0.5144)(0.5208)(0.5170)(0.5981)(0.5769)
(50,50)0.50.95200.65020.85960.82020.9474 0.9504
(1.6289)(1.2294)(1.2703)(0.9739)(1.5586) (1.5103)
10.94740.93680.93300.92780.9456 0.9552
(0.6699)(0.6106)(0.6165)(0.5626)(0.6485) (0.6460)
20.94560.90580.92080.93040.94600.9480
(0.5171)(0.4724)(0.4766)(0.4489)(0.5010)(0.4951)
2.5 0.95120.89000.91780.92640.94480.9450
(0.4992)(0.4545)(0.4586)(0.4319)(0.4840)(0.4770)
4 0.95580.88440.91540.92280.94820.9412
(0.4796)(0.4384)(0.4432)(0.4119)(0.4640)(0.4552)
90.94580.87440.90820.91480.91320.8908
(0.4695)(0.4246)(0.4299)(0.4008)(0.4564)(0.4463)
(50,100)0.50.95320.71660.91720.83860.9518 0.9580
(1.1384)(1.1032)(1.1514)(0.7422)(1.1119) (1.0987)
10.95840.93640.94480.93460.9538 0.9610
(0.5672)(0.5404)(0.5448)(0.4843)(0.5522) (0.5486)
20.94800.88980.91760.92440.94600.9484
(0.4897)(0.4464)(0.4505)(0.4230)(0.4759)(0.4684)
2.5 0.95180.88200.90980.91160.94960.9442
(0.4814)(0.4380)(0.4425)(0.4145)(0.4668)(0.4581)
40.94940.87340.90820.91640.94660.9418
(0.4700)(0.4309)(0.4361)(0.4041)(0.4567)(0.4471)
90.94860.87040.90060.9074
(0.4648)(0.4206)(0.4253)(0.3974)
(100,100)0.50.94880.67680.91020.81980.94680.9472
(1.0705)(1.0617)(1.1132)(0.6829)(1.0486)(1.0308)
10.94960.93620.95480.92660.9496 0.9524
(0.4523)(0.4660)(0.4692)(0.3949)(0.4453) (0.4440)
20.94340.90760.93460.92560.94440.9460
(0.3514)(0.3558)(0.3584)(0.3152)(0.3461)(0.3437)
2.5 0.95140.90600.93860.92860.94860.9492
(0.3394)(0.3435)(0.3462)(0.3032)(0.3341)(0.3313)
4 0.95320.90040.93900.92700.95160.9484
(0.3269)(0.3344)(0.3376)(0.2896)(0.3214)(0.3179)
90.95140.89540.93360.9224
(0.3188)(0.3223)(0.3250)(0.2809)
Figure 1

Graphs to compare the performance of the methods with varying difference between CVs in terms of (A) coverage probability and (B) expected length.

From the simulation results for a = 2 (Table 2 and Fig. 2), CI performed well in terms of coverage probability in almost all cases, while those of CI, CI and CI were the same as for a = 0.5.CI yielded coverage probabilities of under 0.95 in almost all cases except when (n, m) = (10, 10), (20, 20) and k = 0.5. For the Bayesian methods, the coverage probabilities of CI were greater than 0.95 and its expected lengths were shortest for (n, m) = (10, 20), (30, 50), (50, 100) and k = 0.5, 1; (n, m) = (20, 20) and k = 2; (n, m) = (30, 30) and k = 1, 2, 2.5; (n, m) = (50, 50) and k = 0.5, 1, 2.5; and (n, m) = (100, 100) for all k except for k = 4 and 9.
Table 2

Comparison results of the 95% two-sided confidence intervals for the difference between the coefficients of variation of Weibull distributions for a = 2.

(nm)kCoverage probability (Expected length)
CIδ(gci)CIδ(pb)CIδ(bs)CIδ(m.HR)CIδ(MCMC)CIδ(HPD)
(10,10)0.5 0.95120.62420.68520.82660.93680.9466
(7.9720)(1.6763)(1.6970)(2.3414)(5.2853)(4.5489)
10.9514 0.96300.90340.96320.93380.9674
(2.3568) (1.1613)(1.1622)(1.3561)(1.8368)(1.7938)
20.94300.89920.87440.93300.93000.9482
(1.7036)(0.9226)(0.9261)(1.0758)(1.3570)(1.2881)
2.50.94760.87300.86220.91280.93480.9396
(1.6354)(0.8877)(0.8907)(1.0342)(1.3011)(1.2200)
40.94740.85320.84900.89580.92940.9290
(1.5569)(0.8388)(0.8411)(0.9825)(1.2356)(1.1332)
90.94720.85520.88300.90580.94020.9388
(0.6355)(0.5167)(0.5229)(0.5159)(0.5988)(0.5770)
(10,20)0.50.95000.71120.80020.86240.9398 0.9596
(3.6607)(1.5948)(1.1618)(1.7329)(3.0961) (2.9601)
10.95020.93660.90000.93580.9394 0.9576
(1.8261)(1.0249)(1.0273)(1.1529)(1.4955) (1.451)
2 0.95120.86260.85860.90880.93800.9370
(1.5731)(0.8627)(0.8655)(1.0039)(1.2633)(1.1696)
2.50.94700.84640.83900.89360.93300.9260
(1.5644)(0.8484)(0.8518)(0.9886)(1.2476)(1.1432)
4 0.96040.83260.83000.87820.93240.9186
(1.5208)(0.8193)(0.8219)(0.9596)(1.2102)(1.0956)
9 0.95440.83560.83260.88400.93820.9204
(1.5123)(0.8095)(0.8134)(0.9496)(1.1960)(1.0712)
(20,20)0.5 0.95380.63780.78080.82740.94400.9484
(3.2994)(1.5021)(1.5258)(1.5847)(2.8817)(2.6754)
10.9478 0.95100.91560.94200.94020.9598
(1.2330) (0.8735)(0.8781)(0.9134)(1.1231)(1.1132)
20.95140.90080.90380.93460.9420 0.9526
(0.9275)(0.6855)(0.6899)(0.7258)(0.8490) (0.8271)
2.50.95120.88460.89360.92200.94140.9460
(0.8922)(0.6606)(0.6657)(0.6971)(0.8163)(0.7900)
4 0.95060.87380.88440.91860.94300.9468
(0.8582)(0.6286)(0.6334)(0.6660)(0.7847)(0.7507)
90.94860.85640.86920.90580.93700.9314
(0.8336)(0.6079)(0.6133)(0.6444)(0.7584)(0.7186)
(30,30)0.50.95400.63840.81400.82400.94940.9486
(2.3309)(1.3775)(1.4077)(1.2727)(2.1542)(2.0485)
10.94820.94860.92780.93460.9458 0.9600
(0.9231)(0.7463)(0.7532)(0.7347)(0.8719)(0.8667)
20.95700.90560.91340.93160.95080.9520
(0.7005)(0.5752)(0.5798)(0.5827)(0.6629)(0.6512)
2.50.95680.89220.90740.93140.95300.9536
(0.6806)(0.5584)(0.5636)(0.5635)(0.6437) (0.6293)
40.94840.87000.89020.90900.94040.9376
(0.6537)(0.5371)(0.5435)(0.5374)(0.6176)(0.5992)
9 0.95280.86020.88760.90680.94360.9378
(0.6341)(0.5199)(0.5262)(0.5194)(0.5916)(0.5747)
(30,50)0.50.95320.66880.87680.84080.9496 0.9568
(1.7003)(1.2713)(1.3092)(1.0311)(1.6218) (1.5842)
10.95400.94160.93060.93520.9500 0.9576
(0.8051)(0.6830)(0.6899)(0.6549)(0.7671) (0.7609)
2 0.95040.88840.90360.92000.94640.9444
(0.6717)(0.5544)(0.5595)(0.5569)(0.6366)(0.6212)
2.5 0.95120.87380.89880.91680.94620.9412
(0.6576)(0.5405)(0.5464)(0.5436)(0.6227)(0.6055)
4 0.95280.87760.89140.91240.94740.9436
(0.6435)(0.5277)(0.5341)(0.5288)(0.6088)(0.5888)
90.94720.85520.88300.90580.94020.9388
(0.6355)(0.5167)(0.5229)(0.5159)(0.5988)(0.5770)
(50,50)0.50.95320.65060.85380.81440.9484 0.9532
(1.6366)(1.2485)(0.9732)(0.9766)(1.5637) (1.5147)
10.94980.93780.93800.93380.9470 0.9564
(0.6698)(0.6120)(0.6178)(0.5627)(0.6484) (0.6458)
2 0.95120.90120.91580.92880.94760.9486
(0.5152)(0.4690)(0.4729)(0.4476)(0.4990)(0.4931)
2.50.94980.88820.91980.93000.9478 0.9500
(0.4994)(0.4542)(0.4588)(0.4320)(0.4834) (0.4764)
40.94840.88520.91080.92120.94640.9436
(0.4791)(0.4367)(0.4416)(0.4116)(0.4637)(0.4549)
90.94960.88120.90720.91420.94460.9430
(0.4689)(0.4254)(0.4299)(0.4005)(0.4520)(0.4416)
(50,100)0.50.95460.70920.91820.84040.9522 0.9566
(1.1315)(1.1025)(1.1526)(0.7399)(1.1044) (1.0913)
10.95060.93140.93840.92720.9508 0.9562
(0.5698)(0.5402)(0.5444)(0.4860)(0.5548) (0.5511)
20.94500.89360.91020.91340.94140.9390
(0.4902)(0.4502)(0.4552)(0.4232)(0.4749)(0.4670)
2.5 0.95300.88560.91620.92540.95240.9490
(0.4802)(0.4380)(0.4432)(0.4141)(0.4659)(0.4572)
4 0.95120.88820.91380.91980.94700.9454
(0.4730)(0.4317)(0.4369)(0.4052)(0.4579)(0.4482)
9 0.95040.87500.90340.9094
(0.4658)(0.4256)(0.4319)(0.3979)
(100,100)0.50.94680.68180.90480.81580.9496 0.9468
(1.0733)(1.0585)(1.1081)(0.6835)(1.0506) (1.0325)
10.94980.93740.95240.93000.9496 0.9512
(0.4518)(0.4644)(0.4673)(0.3944)(0.4446) (0.4432)
20.95520.91420.94380.93640.9538 0.9544
(0.3505)(0.3527)(0.3548)(0.3145)(0.3449) (0.3424)
2.50.95300.90430.94240.92940.9504 0.9504
(0.3393)(0.3447)(0.3475)(0.3032)(0.3340) (0.3310)
40.94720.89160.93200.91660.94460.9428
(0.3259)(0.3323)(0.3351)(0.2889)(0.3206)(0.3170)
90.94840.89440.93040.9170
(0.3189)(0.3253)(0.3286)(0.2810)
Figure 2

Graphs to compare the performance of the methods with varying sample size in terms of (A) coverage probability and (B) expected length.

Empirical Application of the Derived Confidence Intervals

Example 1

Wind speed data set were collected at 90-meter wind energy potential stations in Trad and Chonburi provinces in 2016 by the Department of Alternative Energy Development and Efficiency, Ministry of Energy (https://www.dede.go.th/ewt_news.php?nid=47706). These are coastal provinces in eastern Thailand that have been identified as potential areas for harvesting wind energy. Figure 3 shows Q-Q plots of the Weibull distributions of the data from the two provinces. Furthermore, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) values reported in Table 3 indicate that these two datasets fit Weibull distributions because the latter have the smallest values for both criteria.
Figure 3

The Weibull Q-Q plot for the wind speed data of two provinces, Thailand: (A) Trad Province (B) Chonburi Province.

Table 3

The AIC and BIC values of the wind speed data of Trad province and Chonburi province.

MethodTradChonburi
AICBICAICBIC
Weibull37.027237.997022.805423.6011
Exponential37.267537.752456.247156.6450
Gamma37.573538.543324.078024.8737
Log-normal39.159440.129224.332725.1285
The statistical summary of the estimates of the parameters for these datasets is (n, m) = (12,11), , , , , and , while the actual difference between the coefficients of variation of the two Weibull distributions . The confidence intervals based on the proposed methods are given in Table 4. The results show that the Bayesian-HPD interval performed well in terms of the coverage probability and expected length when the sample sizes are small, which supports the simulation results. Finally, Fig. 4 shows a trace plot of the generated δ value.
Table 4

The 95% confidence intervals for the difference between the coefficients of variation of the wind speed data of Trad and Chonburi provinces, Thailand.

MethodConfidence intervals for δ
LowerUpperLength
GCI0.33161.10130.7697
MOVER0.23980.86890.6291
PB0.20810.80360.5155
BS0.25310.85550.6024
MCMC0.29121.00950.7183
HPD0.26000.92160.6616
Figure 4

Plot of generated δ of example 1 vs. iteration of the MCMC algorithm.

Example 2

Data on wind speeds measured at 90-meter wind energy potential stations in the southern and northeastern regions of Thailand were collected in April–May, 2019 by the Department of Alternative Energy Development and Efficiency, Ministry of Energy (https://www.dede.go.th/more_news.php?cid=501). The summary statistics for the southern region are n = 12, , and and the northeastern region are m = 20, , and . The actual difference between the coefficients of variation of the Weibull distributions of these datasets . Weibull Q-Q plots of these data are shown in Fig. 5 and the correctness of fitting the data to Weibull distributions in terms of the smallest AIC and BIC values are reported in Table 5. The 95% confidence intervals for δ by applying the six methods (Table 6) indicate that the Bayesian-HPD interval performed well in terms of the coverage probability and expected length when the difference between the coefficients of variation was small, which is once again consistent with the simulation study results. Finally, the trace plot of the generated δ value are presented in Fig. 6.
Figure 5

The Weibull Q-Q plot for the wind speed data of two Thailand’s regions: (A) Southern (B) Northeastern.

Table 5

The AIC and BIC values of the wind speed data of the southern and northeastern regions of Thailand.

MethodSouthernNortheastern
AICBICAICBIC
Weibull37.917638.887168.375870.3673
Exponential45.462345.947297.671198.6669
Gamma39.253575.579071.548273.5397
Log-normal41.543942.513773.587540.2233
Table 6

The 95% confidence intervals for the difference between the coefficients of variation of the wind speed data of the southern and northeastern regions of Thailand.

MethodConfidence intervals for δ
LowerUpperLength
GCI−0.02700.48070.5077
MOVER−0.06780.37550.4435
PB−0.04500.37630.4213
BS−0.05530.36300.4183
MCMC−0.02410.47270.4768
HPD−0.04450.42670.4712
Figure 6

Plot of generated δ of example 2 vs. iteration of the MCMC algorithm.

Discussion

It can be seen from the results of the study that GCI performed well in almost all cases since its coverage probability was greater than or close to the nominal confidence level. For the Bayesian methods, Bayesian-MCMC and Bayesian-HPD interval produced similar results and performed well when k (the difference between the coefficients of variation) was small, zero, or negative. For large sample sizes, the Bayesian methods were not suitable when k was large. Moreover, the expected length of the Bayesian-HPD interval was always shorter than Bayesian-MCMC. In relation to the expected lengths, they tended to decrease when the sample sizes and/or k increased.

Conclusions

This paper proposed six methods for deriving the confidence interval for the difference between the coefficients of variation of Weibull distributions: the GCI method, the Bayesian-MCMC method, the Bayesian-HPD interval, MOVER based on Hendricks and Robey’s confidence interval, the percentile bootstrap method, and the bootstrap confidence interval with standard errors. The performances of these methods were compared using the coverage probability and the expected length. The results of simulation studies showed that the GCI method and the Bayesian-HPD method were the best in different scenarios. The Bayesian-HPD was preferable when shape parameter k was small. The percentile bootstrap method can be used when the sample sizes are small and k = 1. However, MOVER based on Hendricks and Robey’s confidence interval and the bootstrap confidence interval with standard errors are not recommended since these methods yielded coverage probabilities under the nominal confidence level of 0.95 for almost all cases. Click here for additional data file.

The wind speeds measured at 90-meter wind energy potential stations in the southern and northeastern regions of Thailand

Collected April-May 2019. Click here for additional data file. Click here for additional data file.
  6 in total

1.  Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.

Authors:  S Geman; D Geman
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1984-06       Impact factor: 6.226

2.  Closed-form confidence intervals for functions of the normal mean and standard deviation.

Authors:  Allan Donner; G Y Zou
Journal:  Stat Methods Med Res       Date:  2010-09-08       Impact factor: 3.021

3.  Inferences on the common coefficient of variation.

Authors:  Lili Tian
Journal:  Stat Med       Date:  2005-07-30       Impact factor: 2.373

4.  Applicability of the coefficient of variation method for analyzing synaptic plasticity.

Authors:  D S Faber; H Korn
Journal:  Biophys J       Date:  1991-11       Impact factor: 4.033

5.  Measuring the dispersion of rainfall using Bayesian confidence intervals for coefficient of variation of delta-lognormal distribution: a study from Thailand.

Authors:  Noppadon Yosboonruang; Sa-Aat Niwitpong; Suparat Niwitpong
Journal:  PeerJ       Date:  2019-07-22       Impact factor: 2.984

  6 in total
  1 in total

1.  Measurement of dispersion of PM 2.5 in Thailand using confidence intervals for the coefficient of variation of an inverse Gaussian distribution.

Authors:  Wasana Chankham; Sa-Aat Niwitpong; Suparat Niwitpong
Journal:  PeerJ       Date:  2022-02-17       Impact factor: 2.984

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.