Literature DB >> 28124446

Beta-binomial model for meta-analysis of odds ratios.

Abstract

In meta-analysis of odds ratios (ORs), heterogeneity between the studies is usually modelled via the additive random effects model (REM). An alternative, multiplicative REM for ORs uses overdispersion. The multiplicative factor in this overdispersion model (ODM) can be interpreted as an intra-class correlation (ICC) parameter. This model naturally arises when the probabilities of an event in one or both arms of a comparative study are themselves beta-distributed, resulting in beta-binomial distributions. We propose two new estimators of the ICC for meta-analysis in this setting. One is based on the inverted Breslow-Day test, and the other on the improved gamma approximation by Kulinskaya and Dollinger (2015, p. 26) to the distribution of Cochran's Q. The performance of these and several other estimators of ICC on bias and coverage is studied by simulation. Additionally, the Mantel-Haenszel approach to estimation of ORs is extended to the beta-binomial model, and we study performance of various ICC estimators when used in the Mantel-Haenszel or the inverse-variance method to combine ORs in meta-analysis. The results of the simulations show that the improved gamma-based estimator of ICC is superior for small sample sizes, and the Breslow-Day-based estimator is the best for n⩾100. The Mantel-Haenszel-based estimator of OR is very biased and is not recommended. The inverse-variance approach is also somewhat biased for ORs≠1, but this bias is not very large in practical settings. Developed methods and R programs, provided in the Web Appendix, make the beta-binomial model a feasible alternative to the standard REM for meta-analysis of ORs. © 2017 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.

Entities: Chemical Disease Gene Species

Keywords: Intra-cluster correlation; beta-binomial distribution; fixed-effect model; heterogeneity; odds ratio; overdispersion; random-effects model

Mesh：

Year: 2017 PMID： 28124446 PMCID： PMC5434808 DOI： 10.1002/sim.7233

Source DB: PubMed Journal: Stat Med ISSN： 0277-6715 Impact factor: 2.373

Introduction

Meta‐analysis aims to combine effects estimated from a number of studies to achieve greater precision of the conclusions. The great majority of meta‐analyses use the odds ratio (OR) or its logarithm (LOR) as the effect measure of interest. The OR arises as an effect of interest when the aim is to compare binary outcomes in the treatment and control groups of K studies, or to quantify the relation between disease and exposure. Standard models of meta‐analysis are the fixed effect model (FEM) and the random effects model (REM). The former assumes that the LORs θ ,j=1,⋯,K, do not differ across the studies, that is, θ ≡θ; the latter assumes that the LORs themselves are a random sample from, usually, a normal distribution, θ ∼N(θ,τ 2) with the between‐studies variance τ 2. Further, for large sample sizes, estimated LORs are approximately normally distributed, . Therefore, the REM considers that , and the FEM follows for τ 2=0. Importantly, the variances are of order O(1/N ) for sample sizes N ,j=1,⋯,K of the studies. Standard inference concerns the combined effect , estimated as the weighted mean of the individual effects with weights equal to inverse estimated variances, in FEM, and in REM. The distribution of the combined effect is customarily approximated by a normal distribution, . Estimated within‐studies variances are often assumed to be known. Establishing an effect of treatment corresponds to testing the null hypothesis θ=0, and a confidence interval calculation in REM requires an estimate of the between‐studies variance τ 2, which is also of interest for quantifying heterogeneity. See 1 for further details on traditional meta‐analytic techniques. The shortcomings of the inverse‐variance method, as described earlier, in meta‐analysis in general and in its application to the LORs are well known. They include the bias in estimation of the combined effect, underestimation of its variance, and poor coverage of the obtained confidence intervals, especially for sparse data and/or small sample sizes, see 2 for discussion and further references. Under FEM, a considerably better way to combine ORs is the Mantel‐Haenzsel (1959) 3 method. Unfortunately, there is no analogue to this method under the REM. Further, the most popular method of estimating the between‐studies variance τ 2 in REM is the DerSimonian and Laird (1986) 4 method, based on the approximate moments of the Cochran's Q statistic, 5, but this method is not satisfactory, both in general and in application to the heterogeneity estimation of LORs, see 6, 7, 8. 7 recommend the use of the Breslow‐Day (BD) test 9 for testing the heterogeneity of ORs, and also provide a new gamma‐based approximation to the distribution of Q. Alternative approaches to REM include the use of fixed weights 10, 11 and the overdispersion model (ODM) introduced by 12. The ODM allows the interpretation of overdispersion through intra‐cluster correlation (ICC) ρ or its transformation. In this paper, we study the beta‐binomial (BB) model, an important particular case of the ODM for ORs. Both REM and the BB model are compound random‐effects models. Both models include binomial distributions for positive responses in both arms, conditional on the probabilities. The standard REM accounts for between‐study variation imposing a normal distribution on LORs, or on log‐odds in treatment and control arms 13. Similarly, the BB model includes beta‐distributed variation of the probabilities of events in one or both arms across the studies. For the log‐odds‐ratios from a pair of BB distributions, a normal approximation has been suggested by 14 and 15. To obtain the combined effect, we study the standard inverse‐variance method and a version of the Mantel‐Haenszel (MH) method adjusted for clustering, 16. Both methods require estimation of the ICC ρ. We study several methods of estimating ρ, including two new methods, one based on the profiling of the BD test, and another based on the gamma approximation to the distribution of Q by 7. The structure of this paper is as follows. In Section 2, we briefly provide some background for methods of meta‐analysis of LORs. The proposed BB model for meta‐analysis of ORs and the MH‐inspired estimation of the combined OR are introduced in Section 3. Five methods for estimation of an overdispersion parameter ρ, including a new method based on the BD test, are given in Section 4. An example is provided in Section 5. A large simulation study is described in Section 6. Discussion and conclusions are in Section 7.

Background on meta‐analysis of 2×2 tables

Fixed effect model

Consider K comparative studies reporting summary binary outcomes. The data from each study, j=1,⋯,K, constitutes a pair of independent binomial variables, X 1 and X 2, the numbers of events out of n 1 and n 2 subjects in the treatment and control arms where p for i=1,2 are the risks in the treatment and the control arms, respectively. In meta‐analysis of these data, the effect measure that we focus on is the logarithm of the OR, , where the OR for study j is The approximate variance of the log‐odds‐ratio derived by the delta method is For a study of sample size N =n 1+n 2, this variance is of order O(1/N ). The variance is estimated by substituting the estimates and in (2.2), 17, resulting in To reduce the bias of and its estimated variance, or in the case of zero entries, a correction a is usually added to each cell. The most common choice is a=1/2 18, but alternatives are available, 19. The FEM assumes homogeneity of ORs across the studies. The inverse‐variance‐weighted method for pooling LOR estimates from individual studies uses given by (1.1) with weights . An attractive alternative is the MH method, introduced by 3 to combine ORs from stratified 2×2 contingency tables. The MH estimator for pooling ORs from K tables is with weights for the ORs given by (2.1). The advantages of MH estimator are its robustness and its ability to handle empty cells without corrections, 20. The variance of the MH estimator, for both sparse data ( ) and large‐strata ( ) limiting models was derived by 21 and 22 and can be used to obtain confidence intervals for LOR.

Random effects models

The FEM is usually too restrictive. REMs take into account between‐studies heterogeneity, usually by introducing a mixing distribution for θ , or by directly inflating the within‐study variances. The standard REM described in Section 1 uses a normal mixing distribution. More generally, let individual where θ and ν are within‐study parameter of interest and (if relevant) a nuisance parameter, (for instance, location and scale parameters), θ is the overall parameter of interest, h(·) is a transformation, and τ 2 is an unknown variance parameter of the between‐study mixing distribution G, describing the variability and heterogeneity of the effect measures. The distributions F and G are commonly assumed to be normal. However, other combinations of distributions are possible. Examples of non‐normal G distributions include t, skewed normal or skewed t, whereas F can be binomial, 23. For a binomial distribution F, the transformation h can be a log or log‐odds transformation. The REM for ORs discussed by 13 combines an exact non‐central hypergeometric distribution for F with a normal mixing distribution for G. This hypergeometric‐normal model is a generalized linear mixed model.When both F and G are normal and , the standard REM is obtained. In this case, the marginal REM for an estimated effect measure in study j is For the estimated log‐odds‐ratio , the within‐study variance is approximated by (2.2). For the case of risk difference, 24 discuss the dangers of treating the within‐study variances as known. The main concern in the REM lies in estimating the unknown between‐studies variance τ 2. A variety of moment‐based or likelihood‐based estimators of τ 2 have been proposed, see 25, 26. Among the moment‐based estimators, the one derived by 4 is commonly used in practise. Among the likelihood based estimators, the restricted maximum likelihood (REML) estimator is popular because of its reduced bias. An alternative approach to REM, aimed at incorporating heterogeneity through overdispersion, was proposed by 27 and found advantageous by 28. 27 introduced a multiplicative REM in the form where φ is a multiplicative random effects parameter. This parameter allows deflation and inflation in the variance of . A new REM for meta‐analysis based on overdispersion is proposed by 12. In their model, the multiplicative parameter φ becomes study‐specific and is defined by φ =1+a γ, where a =a(N ) is a linear function of sample size N . The advantage of this model is an interpretation of the overdispersion parameter γ as an ICC or a transformation of it. For comparative studies, the within‐study variance can be written as , where R =n 1/n 2 is the allocation ratio of treatment to control group sizes and v is some function of R and relevant distribution parameters of order O(1). The ODM by 12 for two‐sample effect measures is The variance in this model consists of two parts: the FEM variance v (R )/N and the variance‐inflation term [v (R )a /N ]γ of order O(1). Therefore, this model is intermediate between the standard additive REM (2.6), which adds the constant term τ 2 of order O(1) to each within‐study variance, and the multiplicative REM (2.7), which, inflating variances by a constant factor φ, keeps them of order O(1/N ), similar to the FEM.

Overdispersed binary data

The beta‐binomial distribution

The BB distribution is obtained by mixing a binomial distribution Binom (n,p) by a beta distribution for its success probability p. When Y∼ Binom (n,p) and p∼ Beta (α,β), then unconditionally, Y follows a BB distribution with parameters α,β, and n. This distribution arises naturally in Bayesian statistics as the beta distribution is the conjugate prior distribution for the parameter p if the data are binomial. The expected value and variance of Y are It is more convenient to re‐parametrize this distribution as BetaBinom (n,π,ρ), where π=α/(α+β) and ρ=1/(α+β+1). Then which shows the BB distribution to be an overdispersed binomial distribution. A distribution with the same two moments can be obtained as the distribution of the sum of n Bernoulli(π) random variables with ICC ρ. Such distributions may be obtained through various generation mechanisms 29, 30 and differ in the higher moments. To guarantee approximate normality of LORs, we consider only the BB distribution. This distribution is widely used in combining overdispersed binomial data, see, for example, 31.

Odds‐ratios under beta‐binomial model

In the context of meta‐analysis, we assume that the observations in the treatment and control arms of each study are independent, and that within each arm, the observations follow a BB distribution with the same parameter ρ, that is, This is equivalent to requiring that the sum of parameters α+β of the mixing beta distributions not differ between the two arms. The ORs ψ and their estimates are defined as in (2.1), and the log‐odds‐ratios are approximately normally distributed. However, the variances of individual ORs and log‐odds‐ratios have to be adjusted for overdispersion. The adjusted approximate variance of LOR, obtained by using the delta method, is with restriction for ρ For ρ=0 this model is the standard FEM for binomially distributed data. In this case, the variance of LOR (2.2) can be written as with Substituting this expression into (3.2), the variance of LOR for overdispersed binomial data is This variance is clearly inflated in comparison with the standard variance (2.2). For large n 1 and n 2, the inflation term is , therefore it is of order O(1) and increases with ICC. It also may be large for probabilities in either arm close to 0 or 1. Defining it is clear that when both parts of (3.3) do not depend on j, the BB model results in the same first and second moments of the LORs as the standard REM, with the relation between ρ and τ 2 given by (3.3). This is the case when the probabilities in the treatment and control arms do not differ, p ≡p , and the sample sizes are all equal, or are large enough so that (n −1)/n ≈1. The variance of the sample log‐odds‐ratio can also be written in the ODM (2.8) form as for Thus, a is a linear function of N and has the same order as N . For balanced studies R =1, and a simplifies to a =N /2−1. An alternative model considers overdispersion only in the treatment arm: This is perhaps closer to the standard REM, which usually has a random effect only in the treatment arm 27. In this case, the variance for OR is which is still inflated, in comparison to the FEM variance, by the term [(n 1−1)/(n 1 p 1(1−p 1))]ρ . Subsequent methods are easily adapted to this version of the BB model, and we do not pursue this model further.

Adjusted Mantel‐Haenszel method for combining odds ratios

Applying the standard MH method for overdispersed binary data results in a bias and a wrong variance of the combined OR, and a wrong type I error of the associated test, 32. An adjusted version of the MH test and related estimator of the combined OR appropriate for the meta‐analysis of cluster‐randomized trials were studied by 16, 33, 32. In cluster‐randomized trials, arm i of trial j contains m clusters of size n , and an ICC ρ is common to all clusters in trial j. This can be adapted to a case of one cluster in each arm, which is equivalent to the ODM by 12. This adapted MH method is as follows. Introduce a set of correction factors These are referred to as design effects by 34. Next, define the adjusted weights for the ORs The corrected MH estimate of the OR for overdispersed binomial data is We propose to use this corrected version of the MH estimator for combining ORs in the BB model. When there is no overdispersion, ρ=0,C =1 and the expression (3.5) reduces to standard MH estimator (2.4) for the fixed effects model.When , which is the standard MH estimator for the subset of the studies of the largest sample size. The proof is provided in the Web Appendix.To obtain the asymptotic variance of , we adjusted for overdispersion the asymptotic variance of the MH estimator derived by 21 and 22: where and and .

Estimation of ρ

To evaluate the corrected MH estimator (3.5), an estimate of the ICC ρ is required. We consider two modifications of established methods, namely a moment estimator based on Cochran's Q statistic similar to the DerSimonian and Laird (1986) 4 estimator of τ 2, and a REML estimator. We also consider related confidence intervals: an interval based on profiling the Q statistic as in 35 and a REML‐based interval. According to 35, these two approaches perform the best for estimation of the between‐studies variance τ 2 in the additive REM. We also use a version of the Mandel and Paule (1970) 36 method to estimate ρ from the large‐sample approximation of Q by the χ 2 distribution. All three methods were proposed in 12 for estimation of ρ, but have not so far been explored by simulation. However, the chi‐square distribution is a poor approximation to the distribution of the Q statistic for LORs 7, and the modified Q test based on the improved gamma approximation developed by 7 and the Breslow and Day (1980) 9 (BD) test are attractive alternatives for testing the heterogeneity of ORs. In the section, we propose two new methods of point and interval estimation of ρ, based on inverting the modified Q test and the BD test. The point estimation is based on an adaptation of the Mandel and Paule (1970) 36 method, and the interval estimation is achieved through profiling the modified Q and BD tests.

Restricted maximum likelihood estimation of ρ

The restricted likelihood for the normal distribution with mean θ and variances v (1+a ρ)/n is for inverse‐variance weights w j∗=w /(1+a ρ). Following 12, the REML equation for ρ is where . The mean θ is obtained as , and an iterative procedure readily yields a solution, denoted by . The REML confidence intervals are given by all values of ρ that satisfy where is the (1−α) quantile of the chi‐square distribution with 1 degree of freedom. We will see in Section 6 that the REML‐based point and interval estimators of ρ are generally inferior to other methods, and we will not recommend them.

Q‐statistic‐based estimation of ρ

Cochran's statistic is , for the inverse‐variance weights and . Under the null hypothesis of no over‐ or underdispersion ρ=0, the Q‐statistic is approximately chi‐square distributed with K−1 degrees of freedom, so that E(Q)=K−1. This approximation is extremely conservative whenever either binomial parameter is far from 0.5, but its performance is reasonable when the binomial parameters are relatively close to 0.5, 7. Under the ODM (2.8), where , and , 12. The estimate of ρ from (4.4) should satisfy the condition . A moment (M) estimator in the spirit of DerSimonian and Laird (1986) 4 proposed by 12 is underdispersion may be present for Q When the correct weights w j∗=w (ρ)=w /(1+a ρ) are used, , and the corrected Q statistic given by has approximately the distribution. The Mandel and Paule (1970) 36 method of estimating between‐studies variance τ 2 in the standard REM was studied subsequently by 37 and 25. This method uses the approximate chi‐square distribution of the Cochran's Q statistic under homogeneity to find an estimate of τ 2. In our context, the estimating equation Q ∗(ρ)=K−1 provides another moment‐based estimator of ρ in the spirit of Mandel and Paule (1970) 36, also proposed by 12. We refer to this estimator as ρ . A related confidence interval is obtained by inverting the Q test. The confidence interval is constructed as where are the quantiles of the χ 2 distribution with K−1 degrees of freedom. 35 shows that the standard REM confidence intervals for τ 2 based on this approach, named Q‐profile, perform very well, better than the REML confidence intervals described in Section 4.1. However, we will see in Section 6 that the methods introduced in this section are generally inferior to the methods based on the corrected distribution of Q or on the BD test, described in Sections 4.3 and 4.4, respectively, and we will not recommend them.

Corrected Q‐statistic based estimation of ρ

According to 7, the distribution of the Q statistic can be well approximated by a gamma distribution with shape and scale parameters The expected value and variance of the Q statistic for log‐odds‐ratio based on this gamma approximation can be estimated from the relations and where E(Q) is the theoretical approximation to the mean of Q for log‐odds‐ ratio, 7. The Mandel‐Paule estimator of ρ based on the gamma approximation to the distribution of the Q statistic, denoted by , is found from Q ∗(ρ)=E(Q), given that a solution exists, where E(Q) is the solution of equation (4.7). The related confidence interval based on the gamma approximation to the distribution of Q statistic can be obtained from where Γ are the quantiles of the gamma distribution with r(ρ) and λ(ρ) as shape and scale parameters. We will see from the simulations described in Section 6 that this method performs well when sample sizes are small (up to 100).

Breslow‐Day‐based estimation of ρ

The BD test is based on the statistic where and denote the expected number and the asymptotic variance, respectively, of the number of cases in the treatment arm under the assumption of homogeneity of odds ratios, given the fitted MH OR . The expected value in (4.9) is obtained from the quadratic equation where m 1=X 1+X 2. Its asymptotic variance is a particular case, for ρ=0, of the variance of X 1 under overdispersion given by 38 where the C are given by equation (3.4). The asymptotic variance given in (4.11) is not defined when any of the cells of the j‐th table are empty. In these cases, a correction of 0.5 is added to each cell of the table. The BD statistic is now a function of ρ, and has a distribution under the homogeneity of ORs when the value of ρ is estimated correctly. Equating the BD statistic to its first moment, and solving this estimating equation for result in the Mandel‐Paule‐type estimator , which can be used for the calculation of given by (3.5). The range of values for the overdispersion parameter ρ is the interval . When , the variance in the denominator of converges to zero, and the BD statistic tends to infinity. When , the solution of equation (4.12) always exists and is negative. On the other hand, ρ=1 provides the lower limit for the BD statistic, so if this lower limit of , the equation (4.12) does not have a solution; in this case we set . The confidence interval for ρ can be obtained by profiling the BD test, similarly to confidence interval for τ 2 obtained by profiling Cochran's Q under REM by 35. The confidence interval with 95% coverage probability for the ICC parameter ρ based on the modified BD test is given by We will see from the simulations described in Section 6 that the methods developed in this Section are best utilized when sample sizes are large (greater than about 100).

Example: effects of diuretics on pre‐eclampsia

A well‐known meta‐analysis of nine trials which include the total of 6942 patients, evaluated the effect of diuretics on pre‐eclampsia 39. These data have been studied repeatedly, for example in 40, 41, 35, and 12. The basic data with ORs and their logs are provided in [12, Table 2a] and are reproduced in the Web Appendix. These data demonstrate considerable heterogeneity in incidence of pre‐eclampsia in both the treatment and the control groups, 12, suggesting that the BB model may be appropriate. There is also considerable heterogeneity in effect sizes. The overall incidence of pre‐eclampsia varies from 0.015 in study 6 to 0.412 in study 9. The ORs of effect of diuretics vary from 0.229 in study 4, a study with high incidence of 0.308, to 2.971 in study 8, a study with low incidence of 0.038. Cochran's Q‐statistic is Q=27.265, and the total sample size N=6942. Estimated values of τ 2 for standard REM, and of ρ assuming the BB model and using various estimating methods are provided in Table 1. The Der‐Simonian‐Laird estimate of the variance component in standard REM is , and . For τ 2 estimation in REM, Q‐profile confidence intervals 35 are given for the DL, and profile likelihood confidence intervals 40 for the REML method. In BB model, five methods of estimation provide estimates of varying from 0.008 for the moment estimator to 0.019 for the BD‐based estimator. The confidence interval for ρ is shortest for REML and longest for the BD estimator. These values are directly interpretable as the estimated ICCs and their confidence limits. To see the effect of these estimates of heterogeneity on the inference about the OR, we compare the corresponding estimates for LOR and OR, and their confidence intervals, in the same table. The OR is highest (0.672) in the FEM, and, not surprisingly, its confidence interval is the shortest. The OR is lowest (0.596) in the standard REM, and various estimators based on inverse variances or the MH method provide intermediate values of OR, the one based on MH and method‐of‐moments estimator ρ providing the highest value of OR, 0.652. For each estimator of ρ, the MH estimation of OR results in a somewhat higher value of OR than the inverse‐variance‐based estimation, with a somewhat shorter confidence interval for OR. The sample sizes are reasonably large in all included studies, and based on the results of simulations reported in Section 6, we recommend to use the estimated ICC and corresponding value of the pooled OR with confidence interval (0.389,0.994).In this example, the CI's for ρ and for τ 2 do not include 0 so the evidence for REM or BB is stronger compared with FEM. Unfortunately, it is very difficult to distinguish between the REM and BB (see Section D.4 in Web Appendix.)

Table 1

Values and confidence intervals for ρ, for log odds ratios and for odds ratios for diuretics on pre‐eclampsia example; FEM is the fixed effect, REM is the random effects, and BB is the beta‐binomial model. Heterogeneity parameter estimated is τ 2 in REM, and ρ in BB model. L and U are the lower and upper limits of the respective confidence intervals (CIs).

Model	Method	Hetero geneity	L	U	LOR	L	U	length of CI	OR	L	U
FEM		0.000			−0.398	−0.573	−0.223	0.530	0.672	0.564	0.800
REM	DL&IV	0.230	0.072	2.202	−0.517	−0.916	−0.117	0.799	0.596	0.400	0.889
REM	REML&IV	0.300	0.043	1.475	−0.518	−0.956	−0.080	0.876	0.596	0.384	0.923
BB	M&IV	0.008	0.002	0.095	−0.436	−0.792	−0.080	0.712	0.647	0.453	0.923
	M&MH				−0.427	−0.775	−0.080	0.695	0.652	0.461	0.923
BB	REML&IV	0.010	0.001	0.060	−0.447	−0.835	−0.059	0.776	0.640	0.434	0.942
	REML&MH				−0.431	−0.809	−0.053	0.756	0.650	0.445	0.949
BB	MP&IV	0.017	0.002	0.095	−0.469	−0.920	−0.018	0.902	0.626	0.399	0.982
	MP&MH				−0.459	−0.898	−0.020	0.879	0.632	0.407	0.981
BB	cMP&IV	0.018	0.003	0.094	−0.474	−0.942	−0.007	0.936	0.623	0.390	0.993
	cMP&MH				−0.472	−0.927	−0.016	0.911	0.624	0.396	0.984
BB	BD&IV	0.019	0.003	0.107	−0.475	−0.944	−0.006	0.938	0.622	0.389	0.994
	BD&MH				−0.463	−0.920	−0.021	0.899	0.630	0.399	0.980

Simulation study

In this section, we provide a simulation study to assess the performance of point and interval estimators of the overdispersion parameter ρ and the combined LOR θ in the BB model of meta‐analysis. We assess the bias of five point estimators of ρ: the moment method (4.5), the Mandel‐Paule‐inspired method ρ , the corrected Mandel‐Paule estimator based on the gamma approximation to Q distribution ρ , the REML method (4.2), and the BD‐based method (4.12). We also assess four related confidence intervals for ρ (4.6), (4.8), (4.3), and (4.13) for their coverage at the 95% confidence level. Additionally, we compare two estimation methods for obtaining point and interval estimators of the combined odds ratio or its log, the inverse‐variance method and the modified MH method (3.5). We combine the five above‐mentioned point estimators of ρ with these two methods of obtaining the combined effect , resulting in 10 possible combinations, and we assess these estimators of for bias and for coverage. Typically, small values of ρ, below 0.1, appear in bio‐medical applications, 42, 43. Overdispersion is mostly due to clustering by healthcare provider. However, our range of values of ρ up to 0.3 is comparable with τ 2 values of up to 5 in the standard REM for our choice of values of probabilities and LORs provided below. This correspondence between heterogeneity in the additive REM and BB model is given by equation (3.3).

Simulation design

Sizes of the control and treatment groups were taken equal n 1=n 2=n and were generated from a normal distribution with mean n and variance n/4 rounded to the nearest integer and left truncated at 5. For a given probability p 2, the number of events in the control group X 2 was simulated from a BB (n ,p 2,ρ) distribution using the R package emdbook 44. The number of events in the treatment group X 1 was generated from a BB (n ,p 1,ρ) distribution with for a given LOR value of θ. When ρ=0, the numbers of events for treatment and control arm X were generated from binomial distributions with sample size n and probabilities p , preserving the relation between the probabilities in the treatment and control arms. The following configurations of parameters were included in the simulations. The number of studies K=(5,10,20,30,50,80); average sample sizes in each arm are n=(10,20,40,50,80,100,160,250,640,1000); overdispersion parameter ρ varies between 0 and 0.1 in steps of 0.01, and between 0.1 and 0.3 in steps of 0.05. The values of LOR θ vary from 0 to 3 in steps of 1. The probability in the control group p 2 takes values 0.1,0.2,0.4. A total of 10000 simulations were produced for each combination.

Simulation results

Figures 1 and 2 show the bias and coverage of ρ estimated by the five methods mentioned earlier for 12 combinations of K and n for the case of p 2≡0.1 and θ=0 and varying values of . The bias and coverage of true log‐odds‐ratio θ estimated by the inverse‐variance (θ ) for values of θ=0,1,2, are shown in Figures 3, 4, 5, 6, respectively. Similar figures for bias and coverage of θ by the modified MH method ( ) are given in the Web Appendix, Figure B25–Figure B28. A number of figures show fewer than five lines, because the standard methods including moment based estimation, REML and the Mandel‐Paule method perform similarly. The differences between the standard methods and two new methods, the corrected Mandel‐Paule method and the BD based method, are clearly visible.

Figure 1

Figure 2

Coverage at the nominal confidence level of 0.95 of the intra‐cluster correlation ρ estimated from K studies in the beta‐binomial model for p 2=0.1, θ=0 and for average sample sizes n=20,50,100, and 1000. Interval estimation methods: circles – Q‐profile confidence interval for ρ based on χ 2 distribution, squares – Q‐profile confidence interval for ρ based on Γ distribution), diamonds – Profile likelihood confidence intervals, triangles – Breslow‐Day‐Profile confidence intervals. Light grey line at 0.95.

Figure 3

Bias of overall odds ratio obtained from K studies by the inverse‐variance method with the moment estimator in the weights, for p 2=0.1, and for average sample sizes n=20,50,100, and 1000. The biases are given for θ=0 (circles), θ=1 (circle plus), and θ=2 (circle cross). Light grey line at 0.

Figure 4

Coverage at the nominal confidence level of 0.95 of the overall odds ratio ψ obtained from K studies by the inverse‐variance method, for p 2=0.1, θ=0 and . The inverse‐variance weights use the following estimators of ρ: circles – , squares – Corrected Mandel‐Paule estimator , diamonds – restricted maximum likelihood estimator , triangles – Breslow‐Day estimator and reverse‐triangles – Mandel‐Paule estimator . Light grey line at 0.95.

Figure 5

Coverage at the nominal confidence level of 0.95 of the overall odds ratio ψ obtained from K studies by the inverse‐variance method, for p 2=0.1, θ=1 and . The inverse‐variance weights use the following estimators of ρ: circles – , squares – Corrected Mandel‐Paule estimator , diamonds – restricted maximum likelihood estimator , triangles – Breslow‐Day estimator and reverse‐triangles – Mandel‐Paule estimator . Light grey line at 0.95.

Figure 6

Coverage at the nominal confidence level of 0.95 of the overall odds ratio ψ obtained from K studies by the inverse‐variance method, for p 2=0.1, θ=2 and . The inverse‐variance weights use the following estimators of ρ: circles – , squares – Corrected Mandel‐Paule estimator , diamonds – restricted maximum likelihood estimator , triangles – Breslow‐Day estimator and reverse‐triangles – Mandel‐Paule estimator ) Light grey line at 0.95.

Bias estimated from K studies of the intra‐cluster correlation ρ in the beta‐binomial model for p 2=0.1, θ=0 and for average sample sizes n=20,50,100, and 1000. Estimation methods: circles – Moment estimator , squares – Corrected Mandel‐Paule estimator ), diamonds – , triangles‐ Breslow‐Day based estimator , reverse‐triangles – Mandel‐Paule estimator . Light grey line at 0. Coverage at the nominal confidence level of 0.95 of the intra‐cluster correlation ρ estimated from K studies in the beta‐binomial model for p 2=0.1, θ=0 and for average sample sizes n=20,50,100, and 1000. Interval estimation methods: circles – Q‐profile confidence interval for ρ based on χ 2 distribution, squares – Q‐profile confidence interval for ρ based on Γ distribution), diamonds – Profile likelihood confidence intervals, triangles – Breslow‐Day‐Profile confidence intervals. Light grey line at 0.95. Bias of overall odds ratio obtained from K studies by the inverse‐variance method with the moment estimator in the weights, for p 2=0.1, and for average sample sizes n=20,50,100, and 1000. The biases are given for θ=0 (circles), θ=1 (circle plus), and θ=2 (circle cross). Light grey line at 0. Coverage at the nominal confidence level of 0.95 of the overall odds ratio ψ obtained from K studies by the inverse‐variance method, for p 2=0.1, θ=0 and . The inverse‐variance weights use the following estimators of ρ: circles – , squares – Corrected Mandel‐Paule estimator , diamonds – restricted maximum likelihood estimator , triangles – Breslow‐Day estimator and reverse‐triangles – Mandel‐Paule estimator . Light grey line at 0.95. Coverage at the nominal confidence level of 0.95 of the overall odds ratio ψ obtained from K studies by the inverse‐variance method, for p 2=0.1, θ=1 and . The inverse‐variance weights use the following estimators of ρ: circles – , squares – Corrected Mandel‐Paule estimator , diamonds – restricted maximum likelihood estimator , triangles – Breslow‐Day estimator and reverse‐triangles – Mandel‐Paule estimator . Light grey line at 0.95. Coverage at the nominal confidence level of 0.95 of the overall odds ratio ψ obtained from K studies by the inverse‐variance method, for p 2=0.1, θ=2 and . The inverse‐variance weights use the following estimators of ρ: circles – , squares – Corrected Mandel‐Paule estimator , diamonds – restricted maximum likelihood estimator , triangles – Breslow‐Day estimator and reverse‐triangles – Mandel‐Paule estimator ) Light grey line at 0.95.

Bias and coverage in estimation of intra‐cluster correlation ρ

Bias of estimated ICC ρ is negative, and its magnitude clearly increases in ρ, Figure 1 for p 2=0.1, Figure B1 and Figure B2 in Web Appendix for p 2=0.2 and 0.4. For small numbers of studies K combined with small sample sizes ( ), estimation appears to be the best option. However, for larger sample sizes ( ), the BD‐based estimator is the clear winner. Still, its negative bias increases almost linearly with ρ and is acceptable only for ρ<0.1. Coming to coverage of ρ (Figure 2 and Figure B3, Figure B4 in the Web Appendix), once more, the BD‐based estimator appears to be the safest option, apart from the case of very small sample sizes , where the gamma‐based approximation appears to provide better coverage for . Figure B5 and Figure B6 in the Web Appendix show the bias and coverage in estimation of ρ for four values of θ and increasing sample size n, keeping ρ=0.1 fixed. Similar plots of bias and coverage of ρ for p 2≡0.2 and p 2≡0.4 are given in Figure B7–Figure B10 in the Web Appendix. BD‐based estimator ρ remains the best estimator of ρ for all scenarios for , although it acquires a small positive bias when p 2=0.4 and θ=3, the case corresponding to p 1=0.93. Similarly, in Figure B11–Figure B16 in the Web Appendix, the results are presented for three values of p 2 and increasing sample size n, keeping the value of θ fixed. Both bias and coverage improve when the probabilities in both arms are farther from the edges.

Bias in estimation of odds ratio ψ

Bias of the estimated OR was practically the same regardless of the method used for estimation of ICC ρ. This may be due to similarity of sample sizes across studies in our simulations, as the inflation terms (1+(n −1)ρ) in the normalized individual weights ‘almost’ cancel. Without loss of generality, we plotted the results for bias of obtained when using the moment estimator in Figure 3 for values of log‐odds θ=0,1 and 2. There is no bias when θ=0, that is, when the probabilities of an event in two arms are the same, but the bias clearly increases with increasing values of θ and/or ρ. The bias for the inverse‐variance weights is within 10% for or , which would cover the major part of values of these parameters in practice, as θ=2 corresponds to an OR of 7.39, and the values of ICC ρ are usually small. An explanation of this bias and a possible remedy are provided in Section 6.2.3. Unfortunately, the bias is substantially higher for the modified MH method, especially for small numbers of studies K and large values of ρ and n, and the coverage deteriorates accordingly, see Figure B25–Figure B34 in the Web Appendix, and therefore we do not pursue this estimator further.

Bias of sample log‐odds‐ratio under beta‐binomial model

The bias of a number of popular effect measures used for binary data under REMs is discussed in 45. For log‐odds, it is well known that the sample log‐odds‐ratio where the probabilities of events p 1 and p 2 in treatment and control groups are estimated by sample proportions , has a bias of order 1/n under the FEM ρ=0. The standard bias correction studied by Gart et al. (1985) adds 1/2 to X and to n −X , that is, uses when estimating the log‐odds to eliminate the 1/n bias term at the null model ρ=0. Expanding the log‐odds by Taylor series for a general ρ, and taking expectations, (see 45 for details of the derivation) where, importantly, the second term includes a bias of order O(1) when ρ≠0. Therefore, the bias of the sample log‐odds‐ratio is When log‐odds‐ratio θ=0, that is, when the probabilities in the two arms are equal, the biases for sample log‐odds in the two arms cancel out. Thus, the estimate is unbiased to order 1/n. However, when θ≠0 and the probabilities in the two arms are not equal, the sample OR is biased to order O(1), and this bias is not ameliorated by the correction. For example, when p 1=0.1 and p 2=0.4, that is, θ=−1.791, the main bias term is (−4.444+0.417)ρ, increasing linearly with the ICC ρ. Figure C35 in the Web Appendix illustrates the quality of this linear approximation to bias. It works well for small values of ρ, but the bias increases and higher‐order terms become more important for larger values of ρ. In meta‐analysis with fixed weights, it would be possible to correct the resulting bias of the overall effect measure for small values of ρ, but the use of inverse‐variance weights also affects the bias and makes such a correction much more difficult. Luckily, the resulting bias is not very large, as we have seen in Section 6.2.2. We believe that the origin of the higher bias in the corrected MH method is similar, but its consequences are graver.

Coverage of odds‐ratio ψ

The method used for estimation of ICC ρ is of utmost importance for correct estimation of variance, and therefore the coverage of the OR ψ. Plots of coverage for p 2 j=0.1 are is presented in Figures 4, 5, 6 for θ=0,1 and 2, respectively. Overall, as with bias, the modified Mandel‐Paule estimator results in the best coverage for small sample sizes up to 50, and ρ provides superior coverage for . All other estimators of ρ result in inferior coverage, especially for large values of ρ. However, there are important differences in coverage when using the best estimators of ρ because of differences in true value of the OR. For the small number of studies K=5, the coverage is too low for all values of θ, but it drifts from about 90% to about 87% even when the best estimator of ρ is used. Starting from K=10, the coverage is good for θ=0, but becomes lower than nominal when θ increases. It is still reasonable, at about 93%, for θ=1, but reaches 90% or even somewhat lower for used with large sample sizes n=1000. This is due to the increasing biases in the estimation of ψ combined with the ‘improved’ precision for larger sample sizes. Similar plots of coverage for p 2=0.2 and 0.4 when θ=0 are given in the Web Appendix (Figure B17, Figure B18). Figure B19 ‐ Figure B24 in the Web Appendix present the bias and coverage when estimating θ by for different values of p 2 and increasing sample size n, keeping the value of θ fixed. These figures clearly show the biases and reduced coverage of OR due to transformation bias discussed in the previous section. Coverage achieved when is used in the weights is superior for moderate to large sample sizes.

Discussion

In this paper, we developed theory of meta‐analysis of ORs based on the BB model. This model is a natural alternative to the standard REM based on normality of random effects. Of course, other combinations of distributions are possible for meta‐analysis of binomially distributed data. 13 suggest using exact hypergeometric likelihood for individual studies combined with normally distributed random effect for log‐odds. 46 discuss a family of compound binomial distributions obtained by using mixing distributions from the generalized inverse gaussian family of distributions, but these distributions have not been used so far in meta‐analysis. We have concentrated on the case of two independent BB distributions in two arms of each study. We have proposed two new methods of estimation of the ICC ρ in meta‐analysis based on this model. Both our methods work considerably better than other, more traditional methods suggested by 12, and they complement each other by being applicable to meta‐analyses of smaller or larger studies. This model is similar to bivariate binomial‐normal REM for log‐odds‐ratios discussed by [13, p.3056]. The latter model can also incorporate a correlation between the two arms of the same study. However, a similar extension of the BB model is not straightforward. A version of a bivariate BB distribution was proposed by 47, but this distribution has a strictly positive lower bound for correlation between the marginals, so it does not include the case of independent BB distributions. Moreover, 47 show that ‘independence cannot be obtained as a limit in the parameters without sacrificing the overdispersion’. They also discuss other, previously suggested, versions of a bivariate BB distribution, and possible extensions aimed at resolving this problem, but none are satisfactory. However, a different version that allows a range of correlation values, including zero correlation, was applied to meta‐analysis in 48. A new bivariate beta distribution was recently proposed by 49, but so far it has not been used for mixing binomial distributions. An important question is how to differentiate between possible REMs, and how robust are the standard REM methods for meta‐analysis of LORs when the BB model is true. A good summary of existing diagnostic methods to differentiate between BB and logistic‐normal model is provided by 50, however this is a difficult task, 51. In Section D of Web Appendix, we provide a simulation study to answer the second question. Briefly, the heterogeneity is best assessed by the moment‐based DerSimonian‐Laird method, in agreement with 52, who do not recommend the use of likelihood based methods for the dependent binary data, because these methods are not robust against model misspecification. The bias of LOR θ is very considerable for θ=1 and 2, Figure D42, but it is larger than and in the opposite direction to that of θ estimated from the true BB model, Figure 3. This bias does not visibly depend on the method of estimation of τ 2. We also briefly considered a model with a BB distribution in the treatment arm only. This model is analogous to a version of unconditional random effects logistic regression by 53. In this model, the study‐specific log‐odds of the control groups constitute K additional parameters, and this model is not appropriate when , 13. We also proposed a variant of the MH method for meta‐analysis of ORs. Unfortunately, in simulations this method was very biased, especially for ORs greater than 1. Elimination of this bias will be pursued elsewhere. The traditional inverse‐variance approach to combining LORs using ICC ρ in the weights estimated by one of our methods results in reasonable, although somewhat low coverage for a realistic range of ORs and ICCs. Developed methods and R programs, provided in the Web Appendix, make the BB model a feasible alternative to the standard REM for meta‐analysis of ORs. Supporting info item Click here for additional data file. Supporting info item Click here for additional data file. Supporting info item Click here for additional data file. Supporting info item Click here for additional data file.

32 in total

1. Statistical aspects of the analysis of data from retrospective studies of disease.

Authors: N MANTEL; W HAENSZEL
Journal: J Natl Cancer Inst Date: 1959-04 Impact factor: 13.506

2. Random effects meta-analysis of event outcome in the framework of the generalized linear mixed model with applications in sparse data.

Authors: Theo Stijnen; Taye H Hamza; Pinar Ozdemir
Journal: Stat Med Date: 2010-12-20 Impact factor: 2.373

3. Misunderstandings about Q and 'Cochran's Q test' in meta-analysis.

Authors: David C Hoaglin
Journal: Stat Med Date: 2015-08-24 Impact factor: 2.373

4. Intraclass correlation coefficient and outcome prevalence are associated in clustered binary data.

Authors: M C Gulliford; G Adams; O C Ukoumunne; R Latinovic; S Chinn; M J Campbell
Journal: J Clin Epidemiol Date: 2005-03 Impact factor: 6.437

5. Random-effects model for meta-analysis of clinical trials: an update.

Authors: Rebecca DerSimonian; Raghu Kacker
Journal: Contemp Clin Trials Date: 2006-05-12 Impact factor: 2.226

6. Confidence intervals for the amount of heterogeneity in meta-analysis.

Authors: Wolfgang Viechtbauer
Journal: Stat Med Date: 2007-01-15 Impact factor: 2.373

7. Bivariate random effects models for meta-analysis of comparative studies with binary outcomes: methods for the absolute risk difference and relative risk.

Authors: Haitao Chu; Lei Nie; Yong Chen; Yi Huang; Wei Sun
Journal: Stat Methods Med Res Date: 2010-12-21 Impact factor: 3.021

8. Flexible parametric models for random-effects distributions.

Authors: Katherine J Lee; Simon G Thompson
Journal: Stat Med Date: 2008-02-10 Impact factor: 2.373

9. Intra-cluster correlation coefficients in adults with diabetes in primary care practices: the Vermont Diabetes Information System field survey.

Authors: Benjamin Littenberg; Charles D MacLean
Journal: BMC Med Res Methodol Date: 2006-05-03 Impact factor: 4.615

10. An accurate test for homogeneity of odds ratios based on Cochran's Q-statistic.

Authors: Elena Kulinskaya; Michael B Dollinger
Journal: BMC Med Res Methodol Date: 2015-06-10 Impact factor: 4.615

7 in total

1. Overdispersion models for correlated multinomial data: Applications to blinding assessment.

Authors: V Landsman; D Landsman; C S Li; H Bang
Journal: Stat Med Date: 2019-08-28 Impact factor: 2.373

2. Discovery of Synergistic Drug Combinations for Colorectal Cancer Driven by Tumor Barcode Derived from Metabolomics "Big Data".

Authors: Bo Lv; Ruijie Xu; Xinrui Xing; Chuyao Liao; Zunjian Zhang; Pei Zhang; Fengguo Xu
Journal: Metabolites Date: 2022-05-30

Review 3. CYP2D6 polymorphism and its impact on the clinical response to metoprolol: A systematic review and meta-analysis.

Authors: Maxime Meloche; Michael Khazaka; Imad Kassem; Amina Barhdadi; Marie-Pierre Dubé; Simon de Denus
Journal: Br J Clin Pharmacol Date: 2020-04-05 Impact factor: 4.335