Literature DB >> 35151340

Empirical comparisons of heterogeneity magnitudes of the risk difference, relative risk, and odds ratio.

Yuxi Zhao¹, Elizabeth H Slate¹, Chang Xu^2,3, Haitao Chu⁴, Lifeng Lin⁵.

Abstract

Entities: Chemical

Mesh：
Humans
Odds Ratio
Risk

Year: 2022 PMID： 35151340 PMCID： PMC8840324 DOI： 10.1186/s13643-022-01895-7

Source DB: PubMed Journal: Syst Rev ISSN： 2046-4053

× No keyword cloud information.

Introduction

In epidemiology and medical research, the choices of effect measures for binary outcomes have been long debated. Common choices include the risk difference (RD), relative risk (RR), and odds ratio (OR). The RD is often considered more heterogeneous than the ratio measures, RR and OR [1, 2]. Nevertheless, the arguments supporting this claim have been challenged [3]. For example, more rejections of homogeneity in hypothesis testing of RDs are expected than those of ORs. This article empirically compares the heterogeneity magnitudes between the RD, RR, and OR.

Methods

We applied heterogeneity measures to a large Cochrane database of meta-analyses [4]. The Cochrane Library publishes systematic reviews on a wide range of healthcare-related topics. We searched for all Cochrane reviews available online from issue 1 in 2003 to issue 1 in 2020. The search strategy for an older version of the Cochrane database was used in our earlier work [5-7]. In the Cochrane Library, each issue was published monthly, and it included systematic reviews on new topics with formal meta-analyses as well as protocols without formal analyses. An issue may also publish notices to withdraw outdated or flawed reviews and protocols. In this study, we iteratively included all published reviews that reported statistical data in each issue and excluded all withdrawn reviews. In total, we identified 64,929 meta-analyses. In addition, a Cochrane review could investigate multiple disease outcomes and/or multiple intervention comparisons. Therefore, the meta-analyses within the review may not be independent due to the correlations between outcomes or intervention comparisons. For removing the impact of such potential correlations on heterogeneity, we also conducted sensitivity analyses, which were restricted to the meta-analyses with the largest number of studies from each Cochrane review. A total of 3125 meta-analyses were included in the sensitivity analyses. We focused on the heterogeneity measure I 2 and also considered the CVB statistic as a supplemental measure. We reanalyzed each Cochrane meta-analysis and obtained the heterogeneity measures using each effect measure. The RR and OR were analyzed on the logarithmic scale. The I 2 is widely used and is interpreted as a percentage of total variation due to heterogeneity rather than sampling error [4]. The CVB is the between-study coefficient of variation used for providing further insight into heterogeneity magnitudes; it is calculated as the ratio of the between-study standard deviation τ over the absolute value of the overall effect size [8]. In this article, we estimated the between-study variance τ 2 using both the DerSimonian–Laird (DL) and restricted maximum likelihood (REML) methods; the former is the most popular while the latter is recommended with better statistical performance.

Results

Figure 1 and Fig. S1 present the histograms of on a logarithmic scale for the RD, RR, and OR based on the REML and DL estimation methods. Because τ that truly equals 0 may not be exactly estimated as 0, depending on the tolerance of the REML algorithm’s convergence, the histograms in Fig. 1 shows small peaks at very small values. As the RD, RR, and OR are on different scales, the magnitudes of their corresponding may not be directly comparable. In general, the RR and OR led to < 0.01 in more meta-analyses than the RD (Table S1).

Fig. 1

Histograms of between-study standard deviations on a logarithmic scale based on the restricted maximum likelihood method for the RD, RR, and OR. The histograms are restricted to the range from −10 to 2 for Among the 64,929 Cochrane meta-analyses, 48.09% of RDs had I 2 = 0% based on the DL method, while about 56% of RRs and ORs had I 2 = 0%. The REML algorithm failed to converge in a few meta-analyses (≤ 0.22%) and I 2 was not calculable; for the remainder, 43.56% of RDs had I 2 = 0%, while about 50% of RRs and ORs had I 2 = 0%. About 6% of RDs, RRs, and ORs had 0% 0% for all three measures and both the DL and REML methods to avoid the impact of many I 2 = 0%. The RDs’ descriptive statistics of I 2 were noticeably larger than the RRs’ and ORs’.
Fig. 2
Histograms of I 2 (A) and CVB on a logarithmic scale (B) based on the restricted maximum likelihood method for the RD, RR, and OR. A Restricted to I 2 > 1%. B Restricted to the range from −10 to 10 for better visualizations
Histograms of I 2 (A) and CVB on a logarithmic scale (B) based on the restricted maximum likelihood method for the RD, RR, and OR. A Restricted to I 2 > 1%. B Restricted to the range from −10 to 10 for better visualizations Categorized by the number of studies, the average study size, and the total number of events in a meta-analysis, RDs continued to have larger I 2 than RRs and ORs in each category (Fig. S3). The I 2 slightly decreased as the number of studies increased, consistent with previous findings [9]. It remained nearly unchanged as the average study size increased and noticeably increased as the total number of events increased. Similar to the trends of I 2, the histograms in Fig. 2B and S4 indicate that RDs generally had greater CVB values than RRs and ORs. The conclusions regarding CVB by categories of number of studies, average study size, and the total number of events in a meta-analysis were also consistent with those regarding I 2 (Figure S5). In sensitivity analyses using the 3125 meta-analyses with the largest number of studies from each review, the histograms’ overall trends were similar to those based on the complete datasets (Figs. S6 and S7).
Discussion
Our findings consistently supported that the RD seems more heterogeneous than the RR and OR. Yet, large uncertainties in I 2 may confound these findings. The accuracy of I 2 may also be questionable in meta-analyses with few studies and/or rare events [10]. In addition, I 2 has several limitations; for example, it increases as sample sizes increase for the same τ 2. The CVB overcomes this drawback, while it is also subject to some disadvantages, as it increases rapidly for the overall effect size approaching 0. Nevertheless, they are arguably the appropriate tools with intuitive interpretations available in the current research synthesis literature to compare heterogeneity of measures across different scales. We intend our findings as supporting evidence rather than an assertion about heterogeneity magnitudes. Additional file 1: Table S1. Summary of situations where is not calculable or takes very small values (<0.01) among the 64,929 meta-analyses. Table S2. Summary of situations where I2 is not calculable, equals 0%, or takes very small values (≤1%) among the 64,929 meta-analyses. Table S3. Comparisons between I2 of the RD, RR, and OR within the 64,929 meta-analyses. Table S4. Q test results (with the significance level at 0.05) among the pairs of RD, RR, and OR within the 64,929 meta-analyses. Table S5. Summary of descriptive statistics of I2 (%) among the 23,966 meta-analyses with I2>0% for all three effect measures based on both the DL and REML methods. Figure S1. Histograms of between-study standard deviations on a logarithmic scale based on the DerSimonian–Laird method for the RD, RR, and OR. The histograms are restricted to the range from −8 to 2 for log. Figure S2. Histogram of I2 based on the DerSimonian–Laird method for the RD, RR, and OR, restricted to I2>1% for better visualizations. Figure S3. Boxplots of I2 for the RD, RR, and OR categorized by the number of studies (panels a and b), average study size (panels c and d), and total number of events (panels e and f), restricted to I2>1%. The left panels a, c, and e are based on the DerSimonian–Laird method, and the right panels b, d, and f are based on the restricted maximum likelihood (REML) method. Figure S4. Histogram of CVB on a logarithmic scale based on the DerSimonian–Laird method for the RD, RR, and OR. Figure S5. Boxplots of CVB on a logarithmic scale for the RD, RR, and OR categorized by the number of studies (panels a and b), average study size (panels c and d), and total number of events (panels e and f). The left panels a, c, and e are based on the DerSimonian–Laird method, and the right panels b, d, and f are based on the restricted maximum likelihood (REML) method. Figure S6. Histograms of I2 for the RD, RR, and OR, restricted to I2>1% for better visualizations, among the meta-analyses with the largest number of studies from each Cochrane review. Panel a is based on the DerSimonian–Laird method, and panel b is based on the restricted maximum like-lihood (REML) method. Figure S7. Histograms of CVB on a logarithmic scale for the RD, RR, and OR among the meta-analyses with the largest number of studies from each Cochrane review. Panel a is based on the DerSimonian–Laird method, and panel b is based on the restricted maximum likelihood (REML) method.

10 in total

Review 1. Heterogeneity and statistical significance in meta-analysis: an empirical study of 125 meta-analyses.
Authors: E A Engels; C H Schmid; N Terrin; I Olkin; J Lau
Journal: Stat Med       Date: 2000-07-15       Impact factor: 2.373
2. Issues in the selection of a summary statistic for meta-analysis of clinical trials with binary outcomes.
Authors: Jonathan J Deeks
Journal: Stat Med       Date: 2002-06-15       Impact factor: 2.373
Review 3. Measuring inconsistency in meta-analyses.
Authors: Julian P T Higgins; Simon G Thompson; Jonathan J Deeks; Douglas G Altman
Journal: BMJ       Date: 2003-09-06
4. Small studies are more heterogeneous than large ones: a meta-meta-analysis.
Authors: Joanna IntHout; John P A Ioannidis; George F Borm; Jelle J Goeman
Journal: J Clin Epidemiol       Date: 2015-04-02       Impact factor: 6.437
5. Confidence intervals for heterogeneity measures in meta-analysis.
Authors: Bahi Takkouche; Polyna Khudyakov; Julián Costa-Bouzas; Donna Spiegelman
Journal: Am J Epidemiol       Date: 2013-08-06       Impact factor: 4.897
6. Is the Risk Difference Really a More Heterogeneous Measure?
Authors: Charlie Poole; Ian Shrier; Tyler J VanderWeele
Journal: Epidemiology       Date: 2015-09       Impact factor: 4.822
7. A proposed framework to guide evidence synthesis practice for meta-analysis with zero-events studies.
Authors: Chang Xu; Luis Furuya-Kanamori; Liliane Zorzela; Lifeng Lin; Sunita Vohra
Journal: J Clin Epidemiol       Date: 2021-02-13       Impact factor: 6.437
8. Empirical Comparison of Publication Bias Tests in Meta-Analysis.
Authors: Lifeng Lin; Haitao Chu; Mohammad Hassan Murad; Chuan Hong; Zhiyong Qu; Stephen R Cole; Yong Chen
Journal: J Gen Intern Med       Date: 2018-04-16       Impact factor: 5.128
9. Performance of Between-study Heterogeneity Measures in the Cochrane Library.
Authors: Xiaoyue Ma; Lifeng Lin; Zhiyong Qu; Motao Zhu; Haitao Chu
Journal: Epidemiology       Date: 2018-11       Impact factor: 4.822
10. The magnitude of small-study effects in the Cochrane Database of Systematic Reviews: an empirical study of nearly 30 000 meta-analyses.
Authors: Lifeng Lin; Linyu Shi; Haitao Chu; Mohammad Hassan Murad
Journal: BMJ Evid Based Med       Date: 2019-07-04

10 in total