| Literature DB >> 34505983 |
Lifeng Lin1, Chang Xu2, Haitao Chu3.
Abstract
BACKGROUND: Meta-analysis is increasingly used to synthesize proportions (e.g., disease prevalence). It can be implemented with widely used two-step methods or one-step methods, such as generalized linear mixed models (GLMMs). Existing simulation studies have shown that GLMMs outperform the two-step methods in some settings. It is, however, unclear whether these simulation settings are common in the real world. We aim to compare the real-world performance of various meta-analysis methods for synthesizing proportions.Entities:
Keywords: Cochrane review; data transformation; generalized linear mixed model; meta-analysis; proportion
Mesh:
Year: 2021 PMID: 34505983 PMCID: PMC8432281 DOI: 10.1007/s11606-021-07098-5
Source DB: PubMed Journal: J Gen Intern Med ISSN: 0884-8734 Impact factor: 5.128
Results Produced by the Various Methods for Synthesizing Proportions Among Cochrane Datasets (n = 43,644)
| 260 (0.6%) | 0.9% (0.2%, 2.4%) | 1.10 (1.03, 1.46) | |
| 215 (0.5%) | 0.6% (0.2%, 1.8%) | 1.07 (1.02, 1.42) | |
| 113 (0.3%) | 0.4% (0.0%, 1.3%) | 1.05 (1.01, 1.26) | |
| 125 (0.3%) | 0.3% (−0.1%, 1.2%) | 1.06 (1.01, 1.25) | |
| 125 (0.3%) | 0.4% (−0.1%, 1.4%) | 1.06 (1.01, 1.26) | |
| 125 (0.3%) | 0.5% (0.0%, 1.5%) | 1.06 (1.01, 1.30) | |
| 125 (0.3%) | 0.6% (0.1%, 1.6%) | 1.06 (1.01, 1.36) | |
| 3822 (8.8%) | 0.0% (−0.4%, 0.0%) | 1.01 (1.00, 1.03) | |
| 2766 (6.3%) | 0.0% (0.0%, 0.2%) | 1.00 (1.00, 1.02) | |
| 5708 (13.1%) | 0.0% (−0.5%, 0.0%) | 1.02 (1.00, 1.10) | |
| 2818 (6.5%) | 0.0% (−0.2%, 0.0%) | 1.00 (1.00, 1.01) | |
| 2131 (4.9%) | Reference | Reference |
*The column of “failure” gives the counts of datasets that led to computational issues when using the various methods. The corresponding proportions are given in parentheses
†The columns of “absolute difference” and “fold change” give the medians of absolute differences and fold changes of the overall proportion estimates produced by the various methods compared with the reference method. The corresponding interquartile ranges are given in parentheses. The fold change is the ratio of a larger estimate of the overall proportion divided by a smaller estimate
Abbreviations: Two-step (log, logit, or arcsine), two-step method with the log, logit, or arcsine-square-root transformation; Two-step (DAS-H, DAS-G, DAS-A, or DAS-IV), two-step method with the Freeman–Tukey double-arcsine (DAS) transformation, using the harmonic (H), geometric (G), or arithmetic (A) mean of study-specific sample sizes, or using the inverse of the variance (IV) of the synthesized result, as the overall sample size; GLMM (log, logit, probit, cauchit, or cloglog), generalized linear mixed model with the log, logit, probit, cauchit, or complementary log-log link
Figure 1Proportions (with 95% confidence intervals) of Cochrane datasets that led to computational issues when using various meta-analysis methods, categorized by the total sample size within a meta-analysis. The two-step method (DAS-H, DAS-G, DAS-A, or DAS-IV) corresponds to the Freeman–Tukey double-arcsine (DAS) transformation, using the harmonic (H), geometric (G), or arithmetic (A) mean of study-specific sample sizes, or using the inverse of the variance (IV) of the synthesized result, as the overall sample size.
Figure 2Scatter plots of overall proportion estimates (in percentage) produced by various meta-analysis methods against those by the generalized linear mixed model with the logit link among Cochrane datasets. In each panel, the diagonal dashed line represents the identity line, and Spearman’s rank correlation coefficient ρ between the corresponding two meta-analysis methods is displayed. The two-step method (DAS-H, DAS-G, DAS-A, or DAS-IV) corresponds to the Freeman–Tukey double-arcsine (DAS) transformation, using the harmonic (H), geometric (G), or arithmetic (A) mean of study-specific sample sizes, or using the inverse of the variance (IV) of the synthesized result, as the overall sample size.
Figure 3Bland–Altman plots of agreement between overall proportion estimates (in percentage) produced by various meta-analysis methods and those by the generalized linear mixed model with the logit link among Cochrane datasets. In each panel, the horizontal solid line represents the mean difference, and the horizontal dashed lines represent 95% limits of agreement. The two-step method (DAS-H, DAS-G, DAS-A, or DAS-IV) corresponds to the Freeman–Tukey double-arcsine (DAS) transformation, using the harmonic (H), geometric (G), or arithmetic (A) mean of study-specific sample sizes, or using the inverse of the variance (IV) of the synthesized result, as the overall sample size.
Figure 4Box plots of fold changes of overall proportion estimates produced by various meta-analysis methods compared with those by the generalized linear mixed model with the logit link among Cochrane datasets, categorized by the total event count within a meta-analysis. Because many datasets led to extreme values of fold changes, the box plots do not present outliers, and the vertical axis only presents the range from 1 to 2. The two-step method (DAS-H, DAS-G, DAS-A, or DAS-IV) corresponds to the Freeman–Tukey double-arcsine (DAS) transformation, using the harmonic (H), geometric (G), or arithmetic (A) mean of study-specific sample sizes, or using the inverse of the variance (IV) of the synthesized result, as the overall sample size.
Figure 5Box plots of absolute differences between overall proportion estimates (in percentage) produced by various meta-analysis methods and those by the generalized linear mixed model with the logit link among Cochrane datasets, categorized by the total sample size within a meta-analysis. In each panel, the horizontal dashed line represents no difference. Because many datasets led to extreme values of absolute differences, the box plots do not present outliers, and the vertical axis only presents the range from − 6 to 6%. The two-step method (DAS-H, DAS-G, DAS-A, or DAS-IV) corresponds to the Freeman–Tukey double-arcsine (DAS) transformation, using the harmonic (H), geometric (G), or arithmetic (A) mean of study-specific sample sizes, or using the inverse of the variance (IV) of the synthesized result, as the overall sample size.