| Literature DB >> 26196398 |
Binod Neupane1, Joseph Beyene2.
Abstract
In a meta-analysis with multiple end points of interests that are correlated between or within studies, multivariate approach to meta-analysis has a potential to produce more precise estimates of effects by exploiting the correlation structure between end points. However, under random-effects assumption the multivariate estimation is more complex (as it involves estimation of more parameters simultaneously) than univariate estimation, and sometimes can produce unrealistic parameter estimates. Usefulness of multivariate approach to meta-analysis of the effects of a genetic variant on two or more correlated traits is not well understood in the area of genetic association studies. In such studies, genetic variants are expected to roughly maintain Hardy-Weinberg equilibrium within studies, and also their effects on complex traits are generally very small to modest and could be heterogeneous across studies for genuine reasons. We carried out extensive simulation to explore the comparative performance of multivariate approach with most commonly used univariate inverse-variance weighted approach under random-effects assumption in various realistic meta-analytic scenarios of genetic association studies of correlated end points. We evaluated the performance with respect to relative mean bias percentage, and root mean square error (RMSE) of the estimate and coverage probability of corresponding 95% confidence interval of the effect for each end point. Our simulation results suggest that multivariate approach performs similarly or better than univariate method when correlations between end points within or between studies are at least moderate and between-study variation is similar or larger than average within-study variation for meta-analyses of 10 or more genetic studies. Multivariate approach produces estimates with smaller bias and RMSE especially for the end point that has randomly or informatively missing summary data in some individual studies, when the missing data in the endpoint are imputed with null effects and quite large variance.Entities:
Mesh:
Year: 2015 PMID: 26196398 PMCID: PMC4509672 DOI: 10.1371/journal.pone.0133243
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Simulation scenarios and methods for IPD data generation.
| Parameters | Assumed values |
|---|---|
| Suffix |
|
| No. of replications |
|
| No. of end points |
|
| No. of studies ( |
|
| MAF |
|
| Size of study |
|
| MAF in study |
|
| No. of minor allele |
|
| Genotype distribution | In study |
| Heritability |
|
| SNP effects |
|
| SNP joint effect |
|
| Between-study correlation |
|
| Heterogeneity | I2 = 25%,50%,75% (low, moderate, and high heterogeneity) |
| Between study variance, | First, an average of within-study variance |
| Study-wise effects |
|
| Within-Study correlation |
|
| Baseline effect (when |
|
| Residual variance matrix | Diagonal element of |
| IPD data generation |
|
| Summary data in study |
|
Abbreviations: IPD, individual participant data; MAF, minor allele frequency; HWE, Hardy-Weinberg Equilibrium; SNP, single nucleotide polymorphism.
Relative mean bias percentage, RMSE and coverage probability when N = 10000, m = 10, β 1 = 0.1, β 2 = 0.1, ρ = 0.5, ρ = 0.3.
| Summary | Effects | Heterogeneity | Correlation | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| data | % Bias | RMSE | Coverage | % Bias | RMSE | % Bias | %( | |||||||||
| Method | scenario |
|
|
|
|
|
|
|
|
|
|
| ( | -1 | +1 | |1| |
| MV | COM | 50% | 0 | 1 | 0 | 0 | 95.7 | 96.2 | 6 | 3 | -3 | -3 | -17 | 11.8 | 30.5 | 42.3 |
| UV | COM | 0 | 1 | 0 | 0 | 95.3 | 95.8 | 3 | 0 | 0 | 0 | — | — | — | — | |
| MV | COM | 0 | 1 | 0 | 0 | 95.9 | 96.2 | 8 | 8 | -3 | -3 | 31 | 6.1 | 49.3 | 55.4 | |
| MV | MAR | 0 | 0 | 0 | 0 | 95.9 | 95.0 | 6 | 8 | -3 | 0 | -22 | 15.7 | 38.6 | 54.3 | |
| UV | MAR | 0 | 0 | 0 | 0 | 95.3 | 96.3 | 3 | 3 | 0 | 0 | — | — | — | — | |
| MV | MIF | 0 | 27 | 0 | -5 | 95.7 | 80.7 | 6 | -19 | -3 | -3 | -34 | 21.3 | 41.9 | 63.3 | |
| UV | MIF | 0 | 31 | 0 | 0 | 95.3 | 81.5 | 3 | -25 | 0 | 0 | — | — | — | — | |
| MV | COM | 75% | 0 | 0 | 0 | 0 | 95.0 | 94.9 | 0 | -1 | 0 | 0 | -9 | 2.1 | 7.7 | 9.8 |
| UV | COM | 0 | 0 | 0 | 0 | 95.0 | 94.8 | 0 | -1 | 0 | 0 | — | — | — | — | |
| MV | COM | 0 | 0 | 0 | 0 | 95.0 | 94.8 | 1 | 0 | 0 | 0 | 14 | 1.3 | 15.6 | 16.9 | |
| MV | MAR | 0 | 0 | 0 | 0 | 95.0 | 93.2 | 1 | 1 | 0 | 1 | -12 | 4.8 | 15.9 | 20.7 | |
| UV | MAR | 0 | 0 | 0 | 0 | 95.0 | 95.4 | 0 | 0 | 0 | 0 | — | — | — | — | |
| MV | MIF | 0 | 29 | 0 | -4 | 95.1 | 79.1 | 1 | 6 | 0 | 0 | -14 | 7 | 18.1 | 25.1 | |
| UV | MIF | 0 | 35 | 0 | 0 | 95.0 | 79.9 | 0 | 5 | 0 | 0 | — | — | — | — | |
Abbreviation: RMSE, root mean square error; MV, multivariate meta-analysis; UV, Univariate meta-analysis; COM, complete data scenario; MAR, end point 2 missing at random for 30% studies; MAR, end point 2 missing informatively for 30% studies.
aThe between-study variances for both end-points are: = 0.0036 for I 2 = 50%, = 0.0108 for I 2 = 75%.
bRMSE of estimates by MV method are expressed as % smaller (-) or larger (+) of corresponding estimates by UV method.
c ignored.
Relative mean bias percentage, RMSE, and coverage probability when N = 10000, m = 10, β 1 = 0.2, β 2 = 0.3, β 3 = 0.3, β = 0.6, β = 0.7, ρ = 0.6, ρ = ρ = ρ = 0.
| Summary | Effects | Heterogeneity | Correlation | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| data | % Bias | RMSE | Coverage | % Bias | RMSE | % Bias | %( | ||||||||||||||||
| Method | scenario |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| MV | COM | 50% | 0 | 0 | 0 | 0 | 0 | 0 | 96.6 | 96.1 | 96.7 | 11 | 8 | 13 | -3 | -3 | -3 | -14 | -17 | -12 | 16.7 | 16.9 | 16.8 |
| UV | COM | 0 | 0 | 0 | 0 | 0 | 0 | 95.8 | 95.3 | 95.4 | 3 | 0 | 3 | 0 | 0 | 0 | — | — | — | — | — | — | |
| MV | COM | 0 | 0 | 0 | 0 | 0 | 0 | 96.7 | 96.1 | 96.6 | 11 | 8 | 11 | -3 | -3 | -3 | -14 | -17 | -12 | 16.7 | 16.8 | 16.7 | |
| MV | MAR | 0 | 0 | 0 | 0 | 1 | 0 | 96.7 | 95.8 | 96.6 | 11 | 18 | 13 | -3 | 3 | 0 | -22 | -18 | -20 | 21.8 | 21.8 | 21.9 | |
| UV | MAR | 0 | 0 | 0 | 0 | 0 | 0 | 95.8 | 96.6 | 95.4 | 3 | 3 | 3 | 0 | 0 | 0 | — | — | — | — | — | — | |
| MV | MIF | 0 | 9 | 0 | 0 | -3 | 0 | 96.5 | 86.1 | 96.4 | 11 | -32 | 11 | -3 | -7 | 0 | -29 | -17 | -28 | 26.1 | 25.8 | 26.2 | |
| UV | MIF | 0 | 10 | 0 | 0 | 0 | 0 | 95.8 | 87.2 | 95.4 | 3 | -47 | 3 | 0 | 0 | 0 | — | — | — | — | — | — | |
| MV | COM | 75% | 0 | 0 | 0 | 0 | 0 | 0 | 95.2 | 94.7 | 95.6 | 0 | 1 | 2 | -1 | 0 | -1 | -4 | -2 | -3 | 2.9 | 3.5 | 2.9 |
| UV | COM | 0 | 0 | 0 | 0 | 0 | 0 | 95.0 | 94.7 | 95.1 | -2 | -1 | 0 | 0 | 0 | 0 | — | — | — | — | — | — | |
| MV | COM | 0 | 0 | 0 | 0 | 0 | 0 | 95.3 | 94.7 | 95.5 | 0 | 0 | 2 | -1 | 0 | -1 | -4 | -2 | -3 | 2.7 | 3.3 | 2.9 | |
| MV | MAR | 0 | 0 | 0 | 0 | 0 | 0 | 95.3 | 93.3 | 95.6 | 0 | 6 | 2 | -1 | 5 | -1 | -9 | -2 | -8 | 5.1 | 4.9 | 5.1 | |
| UV | MAR | 0 | 0 | 0 | 0 | 0 | 0 | 95.0 | 94.5 | 95.1 | -2 | 1 | 0 | 0 | 0 | 0 | — | — | — | — | — | — | |
| MV | MIF | 0 | 13 | 0 | 0 | -7 | 0 | 95.3 | 78.3 | 95.7 | 0 | -39 | 2 | -1 | 1 | -1 | -18 | -2 | -19 | 7.5 | 6.4 | 6.8 | |
| UV | MIF | 0 | 16 | 0 | 0 | 0 | 0 | 95.0 | 78.1 | 95.1 | -2 | -48 | 0 | 0 | 0 | 0 | — | — | — | — | — | — | |
Abbreviation: RMSE, root mean square error; MV, multivariate meta-analysis; UV, Univariate meta-analysis; COM, complete data scenario; MAR, end point 2 missing at random for 30% studies; MAR, end point 2 missing informatively for 30% studies.
aThe between-study variances for both end-points are: = 0.0036 for I 2 = 50%, = 0.0108 for I 2 = 75%.
bRMSE of estimates by MV method are expressed as % smaller (-) or larger (+) of corresponding estimates by UV method.
c ignored.
Fig 1Biases in the estimates of τ2, and biases and SEs of the pooled estimates of β 2 from multivariate vs. univariate approaches by whether or not ρ is estimated at parameter boundary in 5000 replications in complete summary data scenario.
Scenario: , ρ = 0.75, ρ = 0.5. Symbols and abbreviations: N, total subjects; m, number of studies, β 2 and τ 2, average effect and between-study standard deviation of true study-wise effects for end point 2, respectively; I 2 = degree of between-study heterogeneity; ρ and ρ , true between-and within-study correlations, respectively; MAF, minor allele frequency; SE, standard error; MV, multivariate approach; UV, univariate approach.
Relative mean bias percentage, RMSE and coverage probability when N = 20000, m = 15, β 1 = 0.1, β 2 = 0.1, ρ = 0.6, ρ = 0.3.
| Summary | Effects | Heterogeneity | Correlation | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| data | % Bias | RMSE | Coverage | % Bias | RMSE | % Bias | %( | |||||||||
| Method | scenario |
|
|
|
|
|
|
|
|
|
|
| ( | -1 | +1 | |1| |
| MV | COM | 50% | 0 | 0 | 0 | 0 | 94.8 | 95.2 | 0 | 4 | 0 | -5 | -5 | 5 | 27 | 32 |
| UV | COM | 0 | 0 | 0 | 0 | 94.7 | 95.1 | 0 | 4 | 0 | 0 | — | — | — | — | |
| MV | COM | 0 | 0 | 0 | 0 | 95.2 | 95.4 | 7 | 7 | 0 | -5 | 34 | 1.6 | 52.4 | 53.9 | |
| MV | MAR | 0 | 0 | 0 | -3 | 95.0 | 94.7 | 4 | 7 | 0 | 0 | -12 | 8.4 | 35.7 | 44.1 | |
| UV | MAR | 0 | 0 | 0 | 0 | 94.7 | 95.6 | 0 | 4 | 0 | 0 | — | — | — | — | |
| MV | MIF | 0 | 27 | 0 | -9 | 94.9 | 66.2 | 4 | -41 | 0 | -4 | -26 | 16.2 | 45.2 | 61.4 | |
| UV | MIF | 0 | 32 | 0 | 0 | 94.7 | 63.1 | 0 | -48 | 0 | 0 | — | — | — | — | |
| MV | COM | 75% | 0 | 0 | 0 | 0 | 94.7 | 94.5 | 0 | 1 | 0 | 0 | -3 | 0.3 | 3.2 | 3.5 |
| UV | COM | 0 | 0 | 0 | 0 | 94.8 | 94.5 | 0 | 1 | 0 | 0 | — | — | — | — | |
| MV | COM | 0 | 0 | 0 | 0 | 94.8 | 94.5 | 1 | 2 | 0 | 0 | 15 | 0.1 | 10 | 10.1 | |
| MV | MAR | 0 | 0 | 0 | -3 | 94.7 | 93.4 | 1 | 2 | 0 | 0 | -4 | 1.1 | 10 | 11.1 | |
| UV | MAR | 0 | 0 | 0 | 0 | 94.8 | 95.0 | 0 | 2 | 0 | 0 | — | — | — | — | |
| MV | MIF | 0 | 30 | 0 | -11 | 94.7 | 71.4 | 1 | -5 | 0 | -3 | -11 | 3.7 | 13.2 | 16.9 | |
| UV | MIF | 0 | 39 | 0 | 0 | 94.8 | 69.4 | 0 | -5 | 0 | 0 | — | — | — | — | |
Abbreviation: RMSE, root mean square error; MV, multivariate meta-analysis; UV, Univariate meta-analysis; COM, complete data scenario; MAR, end point 2 missing at random for 30% studies; MAR, end point 2 missing informatively for 30% studies.
aThe between-study variances for both end-points are: = 0.0027 for I 2 = 50%, = 0.0082 for I 2 = 75%.
bRMSE of estimates by MV method are expressed as % smaller (-) or larger (+) of corresponding estimates by UV method.
c ignored.
Relative mean bias percentage, RMSE and coverage probability when N = 30000, m = 30, β 1 = 0.1, β 2 = 0.2, ρ b = 0.6, ρ w = 0.3.
| Summary | Effects | Heterogeneity | Correlation | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| data | % Bias | RMSE | Coverage | % Bias | RMSE | % Bias | %( | |||||||||
| Method | scenario |
|
|
|
|
|
|
|
|
|
|
| ( | -1 | +1 | |1| |
| MV | COM | 50% | 0 | 0 | 0 | 0 | 95.1 | 95.2 | 0 | 3 | 0 | 0 | 1 | 0.7 | 9.4 | 10.1 |
| UV | COM | 0 | 0 | 0 | 0 | 95.1 | 95.3 | 0 | 3 | 0 | 0 | — | — | — | — | |
| MV | COM | 0 | 0 | 0 | 0 | 95.3 | 95.4 | 6 | 6 | 0 | 0 | 40 | 0.1 | 39.5 | 39.6 | |
| MV | MAR | 0 | 0 | 0 | -2 | 95.1 | 94.8 | 0 | 3 | 0 | 0 | -2 | 1.9 | 16.3 | 18.2 | |
| UV | MAR | 0 | 0 | 0 | 0 | 95.1 | 95.3 | 0 | 3 | 0 | 0 | — | — | — | — | |
| MV | MIF | 0 | 27 | 0 | -12 | 95.2 | 57.8 | 0 | -31 | 0 | -4 | -9 | 6.4 | 26.8 | 33.2 | |
| UV | MIF | 0 | 32 | 0 | 0 | 95.1 | 49.7 | 0 | -33 | 0 | 0 | — | — | — | — | |
| MV | COM | 75% | 0 | 0 | 0 | 0 | 94.8 | 94.5 | 1 | 0 | 0 | 0 | -2 | 0 | 0.1 | 0.1 |
| UV | COM | 0 | 0 | 0 | 0 | 94.9 | 94.4 | 1 | 0 | 0 | 0 | — | — | — | — | |
| MV | COM | 0 | 0 | 0 | 0 | 94.9 | 94.4 | 1 | 1 | 0 | 0 | 15 | 0 | 1.7 | 1.7 | |
| MV | MAR | 0 | 0 | 0 | -3 | 94.8 | 94.1 | 1 | 1 | 0 | 2 | -2 | 0 | 0.7 | 0.7 | |
| UV | MAR | 0 | 0 | 0 | 0 | 94.9 | 94.5 | 1 | 1 | 0 | 0 | — | — | — | — | |
| MV | MIF | 0 | 26 | 0 | -16 | 94.8 | 76.1 | 1 | 6 | 0 | -4 | -1 | 0.1 | 1 | 1.1 | |
| UV | MIF | 0 | 36 | 0 | 0 | 94.9 | 69.9 | 1 | 6 | 0 | 0 | — | — | — | — | |
Abbreviation: RMSE, root mean square error; CI, confidence interval; MV, multivariate meta-analysis; UV, Univariate meta-analysis; COM, complete data scenario; MAR, end point 2 missing at random for 30% studies; MAR, end point 2 missing informatively for 30% studies.
aThe between-study variances for both end-points are: = 0.0036 for I 2 = 50%, = 0.0109 for I 2 = 75%.
bRMSE of estimates by MV method are expressed as % smaller (-) or larger (+) of corresponding estimates by UV method.
c ignored.