| Literature DB >> 18800342 |
Julian P T Higgins1, Ian R White, Judith Anzures-Cabrera.
Abstract
When literature-based meta-analyses involve outcomes with skewed distributions, the best available data can sometimes be a mixture of results presented on the raw scale and results presented on the logarithmic scale. We review and develop methods for transforming between these results for two-group studies, such as clinical trials and prospective or cross-sectional epidemiological studies. These allow meta-analyses to be conducted using all studies and on a common scale. The methods can also be used to produce a meta-analysis of ratios of geometric means when skewed data are reported on the raw scale for every study. We compare three methods, two of which have alternative standard error formulae, in an application and in a series of simulation studies. We conclude that an approach based on a log-normal assumption for the raw data is reasonably robust to different true distributions; and we provide new standard error approximations for this method. An assumption can be made that the standard deviations in the two groups are equal. This increases precision of the estimates, but if incorrect can lead to very misleading results.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18800342 PMCID: PMC2978323 DOI: 10.1002/sim.3427
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Data available for D9N polymorphism in the lipoprotein lipase gene and triglyceride levels.
| Carriers | Non-carriers | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Raw | Log | Raw | Log | |||||||
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | |||
| Boer 2003b | 34 | — | — | 0.31 | 0.58 | 1002 | — | — | 0.33 | 0.53 |
| Copenhagen | 241 | 2.10 | 1.46 | 1.05 | 0.37 | 8429 | 1.85 | 1.54 | 0.98 | 0.34 |
| CDRFMP | 14 | 2.05 | 1.21 | — | — | 364 | 1.57 | 1.11 | — | — |
| EARS I & II | 71 | 1.12 | 0.34 | — | — | 1608 | 0.99 | 0.80 | — | — |
| ECTIM | 22 | 1.82 | 1.46 | — | — | 784 | 1.84 | 1.47 | — | — |
| Ehrenborg 1997 | 15 | 1.01 | 0.36 | — | — | 77 | 0.99 | 0.53 | — | — |
| Ferencak 2003 | 5 | 2.04 | 0.92 | — | — | 195 | 1.81 | 0.84 | — | — |
| FOS | 58 | 1.61 | 0.72 | — | — | 2200 | 1.38 | 1.16 | — | — |
| Glisic 2003b | 4 | 2.42 | 1.53 | 0.74 | 0.60 | 129 | 1.64 | 0.94 | 0.37 | 0.49 |
| Reykjavik | 10 | 1.64 | 1.64 | 0.20 | 0.74 | 274 | 1.04 | 0.49 | −0.05 | 0.42 |
| Rios 2003 | 10 | 1.60 | 0.70 | 0.39 | 0.41 | 187 | 1.75 | 0.92 | 0.43 | 0.50 |
| Schulte 1996 | 17 | 1.96 | 0.82 | — | — | 644 | 1.56 | 0.82 | — | — |
| Talmud 1998 | 12 | 1.35 | 0.52 | — | — | 96 | 1.27 | 0.52 | — | — |
| Yang 2004 | 235 | 2.39 | 1.46 | 0.74 | 0.50 | 1275 | 2.34 | 1.26 | 0.73 | 0.49 |
Results of meta-analyses on raw and logarithmic scales, for example, using three transformation methods.
| Available data | Method 1 | Method 2 | Method 3 | |||||
|---|---|---|---|---|---|---|---|---|
| Difference in means | SE | Difference in means | SE | Difference in means | SE | Difference in means | SE | |
| Analysis on raw scale | ||||||||
| Boer 2003b | — | — | 0.009 | 0.175 (0.176) | −0.034 | 0.145 (0.156) | −0.030 | 0.138 |
| Copenhagen | 0.249 | 0.096 | 0.245 | 0.076 (0.076) | 0.213 | 0.068 (0.070) | 0.200 | 0.067 |
| CDRFMP | 0.478 | 0.329 | ||||||
| EARS I & II | 0.130 | 0.045 | ||||||
| ECTIM | −0.027 | 0.315 | − | − | − | |||
| Ehrenborg 1997 | 0.020 | 0.111 | ||||||
| Ferencak 2003 | 0.230 | 0.414 | ||||||
| FOS | 0.236 | 0.097 | ||||||
| Glisic 2003b | 0.780 | 0.769 | 0.877 | 0.821 (0.829) | 0.732 | 0.588 (0.625) | 0.644 | 0.527 |
| Reykjavik | 0.600 | 0.519 | 0.567 | 0.425 (0.435) | 0.297 | 0.186 (0.195) | 0.270 | 0.254 |
| Rios 2003 | −0.152 | 0.232 | −0.136 | 0.228 (0.228) | −0.068 | 0.270 (0.288) | −0.060 | 0.203 |
| Schulte 1996 | 0.393 | 0.200 | ||||||
| Talmud 1998 | 0.080 | 0.159 | ||||||
| Yang 2004 | 0.050 | 0.102 | 0.027 | 0.089 (0.090) | 0.012 | 0.082 (0.088) | 0.011 | 0.074 |
| Meta-analysis (random-effects, using Taylor SEs) | 0.142 | 0.031 | 0.139 | 0.0310 | 0.135 | 0.028 | 0.125 | 0.028 |
| Analysis on logarithmic scale | ||||||||
| Boer 2003b | −0.022 | 0.100 | − | − | − | |||
| Copenhagen | 0.073 | 0.024 | 0.192 | 0.046 (0.041) | 0.126 | 0.054 (0.047) | 0.127 | 0.048 |
| CDRFMP | — | — | 0.319 | 0.159 (0.150) | 0.266 | 0.191 (0.173) | 0.264 | 0.182 |
| EARS I & II | — | — | 0.332 | 0.041 (0.039) | 0.123 | 0.096 (0.085) | 0.123 | 0.042 |
| ECTIM | — | — | −0.017 | 0.187 (0.153) | −0.015 | 0.173 (0.152) | −0.015 | 0.172 |
| Ehrenborg 1997 | — | — | 0.086 | 0.108 (0.106) | 0.020 | 0.144 (0.136) | 0.020 | 0.111 |
| Ferencak 2003 | — | — | 0.125 | 0.198 (0.194) | 0.1250 | 0.210 (0.199) | 0.119 | 0.215 |
| FOS | — | — | 0.335 | 0.060 (0.058) | 0.158 | 0.111 (0.096) | 0.159 | 0.065 |
| Glisic 2003b | 0.370 | 0.303 | 0.363 | 0.316 (0.294) | 0.389 | 0.292 (0.271) | 0.384 | 0.379 |
| Reykjavik | 0.250 | 0.235 | 0.209 | 0.419 (0.265) | 0.15 | 0.158 (0.150) | 0.388 | 0.388 |
| Rios 2003 | −0.040 | 0.135 | −0.057 | 0.140 (0.137) | −0.091 | 0.169 (0.159) | −0.090 | 0.138 |
| Schulte 1996 | — | — | 0.266 | 0.100 (0.099) | 0.224 | 0.128 (0.121) | 0.223 | 0.114 |
| Talmud 1998 | — | — | 0.071 | 0.115 (0.114) | 0.1 | 0.125 (0.120) | 0.1 | 0.121 |
| Yang 2004 | 0.005 | 0.036 | −0.011 | 0.042 (0.039) | 0.021 | 0.039 (0.037) | 0.021 | 0.043 |
| Meta-analysis (random-effects, using Taylor SEs) | 0.047 | 0.022 | 0.154 | 0.050 | 0.091 | 0.030 | 0.091 | 0.021 |
Phet = P-value from a test for heterogeneity among effect estimates in the meta-analysis.
Results in italic are copied across from the available data when no transformations are possible. Main standard errors are calculated using the Taylor approximations; standard errors in parentheses are calculated using the ‘t-test’ methods.
Figure 1Epidemiological studies of D9N polymorphism in the lipoprotein lipase gene and triglyceride levels: meta-analyses on the raw triglyceride scale, using various conversions from the logarithmic to the raw scale. Where conversions are made, methods are ordered as follows: Method 1 (separate variances, t-test SE); Method 1 (separate variances, Taylor SE); Method 2 (equal variances, t-test SE); Method 2 (equal variances, Taylor SE); Method 3.
Figure 2Epidemiological studies of D9N polymorphism in the lipoprotein lipase gene and triglyceride levels: meta-analyses on the log triglyceride scale using various conversions from the raw to the logarithmic scale. Where conversions are made, methods are ordered as follows: Method 1 (separate variances, t-test SE); Method 1 (separate variances, Taylor SE); Method 2 (equal variances, t-test SE); Method 2 (equal variances, Taylor SE); Method 3.
Simulation results for sample size imbalance (log-normal distribution); 10 participants in group 1, 100 participants in group 2; 10 000 simulations in each set.
| Transformation raw to log scale (μ | Transformation log to raw scale (μ | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Set 9 | Set 10 | Set 11 | Set 12 | Set 9 | Set 10 | Set 11 | Set 12 | ||
| True means (group 1, group 2) | (0, 0) | (0, 0) | (0, 0.2) | (0, 0.2) | (1.65, 1.65) | (1.02, 1.02) | (1.65, 2.01) | (1.02, 1.25) | |
| True SDs (group 1, group 2) | (1, 1) | (0.2, 0.2) | (1, 1) | (0.2, 0.2) | (2.16, 2.16) | (0.21, 0.21) | (2.16, 2.64) | (0.21, 0.25) | |
| Bias | |||||||||
| Method 1 | −0.051 | 0.000 | −0.051 | 0.001 | −0.115 | −0.003 | −0.113 | −0.002 | |
| Method 2 | 0.072 | 0.001 | 0.070 | 0.002 | −0.067 | −0.003 | −0.069 | −0.002 | |
| Method 3 | 0.070 | 0.001 | 0.060 | 0.000 | −0.041 | −0.003 | −0.188 | −0.007 | |
| Standard error | |||||||||
| Method 1 | 0.269 | 0.064 | 0.269 | 0.064 | 0.878 | 0.067 | 0.897 | 0.069 | |
| Taylor | 0.688 | 0.064 | 0.676 | 0.064 | 0.776 | 0.067 | 0.794 | 0.069 | |
| Empirical | 0.356 | 0.066 | 0.357 | 0.067 | 0.791 | 0.069 | 0.800 | 0.070 | |
| Method 2 | 0.310 | 0.066 | 0.310 | 0.066 | 0.759 | 0.068 | 0.780 | 0.070 | |
| Taylor | 0.399 | 0.067 | 0.398 | 0.067 | 0.577 | 0.068 | 0.593 | 0.069 | |
| Empirical | 0.401 | 0.067 | 0.400 | 0.067 | 0.586 | 0.068 | 0.602 | 0.070 | |
| Method 3 | Taylor | 0.341 | 0.065 | 0.319 | 0.060 | 0.328 | 0.065 | 0.363 | 0.072 |
| Empirical | 0.385 | 0.067 | 0.380 | 0.066 | 0.349 | 0.066 | 0.358 | 0.068 | |
| Coverage | |||||||||
| Method 1 | 84.3 | 92.1 | 84.1 | 92.2 | 91.7 | 92.3 | 92.8 | 92.7 | |
| Taylor | 96.2 | 92.2 | 95.9 | 92.2 | 90.4 | 92.3 | 91.7 | 92.7 | |
| Method 2 | 86.0 | 94.5 | 86.5 | 94.3 | 97.7 | 95.1 | 97.7 | 95.0 | |
| Taylor | 93.8 | 94.8 | 93.6 | 94.5 | 93.9 | 94.8 | 94.7 | 94.7 | |
| Method 3 | Taylor | 85.9 | 92.0 | 85.1 | 90.4 | 92.5 | 92.4 | 93.3 | 94.6 |
Maximum Monte Carlo errors (raw to log) bias: 0.004; t-test standard error: 0.0005; Taylor standard error: 0.009; t-test coverage: 0.4 per cent; Taylor coverage: 0.4 per cent.
Maximum Monte Carlo errors (log to raw) bias: 0.008; t-test standard error: 0.007; Taylor standard error: 0.009; t-test coverage: 0.3 per cent; Taylor coverage: 0.3 per cent.
Simulation results for basic set: assumptions hold (log-normal distribution); 100 participants in each group; 10 000 simulations in each set.
| Transformation raw to log scale (μ | Transformation log to raw scale (μ | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Set 1 | Set 2 | Set 3 | Set 4 | Set 1 | Set 2 | Set 3 | Set 4 | ||
| True means (group 1, group 2) | (0, 0) | (0, 0) | (0, 0.2) | (0, 0.2) | (1.65, 1.65) | (1.02, 1.02) | (1.65, 2.01) | (1.02, 1.25) | |
| True SDs (group 1, group 2) | (1, 1) | (0.2, 0.2) | (1, 1) | (0.2, 0.2) | (2.16, 2.16) | (0.21, 0.21) | (2.16, 2.64) | (0.21, 0.25) | |
| Bias | |||||||||
| Method 1 | 0.000 | 0.000 | 0.002 | 0.000 | 0.004 | 0.000 | 0.007 | 0.000 | |
| Method 2 | 0.004 | 0.000 | 0.002 | 0.000 | 0.004 | 0.000 | 0.004 | 0.000 | |
| Method 3 | 0.004 | 0.000 | 0.000 | 0.000 | 0.003 | 0.000 | −0.143 | −0.004 | |
| Standard error | |||||||||
| Method 1 | 0.134 | 0.028 | 0.134 | 0.028 | 0.314 | 0.029 | 0.351 | 0.032 | |
| Taylor | 0.446 | 0.028 | 0.432 | 0.028 | 0.292 | 0.029 | 0.326 | 0.032 | |
| Empirical | 0.177 | 0.028 | 0.177 | 0.028 | 0.293 | 0.029 | 0.323 | 0.032 | |
| Method 2 | 0.134 | 0.028 | 0.134 | 0.028 | 0.310 | 0.029 | 0.346 | 0.032 | |
| Taylor | 0.172 | 0.028 | 0.172 | 0.028 | 0.236 | 0.029 | 0.264 | 0.032 | |
| Empirical | 0.185 | 0.029 | 0.182 | 0.028 | 0.235 | 0.029 | 0.265 | 0.032 | |
| Method 3 | Taylor | 0.178 | 0.029 | 0.178 | 0.029 | 0.142 | 0.028 | 0.156 | 0.031 |
| Empirical | 0.183 | 0.029 | 0.179 | 0.028 | 0.142 | 0.028 | 0.159 | 0.031 | |
| Coverage | |||||||||
| Method 1 | 87.2 | 95.1 | 86.8 | 94.6 | 97.4 | 94.9 | 97.4 | 94.8 | |
| Taylor | 99.7 | 95.1 | 99.8 | 94.6 | 96.1 | 94.9 | 96.0 | 94.8 | |
| Method 2 | 84.9 | 94.6 | 85.3 | 94.6 | 99.2 | 95.3 | 98.9 | 95.0 | |
| Taylor | 93.2 | 94.8 | 93.3 | 94.8 | 95.5 | 95.1 | 95.2 | 94.8 | |
| Method 3 | Taylor | 95.3 | 94.8 | 95.6 | 95.3 | 95.1 | 95.1 | 84.1 | 94.4 |
Maximum Monte Carlo errors (raw to log) bias: 0.002; t-test standard error: 0.0001; Taylor standard error: 0.007; t-test coverage: 0.4 per cent; Taylor coverage: 0.3 per cent.
Maximum Monte Carlo errors (log to raw) bias: 0.003; t-test standard error: 0.0006; Taylor standard error: 0.0004; t-test coverage: 0.2 per cent; Taylor coverage: 0.4 per cent.
Data generation and parameter values for all simulations. Parameter values for Z in sets 17–24 (in italics) are derived empirically.
| Log scale (Z) | Raw scale (X) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Group 1 | Group 2 | Group 1 | Group 2 | |||||||||
| Basic set: assumptions hold (log-normal distribution) | 1 | 100 | 100 | 0 | 1 | 0 | 1 | 1.65 | 2.16 | 1.65 | 2.16 | |
| 2 | 100 | 100 | 0 | 0.2 | 0 | 0.2 | 1.02 | 0.21 | 1.02 | 0.21 | ||
| equivalent to | 3 | 100 | 100 | 0 | 1 | 0.2 | 1 | 1.65 | 2.16 | 2.01 | 2.64 | |
| 4 | 100 | 100 | 0 | 0.2 | 0.2 | 0.2 | 0.2 | 0.21 | 1.25 | 0.25 | ||
| Small sample size | As above | 5 | 10 | 10 | As basic set | |||||||
| 6 | 10 | 10 | ||||||||||
| 7 | 10 | 10 | ||||||||||
| 8 | 10 | 10 | ||||||||||
| Sample size imbalance | As above | 9 | 10 | 100 | As basic set | |||||||
| 10 | 10 | 100 | ||||||||||
| 11 | 10 | 100 | ||||||||||
| 12 | 10 | 100 | ||||||||||
| Different standard deviations | As above | 13 | 100 | 100 | 0 | 1 | 0 | 0.5 | 1.65 | 2.16 | 1.13 | 0.60 |
| 14 | 100 | 100 | 0 | 0.2 | 0 | 0.1 | 1.02 | 0.21 | 1.01 | 0.10 | ||
| 15 | 100 | 100 | 0 | 1 | 0.2 | 0.5 | 1.65 | 2.16 | 1.38 | 0.74 | ||
| 16 | 100 | 100 | 0 | 0.2 | 0.2 | 0.1 | 0.2 | 0.21 | 1.23 | 0.12 | ||
| Alternative skew (gamma distribution) | 17 | 100 | 100 | − | − | As basic set | ||||||
| 18 | 100 | 100 | − | − | ||||||||
| 19 | 100 | 100 | − | − | ||||||||
| 20 | 100 | 100 | − | |||||||||
| No skew (normal distribution, positive values only) | 21 | 100 | 100 | As basic set | ||||||||
| rejected and replaced if | 22 | 100 | 100 | − | − | |||||||
| 23 | 100 | 100 | ||||||||||
| 24 | 100 | 100 | − | |||||||||
Figure 3Probability densities of all distributions used in the simulation study. Solid lines represent group 1 and dotted lines group 2.
Simulation results for different standard deviations (log-normal distribution); 100 participants in each group; 10 000 simulations in each set.
| Transformation raw to log scale (μ | Transformation log to raw scale (μ | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Set 13 | Set 14 | Set 15 | Set 16 | Set 13 | Set 14 | Set 15 | Set 16 | ||
| True means (group 1, group 2) | (0, 0) | (0, 0) | (0, 0.2) | (0, 0.2) | (1.65, 1.13) | (1.02, 1.01) | (1.65, 1.38) | (1.02, 1.23) | |
| True SDs (group 1, group 2) | (1, 0.5) | (0.2, 0.1) | (1, 0.5) | (0.2, 0.1) | (2.16, 0.60) | (0.21, 0.10) | (2.16, 0.74) | (0.21, 0.12) | |
| Bias | |||||||||
| Method 1 | −0.037 | −0.000 | −0.037 | 0.001 | −0.008 | −0.000 | −0.009 | 0.001 | |
| Method 2 | −0.366 | −0.014 | −0.367 | −0.014 | 0.513 | 0.015 | 0.565 | 0.018 | |
| Method 3 | −0.360 | −0.014 | −0.366 | −0.014 | 0.513 | 0.015 | 0.483 | 0.015 | |
| Standard error | |||||||||
| Method 1 | 0.107 | 0.022 | 0.107 | 0.022 | 0.228 | 0.023 | 0.232 | 0.024 | |
| Taylor | 0.284 | 0.022 | 0.288 | 0.022 | 0.213 | 0.023 | 0.218 | 0.024 | |
| Empirical | 0.135 | 0.022 | 0.135 | 0.023 | 0.215 | 0.023 | 0.218 | 0.024 | |
| Method 2 | 0.107 | 0.022 | 0.107 | 0.022 | 0.181 | 0.023 | 0.203 | 0.025 | |
| Taylor | 0.125 | 0.022 | 0.125 | 0.022 | 0.154 | 0.023 | 0.172 | 0.025 | |
| Empirical | 0.140 | 0.023 | 0.140 | 0.023 | 0.154 | 0.023 | 0.162 | 0.024 | |
| Method 3 | Taylor | 0.152 | 0.023 | 0.142 | 0.021 | 0.112 | 0.022 | 0.123 | 0.025 |
| Empirical | 0.135 | 0.023 | 0.138 | 0.022 | 0.112 | 0.022 | 0.117 | 0.024 | |
| Coverage | |||||||||
| Method 1 | 86.5 | 95.0 | 86.5 | 94.4 | 95.5 | 95.0 | 95.8 | 94.4 | |
| Taylor | 98.0 | 95.0 | 98.1 | 94.4 | 94.5 | 95.0 | 94.9 | 94.4 | |
| Method 2 | 11.7 | 89.4 | 11.0 | 90.2 | 17.9 | 89.4 | 16.7 | 90.1 | |
| Taylor | 17.5 | 89.6 | 16.7 | 90.4 | 10.9 | 89.2 | 9.8 | 89.8 | |
| Method 3 | Taylor | 28.3 | 90.1 | 21.6 | 88.6 | 1.0 | 89.1 | 3.4 | 91.4 |
Maximum Monte Carlo errors (raw to log) bias: 0.001; t-test standard error: 0.0001; Taylor standard error: 0.004; t-test coverage: 0.3 per cent; Taylor coverage: 0.5 per cent.
Maximum Monte Carlo errors (log to raw) bias: 0.002; t-test standard error: 0.0005; Taylor standard error: 0.0004; t-test coverage: 0.4 per cent; Taylor coverage: 0.3 per cent.