| Literature DB >> 33932323 |
Ozan Cinar1, James Umbanhowar2, Jason D Hoeksema3, Wolfgang Viechtbauer1.
Abstract
Meta-regression can be used to examine the association between effect size estimates and the characteristics of the studies included in a meta-analysis using regression-type methods. By searching for those characteristics (i.e., moderators) that are related to the effect sizes, we seek to identify a model that represents the best approximation to the underlying data generating mechanism. Model selection via testing, either through a series of univariate models or a model including all moderators, is the most commonly used approach for this purpose. Here, we describe alternative model selection methods based on information criteria, multimodel inference, and relative variable importance. We demonstrate their application using an illustrative example and present results from a simulation study to compare the performance of the various model selection methods for identifying the true model across a wide variety of conditions. Whether information-theoretic approaches can also be used not only in combination with maximum likelihood (ML) but also restricted maximum likelihood (REML) estimation was also examined. The results indicate that the conventional methods for model selection may be outperformed by information-theoretic approaches. The latter are more often among the set of best methods across all of the conditions simulated and can have higher probabilities for identifying the true model under particular scenarios. Moreover, their performance based on REML estimation was either very similar to that from ML estimation or at times even better depending on how exactly the REML likelihood was computed. These results suggest that alternative model selection methods should be more widely applied in meta-regression.Entities:
Keywords: information criteria; meta-analysis; meta-regression; model selection; multimodel inference
Mesh:
Year: 2021 PMID: 33932323 PMCID: PMC8359854 DOI: 10.1002/jrsm.1489
Source DB: PubMed Journal: Res Synth Methods ISSN: 1759-2879 Impact factor: 5.273
Correlation matrix (phi coefficients) of the four dichotomous moderator variables
| FUN | FP | FN | STER | |
|---|---|---|---|---|
| FUN | 1 | −0.14 | 0.53 | 0.27 |
| FP | −0.14 | 1 | 0.31 | 0.01 |
| FN | 0.53 | 0.31 | 1 | 0.53 |
| STER | 0.27 | 0.01 | 0.53 | 1 |
Value of the AICc (based on ll REMLr) for the 16 models fitted to the data examining the influence of mycorrhizal inoculation on plant biomass
| Model | Moderator(s) | AICc |
|
| |
|---|---|---|---|---|---|
| 1 | FN + FP | 203.210 | 0.47 | −1.00 | 0.207 |
| 2 | STER + FN + FP | 205.161 | 0.18 | −0.97 | 0.209 |
| 3 | FUN + FN + FP | 205.507 | 0.15 | −0.92 | 0.224 |
| 4 | FUN + STER + FP | 207.540 | 0.05 | −0.71 | 0.206 |
| 5 | FUN + FP | 207.566 | 0.05 | −0.66 | 0.208 |
| 6 | FUN + STER + FN + FP | 207.611 | 0.05 | −0.90 | 0.226 |
| 7 | STER + FP | 208.534 | 0.03 | −0.75 | 0.209 |
| 8 | FP | 212.524 | 0.00 | −0.71 | 0.220 |
| 9 | FUN | 213.047 | 0.00 | 0.00 | 0.000 |
| 10 | FUN + STER | 214.157 | 0.00 | 0.00 | 0.000 |
| 11 | FUN + FN | 215.739 | 0.00 | 0.00 | 0.000 |
| 12 | STER | 216.281 | 0.00 | 0.00 | 0.000 |
| 13 | FUN + STER + FN | 216.765 | 0.00 | 0.00 | 0.000 |
| 14 | FN | 217.803 | 0.00 | 0.00 | 0.000 |
| 15 | STER + FN | 218.329 | 0.00 | 0.00 | 0.000 |
| 16 | – | 218.485 | 0.00 | 0.00 | 0.000 |
| Avg | −0.93 | 0.248 |
Estimated model coefficient for moderator FP from the model including all moderators simultaneously.
Estimated coefficient for moderator FP from the univariate model.
Model‐averaged parameter estimate (and corresponding standard error) for moderator FP.
Results for testing each moderator in a series of univariate models, in the full model, when using multimodel inference, and their relative variance importance (RVI) values based on all 16 models
| Univariate testing | Full model testing | Multimodel inference | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| RVI | |
| FUN | 0.68 | 0.222 |
| 0.23 | 0.270 | 0.84 | 0.11 | 0.236 | 0.47 | 0.31 |
| FP | −0.71 | 0.220 |
| −0.90 | 0.226 |
| −0.93 | 0.248 |
|
|
| FN | 0.49 | 0.248 |
| 0.57 | 0.342 | 1.66 | 0.68 | 0.388 | 1.76 |
|
| STER | 0.64 | 0.277 |
| 0.28 | 0.298 | 0.93 | 0.12 | 0.248 | 0.47 | 0.32 |
Note: Test statistics that exceed ±1.96 and RVI values that exceed 0.5 are shown in bold.
FIGURE 1Kernel density plots of the probabilities of identifying the true model across all 12,000 conditions for each of the methods. Gray lines represent conditions where the true model was the empty model (i.e., β = 0), whereas black lines represent conditions where there was a non‐zero association between the moderators and the effect sizes (i.e., β > 0). The triangles (gray triangle and black triangle) indicate the corresponding median probabilities
FIGURE 2Proportion of conditions where each method was among the best methods (i.e., no worse than five percentage points than the best method). Gray lines represent conditions where the true model was the empty model (i.e., β = 0), whereas black lines represent conditions where there was a non‐zero association between the moderators and the effect sizes (i.e., β > 0)
η2 values derived from the two‐way ANOVA predicting the probabilities of selecting the true model based on the model selection method, all design factors, and their two‐way interactions
| Method | 0.27 | |||||||
|
| 0.10 | 0.47 | ||||||
|
| 0.07 | 0.01 | 0.42 | |||||
|
| 0.05 | 0.00 | 0.00 | 0.18 | ||||
|
| 0.51 | 0.23 | 0.17 | 0.10 | 0.85 | |||
|
| 0.02 | 0.00 | 0.07 | 0.00 | 0.06 | 0.30 | ||
|
| 0.04 | 0.00 | 0.00 | 0.00 | 0.02 | 0.00 | 0.24 | |
|
| 0.01 | 0.01 | 0.00 | 0.00 | 0.02 | 0.00 | 0.02 | 0.05 |
| Method |
|
|
|
|
|
|
|
Note: The diagonal and off‐diagonal values display the η 2 values of the main effects and two‐way interactions, respectively.
FIGURE 3Probabilities of identifying the true model of six model selection methods as a function of β and k. The methods shown are selection via univariate and full model testing and the information‐theoretic approaches using the AICc criterion combined with the ll REMLr function. Each line represents the probabilities of a model selection method averaged over the remaining factors. (a) k = 20. (b) k = 40. (c) k = 60. (d) k = 80 [Colour figure can be viewed at wileyonlinelibrary.com]