| Literature DB >> 28388646 |
Robbie C M van Aert1, Marcel A L M van Assen1,2.
Abstract
The vast majority of published results in the literature is statistically significant, which raises concerns about their reliability. The Reproducibility Project Psychology (RPP) and Experimental Economics Replication Project (EE-RP) both replicated a large number of published studies in psychology and economics. The original study and replication were statistically significant in 36.1% in RPP and 68.8% in EE-RP suggesting many null effects among the replicated studies. However, evidence in favor of the null hypothesis cannot be examined with null hypothesis significance testing. We developed a Bayesian meta-analysis method called snapshot hybrid that is easy to use and understand and quantifies the amount of evidence in favor of a zero, small, medium and large effect. The method computes posterior model probabilities for a zero, small, medium, and large effect and adjusts for publication bias by taking into account that the original study is statistically significant. We first analytically approximate the methods performance, and demonstrate the necessity to control for the original study's significance to enable the accumulation of evidence for a true zero effect. Then we applied the method to the data of RPP and EE-RP, showing that the underlying effect sizes of the included studies in EE-RP are generally larger than in RPP, but that the sample sizes of especially the included studies in RPP are often too small to draw definite conclusions about the true effect size. We also illustrate how snapshot hybrid can be used to determine the required sample size of the replication akin to power analysis in null hypothesis significance testing and present an easy to use web application (https://rvanaert.shinyapps.io/snapshot/) and R code for applying the method.Entities:
Mesh:
Year: 2017 PMID: 28388646 PMCID: PMC5384677 DOI: 10.1371/journal.pone.0175302
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Probability density functions of the replication (panel a) and transformed original effect size when statistical significance is taken into account (panel b).
The four hypothesized effect sizes (zero, small, medium, and large) are denoted by θ = 0 (0, red distribution), θ = 0.1 (S, blue distribution), θ = 0.31 (M, yellow distribution), and θ = 0.549 (L, green distribution). The dashed vertical line refers to the observed effect sizes in the hypothetical example of Maxwell et al. [8] for the replication (panel a) and original study (panel b). The dots on the vertical dashed line refer to densities for no, small, medium, and large effect in the population.
Likelihoods and posterior model probabilities for zero, small, medium, and large hypothesized correlations for the example of Maxwell et al. [8].
| Prior | Method | Zero (ρS = 0) | Small (ρS = 0.1) | Medium (ρS = 0.3) | Large (ρS = 0.5) | |
|---|---|---|---|---|---|---|
| Likelihood | 22.594 | 55.304 | 0.819 | 0 | ||
| Posterior model probabilities | Uniform | Snapshot hybrid | .287 | .703 | .010 | 0 |
| Snapshot naïve | .063 | .866 | .071 | 0 | ||
| Snapshot hybrid | .446 | .546 | .008 | 0 | ||
| Snapshot hybrid | .707 | .289 | .004 | 0 | ||
| Snapshot hybrid | .288 | .702 | .010 | 0 |
Posterior model probabilities of one snapshot (ρS) relative to the others are calculated with the snapshot hybrid (second row and last three rows) and without correcting for statistical significance (snapshot naïve, third row). For snapshot hybrid, posterior model probabilities are calculated for four different sets of prior model probabilities; equal prior model probabilities (i.e., uniform encompassing model), prior model probabilities where the hypothesized zero effect gets a weight (p) 2 or 6 times higher than the other hypothesized effects, and prior model probabilities when a normal distribution with mean and variance equal to 0 and 1 is the encompassing model, respectively.
Expected values of the posterior model probabilities of the snapshot hybrid, snapshot naïve, and traditional fixed-effect meta-analysis (FE).
| Snapshot Hybrid | Snapshot Naïve | FE | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| ρS = 0 | ρS = 0.1 | ρS = 0.3 | ρS = 0.5 | ρS = 0 | ρS = 0.1 | ρS = 0.3 | ρS = 0.5 | |||
| ρ = 0 | 31 | 0.36 | 0.151 | 0.023 | 0.336 | 0.411 | 0.076 | 0.215 | ||
| 55 | 0.375 | 0.089 | 0.002 | 0.479 | 0.304 | 0.005 | 0.16 | |||
| 96 | 0.368 | 0.03 | 0 | 0.648 | 0.112 | 0 | 0.12 | |||
| 300 | 0.243 | 0 | 0 | 0.662 | 0 | 0 | 0.068 | |||
| 1,000 | 0.052 | 0 | 0 | 0.242 | 0 | 0 | 0.037 | |||
| ρ = 0.1 | 31 | 0.36 | 0.231 | 0.057 | 0.11 | 0.485 | 0.147 | 0.268 | ||
| 55 | 0.375 | 0.211 | 0.011 | 0.111 | 0.501 | 0.021 | 0.215 | |||
| 96 | 0.368 | 0.15 | 0 | 0.101 | 0.347 | 0 | 0.177 | |||
| 300 | 0.243 | 0.012 | 0 | 0.04 | 0.02 | 0 | 0.127 | |||
| 1,000 | 0.052 | 0 | 0 | 0.007 | 0 | 0 | 0.103 | |||
| ρ = 0.3 | 31 | 0.151 | 0.231 | 0.25 | 0.027 | 0.099 | 0.417 | 0.381 | ||
| 55 | 0.089 | 0.211 | 0.178 | 0.012 | 0.089 | 0.236 | 0.337 | |||
| 96 | 0.03 | 0.15 | 0.082 | 0.003 | 0.066 | 0.089 | 0.312 | |||
| 300 | 0 | 0.012 | 0.003 | 0 | 0.008 | 0.003 | 0.3 | |||
| 1,000 | 0 | 0 | 0 | 0 | 0 | 0 | 0.3 | |||
| ρ = 0.5 | 31 | 0.023 | 0.058 | 0.25 | 0.002 | 0.014 | 0.188 | 0.516 | ||
| 55 | 0.002 | 0.011 | 0.178 | 0 | 0.002 | 0.146 | 0.499 | |||
| 96 | 0 | 0 | 0.082 | 0 | 0 | 0.075 | 0.498 | |||
| 300 | 0 | 0 | 0.003 | 0 | 0 | 0.003 | 0.499 | |||
| 1,000 | 0 | 0 | 0 | 0 | 0 | 0 | 0.5 | |||
ρ denotes the effect size in the population, N is the sample size in the original study and replication, and ρS refers to the snapshots of effect size.
Probability of posterior model probability larger than .25 of snapshot hybrid and snapshot naïve.
| Snapshot Hybrid | Snapshot Naïve | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| ρS = 0 | ρS = 0.1 | ρS = 0.3 | ρS = 0.5 | ρS = 0 | ρS = 0.1 | ρS = 0.3 | ρS = 0.5 | ||
| ρ = 0 | 31 | 0.914 | 0.22 | 0.017 | 0.725 | 0.757 | 0.082 | ||
| 55 | 0.84 | 0.107 | 0.001 | 0.891 | 0.473 | 0.001 | |||
| 96 | 0.705 | 0.03 | 0 | 0.97 | 0.146 | 0 | |||
| 300 | 0.359 | 0 | 0 | 0.882 | 0 | 0 | |||
| 1,000 | 0.065 | 0 | 0 | 0.321 | 0 | 0 | |||
| ρ = 0.1 | 31 | 0.687 | 0.417 | 0.063 | 0.123 | 0.885 | 0.206 | ||
| 55 | 0.686 | 0.322 | 0.007 | 0.132 | 0.758 | 0.015 | |||
| 96 | 0.654 | 0.206 | 0 | 0.103 | 0.495 | 0 | |||
| 300 | 0.36 | 0.015 | 0 | 0.033 | 0.024 | 0 | |||
| 1,000 | 0.067 | 0 | 0 | 0.007 | 0 | 0 | |||
| ρ = 0.3 | 31 | 0.233 | 0.483 | 0.385 | 0.009 | 0.119 | 0.656 | ||
| 55 | 0.114 | 0.391 | 0.252 | 0.003 | 0.109 | 0.338 | |||
| 96 | 0.027 | 0.234 | 0.104 | 0 | 0.081 | 0.114 | |||
| 300 | 0 | 0.015 | 0.003 | 0 | 0.009 | 0.003 | |||
| 1,000 | 0 | 0 | 0 | 0 | 0 | 0 | |||
| ρ = 0.5 | 31 | 0.014 | 0.062 | 0.471 | 0 | 0.004 | 0.292 | ||
| 55 | 0 | 0.007 | 0.265 | 0 | 0 | 0.204 | |||
| 96 | 0 | 0 | 0.105 | 0 | 0 | 0.096 | |||
| 300 | 0 | 0 | 0.003 | 0 | 0 | 0.003 | |||
| 1,000 | 0 | 0 | 0 | 0 | 0 | 0 | |||
ρ denotes the effect size in the population, N is the sample size in the original study and replication, and ρS refers to the snapshots of effect size. Note that the sum of probabilities across four snapshots is sometimes larger than 1, because posterior model probabilities can be larger than .25 for more than snapshot.
Probability of posterior model probability larger than .75 of snapshot hybrid and snapshot naïve.
| Snapshot Hybrid | Snapshot Naïve | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| ρS = 0 | ρS = 0.1 | ρS = 0.3 | ρS = 0.5 | Inconcl. | ρS = 0 | ρS = 0.1 | ρS = 0.3 | ρS = 0.5 | Inconcl. | ||
| ρ = 0 | 31 | 0 | 0 | 0 | 0.96 | 0 | 0 | 0.002 | 0.998 | ||
| 55 | 0 | 0.003 | 0 | 0.855 | 0 | 0.085 | 0 | 0.914 | |||
| 96 | 0 | 0.003 | 0 | 0.706 | 0.343 | 0.021 | 0 | 0.628 | |||
| 300 | 0.061 | 0 | 0 | 0.298 | 0.487 | 0 | 0 | 0.395 | |||
| 1,000 | 0.016 | 0 | 0 | 0.049 | 0.132 | 0 | 0 | 0.189 | |||
| ρ = 0.1 | 31 | 0.01 | 0 | 0.002 | 0.988 | 0 | 0 | 0.012 | 0.988 | ||
| 55 | 0.033 | 0.017 | 0 | 0.95 | 0 | 0.263 | 0.001 | 0.736 | |||
| 96 | 0.057 | 0.043 | 0 | 0.9 | 0 | 0.165 | 0 | 0.547 | |||
| 300 | 0.065 | 0.004 | 0 | 0.306 | 0.001 | 0.007 | 0 | 0.049 | |||
| 1,000 | 0.018 | 0 | 0 | 0.049 | 0.001 | 0 | 0 | 0.006 | |||
| ρ = 0.3 | 31 | 0 | 0 | 0.06 | 0.94 | 0 | 0 | 0.157 | 0.843 | ||
| 55 | 0 | 0 | 0.05 | 0.835 | 0 | 0 | 0.077 | 0.41 | |||
| 96 | 0 | 0 | 0.025 | 0.33 | 0 | 0.006 | 0.029 | 0.163 | |||
| 300 | 0 | 0.005 | 0.001 | 0.012 | 0 | 0.003 | 0.001 | 0.008 | |||
| 1,000 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |||
| ρ = 0.5 | 31 | 0 | 0 | 0 | 0.502 | 0 | 0 | 0 | 0.301 | ||
| 55 | 0 | 0 | 0.018 | 0.25 | 0 | 0 | 0.034 | 0.17 | |||
| 96 | 0 | 0 | 0.027 | 0.078 | 0 | 0 | 0.024 | 0.072 | |||
| 300 | 0 | 0 | 0.001 | 0.002 | 0 | 0 | 0.001 | 0.002 | |||
| 1,000 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |||
ρ denotes the effect size in the population, N is the sample size in the original study and replication, and ρS refers to snapshots of effect size. The columns “Inconcl.” indicate the probability of observing inconclusive results (i.e., none of the posterior model probabilities for the hypothesized effect sizes was larger than .75)
Average posterior model probabilities for the study-pairs in EE-RP and RPP for snapshot hybrid and snapshot naïve at four different snapshots (ρS = 0; 0.1; 0.3; 0.5).
| ρS | |||||
|---|---|---|---|---|---|
| 0 | 0.1 | 0.3 | 0.5 | ||
| Snapshot Hybrid | EE-RP | 0.084 | 0.137 | 0.34 | 0.44 |
| RPP | 0.293 | 0.234 | 0.217 | 0.256 | |
| Snapshot Naïve | EE-RP | 0.03 | 0.165 | 0.361 | 0.444 |
| RPP | 0.126 | 0.285 | 0.267 | 0.321 | |
Proportions of how often the posterior model probability is larger than .25 for the study-pairs in EE-RP and RPP for snapshot hybrid and snapshot naïve at seven different snapshots (ρS = 0; 0–0.1; 0.1; 0.1–0.3; 0.3; 0.3–0.5; 0.5).
| ρS | ||||||||
|---|---|---|---|---|---|---|---|---|
| 0 | 0–0.1 | 0.1 | 0.1–0.3 | 0.3 | 0.3–0.5 | 0.5 | ||
| Snapshot Hybrid | EE-RP | 0 | 0.125 | 0.062 | 0.062 | 0.312 | 0 | 0.438 |
| RPP | 0.134 | 0.284 | 0.045 | 0.119 | 0.06 | 0.164 | 0.194 | |
| Snapshot Naïve | EE-RP | 0 | 0.062 | 0.125 | 0 | 0.375 | 0 | 0.438 |
| RPP | 0.015 | 0.194 | 0.149 | 0.06 | 0.164 | 0.179 | 0.239 | |
Proportions of how often the posterior model probability is larger than .75 for the study-pairs in EE-RP and RPP for snapshot hybrid and snapshot naïve at four different snapshots (ρS = 0; 0.1; 0.3; 0.5) and how often none of the posterior model probabilities is larger than .75 (Inconclusive results, final column).
| ρS | ||||||
|---|---|---|---|---|---|---|
| 0 | 0.1 | 0.3 | 0.5 | Inconcl. | ||
| Snapshot Hybrid | EE-RP | 0 | 0.062 | 0.312 | 0.438 | 0.188 |
| RPP | 0.134 | 0.030 | 0.045 | 0.164 | 0.627 | |
| Snapshot Naïve | EE-RP | 0 | 0.125 | 0.375 | 0.438 | 0.062 |
| RPP | 0.015 | 0.119 | 0.104 | 0.239 | 0.522 | |
Proportions of how often the posterior model probability is larger than .75 for the study-pairs in RPP grouped by social and cognitive psychological studies for snapshot hybrid and snapshot naïve at four different snapshots (ρS = 0; 0.1; 0.3; 0.5) and how often none of the posterior model probabilities is larger than .75 (Inconclusive results, final column).
| ρS | ||||||
|---|---|---|---|---|---|---|
| 0 | 0.1 | 0.3 | 0.5 | Inconcl. | ||
| Snapshot Hybrid | Social | 0.235 | 0.059 | 0 | 0.118 | 0.588 |
| Cognitive | 0.030 | 0 | 0.091 | 0.212 | 0.667 | |
| Snapshot Naïve | Social | 0.029 | 0.176 | 0.059 | 0.118 | 0.618 |
| Cognitive | 0 | 0.061 | 0.152 | 0.364 | 0.424 | |
Required sample size computed with snapshot hybrid based on characteristics of the original study as described in Maxwell et al. [8]; r = .243 and N = 80.
| With original study | Without original study | |
|---|---|---|
| ρS = 0 | 587 | 645 |
| ρS = 0.1 | 709 | 664 |
| ρS = 0.3 | 223 | 215 |
| ρS = 0.5 | 284 | 116 |
Sample size was computed with snapshot hybrid for a desired posterior model probability of a = 0.75 and the desired probability of observing a posterior model probability larger than a was b = 0.8. The hypothesized effect size was equal to ρS = 0 (no effect), 0.1 (small), 0.3 (medium), and 0.5 (large). The penultimate column refers to the required sample size where information of the original study is included and the last column where this information is excluded.