| Literature DB >> 35396714 |
Erik W van Zwet1, Steven N Goodman2,3,4.
Abstract
We use information derived from over 40K trials in the Cochrane Collaboration database of systematic reviews (CDSR) to compute the replication probability, or predictive power of an experiment given its observed (two-sided) P $$ P $$ -value. We find that an exact replication of a marginally significant result with P = . 05 $$ P=.05 $$ has less than 30% chance of again reaching significance. Moreover, the replication of a result with P = . 005 $$ P=.005 $$ still has only 50% chance of significance. We also compute the probability that the direction (sign) of the estimated effect is correct, which is closely related to the type S error of Gelman and Tuerlinckx. We find that if an estimated effect has P = . 05 $$ P=.05 $$ , there is a 93% probability that its sign is correct. If P = . 005 $$ P=.005 $$ , then that probability is 99%. Finally, we compute the required sample size for a replication study to achieve some specified power conditional on the p $$ p $$ -value of the original study. We find that the replication of a result with P = . 05 $$ P=.05 $$ requires a sample size more than 16 times larger than the original study to achieve 80% power, while P = . 005 $$ P=.005 $$ requires at least 3.5 times larger sample size. These findings confirm that failure to replicate the statistical significance of a trial does not necessarily indicate that the original result was a fluke.Entities:
Keywords: Cochrane Review; actual power; clinical trial; predictive power; type S error
Mesh:
Year: 2022 PMID: 35396714 PMCID: PMC9325423 DOI: 10.1002/sim.9406
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.497
FIGURE 1Left panel: The empirical distribution of 45 955 ‐values from the Cochrane Database of Systematic Reviews (CDSR) together with the fitted mixture distribution of four normal components. Also shown (in grey) the distribution of the signal‐to‐noise ratio which was obtained by deconvolution. Right panel: Histogram representing the distribution of the actual power across the CDSR
Mixture distributions of the ‐value and the signal‐to‐noise ratio (SNR) estimated from 45 955 ‐values from the Cochrane Database of Systematic Reviews (CDSR)
| comp.1 | comp.2 | comp.3 | comp.4 | |
|---|---|---|---|---|
| Proportions | 0.33 | 0.31 | 0.30 | 0.06 |
| Means |
|
|
|
|
| SD of the | 1.27 | 1.60 | 2.57 | 5.94 |
| SD of the SNR | 0.78 | 1.25 | 2.37 | 5.85 |
Columns 1 and 2: The (two‐sided) ‐value and (absolute) ‐value of the original study. Columns 3 and 4: Predictive power, that is, the probability that an exact replication study is significant and the sign of the estimated effect is the same as in the original study. Columns 5 and 6: Probability that the signs of the estimated effect in the original study and the replication study are the same. Columns 7 and 8: Probability that the sign of the estimated effect of the original study is correct. All probabilities are reported assuming either the uniform prior or the prior estimated from the Cochrane data
| Predictive power | Replication of the sign | Original sign correct | |||||
|---|---|---|---|---|---|---|---|
|
|
| uniform prior | Cochrane | uniform prior | Cochrane | uniform prior | Cochrane |
| .5 | 0.67 | 0.18 | 0.11 | 0.68 | 0.62 | 0.75 | 0.69 |
| .3 | 1.04 | 0.26 | 0.15 | 0.77 | 0.68 | 0.85 | 0.78 |
| .1 | 1.64 | 0.41 | 0.23 | 0.88 | 0.78 | 0.95 | 0.90 |
| .05 | 1.96 | 0.50 | 0.29 | 0.92 | 0.83 | 0.98 | 0.93 |
| .03 | 2.17 | 0.56 | 0.34 | 0.94 | 0.86 | 0.98 | 0.95 |
| .01 | 2.58 | 0.67 | 0.44 | 0.97 | 0.90 | 1.00 | 0.98 |
| .005 | 2.81 | 0.73 | 0.50 | 0.98 | 0.92 | 1.00 | 0.99 |
| .001 | 3.29 | 0.83 | 0.64 | 0.99 | 0.96 | 1.00 | 1.00 |
FIGURE 2Predictive power based on the uniform prior and the Cochrane prior. We show the post hoc power for comparison only
FIGURE 3Left panel: Probability that the sign of the original estimate is the same as the replication estimate, given the absolute ‐value of the original trial. Right panel: Probability that the sign of the original estimate is correct, given the absolute ‐value. All probabilities are shown assuming either the uniform prior or the prior estimated from the Cochrane data
FIGURE 4Sample size multiplier that is needed to reach a particular probability of a statistically significant replication (in the same direction), given the absolute ‐value of the original study. Note the logarithmic scale of the ‐axis
Multiplier for the sample size of the original study, to have 50%, 80%, or 90% actual power of the replication study. It is not always possible to reach the required power, which is then indicated by “np”
|
|
| Multiplier 50% | Multiplier 80% | Multiplier 90% |
|---|---|---|---|---|
| .500 | 0.67 | 26.8 | np | np |
| .300 | 1.04 | 10.9 | np | np |
| .100 | 1.64 | 3.9 | 41.7 | np |
| .050 | 1.96 | 2.6 | 16.3 | 133.7 |
| .030 | 2.17 | 2.0 | 10.2 | 45.0 |
| .010 | 2.58 | 1.3 | 5.0 | 13.4 |
| .005 | 2.81 | 1.0 | 3.6 | 8.3 |
| .001 | 3.29 | 0.6 | 1.9 | 3.8 |