| Literature DB >> 29204324 |
Abstract
BACKGROUND: Publication bias is a form of scientific misconduct. It threatens the validity of research results and the credibility of science. Although several tests on publication bias exist, no in-depth evaluations are available that examine which test performs best for different research settings.Entities:
Keywords: Caliper test; Egger’s test; FAT; Monte carlo simulation; Publication bias; Statistics; Test for excess significance; p-uniform
Year: 2017 PMID: 29204324 PMCID: PMC5712469 DOI: 10.7717/peerj.4115
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Publication bias tests in comparison.
compares the four evaluated publication bias tests in respect to four criteria, the measurement level, the sample used by the test, its underlying assumptions and its limitations.
| Test | Measurement level | Sample | Assumption | Limitation |
|---|---|---|---|---|
| FAT | Continuous [ −∞, ∞] | All | Cov(es, se) = 0 | Only one-sided publication bias (PB) detectable |
| PU | Continuous [0, 1] | Uniform or right skewed | Only one-sided PB detectable | |
| TES | Dichotomous [0, 1] | All | E = O | Only on prespecified levels |
| CT | Dichotomous [0, 1] | Threshold ± caliper width | P(UC) = P(OC) | Only on prespecified levels |
Data generating process (DGP) of Monte Carlo simulation.
The 100 conditions of the Monte Carlo simulations are described. Two different aspects were varied: the underlying data and the publication bias behaviour of the actors. For the underlying data the true effect size, the number of observations (N) and the number of studies included in the meta-analysis (K) were varied. The behavioural component altered the proportions of authors who are willing to commit publication bias and its actual form as either p-hacking or file-drawer.
| Conditions | Values | Functional form | |
|---|---|---|---|
| 1. True effects | |||
| 2. Number of observations | μ | | | |
| 3. Number of studies | |||
| 4. Publication bias (PB) | |||
| 4.1. | Draw new sample size | ||
| 4.2. | Run new analyses with same dataset | ||
Risk factors for publication and its impact on bias in the simulated data (OLS regression).
The first two columns in Table 3 show that actual committed publication bias behaviour depended largely on the opportunity structure of the underlying data. Despite the defined 50% or 100% willingness of the authors to commit publication bias, only those actors who face insignificant effects (caused by small effects and sample sizes) engaged in publication bias practices. The success of publication bias in terms of significant results is shown in column three, dependent on the opportunity structure and form of publication bias. Conditions under p-hacking were slightly less effective in obtaining significant results than conditions under file-drawer publication bias. Column four shows the deflating impact of publication bias on meta-analytic p-values. For an average publication bias this p-values halved or even quartered.
| Publication bias committed (50% intention) | Publication bias committed (100% intention) | Publication bias successful (in relation to committed) | Deflation of | |
|---|---|---|---|---|
| −0.105 | −0.211 | 0.179 | ||
| ( | (0.000) | (0.001) | (0.001) | |
| −0.196 | −0.391 | 0.471 | ||
| ( | (0.001) | (0.001) | (0.001) | |
| −0.389 | −0.777 | 0.513 | ||
| (0.001) | (0.001) | (0.001) | ||
| −0.451 | −0.899 | 0.503 | ||
| (0.001) | (0.001) | (0.002) | ||
| −0.319 | −0.636 | 0.358 | ||
| (0.001) | (0.001) | (0.001) | ||
| p-hacking | −0.100 | 0.197 | ||
| (0.001) | (0.002) | |||
| Committed PB [ +10ppts] | −0.018 | |||
| ( | (0.001) | |||
| Successful PB [+10ppts] | −0.077 | |||
| ( | (0.001) | |||
| Constant | 0.541 | 1.080 | 0.313 | 0.225 |
| (0.000) | (0.001) | (0.001) | (0.002) | |
| Observations | 61,760 | 61,760 | 115,843 | 123,520 |
| 0.939 | 0.958 | 0.648 | 0.168 |
Notes.
Standard errors in parentheses.
p < 0.05.
p < 0.01.
p < 0.001.
Conditional false positive rates of the publication bias tests (OLS regression).
Table 4 displays the false positive rates of the publication bias tests conditional on the number of studies included in the meta-analysis (K) as well as the between study heterogeneity (I2). The FAT had the most consistent false positive rate. The 15% CT missed the 5%-level clearly while the 10% CT showed a large variability and gets close to it. The 10% and 15% CT are therefore problematic because they may suffer from inflated false positive rates.
| PU | FAT | TES | 3% CT | 5% CT | 10% CT | 15% CT | |
|---|---|---|---|---|---|---|---|
| −0.005 | 0.000 | 0.006 | 0.026 | 0.023 | 0.026 | 0.050 | |
| (ref. | (0.001) | (0.002) | (0.001) | (0.001) | (0.001) | (0.001) | (0.002) |
| I2 [ +10 percentage points] | −0.003 | −0.001 | −0.001 | 0.000 | 0.000 | −0.001 | −0.003 |
| (0.000) | (0.000) | (0.000) | (0.000) | (0.000) | (0.000) | (0.000) | |
| Constant | 0.023 | 0.049 | 0.010 | 0.003 | 0.008 | 0.019 | 0.028 |
| (0.001) | (0.001) | (0.001) | (0.001) | (0.001) | (0.001) | (0.001) | |
| Observations | 73,960 | 74,560 | 74,560 | 62,644 | 66,546 | 69,718 | 70,936 |
| 0.005 | 0.000 | 0.002 | 0.010 | 0.007 | 0.006 | 0.017 |
Notes.
Test H0: constant = 0.05.
Standard errors in parentheses.
p < 0.05.
p < 0.01.
p < 0.001.
Conditional statistical power of the publication bias tests (OLS regression).
shows the statistical power of the publication bias tests conditional on the number of studies included in the meta-analysis (K) and the between study heterogeneity (I2). In contrast to Table 4, also the share of committed as well as successful publication bias and its form as either file-drawer or p-hacking was controlled. Overall the FAT had the largest power but was not able to detect p-hacking as good as the TES. The CTs were underpowered if a low number of studies was included in a meta-analysis but performed well in studies with large Ks. Both, PU and the TES, were not able to detect publication bias under effect heterogeneity.
| PU | FAT | TES | 3% CT | 5% CT | 10% CT | 15% CT | |
|---|---|---|---|---|---|---|---|
| 0.165 | 0.244 | 0.238 | 0.573 | 0.513 | 0.382 | 0.307 | |
| ( | (0.002) | (0.002) | (0.002) | (0.002) | (0.002) | (0.002) | (0.002) |
| I2 [+10 percentage points] | −0.065 | 0.001 | −0.064 | 0.006 | 0.005 | 0.008 | 0.010 |
| (0.000) | (0.000) | (0.000) | (0.000) | (0.000) | (0.000) | (0.000) | |
| 0.048 | −0.110 | 0.075 | 0.179 | 0.187 | 0.186 | 0.177 | |
| (0.002) | (0.002) | (0.002) | (0.002) | (0.002) | (0.002) | (0.002) | |
| Comitted PB [+10ppts] | 0.051 | 0.030 | −0.065 | −0.035 | −0.053 | −0.073 | −0.084 |
| ( | (0.000 | (0.001) | (0.001) | (0.001) | (0.001) | (0.001) | (0.001) |
| Successful PB [+10ppts] | 0.103 | 0.099 | 0.221 | 0.162 | 0.193 | 0.224 | 0.234 |
| ( | (0.001 | (0.001) | (0.001) | (0.001) | (0.001) | (0.001) | (0.001) |
| Constant | 0.483 | 0.569 | 0.515 | −0.002 | 0.125 | 0.300 | 0.386 |
| (0.002) | (0.002) | (0.002) | (0.002) | (0.002) | (0.002) | (0.002) | |
| Observations | 123,520 | 123,520 | 123,520 | 107,736 | 111,315 | 115,243 | 117,207 |
| 0.572 | 0.306 | 0.473 | 0.497 | 0.483 | 0.457 | 0.446 |
Notes.
Test H0: constant = 0.8.
Standard errors in parentheses.
p < 0.05.
p < 0.01.
p < 0.001.