| Literature DB >> 26629998 |
Martin E Héroux1,2, Janet L Taylor1,2, Simon C Gandevia1,2.
Abstract
The magnitude and direction of reported physiological effects induced using transcranial magnetic stimulation (TMS) to modulate human motor cortical excitability have proven difficult to replicate routinely. We conducted an online survey on the prevalence and possible causes of these reproducibility issues. A total of 153 researchers were identified via their publications and invited to complete an anonymous internet-based survey that asked about their experience trying to reproduce published findings for various TMS protocols. The prevalence of questionable research practices known to contribute to low reproducibility was also determined. We received 47 completed surveys from researchers with an average of 16.4 published papers (95% CI 10.8-22.0) that used TMS to modulate motor cortical excitability. Respondents also had a mean of 4.0 (2.5-5.7) relevant completed studies that would never be published. Across a range of TMS protocols, 45-60% of respondents found similar results to those in the original publications; the other respondents were able to reproduce the original effects only sometimes or not at all. Only 20% of respondents used formal power calculations to determine study sample sizes. Others relied on previously published studies (25%), personal experience (24%) or flexible post-hoc criteria (41%). Approximately 44% of respondents knew researchers who engaged in questionable research practices (range 30–81%), yet only 18% admitted to engaging in them (range 6–38%) [corrected]. These practices included screening subjects to find those that respond in a desired way to a TMS protocol, selectively reporting results and rejecting data based on a gut feeling. In a sample of 56 published papers that were inspected, not a single questionable research practice was reported. Our survey revealed that approximately 50% of researchers are unable to reproduce published TMS effects. Researchers need to start increasing study sample size and eliminating--or at least reporting--questionable research practices in order to make the outcomes of TMS research reproducible.Entities:
Mesh:
Year: 2015 PMID: 26629998 PMCID: PMC4668054 DOI: 10.1371/journal.pone.0144151
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Prevalence of questionable research practices.
|
|
|
|
|---|---|---|
| Screen for ‘responders’ to a TMS protocol | 38 (68) | 18 (38) |
| Drop data points based on a gut feeling | 18 (38) | 6 (13) |
| Exclude data after looking at impact on results | 14 (30) | 3 (6) |
| Not report all experimental conditions | 19 (40) | 10 (21) |
| Selectively report outcomes | 23 (49) | 5 (11) |
| Selectively report time points | 14 (30) | 5 (11) |
| Selectively report sub-groups of subjects | 18 (38) | 8 (17) |
| Reject ‘outliers’ without statistical support | 19 (30) | 10 (21) |
See S1 File for the exact wording used in the online survey.
Fig 1Factors contributing to irreproducible TMS results.
When planning and executing a research study, the size of the investigated effect and the size of the sample directly influence a study’s statistical power—probability of correctly rejecting the null hypothesis when the null hypothesis is false—and the certainty of reported researcher results. Selecting sample size based on previous experience, published reports or power calculations based on inflated effect sizes from the literature often results in too few subjects being tested. In a study with low statistical power, significant results (i.e. p < 0.05) are biased towards extreme values(i.e. a large effect; study B). Independently, questionable research practices will also increase the rate of false discoveries and exaggerated effect sizes. Because these results meet the traditional level of statistical significance, they will likely become part of published literature. For the unlucky scientist who did not find statistically significant results (study A), the study may never be written up or it will be rejected by publishers because it presents uncertain, negative results. These studies become part of the cemetery of unpublished scientific research, the file-drawer.