| Literature DB >> 28596617 |
Fiona Fidler1, Yung En Chee1, Bonnie C Wintle1, Mark A Burgman1, Michael A McCarthy1, Ascelin Gordon1.
Abstract
Recent replication projects in other disciplines have uncovered disturbingly low levels of reproducibility, suggesting that those research literatures may contain unverifiable claims. The conditions contributing to irreproducibility in other disciplines are also present in ecology. These include a large discrepancy between the proportion of "positive" or "significant" results and the average statistical power of empirical research, incomplete reporting of sampling stopping rules and results, journal policies that discourage replication studies, and a prevailing publish-or-perish research culture that encourages questionable research practices. We argue that these conditions constitute sufficient reason to systematically evaluate the reproducibility of the evidence base in ecology and evolution. In some cases, the direct replication of ecological research is difficult because of strong temporal and spatial dependencies, so here, we propose metaresearch projects that will provide proxy measures of reproducibility.Entities:
Keywords: metaresearch; open science; publication bias; reproducibility; transparency
Year: 2017 PMID: 28596617 PMCID: PMC5384162 DOI: 10.1093/biosci/biw159
Source DB: PubMed Journal: Bioscience ISSN: 0006-3568 Impact factor: 8.589
Existing estimates of the statistical power of ecology research.
| Power estimate for effect sizes (ES) | ||||
|---|---|---|---|---|
| Source | Research field | Small ES | Medium ES | Large ES |
| Parris and McCarthy ( | Effects of toe-clipping frogs | 6%–10% | 8%–21% | 15%–60% |
| Jennions and Møller ( | Behavioural Ecology | 13%–16% | 40%–47% | 65%–72% |
| Smith et al. ( | Animal Behaviour | 7%–8% | 23%–26% | – |
Questionable Research Practices (QRPs) that can inflate the false positive rate in the literature and result in less reproducible research (adapted from John et al. 2012).
| · Checking the statistical significance of results before deciding whether to collect more data | |
| · Stopping data collection early because results reached statistical significance | |
| · Deciding whether to exclude data points (e.g., outliers) only after foreshadowing the impact on statistical significance and not reporting the impact of the data exclusion | |
| Rounding off a | |
| Cherry-picking | · Failing to report dependent or response variables or relationships that did not reach statistical significance or other threshold |
| · Failing to report conditions or treatments that did not reach statistical significance or other threshold | |
| HARKing (hypothesizing after the results are known) | Presenting a |