| Literature DB >> 33960637 |
Rose E O'Dea1, Malgorzata Lagisz1, Michael D Jennions2, Julia Koricheva3, Daniel W A Noble1,2, Timothy H Parker4, Jessica Gurevitch5, Matthew J Page6, Gavin Stewart7, David Moher8, Shinichi Nakagawa1.
Abstract
Since the early 1990s, ecologists and evolutionary biologists have aggregated primary research using meta-analytic methods to understand ecological and evolutionary phenomena. Meta-analyses can resolve long-standing disputes, dispel spurious claims, and generate new research questions. At their worst, however, meta-analysis publications are wolves in sheep's clothing: subjective with biased conclusions, hidden under coats of objective authority. Conclusions can be rendered unreliable by inappropriate statistical methods, problems with the methods used to select primary research, or problems within the primary research itself. Because of these risks, meta-analyses are increasingly conducted as part of systematic reviews, which use structured, transparent, and reproducible methods to collate and summarise evidence. For readers to determine whether the conclusions from a systematic review or meta-analysis should be trusted - and to be able to build upon the review - authors need to report what they did, why they did it, and what they found. Complete, transparent, and reproducible reporting is measured by 'reporting quality'. To assess perceptions and standards of reporting quality of systematic reviews and meta-analyses published in ecology and evolutionary biology, we surveyed 208 researchers with relevant experience (as authors, reviewers, or editors), and conducted detailed evaluations of 102 systematic review and meta-analysis papers published between 2010 and 2019. Reporting quality was far below optimal and approximately normally distributed. Measured reporting quality was lower than what the community perceived, particularly for the systematic review methods required to measure trustworthiness. The minority of assessed papers that referenced a guideline (~16%) showed substantially higher reporting quality than average, and surveyed researchers showed interest in using a reporting guideline to improve reporting quality. The leading guideline for improving reporting quality of systematic reviews is the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) statement. Here we unveil an extension of PRISMA to serve the meta-analysis community in ecology and evolutionary biology: PRISMA-EcoEvo (version 1.0). PRISMA-EcoEvo is a checklist of 27 main items that, when applicable, should be reported in systematic review and meta-analysis publications summarising primary research in ecology and evolutionary biology. In this explanation and elaboration document, we provide guidance for authors, reviewers, and editors, with explanations for each item on the checklist, including supplementary examples from published papers. Authors can consult this PRISMA-EcoEvo guideline both in the planning and writing stages of a systematic review and meta-analysis, to increase reporting quality of submitted manuscripts. Reviewers and editors can use the checklist to assess reporting quality in the manuscripts they review. Overall, PRISMA-EcoEvo is a resource for the ecology and evolutionary biology community to facilitate transparent and comprehensively reported systematic reviews and meta-analyses.Entities:
Keywords: comparative analysis; critical appraisal; evidence synthesis; non-independence; open science; pre-registration; registration; study quality
Mesh:
Year: 2021 PMID: 33960637 PMCID: PMC8518748 DOI: 10.1111/brv.12721
Source DB: PubMed Journal: Biol Rev Camb Philos Soc ISSN: 0006-3231
Fig 1Results from our assessment of reporting quality of systematic reviews and meta‐analyses published between 2010 and 2019, in ecology and evolutionary biology (n = 102). For each paper, the reporting score represents the mean ‘average item % score’ across all applicable items. Full details are provided in the Supporting Information and supplementary code. Red columns indicate the minority of papers that cited a reporting guideline (n = 15 cited PRISMA, and n = 1 cited Koricheva & Gurevitch, 2014). The subset of papers that referenced a reporting guideline tended to have higher reporting scores (note that these observational data cannot distinguish between checklists causing better reporting, or authors with better reporting practices being more likely to report using checklists). Welch's t‐test: t‐value = 5.21; df = 25.65; P < 0.001.
Fig 2PRISMA‐EcoEvo for authors, peer‐reviewers, and editors. Planning and protocols are shown in grey because, while PRISMA‐EcoEvo can point authors in the right direction, authors should seek additional resources for detailed conduct guidance. Authors can use PRISMA‐EcoEvo as a reporting guideline for both registered reports (Primer C) and completed manuscripts. Reviewers and editors can use PRISMA‐EcoEvo to assess reporting quality of the systematic review and meta‐analysis manuscripts they read. Editors can promote high reporting quality by asking submitting authors to complete the PRISMA‐EcoEvo checklist, either by downloading a static file at https://osf.io/t8qd2/, or by using an interactive web application at https://prisma‐ecoevo.shinyapps.io/checklist/.
PRISMA‐EcoEvo v1.0. Checklist of preferred reporting items for systematic reviews and meta‐analyses in ecology and evolutionary biology, alongside an assessment of recent reporting practices (based on a representative sample of 102 meta‐analyses published between 2010 and 2019; references to all assessed papers are provided in the reference list, while the Supporting Information presents details of the assessment). The proportion of papers meeting each sub‐item is presented as a percentage. While all papers were assessed for each item, there was a set of reasons why some items might not be applicable (e.g. no previous reviews on the topic would make sub‐item 2.2 not applicable). Only applicable sub‐items contributed to reporting scores; sample sizes for each sub‐item are shown in the column on the right. Asterisks (*) indicate sub‐items that are identical, or very close, to items from the 2009 PRISMA checklist. In the wording of each Sub‐item, ‘review’ encompasses all forms of evidence syntheses (including systematic reviews), while ‘meta‐analysis’ and ‘meta‐regression’ refer to statistical methods for analysing data collected in the review (definitions are discussed further in Primer A)
| Checklist item | Sub‐item number | Sub‐item | Papers meeting component (%) | No. papers applicable |
|---|---|---|---|---|
| Title and abstract | 1.1 | Identify the review as a systematic review, meta‐analysis, or both* | 100 | 102 |
| 1.2 | Summarise the aims and scope of the review | 97 | 102 | |
| 1.3 | Describe the data set | 74 | 102 | |
| 1.4 | State the results of the primary outcome | 96 | 102 | |
| 1.5 | State conclusions* | 94 | 102 | |
| 1.6 | State limitations* | 17 | 96 | |
| Aims and questions | 2.1 | Provide a rationale for the review* | 100 | 102 |
| 2.2 | Reference any previous reviews or meta‐analyses on the topic | 93 | 75 | |
| 2.3 | State the aims and scope of the review (including its generality) | 91 | 102 | |
| 2.4 | State the primary questions the review addresses (e.g. which moderators were tested) | 96 | 102 | |
| 2.5 | Describe whether effect sizes were derived from experimental and/or observational comparisons | 57 | 76 | |
| Review registration | 3.1 | Register review aims, hypotheses (if applicable), and methods in a time‐stamped and publicly accessible archive and provide a link to the registration in the methods section of the manuscript. Ideally registration occurs before the search, but it can be done at any stage before data analysis | 3 | 102 |
| 3.2 | Describe deviations from the registered aims and methods | 0 | 3 | |
| 3.3 | Justify deviations from the registered aims and methods | 0 | 3 | |
| Eligibility criteria | 4.1 | Report the specific criteria used for including or excluding studies when screening titles and/or abstracts, and full texts, according to the aims of the systematic review (e.g. study design, taxa, data availability) | 84 | 102 |
| 4.2 | Justify criteria, if necessary (i.e. not obvious from aims and scope) | 54 | 67 | |
| Finding studies | 5.1 | Define the type of search (e.g. comprehensive search, representative sample) | 25 | 102 |
| 5.2 | State what sources of information were sought (e.g. published and unpublished studies, personal communications)* | 89 | 102 | |
| 5.3 | Include, for each database searched, the exact search strings used, with keyword combinations and Boolean operators | 49 | 102 | |
| 5.4 | Provide enough information to repeat the equivalent search (if possible), including the timespan covered (start and end dates) | 14 | 102 | |
| Study selection | 6.1 | Describe how studies were selected for inclusion at each stage of the screening process (e.g. use of decision trees, screening software) | 13 | 102 |
| 6.2 | Report the number of people involved and how they contributed (e.g. independent parallel screening) | 3 | 102 | |
| Data collection process | 7.1 | Describe where in the reports data were collected from (e.g. text or figures) | 44 | 102 |
| 7.2 | Describe how data were collected (e.g. software used to digitize figures, external data sources) | 42 | 102 | |
| 7.3 | Describe moderator variables that were constructed from collected data (e.g. number of generations calculated from years and average generation time) | 56 | 41 | |
| 7.4 | Report how missing or ambiguous information was dealt with during data collection (e.g. authors of original studies were contacted for missing descriptive statistics, and/or effect sizes were calculated from test statistics) | 47 | 102 | |
| 7.5 | Report who collected data | 10 | 102 | |
| 7.6 | State the number of extractions that were checked for accuracy by co‐authors | 1 | 102 | |
| Data items | 8.1 | Describe the key data sought from each study | 96 | 102 |
| 8.2 | Describe items that do not appear in the main results, or which could not be extracted due to insufficient information | 42 | 53 | |
| 8.3 | Describe main assumptions or simplifications that were made (e.g. categorising both ‘length’ and ‘mass’ as ‘morphology’) | 62 | 86 | |
| 8.4 | Describe the type of replication unit (e.g. individuals, broods, study sites) | 73 | 102 | |
| Assessment of individual study quality | 9.1 | Describe whether the quality of studies included in the systematic review or meta‐analysis was assessed (e.g. blinded data collection, reporting quality, experimental | 7 | 102 |
| 9.2 | Describe how information about study quality was incorporated into analyses (e.g. meta‐regression and/or sensitivity analysis) | 6 | 102 | |
| Effect size measures | 10.1 | Describe effect size(s) used | 97 | 102 |
| 10.2 | Provide a reference to the equation of each calculated effect size (e.g. standardised mean difference, log response ratio) and (if applicable) its sampling variance | 63 | 91 | |
| 10.3 | If no reference exists, derive the equations for each effect size and state the assumed sampling distribution(s) | 7 | 28 | |
| Missing data | 11.1 | Describe any steps taken to deal with missing data during analysis (e.g. imputation, complete case, subset analysis) | 37 | 57 |
| 11.2 | Justify the decisions made to deal with missing data | 21 | 57 | |
| Meta‐analytic model description | 12.1 | Describe the models used for synthesis of effect sizes | 97 | 102 |
| 12.2 | The most common approach in ecology and evolution will be a random‐effects model, often with a hierarchical/multilevel structure. If other types of models are chosen (e.g. common/fixed effects model, unweighted model), provide justification for this choice | 50 | 40 | |
| Software | 13.1 | Describe the statistical platform used for inference (e.g. | 92 | 102 |
| 13.2 | Describe the packages used to run models | 74 | 80 | |
| 13.3 | Describe the functions used to run models | 22 | 69 | |
| 13.4 | Describe any arguments that differed from the default settings | 29 | 75 | |
| 13.5 | Describe the version numbers of all software used | 33 | 102 | |
| Non‐independence | 14.1 | Describe the types of non‐independence encountered (e.g. phylogenetic, spatial, multiple measurements over time) | 32 | 102 |
| 14.2 | Describe how non‐independence has been handled | 74 | 102 | |
| 14.3 | Justify decisions made | 47 | 102 | |
| Meta‐regression and model selection | 15.1 | Provide a rationale for the inclusion of moderators (covariates) that were evaluated in meta‐regression models | 81 | 94 |
| 15.2 | Justify the number of parameters estimated in models, in relation to the number of effect sizes and studies (e.g. interaction terms were not included due to insufficient sample sizes) | 20 | 94 | |
| 15.3 | Describe any process of model selection | 80 | 40 | |
| Publication bias and sensitivity analyses | 16.1 | Describe assessments of the risk of bias due to missing results (e.g. publication, time‐lag, and taxonomic biases) | 65 | 102 |
| 16.2 | Describe any steps taken to investigate the effects of such biases (if present) | 47 | 30 | |
| 16.3 | Describe any other analyses of robustness of the results, e.g. due to effect size choice, weighting or analytical model assumptions, inclusion or exclusion of subsets of the data, or the inclusion of alternative moderator variables in meta‐regressions | 35 | 102 | |
| Clarification of | 17.1 | When hypotheses were formulated after data analysis, this should be acknowledged | 14 | 28 |
| Metadata, data, and code | 18.1 | Share metadata (i.e. data descriptions) | 44 | 102 |
| 18.2 | Share data required to reproduce the results presented in the manuscript | 77 | 102 | |
| 18.3 | Share additional data, including information that was not presented in the manuscript (e.g. raw data used to calculate effect sizes, descriptions of where data were located in papers) | 39 | 102 | |
| 18.4 | Share analysis scripts (or, if a software package with graphical user interface (GUI) was used, then describe full model specification and fully specify choices) | 11 | 102 | |
| Results of study selection process | 19.1 | Report the number of studies screened* | 37 | 102 |
| 19.2 | Report the number of studies excluded at each stage of screening | 22 | 102 | |
| 19.3 | Report brief reasons for exclusion from the full‐text stage | 27 | 102 | |
| 19.4 | Present a Preferred Reporting Items for Systematic Reviews and Meta‐Analyses (PRISMA)‐like flowchart ( | 19 | 102 | |
| Sample sizes and study characteristics | 20.1 | Report the number of studies and effect sizes for data included in meta‐analyses | 96 | 91 |
| 20.2 | Report the number of studies and effect sizes for subsets of data included in meta‐regressions | 57 | 93 | |
| 20.3 | Provide a summary of key characteristics for reported outcomes (either in text or figures; e.g. one quarter of effect sizes reported for vertebrates and the rest invertebrates) | 62 | 102 | |
| 20.4 | Provide a summary of limitations of included moderators (e.g. collinearity and overlap between moderators) | 22 | 87 | |
| 20.5 | Provide a summary of characteristics related to individual study quality (risk of bias) | 60 | 5 | |
| Meta‐analysis | 21.1 | Provide a quantitative synthesis of results across studies, including estimates for the mean effect size, with confidence/credible intervals | 94 | 87 |
| Heterogeneity | 22.1 | Report indicators of heterogeneity in the estimated effect (e.g. | 52 | 84 |
| Meta‐regression | 23.1 | Provide estimates of meta‐regression slopes (i.e. regression coefficients) and confidence/credible intervals | 78 | 94 |
| 23.2 | Include estimates and confidence/credible intervals for all moderator variables that were assessed (i.e. complete reporting) | 59 | 94 | |
| 23.3 | Report interactions, if they were included | 59 | 27 | |
| 23.4 | Describe outcomes from model selection, if done (e.g. | 81 | 36 | |
| Outcomes of publication bias and sensitivity analyses | 24.1 | Provide results for the assessments of the risks of bias (e.g. Egger's regression, funnel plots | 60 | 102 |
| 24.2 | Provide results for the robustness of the review's results (e.g. subgroup analyses, meta‐regression of study quality, results from alternative methods of analysis, and temporal trends) | 44 | 102 | |
| Discussion | 25.1 | Summarise the main findings in terms of the magnitude of effect | 73 | 102 |
| 25.2 | Summarise the main findings in terms of the precision of effects (e.g. size of confidence intervals, statistical significance) | 57 | 102 | |
| 25.3 | Summarise the main findings in terms of their heterogeneity | 47 | 89 | |
| 25.4 | Summarise the main findings in terms of their biological/practical relevance | 98 | 102 | |
| 25.5 | Compare results with previous reviews on the topic, if available | 88 | 72 | |
| 25.6 | Consider limitations and their influence on the generality of conclusions, such as gaps in the available evidence (e.g. taxonomic and geographical research biases) | 72 | 100 | |
| Contributions and funding | 26.1 | Provide names, affiliations, and funding sources of all co‐authors | 92 | 102 |
| 26.2 | List the contributions of each co‐author | 31 | 102 | |
| 26.3 | Provide contact details for the corresponding author | 100 | 102 | |
| 26.4 | Disclose any conflicts of interest | 0 | 8 | |
| References | 27.1 | Provide a reference list of all studies included in the systematic review or meta‐analysis | 92 | 102 |
| 27.2 | List included studies as referenced sources (e.g. rather than listing them in a table or supplement) | 18 | 102 |
Fig 3PRISMA‐style flowcharts and some variations. (A) The classic flow‐chart: all searches are conducted around the same date, and screening occurs after de‐duplication. (B) Records are obtained from different databases (or other sources, e.g. personal archives or requests) and screened separately. De‐duplication occurs after at least one stage of screening. (C) The studies included after a classic search are then used as the ‘seed’ for a new search, based on citation information. Authors can retrieve all papers cited in included articles (backwards search), and all papers that cite the included articles (forwards search). A second round of de‐duplication and screening then occurs. (D) When a systematic review is an update of an already existing one, the newly found papers are added to the existing (old) set of included papers. As a further extension, it would be beneficial to record, and report, how many of the included articles originate from each source. For example, if one database contributed none of the included articles, then updates of the review could save time by not screening articles from that database.