| Literature DB >> 29540417 |
Matthew J Page1,2, Joanne E McKenzie1, Julian P T Higgins2.
Abstract
BACKGROUND: Several scales, checklists and domain-based tools for assessing risk of reporting biases exist, but it is unclear how much they vary in content and guidance. We conducted a systematic review of the content and measurement properties of such tools.Entities:
Keywords: bias (epidemiology); checklist; publication bias; review literature as topic
Mesh:
Year: 2018 PMID: 29540417 PMCID: PMC5857645 DOI: 10.1136/bmjopen-2017-019703
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Flow diagram of identification, screening and inclusion of studies. aRecords identified from Ovid MEDLINE, Ovid Embase, Ovid PsycINFO and Google Scholar. bRecords identified from screening references of included articles. SR, systematic review.
List of included tools
| Article ID | Tool | Scope of tool | Types of reporting biases assessed | Level of assessment* | ||
| Selective publication | Selective non-reporting | Selection of the reported result | ||||
| Balshem | Agency for Healthcare Research and Quality (AHRQ) outcome and analysis reporting bias framework | Reporting bias only | ✓ | ✓ | Specific outcome/result in a study | |
| Berkman | AHRQ tool for evaluating the risk of reporting bias | Reporting bias only | ✓ | ✓ | Specific synthesis of studies | |
| Downes | Appraisal tool for Cross-Sectional Studies (AXIS) tool | Multiple sources of bias | ✓ | Study | ||
| Downs and Black | Downs-Black tool | Multiple sources of bias | ✓ | Study | ||
| Guyatt | Grading of Recommendations Assessment, Development and Evaluation (GRADE) | Multiple sources of bias | ✓ | ✓ | Specific synthesis of studies | |
| Hayden | Quality In Prognosis Studies (QUIPS) tool | Multiple sources of bias | ✓ | Study | ||
| Higgins | Cochrane risk of bias tool for randomised trials (RoB 1.0) | Multiple sources of bias | ✓ | ✓ | Study | |
| Higgins | RoB 2.0 revised tool for assessing risk of bias in randomised trials | Multiple sources of bias | ✓ | Specific outcome/result in a study | ||
| Hoojimans | SYstematic Review Centre for Laboratory animal Experimentation (SYRCLE) RoB tool | Multiple sources of bias | ✓ | ✓ | Study | |
| Kim | Risk of Bias Assessment Tool for Nonrandomized Studies (RoBANS) | Multiple sources of bias | ✓ | ✓ | Study | |
| Kirkham | Outcome Reporting Bias In Trials I (ORBIT-I) classification system for benefit outcomes | Reporting bias only | ✓ | Specific outcome/result in a study | ||
| Meader | Semi-Automated Quality Assessment Tool (SAQAT) | Multiple sources of bias | ✓ | ✓ | Specific synthesis of studies | |
| Reid | Selective reporting bias algorithm | Reporting bias only | ✓ | ✓ | Study | |
| Saini | ORBIT-II classification system for harm outcomes | Reporting bias only | ✓ | Specific outcome/result in a study | ||
| Salanti | Framework for evaluating the quality of evidence from a network meta-analysis | Multiple sources of bias | ✓ | ✓ | Specific synthesis of studies | |
| Sterne | Risk Of Bias In Non-randomized Studies of Interventions I (ROBINS-I) tool | Multiple sources of bias | ✓ | Specific outcome/result in a study | ||
| Viswanathan and Berkman | Research Triangle Institute (RTI) item bank for assessment of risk of bias and precision for observational studies of interventions or exposures | Multiple sources of bias | ✓ | Study | ||
| Viswanathan et al | RTI item bank for assessing risk of bias and confounding for observational studies of interventions or exposures | Multiple sources of bias | ✓ | Study | ||
*Level of assessment classified as: ‘study’ when assessments are directed at a study as a whole (eg, tool used to assess whether any outcomes in a study were not reported); ‘specific outcome/result in a study’ when assessments are directed at a specific outcome or result within a study (eg, tools used to assess whether a particular outcome, such as pain, was not reported) or ‘specific synthesis of studies’ when assessments are directed at a specific synthesis (eg, tool used to assess whether a particular synthesis, such as a meta-analysis of pain, is missing unpublished studies).
Summary of general characteristics of included tools
| Characteristic | Summary data (n=18 tools) |
| Type of tool | |
| Domain-based | 16 (89%) |
| Checklist | 1 (6%) |
| Scale | 1 (6%) |
| Scope of tool | |
| Assessment of reporting bias only | 5 (28%) |
| Assessment of multiple sources of bias/quality | 13 (72%) |
| Types of reporting bias assessed | |
| Bias due to selective publication only | 0 (0%) |
| Bias due to selective non-reporting only | 6 (33%) |
| Bias in selection of the reported result only | 3 (17%) |
| Bias due to selective publication and bias due to selective non-reporting | 4 (22%) |
| Bias due to selective non-reporting and bias in selection of the reported result | 5 (28%) |
| Total number of items in the tool | 7 (5–13) |
| Number of items relevant to risk of reporting bias | 1 (1–2) |
| Number of response options for risk of reporting bias judgement | 3 (3–3) |
| Types of study designs to which the tool applies | |
| Randomised trials only | 5 (28%) |
| Systematic reviews only | 3 (17%) |
| Non-randomised studies of interventions only | 2 (11%) |
| Randomised trials and non-randomised studies of interventions | 2 (11%) |
| Non-randomised studies of interventions or exposures | 2 (11%) |
| Other (cross-sectional studies, animal studies, network meta-analyses, prognosis studies) | 4 (22%) |
| Level of assessment of risk of reporting bias | |
| Study as a whole | 9 (50%) |
| Specific outcome/result in a study | 5 (28%) |
| Specific synthesis of studies | 4 (22%) |
| Data sources used to inform tool content* | |
| Literature review (eg, of items in existing tools or empirical evidence) | 9 (50%) |
| Ideas generated at expert consensus meeting | 8 (44%) |
| Pilot feedback on preliminary version of the tool | 7 (39%) |
| Data from psychometric or cognitive testing† | 5 (28%) |
| Other (eg, adaptation of existing tool) | 5 (28%) |
| Delphi study responses | 2 (11%) |
| No methods stated | 2 (11%) |
| Guidance available | |
| Brief annotation per item/response option | 9 (50%) |
| Detailed guidance manual | 4 (22%) |
| Worked example for each response option | 2 (11%) |
| Detailed annotation per item/response option | 1 (6%) |
| None | 2 (11%) |
Summary data given as number (%) or median (IQR).
*The percentages in this category do not sum to 100% since the development of some tools was informed by multiple data sources.
†Psychometric testing includes any evaluation of the measurement properties (eg, construct validity, inter-rater reliability, test–retest reliability) of a draft version of the tool. Cognitive testing includes use of qualitative methods (eg, interview) to explore whether assessors who are using the tool for the first time were interpreting the tool and guidance as intended.
Criteria used in existing tools to inform a judgement of ‘high’ risk of bias due to selective publication
| ‘High’ risk of bias criteria proposed in existing tools | AHRQ RRB | GRADE | SAQAT | NMA-Quality | Total, n (%) |
| Assessment directed at a specific synthesis (eg, meta-analysis) | |||||
| Evidence of funnel plot asymmetry (based on visual inspection of funnel plot or statistical test for funnel plot asymmetry) | ✓ | ✓ | ✓ | ✓ | 4 (100) |
| Smaller studies tend to demonstrate more favourable results (based on visual assessment, without funnel plot) | ✓ | 1 (25) | |||
| Clinical decision would differ for estimates from a fixed-effect versus a random-effects model because the findings from a fixed-effect model are closer to the null | ✓ | 1 (25) | |||
| Substantial heterogeneity in the meta-analysis cannot be explained by some clinical or methodological factor | ✓ | 1 (25) | |||
| At least one study is affected by non-publication or non-accessibility | ✓ | 1 (25) | |||
| Presence of small (often ‘positive’) studies with for-profit interest in the synthesis | ✓ | ✓ | 2 (50) | ||
| Presence of early studies (ie, set of small, ‘positive’ trials addressing a novel therapy) in the synthesis | ✓ | ✓ | 2 (50) | ||
| Discrepancy in findings between published and unpublished trials | ✓ | ✓ | ✓ | 3 (75) | |
| Search strategies were not comprehensive | ✓ | ✓ | ✓ | 3 (75) | |
| Methods to identify all available evidence were not comprehensive | ✓ | ✓ | 2 (50) | ||
| Grey literature were not searched | ✓ | 1 (25) | |||
| Restrictions to study selection on the basis of language were applied | ✓ | 1 (25) | |||
| Industry influence may apply to studies included in the synthesis | ✓ | 1 (25) |
AHRQ RRB, AHRQ tool for evaluating the risk of reporting bias29; GRADE, GRADE rating of quality of evidence34–37; NMA-Quality, Framework for evaluating the quality of evidence from a network meta-analysis49; SAQAT, Semi-Automated Quality Assessment Tool.45 46
Criteria used in existing tools to inform a judgement of ‘high’ risk of bias due to selective non-reporting
| ‘High’ risk of bias criteria proposed in existing tools | AHRQ ORB | AHRQ RRB | AXIS | GRADE | QUIPS | RoB 1.0 | SYRCLE RoB | RoBANS | ORBIT-I | SAQAT | Reid | ORBIT-II | NMA-Quality | RTI 2012 | RTI 2013 | Total, n (%) |
| Assessment directed at study as a whole | ||||||||||||||||
| One or more outcomes of interest were clearly measured, but no results were reported | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | 8 (53) | |||||||
| One or more outcomes of interest were reported incompletely so that they could not be entered in a meta-analysis | ✓ | ✓ | 2 (13) | |||||||||||||
| The study report fails to include results for a key outcome that would be expected to have been reported for such a study | ✓ | ✓ | ✓ | ✓ | ✓ | 5 (33) | ||||||||||
| Assessment directed at a specific outcome | ||||||||||||||||
| Particular outcome clearly measured but no results were reported | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | 6 (40) | |||||||||
| Particular outcome of interest is reported incompletely so that it cannot be entered in a meta-analysis (typically stating only that P>0.05) | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | 6 (40) | |||||||||
| Judgement says particular outcome is likely to have been measured and analysed but not reported on the basis of its results | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | 6 (40) | |||||||||
| Composite outcomes are presented without the individual component outcomes | ✓ | ✓ | 2 (13) | |||||||||||||
| Result reported globally across all groups | ✓ | 1 (7) | ||||||||||||||
| Result reported for some groups only | ✓ | 1 (7) | ||||||||||||||
| Data were not reported consistently for the outcome of interest | ✓ | 1 (7) | ||||||||||||||
| Assessment directed at a specific synthesis | ||||||||||||||||
| Selective non-reporting suspected in a number of included studies | ✓ | ✓ | ✓ | ✓ | 4 (27) | |||||||||||
AHRQ ORB, AHRQ outcome and analysis reporting bias framework28; AHRQ RRB, AHRQ tool for evaluating the risk of reporting bias29; AXIS, Appraisal tool for Cross-Sectional Studies30; GRADE, GRADE rating of quality of evidence34–37; NMA-Quality, Framework for evaluating the quality of evidence from a network meta-analysis49; ORBIT-I, Outcome Reporting Bias In Trials classification system for benefit outcomes3 32; ORBIT-II, Outcome Reporting Bias In Trials classification system for harm outcomes48; QUIPS, Quality In Prognosis Studies tool38; Reid, Reid et al selective reporting bias algorithm47; RoB 1.0, Cochrane risk of bias tool for randomised trials21 39 40; RoBANS, Risk of Bias Assessment Tool for Nonrandomized Studies44; RTI 2012, RTI Item Bank for Assessment of Risk of Bias and Precision for Observational Studies of Interventions or Exposures51; RTI 2013, RTI Item Bank for Assessing Risk of Bias and Confounding for Observational Studies of Interventions or Exposures52; SAQAT, Semi-Automated Quality Assessment Tool45 46; SYRCLE RoB, SYstematic Review Centre for Laboratory animal Experimentation risk of bias tool.43
Criteria used in existing tools to inform a judgement of ‘high’ risk of bias in selection of the reported result
| ‘High’ risk of bias criteria proposed in existing tools | AHRQ ORB | Downs-Black | RoB 1.0 | RoB 2.0 | SYRCLE RoB | RoBANS | Reid | ROBINS-I | Total, n (%) |
| Assessment directed at study as a whole | |||||||||
| One or more reported outcomes were not prespecified (unless clear justification for their reporting is provided, such as an unexpected adverse event) | ✓ | ✓ | ✓ | ✓ | 4 (50) | ||||
| One or more outcomes were reported using measurements, analysis methods or subsets of the data (eg, subscales) that were not prespecified | ✓ | ✓ | 2 (15) | ||||||
| One or more retrospective, unplanned, subgroup analyses were reported | ✓ | 1 (13) | |||||||
| Any analyses that had not been planned at the outset of the study were not clearly indicated | ✓ | 1 (13) | |||||||
| Assessment directed at a specific outcome/result | |||||||||
| Particular outcome was not prespecified but results were reported | ✓ | 1 (13) | |||||||
| Reported result for a particular outcome is likely to have been selected, on the basis of the findings, from multiple outcome measurements (eg, scales, definitions, time points) within the outcome domain | ✓ | ✓ | 2 (25) | ||||||
| Reported result for a particular outcome is likely to have been selected, on the basis of the findings, from multiple analyses of the data | ✓ | ✓ | 2 (25) | ||||||
| Reported result for a particular outcome is likely to have been selected, on the basis of the findings, from different subgroups | ✓ | 1 (13) | |||||||
AHRQ ORB, AHRQ outcome and analysis reporting bias framework28; Downs-Black, Downs Black tool31; Reid, Reid et al selective reporting bias algorithm47; RoB 1.0, Cochrane risk of bias tool for randomised trials21 39 40; RoB 2.0, Revised tool for assessing risk of bias in randomised trials41 42; RoBANS, Risk of Bias Assessment Tool for Nonrandomized Studies44; ROBINS-I, Risk Of Bias In Non-randomized Studies of Interventions tool4; SYRCLE RoB, SYstematic Review Centre for Laboratory animal Experimentation risk of bias tool.43
Reported measurement properties of tools with an assessment of the risk of reporting bias
| Study ID | Tool | Measurement property | Sample size | Areas of healthcare addressed | Weighted kappa (95% CI) | Weighting scheme | Interpretation of kappa* |
| Armijo-Olivo | RoB 1.0 | Inter-rater agreement of assessments of risk of bias due to selective non-reporting (between two external reviewers) | 87 | Musculoskeletal, cardiorespiratory, neurological and gynaecological conditions. | 0.5 (CI not reported) | Not described | Moderate agreement |
| Armijo-Olivo | RoB 1.0 | Inter-rater agreement of assessments of risk of bias due to selective non-reporting (between two external reviewers and Cochrane reviewers) | 87 | See above | 0.13 (CI not reported) | Not described | Slight agreement |
| Hartling | RoB 1.0 | Inter-rater agreement of assessments of risk of bias due to selective non-reporting | 163 | Child health | 0.13 (95% CI −0.05 to 0.31) | Not described | Slight agreement |
| Hartling | RoB 1.0 | Inter-rater agreement of assessments of risk of bias due to selective non-reporting | 107 | Asthma | 0.4 (95% CI 0.14 to 0.67) | Not described | Fair agreement |
| Hartling | RoB 1.0 | Inter-rater agreement of assessments of risk of bias due to selective non-reporting (between two reviewers, all trials) | 124 | Varied | 0.27 (95% CI 0.06 to 0.49) | Not described | Fair agreement |
| Hartling | RoB 1.0 | Inter-rater agreement of assessments of risk of bias due to selective non-reporting (between pairs of reviewers across different centres, all trials) | 30 | Varied | 0.08 (95% CI −0.09 to 0.26) | Not described | Slight agreement |
| Jordan | RoB 1.0 | Inter-rater agreement of assessments of risk of bias due to selective non-reporting (between judgements of trials appearing in two SRs) | 28 | Subfertility | Not reported† | Not applicable | Not applicable |
| Vale | RoB 1.0 | Agreement between selective non-reporting assessments performed using published article only versus published article and data collected during the individual participant data process | 95 | Cancer pain | Not reported† | Not applicable | Not applicable |
| Hoojimans | SYRCLE RoB | Inter-rater agreement of assessments of risk of bias due to selective non-reporting | 32 | Animal studies (not specified) | 0.62 (CI not reported) | Not described | Substantial agreement |
| Kim | RoBANS | Inter-rater agreement of assessments of risk of bias due to selective non-reporting | 39 | Depression, myocardial infarction, postpartum haemorrhage, chronic non-cancer pain | 0 (CI not reported) | Not described | Poor agreement |
| Llewellyn | SAQAT | Inter-rater agreement of assessments of risk of bias due to selective publication (between two SAQAT raters) | 29 | Varied | 0.63 (95% CI 0.17 to 1) | Quadratic | Substantial agreement |
| Llewellyn | SAQAT | Inter-rater agreement of assessments of risk of bias due to selective publication (between one rater using SAQAT and one using the standard GRADE approach) | 15 | Varied | Not reported† | Not applicable | Not applicable |
| Norris | ORBIT-I | Inter-rater agreement of ORBIT-I classifications of risk of bias due to selective non-reporting | 40 | Varied | Not calculated, as too little variation in judgements | Not applicable | Not applicable |
| Bilandzic | ROBINS-I | Inter-rater agreement of assessments of risk of bias in selection of the reported result | 16 | Thiazolidinediones and cardiovascular events | 0.78 (CI not reported) | Linear | Substantial agreement |
| Bilandzic | ROBINS-I | Inter-rater agreement of assessments of risk of bias in selection of the reported result | 21 | COX-2 inhibitors and cardiovascular events | 0.45 (CI not reported) | Linear | Moderate agreement |
*Interpretation of kappa based on categorisation system defined by Landis et al.27
†Data presented as per cent agreement, not weighted kappa.
COX-2, cyclooxygenase-2; ORBIT-I, Outcome Reporting Bias In Trials classification system for benefit outcomes3 32; RoB 1.0, Cochrane risk of bias tool for randomised trials21 39 40; RoBANS, Risk of Bias Assessment Tool for Nonrandomized Studies44; ROBINS-I, Risk Of Bias In Non-randomized Studies of Interventions tool4; SAQAT, Semi-Automated Quality Assessment Tool45 46; SRs, systematic reviews; SYRCLE RoB, SYstematic Review Centre for Laboratory animal Experimentation risk of bias tool.43