| Literature DB >> 35560730 |
Michal Shimonovich1, Anna Pearce1, Hilary Thomson1, Srinivasa Vittal Katikireddi1.
Abstract
In fields (such as population health) where randomised trials are often lacking, systematic reviews (SRs) can harness diversity in study design, settings and populations to assess the evidence for a putative causal relationship. SRs may incorporate causal assessment approaches (CAAs), sometimes called 'causal reviews', but there is currently no consensus on how these should be conducted. We conducted a methodological review of self-identifying 'causal reviews' within the field of population health to establish: (1) which CAAs are used; (2) differences in how CAAs are implemented; (3) how methods were modified to incorporate causal assessment in SRs. Three databases were searched and two independent reviewers selected reviews for inclusion. Data were extracted using a standardised form and summarised using tabulation and narratively. Fifty-three reviews incorporated CAAs: 46/53 applied Bradford Hill (BH) viewpoints/criteria, with the remainder taking alternative approaches: Medical Research Council guidance on natural experiments (2/53, 3.8%); realist reviews (2/53, 3.8%); horizontal SRs (1/53, 1.9%); 'sign test' of causal mechanisms (1/53, 1.9%); and a causal cascade model (1/53, 1.9%). Though most SRs incorporated BH, there was variation in application and transparency. There was considerable overlap across the CAAs, with a trade-off between breadth (BH viewpoints considered a greater range of causal characteristics) and depth (many alternative CAAs focused on one viewpoint). Improved transparency in the implementation of CAA in SRs in needed to ensure their validity and allow robust assessments of causality within evidence synthesis.Entities:
Keywords: causal assessment; causality; population health; systematic review
Mesh:
Year: 2022 PMID: 35560730 PMCID: PMC9543433 DOI: 10.1002/jrsm.1569
Source DB: PubMed Journal: Res Synth Methods ISSN: 1759-2879 Impact factor: 9.308
Inclusion and exclusion criteria for reviews adapted from sample, phenomenon of interest, design, evaluation, and research type (SPIDER) protocol for mixed method studies ,
| Protocol category | Explanation | Criteria | ||
|---|---|---|---|---|
| Explanation based on (34, 36) | Operationalised in methodological review | Include | Exclude | |
| Sample | Similar to ‘population’ in PICO, sample refers to the participants included in the studies | The sample in this methodological review refers to the remit of the reviews, rather than the samples of the studies within those reviews. The remit is population or public health. There are no restrictions on criteria related to the study design, conditions, characteristics or settings within the reviews | Population health research (including public health interventions, health policy interventions, exposures such as risk factors and determinants). We focus on population health due to the importance of causal assessment in understanding both complex health interventions as well as potential health risks | Clinical interventions (including pharmaceutical, surgical or psychological interventions) |
| Phenomenon of interest | Phenomenon of interest relates to the aim or focus of the included reviews | The phenomena of interest are reviews that explicitly stated their aim was to identify a causal relationship, including those that identified putative pathways for a causal relationship | Reviews that explicitly stated they aimed to identify a putative causal relationship between exposure and population health outcome. We focus on explicit evaluations of causality largely due to time and resource constraints | Explicit mention that likelihood of causal relationship was not considered |
| Design | Design refers to the study design (including any theoretical frameworks) used to inform the research methods | The review design will be limited to systematic reviews (SRs) and reviews of SRs (RoRs) | SRs and RoRs. RoRs are also included because they use similar methods and often aspire to achieve the transparency of SRs17. Because of the variation in how SRs/RoRs are defined, we defined inclusion by self‐identification. Determining how closely a review followed SR/RoR principles and methods was beyond the scope of this methodological review | Non‐systematic reviews including methodological review |
| Evaluation | The term ‘evaluation’ is comparable to ‘outcomes’ in PICO. To accommodate qualitative research, evaluation includes unmeasurable findings | For the purposes of this methodological review, this refers to what approaches are incorporated into the reviews | Reviews that have incorporated approach(es) to causal assessment. We limited SRs/RoRs to those that explicitly incorporated one or more causal assessment approach (CAA) because of resources and time constraints | No explicit mention approach has been incorporated to support causal assessment |
| Research type | The research type refers to either quantitative, qualitative or mixed methods | This protocol category was not utilised as we did not have restrictions related to criteria for study design of the review | Not applicable | Not applicable |
Abbreviation: PICO, population, intervention, comparison and outcome.
Overview of how Bradford Hill (BH) viewpoints were applied, categorised by five domains
| Domain | Description | Summary of results |
|---|---|---|
| Viewpoints used | The viewpoints used in each review |
Strength of association: 44/46, 95.7% Temporality: 44/46, 95.7% Dose–response: 43/46, 93.5% Consistency: 41/46, 89.1% Plausibility: 38/46, 82.6% Experiment: 32/46, 69.6% Coherence: 21/46, 45.7% Specificity: 18/46, 39.1% Analogy: 15/46, 32.6% |
| Viewpoint definition | Whether a description, interpretation or definition of each viewpoint is provided | Description,/interpretation/definition provided: 15/46, 32.6% |
| Viewpoint indicators | Criteria to determine if viewpoint had been met are reported |
Indicators used and reported: 19/46, 41.3% Example of indicators include, but are not limited to, quantitative ranges (e.g., risk ratio (RR) or odds ratio (OR) between 3.0 and 8.0 for strong association) or qualitative thresholds (e.g., at least one credible mechanism to explain association for plausibility) |
| Overall support for viewpoints | Report level or degree of support for each viewpoint (e.g., strong, moderate, and weak) | 44/46 (95.7%) |
| Viewpoint application | Viewpoints were applied before or after evidence was synthesised and could be applied to all studies as a collective, studies individually or groups of studies (e.g., by study design, by exposure/outcome relationship). Some reviews applied viewpoints in more than one way |
36/46 (78.3%) 13/46 (28.3%) 12/46 (26.1%) 4/46 (8.7%) 1/46 (2.2%) review It was unclear how 2/46 (4.3%) reviews |
Note: The domain descriptions and corresponding reviews are summarised.
Impact of causal assessment approaches (CAAs) o conduct of systematic review (SR) stages
| Review stage | Number of reviews ( | CAA incorporated into SR stage |
|---|---|---|
| Review aims and objective | 39/53, 73.6% | Most reviews explicitly stated that one of their review aims and objectives was to assess (statistically and/or narratively) evidence for causal relationship. The nature of how causality was assessed varied across reviews, where reviews might focus on statistically analysing the evidence and/or on narratively assessing the evidence |
| Review design | 41/53, 77.4% | Most reviews included their CAA as a specific part of their overall review design for identifying, synthesising, and analysing evidence. Including CAA in a SR study design suggests that causal assessment was an a priori consideration for the SR, an important fact to consider when critically appraising SRs |
| Inclusion/exclusion criteria | 14/53, 26.4% |
Of the 14 reviews that designed criteria to reflect incorporating CAAs, 10 reviews (all of which utilised BH viewpoints except for one realist review) included studies that considered potential causal pathways Another four reviews |
| Search terms | 3/53, 5.7% | A few reviews designed their search strategies to specifically identify studies that support their CAA. Common terms included: causality, experiment, instrumental variable, regression discontinuity, and mechanism |
| Data extraction | 14/53, 26.4% | All 14 reviews |
| Synthesis | 51/53, 96.2% | 44 |
| Conclusion | 53/53, 100% | All reviews considered causality in their conclusions but it was unclear in three reviews whether a conclusion regarding a causal relationship was drawn. |
FIGURE 1PRISMA flow diagram with primary reasons for excluding full text reviews [Colour figure can be viewed at wileyonlinelibrary.com]
Overview of description of causal assessment approaches (CAAs) and how they were incorporated into systematic reviews (SRs) and reviews of reviews (RoRs)
| CAA | Number of reviews | Description of CAA | How CAA was incorporated into SRs |
|---|---|---|---|
| Bradford Hill (BH) viewpoints | 46 | BH viewpoints, also known as criteria, are a set of nine characteristics to consider when assessing a causal relationship. | The most commonly used CAA, there was considerable variation in which BH viewpoints were used and how they were operationalised. There was also variation in transparency and clarity about how the viewpoints were incorporated and used in causal assessment |
| Medical research council (MRC) guidance on natural experiments | 2 | The MRC guidance on natural experiments posits that certain study designs and analytic methods are more suitable to assess causality than others, and suggests that results from different studies be compared. | Two reviews |
| Realist reviews | 2 | Realist reviews are an established CAA with an existing set of guidelines for incorporating realist synthesis principles into SRs. | Both realist reviews narratively assessed causal mechanisms that may explain the relationship under study, and both determined that further evidence is needed to understand possible mechanisms |
| Horizontal systematic review | 1 | This CAA was developed by the review authors to collate evidence of causal effects across a range of study designs and risk (identified for having varying properties, such as threat of confounding, measurement error or proximity to the outcome on the causal pathway | The authors considered evidence from observational studies (that accounted for confounding and reverse causation), genetic studies using Mendelian randomisation, and RCTs for four risk factors Separate meta‐analyses were conducted for each risk factor and by study design. The meta‐analysis results were compared across risk factors, considering the differing sources and level of bias across the different methods |
| Sign test hypotheses | 1 | This approach, interrogates the evidence for reverse causation (such that the outcome is in fact the cause of the exposure) | The authors |
| Causal cascade method | 1 | Based on logic model developed to illustrate the ‘framework of causal relationships’, the authors conducted a Bayesian meta‐analyses on the heterogeneity across RCTs |
Authors hypothesised reasons for heterogeneity found in RCTs evaluating breast cancer screening on mortality—including attendance rates, the accuracy of screening tests, and social class. The logic model in Figure The authors then considered the trial evidence across these different inter‐related factors to consider whether heterogeneity in the evidence base could be explained by these factors. Based on the assumptions in the logic model and the included studies, the review estimated the relative risk of advanced‐stage breast cancer and breast cancer mortality by three different attendance rates and sensitivity in trials (a total of nine scenarios). Overall, they found that attendance rate and sensitivity may explain statistical heterogeneity across trials |
Note: The review topics, in terms of exposures, varied: sixteen (16/53, 30.2%) reviews focused on occupational health , , , , , , , , , , , , , , ; eleven (11/53, 20.8%) on environmental health , , , , , , , , , ; nine (9/53, 17.0%) on nutritional health , , , , , , , , ; four (4/53, 7.5%) on smoking , , , ; four (4/53, 7.5%) on mental health , , , ; three (3/53, 5.7%) on alcohol consumption , , ; two (2/53, 3.8%) on child health , ; two (2/53, 3.8%) on health inequalities , ; one (1/53, 1.9%) on diagnostics ; and one (1/53, 1.9%) on respiratory diseases.
Abbreviation: RCTs, randomised controlled trials.