| Literature DB >> 35022173 |
Matthew Dodd1,2, Katherine Fielding3, James R Carpenter4,5, Jennifer A Thompson3, Diana Elbourne4,2.
Abstract
BACKGROUND: In non-inferiority trials with non-adherence to interventions (or non-compliance), intention-to-treat and per-protocol analyses are often performed; however, non-random non-adherence generally biases these estimates of efficacy.Entities:
Keywords: general medicine (see internal medicine); public health; statistics & research methods
Mesh:
Year: 2022 PMID: 35022173 PMCID: PMC8756274 DOI: 10.1136/bmjopen-2021-052656
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Flow chart showing the eligibility of papers reviewed (uploaded separately).
Characteristics of eligible analyses (n=26)
| Characteristics | n (%) |
| Type of publication (n=24) | |
| Results | 13 (54) |
| Methodology | 7 (29) |
| Protocol | 4 (17) |
| Year of publication (n=24) | |
| 2006–2010 | 5 (21) |
| 2011–2015 | 4 (17) |
| 2016–2020 | 15 (63) |
| Disease area or patient population | |
| Mental health | 4 (15) |
| Appendicitis | 2 (8) |
| Cancer | 2 (8) |
| Respiratory infection/disease | 2 (8) |
| Ulcerative colitis | 2 (8) |
| Anaemia | 1 (4) |
| Ear infection | 1 (4) |
| General surgery patients | 1 (4) |
| Heart disease | 1 (4) |
| HIV | 1 (4) |
| Individuals receiving life-sustaining therapies | 1 (4) |
| Renal disease | 1 (4) |
| Smoking cessation | 1 (4) |
| Throat infection | 1 (4) |
| Urinary incontinence | 1 (4) |
| Simulation study | 4 (15) |
| Unit of randomisation | |
| Individual | 19 (73) |
| Cluster | 3 (12) |
| Simulation study | 4 (15) |
| Type of experimental intervention | |
| Drug | 9 (35) |
| Method of treatment delivery | 3 (12) |
| Additional patient examination | 2 (8) |
| Nutritional | 2 (8) |
| Surgical | 1 (4) |
| Simulation study | 4 (15) |
| Other | 5 (19) |
| Type of outcome | |
| Binary | 13 (50) |
| Continuous | 8 (31) |
| Time to event | 4 (15) |
| Count | 1 (4) |
| Composite outcome | 3 (12) |
Estimates of non-adherence to interventions reported in methodology and results papers, combined across trial arms unless reported (n=13)
| Estimate of non-adherence | Binary measure of adherence (n=11) | Continuous* measure of adherence (n=2) | Binary or continuous* measure of adherence (n=13) |
| ≤5% | 4 (36) | 0 (0) | 4 (31) |
| 6%–10% | 4 (36) | 0 (0) | 4 (31) |
| 11%–25% | 0 (0) | 1 (50) | 1 (8) |
| 26%–50% | 2 (18)† | 1 (50) | 3 (23) |
| >50% | 1 (9)† | 0 (0) | 1 (8) |
Data presented as n (%).
*Mean level of non-adherence.
†Two papers provided an estimate of non-adherence in only one arm of the trial.
Statistical methods that were identified as attempting to account for non-adherence to interventions
| Method (estimand*) | Brief description | n (%) | Advantages† | Disadvantages† |
| IV approaches (CACE) | CACE estimated using the conventional IV estimator, the generalised method of moments IV estimator or 2SLS regression | 9 (35) |
Straightforward to compute. Preserves the balance in patient characteristics from randomisation. Although the validity of the IV estimator depends on several key assumptions, these assumptions are likely to hold in most double-blinded studies. Correctly adjusts for missing outcome data, assuming that these are MAR. Can account for unknown confounders. Recent methods using doubly robust procedures have been developed to boost power when using IV estimation |
Accurate data on compliance behaviour must also be available. The IV method … does increase the sample size requirements of the study as the expected proportion of non-compliers increases. Requires the ‘exclusion restriction’ to be fulfilled (ie, treatment allocation only influences the outcome through the treatment and not through any other pathways). This assumption is unverifiable and we are only likely to be confident that it holds in a double-blinded study |
| Adjustment for observed adherence (estimand unclear) | Observed adherence included as a covariate within a regression model | 3 (12) | None stated | None stated |
| Adherence modelled as a time-varying covariate in a time-to-event analysis (estimand unclear) | Attributes the time at risk between observations and the outcomes occurring during the same period to the concurrent value of adherence | 3 (12) | None stated |
Results … (from the) time-dependent analysis could be the consequence of a selection bias. |
| Tipping point approach (estimates the probability of reversing the trial conclusions under a range of assumptions about the outcome data following non-adherence) | Outcome data following non-adherence is treated as missing. Assesses how sensitive the trial results are to the values of these missing outcomes | 2 (8)‡ |
A model or mechanism for the missing outcomes does not have to be assumed. All randomised individuals are included in the analysis |
Require(s) assumptions that are often difficult or not possible to verify |
| Rank preserving structural failure time model and G-estimation (ATE) | Untreated survival times (those that would occur if no EXP were received) are assumed to be a function of both time on/off EXP and the effect of EXP compared with CON. The value for the effect of EXP that results in equal untreated survival times in the randomised groups is identified (via G-estimation) and used to calculate adjusted survival times that would have been observed had no switching occurred | 2 (8) |
Takes compliance history into account. Maintains the original randomised group. G-estimation can be performed for other types of outcomes, such as continuous, binary or count responses, using structural nested mean models |
Can only be used under a specific set of assumptions. The rank preserving structural failure time model … incorporates a strong non-interaction assumption with respect to the treatment effect |
| Inverse-probability-of-treatment weighting (ATE) | A pseudo-population is created by re-weighting participants’ outcomes according to the probability of adherence at each visit, given previous values of the intervention received and confounders. The causal effect of EXP is estimated by performing an unadjusted analysis in the pseudo-population, which is equivalent to a weighted analysis in the original cohort | 2 (8) |
Ensure(s) that the reweighted arms are similar and comparable. Sensitivity analysis methods are available to address unobserved confounding and covariate measurement errors |
It eliminates bias if all confounders can be appropriately adjusted for, but in general this will not be possible |
| Structural mean models (CACE) | Baseline variables that predict adherence differentially in each arm of the trial and are also conditionally independent of outcome are identified. Enables the estimation of two distinct causal parameters, from which a contrast can be made | 1 (4) |
Straightforward to implement using standard statistical software |
This paper highlights the increase in variance experienced when fitting these models, something that can only be reduced when the models include strong predictors of adherence and outcome. Extension of the approach to handle non-linear outcome variables is also required |
| CACE analysis using propensity score approach (CACE) | A propensity score is developed in the EXP group in order to predict the probability that those in the CON group would have been fully adherent if assigned to EXP. CON group outcomes are re-weighted using these probabilities and compared with the outcomes of those who were fully adherent in the EXP group | 1 (4) | None stated | None stated |
| CACE analysis using a mixture modelling approach (CACE) | A mixture model is used to identify those in the CON group that are likely to have been fully adherent had they been assigned to EXP. Outcomes in this subgroup are compared with outcomes of those who were fully adherent in the EXP group | 1 (4) | None stated | None stated |
| Test statistic based on the OR in CRTs (estimand unclear) | A test statistic for assessing non-inferiority based on the OR in CRTs under the Dirichlet multinomial model | 1 (4) | None stated | None stated |
| CACE analysis (CACE) | Exact method not stated | 1 (4) | None stated | None stated |
*Intercurrent event component of the estimand.
†As stated by the authors.
‡Both applications of the tipping point approach were reported in the same methodology paper.
ATE, average treatment effect in the population; CACE, complier average causal effect; CON, control; CRT, cluster randomised trial; EXP, experimental intervention; IV, instrumental variable; MAR, missing at random; OR, odds ratio; 2SLS, two-stage least squares.
Details of the statistical methods reported on more than one occasion
| Method | Description | Key assumptions | Advantages | Disadvantages |
| IV approaches | With two randomised groups (Z; Z=0 for those in the CON group and Z=1 for those in the EXP group), a binary measure of the intervention received (X; X=1 when EXP received and X=0 when CON received), and a continuous outcome (Y), the conventional IV estimator of CACE is: | The instrument (randomisation): Affects the outcome only through the intervention received (the exclusion restriction). Does not share common causes with the outcome (the exchangeability assumption). Causes some participants to receive their assigned intervention (the relevance assumption). Additional assumption required to estimate CACE: There are no participants who would always receive the opposite of their random allocation (the monotonicity assumption) |
Preserves randomised comparison. Assumptions well suited to double-blinded trials (typically assumptions 1 and 4 are satisfied by effective double blinding and use of objective outcomes, and assumption 2 valid due to randomisation). Under assumptions 1–4, able to estimate CACE even when unmeasured confounding is present. Inclusion of confounders in 2SLS regression can improve precision. Can be extended to allow for partial adherence, binary or time-to-event outcomes, and clustering, though additional assumptions may be required |
Requires untestable assumptions (1 and 4) that may be violated. When adherence is binary, the exclusion restriction implies no effect of EXP in those who are non-adherent. Only appropriate when crossovers occur and cannot be used when non-trial interventions are received. The sample size required to maintain statistical power to detect non-inferiority increases as the quantity of non-adherence increases. Simple approaches described involve a single measure of the intervention received (eg, ≥80% of sessions attended versus <80%, or the proportion of sessions attended) and are susceptible to time-varying confounding (where predictors of both adherence and outcome vary over time) |
| Adjustment for observed adherence | Observed adherence at a fixed timepoint (eg, whether surgery was performed) or multiple timepoints (eg, whether medication was taken adequately between follow-up visits) included as a covariate within a regression model |
Individuals in the EXP and CON groups with the same level of observed adherence are comparable. Within each trial arm, the effect of adherence on the outcome is the same. The functional form of adherence is correctly specified. Other model assumptions are not violated |
Relatively straightforward to implement |
Susceptible to selection bias and should be avoided. Different factors may lead to non-adherence in the two groups, meaning those in the EXP group with an observed level of adherence may differ from those in the CON group with the same level of adherence. Also, some of those in the CON group that were non-adherent may have been fully adherent if assigned to the EXP group (and vice versa). Does not account for time-varying confounding |
| Adherence modelled as a time-varying covariate in a time-to-event analysis | An extension of the Cox PH model that allows the intervention received to vary over time. The model takes the form: |
Only the current value of the intervention (at time t) affects the hazard. The effect of receiving EXP is the same for all participants regardless of when it is received. Other assumptions of the Cox PH model (including uninformative censoring) are not violated |
Allows for time-varying confounding that is not influenced by previous intervention received. Can be extended to allow for more flexible measures of the intervention received, such as cumulative exposure to the intervention. Non-trial interventions may be incorporated |
Assumptions 1 and 2 are difficult to verify and may be violated Susceptible to selection bias if switching is related to prognostic factors Does not account for time-varying confounding that is influenced by previous intervention received |
| Tipping point approach | Outcome data following non-adherence is treated as missing. A range of assumptions about these outcomes are explored to assess how sensitive the trial results are to the missing values | Assumptions made about the values of missing outcomes following non-adherence, for example, all missing outcomes are (1) failures, (2) successes in the CON group and failures in the EXP group (worst case scenario) or (3) failures in the CON group and successes in the EXP group (best case scenario) |
All randomised individuals are included in the analysis. A model or mechanism for the missing outcomes does not have to be assumed |
While a range of assumptions about the missing values can be explored, these assumptions are often difficult or not possible to verify |
| Rank preserving structural failure time model and G-estimation | Let Ti denote the observed survival time for the ith participant and Ui their survival time that would have been observed if they received no EXP. Ti is assumed to be a function of time on ( |
If no participants received any EXP, the average survival time in the two groups would be equal (due to randomisation). The effect of receiving EXP is the same for all participants regardless of when it is received. If participant i experiences the event of interest before participant j when both are treated, then participant i would also experience the event before participant j when both are untreated (rank preserving) |
Preserves randomised comparison. Takes adherence history into account. Allows for time-varying confounding, including when these confounders are affected by previous intervention received. Does not require information on potential confounders (only randomised group, observed event times and intervention history) |
Assumptions 2 and 3 are difficult to verify and may be violated. Additional assumptions required when the CON group receive an active intervention. Only appropriate when crossovers occur and cannot be used when non-trial interventions are received. G-estimation may not work well if the number of participants or events is small. Computationally intensive |
| Inverse-probability-of-treatment weighting | Confounding is accounted for by re-weighting participants’ outcomes. Typically, logistic regression is used to predict probabilities of adherence at each visit given previous values of the intervention received and confounders. The inverse of these probabilities (the weights) are used to create a pseudo-population in which time-varying confounders are not associated with the intervention. The causal effect of EXP is estimated by performing an unadjusted analysis in the pseudo-population, which is equivalent to a weighted analysis in the original cohort. |
The absence of unmeasured confounding (exchangeability). Participants have a non-zero probability of receiving each intervention (positivity). The observed outcome equals the counterfactual outcome of the intervention actually received (consistency). No misspecification of the model used to estimate the weights |
Preserves randomised comparison. Takes adherence history into account. Allows for time-varying confounding, including when these confounders are affected by previous intervention received. Can be extended to allow for non-trial interventions, but may result in large weights and estimates very sensitive to model specification |
Eliminates bias if all confounders can be appropriately adjusted for, but in general this will not be possible. Cannot be used if covariates perfectly predict adherence. Unstable in the presence of extreme weights |
CACE, complier average causal effect; CON, control; EXP, experimental intervention; HR, hazard ratio; IV, instrumental variable; PH, proportional hazards; 2SLS, two-stage least squares.