| Literature DB >> 35998928 |
Suzie Cro1, Brennan C Kahan2, Sunita Rehal3, Anca Chis Ster4, James R Carpenter2,5, Ian R White2, Victoria R Cornelius1.
Abstract
OBJECTIVES: To evaluate how often the precise research question being addressed about an intervention (the estimand) is stated or can be determined from reported methods, and to identify what types of questions are being investigated in phase 2-4 randomised trials.Entities:
Mesh:
Year: 2022 PMID: 35998928 PMCID: PMC9396446 DOI: 10.1136/bmj-2022-070146
Source DB: PubMed Journal: BMJ ISSN: 0959-8138
Deciphering the research question being investigated in example trial11*
| Research question (estimand†) attribute | Definition of attribute | Method of statistical analysis used in example trial* (estimator) | Inferable attribute from reported statistical analysis (estimator) in atopic dermatitis trial‡ |
|---|---|---|---|
| Population | Population of patients targeted by clinical question (ie, who the treatment effect is for) | Analysis included all randomised participants | Adult patients with atopic dermatitis, as defined by the trial inclusion and exclusion criteria |
| Treatment conditions | Treatment strategies being compared | All randomised participants were included in a repeated measure analysis using a mixed model. Data after rescue therapy or treatment discontinuation was set missing (see handling of intercurrent events); model included fixed effects for treatment and treatment-by-visit and data up to time of first rescue therapy or treatment discontinuation | Baricitinib (4 mg) or placebo daily plus topical corticosteroids continued through to week 16, and without rescue therapy |
| Outcome variable | Endpoint or measure collected for each patient | The model outcome was the change in Work Productivity and Activity Impairment-AD absenteeism domain score between baseline and 16 weeks | Change in Work Productivity and Activity Impairment-AD absenteeism domain score from baseline at 16 weeks |
| Handling of intercurrent events | Specification of how to account for intercurrent events§ | Data after treatment discontinuation and rescue therapy use were set to missing; model implicitly imputed missing data using the data from participants who did not discontinue or require rescue therapy | Treatment discontinuation was handled using a hypothetical strategy, as if all treatment was adhered to (even if discontinued due to adverse event); rescue therapy was handled using a hypothetical strategy, as if no rescue therapy was received |
| Population level summary measure for outcome | Targeted summary measure for outcome variable, used to compare treatment conditions (eg, mean difference, risk ratio, odds ratio) | Model estimated mean change from baseline and included a covariate for treatment group | Mean treatment group difference in outcome |
|
| |||
| What is the mean change from baseline in Work Productivity and Activity Impairment-AD absenteeism domain score at 16 weeks for baricitinib 4 mg daily plus topical corticosteroids versus placebo plus topical corticosteroids, continued through to week 16, and without rescue therapy for adults with atopic dermatitis, as defined by trial’s inclusion and exclusion criteria | |||
Example trial is placebo controlled trial investigating baricitinib plus topical corticosteroids versus placebo plus tropical corticosteroids in atopic dermatitis (Wollenberg A, Nakahara T, Maari C, et al).11
Table 2 provides definitions and another example of an estimand, estimator, and estimate.
Inferred from the statistical methods (ie, the estimator) for Work Productivity and Activity Impairment-AD outcomes; absenteeism domain and 4 mg dose used as example.
Strategies for handling intercurrent events summarised in ICH E9(R1) are described in table 2.
Definitions of estimands, estimators, and estimates, using an example trial13*
| Definition | Example* |
|---|---|
|
| |
| An estimand is a precise description of the question being investigated in a clinical trial. It defines what treatment effect researchers want (or demand) to find out and includes five attributes (targeted population, treatment conditions, outcome variable, handling of intercurrent events, and population level summary measure for the outcome).† | Estimand description: |
|
| |
| An estimator is the statistical machinery or method that researchers use to get from what they want to know (the estimand) to knowing the answer (the estimate); it is how you get to what you want to know. | Analysis was based on the intention-to-treat principle and included all randomised participants with at least one follow-up in the group to which they were randomised, regardless of subsequent treatment received. A linear mixed effect model estimated the mean difference in PPPASI at eight weeks between groups, adjusted for baseline and using all observed data (including data observed after treatment discontinuation, rescue therapy, prohibited therapy, or other topical therapy). |
|
| |
| An estimate is a numerical result. The estimand describes what the numerical result represents. | –1.65, 95% confidence interval –4.77 to 1.47; P=0.30 |
PPPASI=Palmoplantar pustulosis area and severity score.
Example trial looks at effect of anakinra on Palmoplantar pustulosis (Cro S, Cornelius VR, Pink AE, et al).13
First three attributes might be recognisable from the PICO framework (patient/population, intervention, comparison, and outcomes),14 but alone are insufficient, and require careful consideration because they are typically affected by intercurrent events.
Fig 1Description of primary estimand reported in 255 eligible randomised controlled trials, by estimand attribute. Table 1 provides definitions of estimand attributes. IE=intercurrent events (eg, intervention discontinuation or use of rescue therapy)
Attribute details of primary estimands in eligible randomised controlled trials
| Attribute detail of primary estimand (No of trials) | No of trials | Proportion (%) of trials where attribute is stated or inferable | Proportion (%) of total No of trials (n=255) |
|---|---|---|---|
|
| |||
| All eligible participants | 169 | 98 | 66 |
| All eligible participants with a prespecified baseline characteristic | 4 | 2 | 2 |
| Not stated or inferable | 82 | NA | 32 |
|
| |||
| Treatment offer regardless of any intercurrent event (treatment policy) | 175 | 77 | 69 |
| Treatment offer, given a specified surgery or procedure was received | 3 | 1 | 1 |
| Treatment offer, with no use of another specified treatment | 1 | 0 | 0 |
| Initiating treatment | 36 | 16 | 14 |
| Initiating treatment but with no rescue therapy | 1 | 0 | 0 |
| Receiving all treatment | 8 | 4 | 3 |
| Receiving all treatment but with no use of another specified treatment | 2 | 1 | 1 |
| Receiving a specific amount of treatment | 1 | 0 | 0 |
| Not stated or inferable | 28 | NA | 11 |
|
| |||
| Composite | 5 | 4 | 2 |
| Hypothetical | 2 | 2 | 1 |
| Treatment policy | 96 | 76 | 38 |
| Treatment policy and composite | 7* | 6 | 3 |
| Treatment policy and hypothetical | 15† | 12 | 6 |
| Not stated or inferable | 117 | NA | 46 |
|
| |||
| No | 2 | 15 | 1 |
| Treatment policy inferable (intention-to-treat analysis) | 11 | 85 | 4 |
|
| |||
| Binary outcome: | |||
| Odds ratio | 17 | 8 | 7 |
| Risk difference | 26 | 12 | 10 |
| Risk ratio | 24 | 11 | 9 |
| 1−risk ratio (vaccine efficacy) | 2 | 1 | 1 |
| Continuous outcome: | |||
| Standardised mean difference | 2 | 1 | 1 |
| Geometric mean ratio | 1 | 0 | 0 |
| Mean difference | 63 | 28 | 25 |
| Mean difference for area under curve | 1 | 0 | 0 |
| Median difference | 5 | 2 | 2 |
| Median ratio | 1 | 0 | 0 |
| Count/rate: | |||
| Incidence rate ratio | 7 | 3 | 3 |
| Survival/time-to-event: | |||
| 1−risk ratio (vaccine efficacy) | 1 | 0 | 0 |
| Hazard ratio | 66 | 29 | 26 |
| Ordinal: | |||
| Proportional odds ratio | 8 | 4 | 3 |
| Not stated or inferable | 31 | NA | 12 |
NA=not applicable; intercurrent events=events that occur after randomised and affect the interpretation or existence of patient outcomes (eg, intervention discontinuation or use of rescue therapy); treatment policy=planned treatment course but not necessarily received. Percentages might not add up to 100% owing to rounding.
One strategy (n=4 trials) classed as inferable from the reported statistical methods was considered difficult to infer and consisted of an analysis of a time-to-event outcome following the intention-to-treat principle where participants who died were censored at last observation day; this analytical approach includes deaths by infinite time to events18 (see eTable 9).
Two strategies (n=3 trials) classed as inferable from the reported statistical methods were considered difficult to infer and included the following: one trial that used a joint model for a continuous outcome (change in the estimated glomerular filtration rate from baseline modelled using linear mixed model) and time to trial discontinuation due to death or end stage kidney disease before end of 104 week follow-up (Weibull parametric survival model), and that followed the intention-to-treat principle; and two trials that used marginal models on competing risk for recurrent events to handle terminal competing events, and that followed the intention-to-treat principle. Consensus was reached that these two approaches inferred a hypothetical estimand strategy with respect to the competing terminal intercurrent events since the outcome was not collected after the occurrence of the competing terminal intercurrent events, and the resulting treatment effect is estimated conditionally on a patient specific frailty (random effect) that models the correlation between the outcome and occurrence of the competing terminal intercurrent events19 (see eTable 9).
Statistical methods used for handling intercurrent events in eligible randomised controlled trials, by strategy (excluding treatment policy strategies)
| Statistical method | No of primary estimands | No of supplementary estimands (n=35 estimands, n=28 trials) |
|---|---|---|
|
| ||
| Participant with IE treated as non-responder or treatment failure | 6* | — |
| Participant with IE treated given poor outcome (change in outcome=0) | 1 | — |
| Participant with terminal IE (death) given worst outcome (score 0 on continuous scale) | 1 | — |
| Participant with deaths at any time point during follow-up censored at last observation day—time-to-event analysis (ie, those who died treated as an infinite time to event) | 4 | 4 |
| Competing risk model—Fine and Gray (ie, those individuals who died are treated as an infinite time to event) | — | 9 |
| Joint rank model including all survival events (ranked participants by time to death and then by change in outcome; ranked score then analysed as outcome) | — | 1 |
| Imputed as having been administered antibiotics (primary count outcome) for the remainder of time after death (a composite strategy) | — | 1§ |
|
| ||
| Time-to-event data censored at the time of IE (time-to-event analysis) | 10 | 4 |
| Data after IE set to missing—mixed model | 2 | 1¶ |
| Data after IE set to missing—multiple imputation | — | 4** |
| Data after IE set to missing—inverse probability weighting | — | 2 |
| Data after IE not collected as does not exist (terminal IE)—multiple imputation | 2†‡ | — |
| Data after IE not collected as does not exist (terminal IE)—competing risk (terminal IE) marginal model for recurrent events | 2 | — |
| Data after IE not collected as does not exist (terminal IE)—joint model for outcome and time to terminal IE | 1 | — |
| Data after IE not collected as does not exist (terminal IE)—inverse probability weighting | — | 1 |
| Data after IE not collected as does not exist (terminal IE)—imputation with lower value than previous score for participants that died | — | 1 |
|
| ||
| Complier average causal effect (CACE) analysis (implemented using latent growth mixture model (n=1), instrumental variable regression (n=3), structural mean model (n=1), or stated a CACE model (n=1)) | — | 6†† |
| Instrumental variables analysis that estimated the change in treatment effect per unit change in compliance | — | 1 |
IE=intercurrent event.
One trial stated assumption of an unfavourable value after intercurrent event.
One trial stated that “Group B IC [intercurrent] events were assumed to follow a hypothetical scenario, in which iGFR [measured glomerular filtration rate] values after developing ESKD [end stage kidney disease] take on biologically plausible values that are not confounded by the IC event, i.e., by ESKD treatments such as dialysis or kidney transplant. Group C IC events were assumed to conform to a hypothetical scenario, in which post-IC iGFR values have a similar distribution to other non-ESKD subjects with similar characteristics and pre-IC iGFR values.”20
One trial used referenced based multiple imputation to impute data after death; the other trial used a combination of missing-at-random (MAR) and missing-not-at-random (MNAR) multiple imputation for two different intercurrent events (MAR multiple imputation after death, and MNAR multiple imputation after diagnosis or treatment for end stage kidney disease).
Trial assumed “any resident who died due to infection will have been taking antibiotics on all missing diary days.”21 eTable 9 summarises analysis methods for which the inferring of the strategy for handling intercurrent events was difficult.
Trial assumed that “withdrawn subjects, had they completed the trial, would not have behaved differently than completing subjects from the same treatment arm with the same baseline characteristics and change in body weight at time of withdrawal.”22
Two trials used MAR multiple imputation, one trial used delta based multiple imputation with a tipping point analysis, and one trial used placebo based MNAR multiple imputation post IE.
Three trials stated the following assumptions: “that a participant’s treatment was protocol compliant or non-compliant, when compliance status was indeterminable. (2 CACE [complier average causal effect] analyses done). (CACE1 assumes compliance where compliance status is indeterminable, CACE2 assumes non-compliance where compliance status is indeterminable)”23; “that members of the control group have the same probability of non-compliance as members of the intervention group; that members of the intervention group have the same probability of contamination as members of the control group; and that being offered the treatment has no effect on the outcome”24; “the treatment has no effect in the non-compliant subset.”25
Fig 2Intercurrent events occurring in eligible randomised controlled trials (n=255). Unclear if occurred=intercurrent event described in the introduction or methods but no frequency data reported, therefore was potentially an intercurrent event but not possible to ascertain whether actually occurred in the trial. *Non-adherence=treatment non-adherence or discontinuation for the given reason. †Additional=not part of usual care (eg, rescue or prohibited treatment). ‡An intercurrent event is defined as an event which occurs after randomisation and effects the existence of interpretation of trial outcomes; where death was the primary trial outcome, by definition this excludes the possibility of death being an intercurrent event. §Other terminal events observed include graft failure, termination of pregnancy, miscarriage or medical termination of pregnancy <20 weeks, pregnancy loss <22 weeks, cancelled surgery (outcome pain use within first 24 hours post-surgery). ¶Other intercurrent events are listed in eTable 4; 33 (13%) trials had one other intercurrent event, nine (4%) trials had two other intercurrent events, and two (1%) trials had three other intercurrent events