Literature DB >> 35098576

Sensitivity to missing not at random dropout in clinical trials: Use and interpretation of the trimmed means estimator.

Audinga-Dea Hazewinkel^1,2, Jack Bowden^2,3, Kaitlin H Wade^1,2, Tom Palmer^1,2, Nicola J Wiles⁴, Kate Tilling^1,2.

Abstract

Outcome values in randomized controlled trials (RCTs) may be missing not at random (MNAR), if patients with extreme outcome values are more likely to drop out (eg, due to perceived ineffectiveness of treatment, or adverse effects). In such scenarios, estimates from complete case analysis (CCA) and multiple imputation (MI) will be biased. We investigate the use of the trimmed means (TM) estimator for the case of univariable missingness in one continuous outcome. The TM estimator operates by setting missing values to the most extreme value, and then "trimming" away equal fractions of both groups, estimating the treatment effect using the remaining data. The TM estimator relies on two assumptions, which we term the "strong MNAR" and "location shift" assumptions. We derive formulae for the TM estimator bias resulting from the violation of these assumptions for normally distributed outcomes. We propose an adjusted TM estimator, which relaxes the location shift assumption and detail how our bias formulae can be used to establish the direction of bias of CCA and TM estimates, to inform sensitivity analyses. The TM approach is illustrated in a sensitivity analysis of the CoBalT RCT of cognitive behavioral therapy (CBT) in 469 individuals with 46 months follow-up. Results were consistent with a beneficial CBT treatment effect, with MI estimates closer to the null and TM estimates further from the null than the CCA estimate. We propose using the TM estimator as a sensitivity analysis for data where extreme outcome value dropout is plausible.

Entities: Chemical

Keywords: bias quantification; dropout; missing not at random; randomized controlled trials; sensitivity analyses; trimmed means

Mesh：

Year: 2022 PMID： 35098576 PMCID： PMC9303448 DOI： 10.1002/sim.9299

Source DB: PubMed Journal: Stat Med ISSN： 0277-6715 Impact factor: 2.497

INTRODUCTION

Randomized controlled trials (RCT) are considered the gold standard for assessing causality, because randomization balances observed and unobserved variables across the treatment groups, on average. However, RCTs remain vulnerable to other sources of bias, including from missing data due to dropout. In this article, we consider the case of an RCT with an incomplete continuous outcome, measured at a single time‐point. The impact of missing data depends on the missingness mechanism and the analysis model. Three missingness mechanisms can be distinguished: missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR). With MCAR, missingness is unrelated to any measured or unmeasured characteristics and the observed sample is a representative subset of the unobserved full data. MAR means the missingness can be explained by observed data and, with MNAR, missingness is a function of the unobserved data (in our case, the outcome) itself. , , There are four general ways of dealing with incomplete data: a complete case analysis (CCA), inverse probability weighting (IPW), multiple imputation (MI), and maximum likelihood (ML) based inference. A CCA is the analysis model intended to be applied to the trial data at its outset, restricted only to individuals with observed outcomes. Assuming correct model specification, CCA in our scenario is unbiased if the outcome is MCAR or MAR, conditional on the covariates in the analysis model. , , With IPW, the CCA model is fitted, but each individual is now weighted by the inverse of its probability of being observed. , While CCA and IPW discard any incomplete cases, ML and MI use all available information. With ML, parameters are estimated by maximizing the probability density function (the likelihood) of the observed data, with missing data values removed from the likelihood through a process of summation or integration. , , With MI, the observed data are used to repeatedly predict—“impute”—the missing values. MAR can be intuitively understood as the assumption that the distributions of the missing and observed values (here, outcome values) are the same given the fully observed variables. In MI, this distribution is estimated, conditional on the observed model covariates and optional auxiliary variables. The uncertainty in the imputation process is dealt with by generating multiple complete datasets, to which the analysis model is applied, , , , with the resulting estimates pooled using Rubin's rules. Broadly, IPW, MI, and ML will be valid if the models are correctly specified and if the data are MAR, conditional on the covariates in the IPW model, imputation model, and analysis model, respectively. When data are MNAR, however, the treatment effect estimate will be biased in general for any of these approaches. , , Observed data cannot be used to determine whether data are MAR or MNAR, and, consequently, if estimates from CCA, MI, ML, or IPW analyses are likely to be biased. Guidelines published by the National Research Council (US) and the European Medical Agency strongly recommend examining sensitivity to departures from the missing data assumptions made in the primary analysis. , , Two approaches for dealing with incomplete data, which can be adapted to sensitivity analyses, are selection models (SM) and pattern mixture models (PMM). SM specify the relationship between outcome and the probability of being missing, while PMMs define the outcome distribution for unobserved individuals across a range of missingness patterns. Such approaches require the specification of a missingness model or numerical sensitivity parameters, to characterize the unknown missingness mechanism. Choosing plausible distributions and values, however, is often challenging, requiring elicitation of expert opinion. , , We propose using the trimmed means (TM) estimator in addition or as a simpler alternative to SM and PMM sensitivity analyses. Permutt and Li suggested a TM estimator for RCTs where patients who drop out have more extreme (unobserved) outcome values than those who do not drop out (eg, lower value dropout, when comparator group patients, not experiencing a benefit of treatment, leave the study early). Observations are ordered within each treatment group, with the missing observations assigned the lowest rank. Equal proportions of data are trimmed away from the lower end of both treatment group distributions and the TM treatment effect estimate is obtained from a linear regression of the exposure and outcome variables, using the remaining “trimmed” data. The TM estimator requires the specification of only a single sensitivity parameter—the size of the trimming fraction, which is bounded from below by the highest observed proportion of dropout. The TM estimator will give an unbiased estimate of the true treatment effect, given two main assumptions—the location shift assumption and the strong MNAR assumption. The first specifies identical distributions of the outcome in the treatment groups, with the only difference being a mean shift. The second restricts all dropout to the fraction that is trimmed away. , Previous studies have performed a variety of simulations investigating the type 1 error and power of the TM estimator across a range of clinical scenarios and for various MNAR/MAR dropout patterns. However, the biases that arise from violations of the strict TM assumptions and how to correct for these biases have yet to be established. In this article, we derive formulae for the bias resulting from the violation of the location shift and strong MNAR assumptions for normally distributed outcomes, and illustrate the theoretical results with simple simulations of various MNAR/MAR mechanisms. Additionally, we propose an adjustment to the estimator which relaxes the location shift assumption and leaves the estimator reliant only on the strong MNAR assumption and normality. For the purpose of this article, consistent with previous publications, , , we primarily consider worst value dropout. The principle, however, is equally applicable to higher value dropout, which may occur when patients leave the study perceiving themselves to be recovered, or when higher values indicate a worse response. We also primarily consider the case of 50% fixed trimming, but provide more general formulae for alternate trimming fractions. The TM approach is illustrated in an application to the CoBalT RCT (registration ISRCTN38231611), which compares the effectiveness of cognitive behavioral therapy (CBT) as an adjunct to pharmacotherapy vs usual care in patients with treatment resistant depression. We use the TM estimator and two MI models in a sensitivity analysis of the long‐term CCA treatment effect, estimated at 46 months. We have developed an R package “tmsens” for performing a TM regression and conducting a TM estimator sensitivity analysis, available from https://github.com/dea‐hazewinkel/tmsens.

METHODS

Notation

Consider a clinical trial with n subjects, with a continuous outcome, Y, with patients randomized to receive an active treatment () or comparator (). Let denote the observed outcome for patient i randomized to treatment group j, where and . Let be the population mean for arm j, with , and its estimate, , obtained from the corresponding sample mean: When there are no missing outcomes in the trial, each is an unbiased estimate of . Let denote the true treatment effect, given by the difference in treatment group means: where is estimated by : When the trial outcome is incomplete, we define a missing indicator M, which equals 0 if Y is observed and 1 if Y is not observed. Let be the mean of all observed values for group j, given by , with and being subject i's missingness indicator. Then estimates the population complete case mean and the complete case estimand of the treatment effect is given by with estimated by

TM estimator

We first define the population trimmed mean, , in the absence of dropout ( i). For treatment group j, is given by the expected value of all observations exceeding the quantile : where p is the proportion of outcomes trimmed away from the lower end of the distribution of each group j (eg, for , the bottom 25% of the distribution would be removed). Then is estimated by with being the sample size after trimming (, with the ceiling function). Let denote the TM effect estimand, given by the difference in population trimmed means (), estimated by . If the outcomes within each treatment group are normally distributed with underlying mean and variance , then . If the treatment group SDs are equal (), is common across treatment groups and the population TM effect, , is identical to the population mean difference , since This property will also hold in absence of normality, if the outcome distributions for each group are identical in shape and differ only by a mean shift. We now consider the TM estimator in the presence of dropout. As the estimator assumes worst value dropout, all missing values () are assigned a value smaller than the worst observed outcome: The precise value is unimportant as long as the above inequality holds. The trimmed mean is obtained by taking the average of all observations exceeding the quantile of the trimming proportion, p, as in (2), where p must now be equal to or exceed the largest observed dropout proportion across the groups: When the trimming fraction, p, is chosen based on the observed dropout proportions (so that ), this is known as adaptive trimming. A more common practice is that of fixed trimming, where p is chosen a priori, with 50% trimming () a customary choice. The TM estimator assumes lower value dropout, so that all missing values are contained in the fraction, p, that is trimmed away, with no missing values in the trimmed fraction used for estimation (the “strong MNAR assumption”): with q a probability in the interval . When (4) holds, is an unbiased estimator for the TM effect, . In addition, if the group variances are equal (the “location shift assumption”), is unbiased for (3). Previously, it has been stressed that the TM estimator estimates an unique estimand—the mean difference of the best X% patients in each treatment group, making it hard to compare with other approaches. , In order to identify this TM‐specific estimand, only the strong MNAR assumption is required. Under the additional constraint of the location shift assumption, the TM effect becomes an unbiased estimator of the population mean difference as shown in (3). We now examine the two sources of bias in the TM estimator. Let be the total bias for the TM estimator, representing the deviation of the TM estimate, , from the population mean difference, : The first component results from the violation of the strong MNAR assumption, the second from violation of the location shift assumption. Supplementary Figure S1 (Appendix B) illustrates TM and CCA estimator behavior across four scenarios with varying dropout patterns and equal and unequal treatment arm SDs. Box 1 summarizes the terminology used throughout this article.

The location shift assumption

We now derive expressions for the bias resulting from the violation of the location shift assumption, , for the case of 50% trimming, with results for a more general trimming fraction, p, derived in Appendix C. We assume that the strong MNAR assumption is satisfied and the outcomes are normally distributed for each treatment arm. Then, is equal to and the total bias (5) is given by We define the 50% trimmed left‐truncated mean (Appendix C): Then, the 50% trimmed population mean difference is given by with the bias, , resulting from unequal SDs: The TM estimator bias, (7), can alternatively be expressed as a function of the trimmed fraction SDs, rather than of the unobserved full sample SDs (Appendix D): From both bias formulae, (7) and (8), it is apparent that the estimator is unbiased given equal treatment group SDs, with the latter notation (8) making it explicit that equality of trimmed fraction SDs suffices. This property underlies the adjustment to the TM estimator outlined in Section 2.5, which relaxes the location shift assumption, allowing for different treatment group SDs.

The strong MNAR assumption

We now derive expressions for the bias resulting from the violation of the strong MNAR assumption, , for 50% lower value trimming and dropout in the comparator group, with results for a given trimming fraction, p, and dropout in both groups derived in Appendices D and E, respectively. The strong MNAR assumption assumes that all missing outcomes are restricted to the fraction p that is trimmed away, so that all values exceeding the quantile are observed: with q a probability in the interval . We assume that the location shift assumption is satisfied and the outcomes are normally distributed in each treatment arm, prior to dropout. Then, is an unbiased estimator for and the total bias (5) reduces to We define a new population parameter, , which gives the population TM effect for a scenario with a normally distributed outcome in the treatment group and a comparator group where the outcome is no longer normally distributed due to dropout. Then, unbiasedly estimates , the TM effect, and the bias, (9), can be written as and, with and , simplified to Here, is the trimmed mean of a comparator group with normally distributed outcomes (6), with for 50% trimming: and the 50% trimmed mean of a comparator group with some non‐normal outcome distribution: We cannot quantify (12) without defining the underlying distribution. If we assume the outcomes are normally distributed prior to dropout, it is sufficient to specify the distribution of the dropout values. In Section 2.4.1 we derive the strong MNAR bias under assumption of a homogeneous dropout mechanism, where, in a given part of the distribution, any observation is equally likely to be selected for dropout. When the homogeneous dropout is confined to the fraction that is trimmed away, the strong MNAR assumption is satisfied. We derive the strong MNAR bias for scenarios where the dropout extends beyond this cutoff. When dropout is homogeneous across the entirety of the distribution, this is equivalent to assuming dropout is MCAR. Section 2.4.2 derives equivalent expressions for bias of the CCA estimator in these scenarios. Section 2.4.3 derives the maximum strong MNAR bias of the TM estimator, which occurs when the missing outcomes are located on the opposite side of the distribution to the one that is trimmed away (eg, if there is 20% dropout and we use the TM estimator under the assumption of 20% lowest value dropout, maximum bias would occur when the 20% highest values drop out).

TM estimator strong MNAR bias under assumption of homogeneous dropout

Consider a truncated normal distribution with lower bound, and upper bound, , and let denote the fraction of this distribution, so that for and . Let be the trimmed mean of a population with normally distributed outcomes, which are no longer normal due to dropout, and let us assume a homogeneous dropout mechanism. Let denote the fraction affected by homogeneous dropout, with , and its upper bound, so that each value in is equally likely to drop out, with a given probability a: with a a probability in the interval Further, let denote the fraction of the distribution that is trimmed away under 50% lower value trimming, the trimmed fraction used for estimation, and the fraction of the distribution for which the trimmed fraction, , is affected by dropout (in violation of the strong MNAR assumption). When the strong MNAR assumption is satisfied, the 50% trimmed mean (6) is given by the top half of the distribution, . When this assumption is violated, dropout is present in the fraction , which now contains less than 50% of the outcomes. Consequently, when estimating the 50% trimmed mean, a larger fraction of the distribution, , is taken, with , and the trimmed mean given by . The shift from 0.5 to b is a function of the dropout proportion, , the dropout spread, , the trimming proportion, , and , with We define the 50% trimmed mean in presence of homogeneous dropout, : with and the means of truncated normal distributions, with, for example, given by The bias of the TM estimator under strong MNAR violation in the comparator group (11) can then be expressed as (14) is a function of the size of the trimmed fraction (), the fraction affected by dropout (), the SD of the affected group (), and the magnitude of the shift from 0.5 to b (13), which is affected by the dropout proportion (). When the strong MNAR assumption is satisfied, no shift occurs, since , so that , and reduces to 0. The full derivation of (14) is given in Appendix E, with the equivalent for dropout in both treatment groups given in Appendix F. The bias resulting from the violation of the strong MNAR assumption is determined by the full sample distribution and the dropout mechanism. In (14), we define the bias for normally distributed outcomes and a homogeneous dropout mechanism. When calculating the bias, departures from the strong MNAR assumption are accounted for by increasing the dropout spread. We do this by allowing missing outcomes to be spread over more of the outcome distribution. For example, for 50% lower value trimming, the TM estimator assumes that all dropout occurs in the lower half of the distribution. The analyst may examine sensitivity to this assumption by specifying a dropout spread across the lower 75% of the distribution () and calculating the resulting bias with (14). This bias is calculated under the assumption that dropout is homogeneous across the specified dropout spread. Under this assumption, the greatest bias occurs when dropout is spread across the entire distribution (c=1), which is equivalent to assuming that the outcomes are MCAR. When dropout does depend on a covariate related to the outcome, this will result in violation of the strong MNAR assumption and a non‐homogeneous dropout distribution. Exact quantification of the bias in the latter scenario, however, requires specification of the covariate‐dependent dropout mechanism and the covariate‐outcome relationship and is beyond the scope of this article. Instead, we propose using (14) to calculate an upper bound for the strong MNAR bias. This formula calculates the bias of the TM estimator, which assumes directional dropout, under the assumption that the dropout in truth is random within a given dropout spread. When there is greater confidence in the strong MNAR assumption holding, the analyst can reflect this by choosing smaller values for c in (14), which reduces the assumed dropout spread, resulting in a smaller upper bound for the bias. Supplementary Figure S3 (Appendix G) shows the maximum bias calculated for a range of dropout spreads. In Section 2.4.3, we derive the bias of the TM estimator when outcomes are MNAR contrary to the strong MNAR assumption and located on the opposite side of the distribution to the one being trimmed.

CCA estimator bias under the strong MNAR assumption

Unlike the TM estimator, the CCA estimator will be biased under the strong MNAR assumption. We define this bias, , for the case of dropout in the comparator group, with results for dropout in both groups derived in Appendix H. As for the TM estimator, we derive the bias under the assumption of a homogeneous dropout mechanism and normally distributed outcomes, with given by and with the dropout spread, c its upper bound, and the dropout proportion. We fully derive (16) in Appendix H. In the absence of dropout (), and for dropout spread homogeneously across the entire outcome distribution (), is 0. The CCA estimate will be maximally biased for dropout restricted to the very lowest or highest values of the distribution, the former resulting in overestimation of the complete case mean, , the latter in its underestimation. We can write the maximum bias for a given treatment group j under high or low value dropout as The full derivation of (17) is given in Appendix I. The dropout mechanism under which maximum CCA bias is achieved can be defined in terms of a threshold SM. Consider dropout of the highest values in the comparator group, so that all observations exceeding the 'th quantile of the outcome distribution are unobserved, with an overall selection probability . For normally distributed outcomes, this threshold is given by so that the probability of selecting a given outcome value is In the broader context of SMs in general, Copas and Jackson defined a bias limit, which is equivalent to the maximum CCA estimator bias that we derive here (17).

Maximum bias of the TM estimator

We extend Copas and Jackson's bias limit to the TM estimator, and define the maximum possible TM bias, which, for the case of lower value trimming, occurs under highest value dropout. For the comparator group, the maximum bias resulting from dropout is then given by with the total maximum bias, , accounting for the potential violation of the location shift assumption, given by Full derivations of ((19), (20)) are given in Appendix J, alongside expressions for maximum bias under dropout in the treatment group. Appendix K describes a simple simulation illustrating the maximum CCA and TM estimator biases under dropout assumption violations.

Adjusted estimator

Here, we define an adjusted estimator which relaxes the location shift assumption. Under the assumption of normally distributed outcomes, the 50% trimmed fractions will have half‐normal distributions. The SD of a given fraction can be adjusted by mirroring this half‐normal distribution, and rescaling this now complete, if artificial, normal distribution, using the full sample SD of the other group. The latter SD will either be observed, in the absence of dropout, or can alternatively be extrapolated from the observed fraction SD, using the underlying properties of normality. The adjustment can be performed on either treatment group, but we illustrate the procedure for the comparator group. Let be the unadjusted 50% comparator trimmed mean, with and the adjusted comparator trimmed mean, with Substituting (21) for in (22) then gives the comparator group trimmed mean under the treatment group SD, . From the adjusted comparator trimmed mean, (22), and the unadjusted treatment trimmed mean, (6), we obtain the population adjusted TM estimate, under comparator group rescaling, , as Equivalently, we can obtain the population adjusted TM estimate, under treatment group rescaling, , with defined analogously to (22). As for the unadjusted estimator, violation of the strong MNAR assumption will bias the estimate, with violation in the comparator group resulting in an overestimation of the treatment effect, and the converse true for the treatment group. We show in simulation (Section 3, Table 1) that violation of the strong MNAR assumption results in greater bias for the adjusted estimators than for the unadjusted estimator, with the greatest bias observed when rescaling the group for which the strong MNAR assumption is most violated, and recommend that the adjustment be applied to the group for which the assumption is most plausible. In Appendix L, expressions are derived for the adjusted estimator bias resulting from strong MNAR assumption violation in either or both treatment groups, under comparator group rescaling and under treatment group rescaling.

TABLE 1

Estimated treatment effects () for the CCA (), TM (), and adjusted TM () estimators, with the latter obtained by performing the adjustment on the comparator group () and the treatment group ()

Dropout spread		β^c	βt^	β^at1	β^at0	B^tLSa	B^tSMb	B^tc	B^at1d	B^at0e
(a) σ0=σ1=1
1) h0,c=0.2	β^/B^	0.15	0.50	0.50	0.50	0.00	0.00	0.00	0.00	0.00
	SD^	0.02	0.02	0.03	0.03	0.02	0.02	0.02	0.03	0.03
2) h0,c=0.5	β^/B^	0.30	0.50	0.50	0.50	0.00	0.00	0.00	0.00	0.00
	SD^	0.02	0.02	0.03	0.03	0.02	0.02	0.02	0.03	0.03
3) h0,c=0.75	β^/B^	0.40	0.56	0.64	0.70	0.00	0.06	0.06	0.14	0.20
	SD^	0.02	0.02	0.03	0.03	0.02	0.02	0.02	0.03	0.03
4) h0,c=1	β^/B^	0.50	0.69	0.77	0.81	0.00	0.19	0.19	0.27	0.31
	SD^	0.02	0.02	0.03	0.03	0.02	0.02	0.02	0.03	0.03
(b) σ0=1.5, σ1=1
1) h0,c=0.2	β^/B^	−0.03	0.10	0.50	0.50	−0.40	0.00	−0.40	0.00	0.00
	SD^	0.03	0.03	0.03	0.03	0.03	0.03	0.03	0.03	0.03
2) h0,c=0.5	β^/B^	0.20	0.10	0.50	0.50	−0.40	0.00	−0.40	0.00	0.00
	SD^	0.03	0.03	0.03	0.03	0.03	0.03	0.03	0.03	0.03
3) h0,c=0.75	β^/B^	0.34	0.19	0.70	0.82	−0.40	0.09	−0.31	0.20	0.32
	SD^	0.03	0.03	0.04	0.04	0.03	0.03	0.03	0.04	0.04
4) h0,c=1	β^/B^	0.50	0.39	0.91	0.96	−0.40	0.29	−0.11	0.41	0.46
	SD^	0.03	0.03	0.03	0.03	0.03	0.03	0.03	0.03	0.03

Note: TM estimator bias when violating the location shift () and strong MNAR assumptions () are shown, alongside the total bias (), and the adjusted TM estimator biases (, ). Mean and SD estimates for simulations are reported. The TM estimands are estimated under 50% trimming of normally distributed outcomes (), for equal (a) and unequal (b) treatment group SDs, true treatment effect , and 20% comparator group dropout. Four scenarios are considered: dropouts restricted to (1) the lowest 20% of the distribution (), (2) the lowest 50%, (3) the lowest 75%, and (4) left unrestricted.

is defined in (7), Section 2.3.

is defined in (14), Section 2.4.

and equivalently .

, with further defined in (L14), Appendix L.3.

, with further defined in (L7), Appendix L.1.

Estimated treatment effects () for the CCA (), TM (), and adjusted TM () estimators, with the latter obtained by performing the adjustment on the comparator group () and the treatment group () Note: TM estimator bias when violating the location shift () and strong MNAR assumptions () are shown, alongside the total bias (), and the adjusted TM estimator biases (, ). Mean and SD estimates for simulations are reported. The TM estimands are estimated under 50% trimming of normally distributed outcomes (), for equal (a) and unequal (b) treatment group SDs, true treatment effect , and 20% comparator group dropout. Four scenarios are considered: dropouts restricted to (1) the lowest 20% of the distribution (), (2) the lowest 50%, (3) the lowest 75%, and (4) left unrestricted. is defined in (7), Section 2.3. is defined in (14), Section 2.4. and equivalently . , with further defined in (L14), Appendix L.3. , with further defined in (L7), Appendix L.1.

A SIMULATION STUDY

Consider a clinical trial on subjects randomized to either a treatment () or comparator () arm, with . We assume the outcomes are normally distributed, with a true treatment effect , and 20% dropout in the comparator group. Table 1 gives the mean treatment effect estimates and bias across simulations for the CCA estimator and the TM estimator under 50% trimming, for four scenarios. In the first three, dropout is restricted to the lowest 20%, 50%, and 75%, respectively, of the comparator arm outcome distribution, and MCAR in the fourth scenario. In the first two scenarios, the strong MNAR assumption is satisfied and we observe in Table 1a, for equal treatment SDs (), that the TM estimate, , is unbiased, while the CCA estimate, , is biased towards the null. On violating the strong MNAR assumption, in Scenarios 3 and 4, this CCA estimator bias decreases, with being unbiased under random dropout (). We observe the same pattern for and (Table 1b), with the CCA bias now larger due to the higher , which acts multiplicatively on the bias (16). Under unequal SDs, the location shift assumption is violated, and the TM estimates are biased towards the null. For the first two scenarios, the total bias, , is equal to , while in Scenarios 3 and 4, is given by the sum of and , with the latter biasing the TM estimates away from the null. As for the CCA estimator, the increased comparator group SD results in a larger bias in Table 1b. The adjusted TM estimates are given by and , for rescaling applied to the treatment and comparator group, respectively. In the first two scenarios, the strong MNAR assumption is satisfied, and both estimators estimate an unbiased treatment effect, , under unequal SDs (Table 1b). For Scenarios 3 and 4, however, both are biased away from the null, with bias components, and , bigger than the unadjusted estimator bias, . The greatest bias is observed for , a consequence of performing the adjustment on the group for which the strong MNAR assumption is violated. Examples of bias calculations for the CCA and (un)adjusted TM estimators are given in Appendix M. Appendix N describes an additional simulation, for trimming fractions, p, other than 50%, illustrating the bias trade‐off between the and bias components. Appendix O provides R code for obtaining adjusted and unadjusted TM estimates for a simulated dataset. Table 2 shows the results for a set of more complicated simulations, in which an additional continuous covariate, U, acts on the outcome (Scenario I). Once again, we impose 20% lower value dropout in the comparator group, but now also allow for covariate‐dependent dropout (Scenario II). Given equal treatment group SDs, the CCA estimate, , is biased, while the TM estimate, , is unbiased in Scenarios I and II, for normally and log‐normally distributed outcomes, both for the naive regression of outcome on treatment, and when including U (adj) in the regression. Given unequal SDs, the adjusted TM estimate, , obtained by rescaling the group with the least dropout, is unbiased for normally distributed outcomes, when dropout is restricted to the trimming fraction (Scenario I). Unlike the regular TM estimator, the adjusted TM estimator does not allow for additional sources of dropout and adjusting for covariates within the regression will introduce a small bias.

TABLE 2

Estimated treatment effects for the CCA (), TM (), and adjusted TM () estimator, with the latter obtained when performing the adjustment on the treatment group

		Equal SDs			Unequal SDs
		βc^	βt^	β^at1	βc^	βt^	β^at1	pd1	pd0
Normally distributed outcomes
I	β^	0.30	1.00	1.00	0.13	0.61	1.00	0.00	0.20
	SD	0.13	0.15	0.14	0.15	0.16	0.16	0.00	0.00
I (adj)	β^	0.45	1.00	1.00	0.25	0.58	0.97	0.00	0.20
	SD	0.11	0.14	0.14	0.13	0.15	0.15	0.00	0.00
II	β^	0.24	1.00	0.83	0.07	0.69	0.92	0.18	0.36
	SD	0.15	0.17	0.19	0.16	0.18	0.19	0.02	0.02
II (adj)	β^	0.39	1.00	0.83	0.18	0.66	0.89	0.18	0.36
	SD	0.13	0.16	0.18	0.15	0.17	0.18	0.02	0.02
Log‐normally distributed outcomes
I	β^	0.30	1.00	0.73	0.20	0.63	0.89	0.00	0.20
	SD	0.14	0.19	0.15	0.07	0.10	0.10	0.00	0.00
I (adj)	β^	0.46	1.00	0.73	0.33	0.61	0.86	0.00	0.20
	SD	0.13	0.19	0.15	0.03	0.09	0.09	0.00	0.00
II	β^	0.35	1.00	0.63	0.14	0.72	0.84	0.18	0.36
	SD	0.15	0.19	0.16	0.09	0.14	0.16	0.02	0.02
II (adj)	β^	0.54	1.00	0.63	0.27	0.70	0.82	0.18	0.36
	SD	0.14	0.19	0.16	0.07	0.12	0.14	0.02	0.02

Note: Mean and SD estimates for simulations are reported. The TM estimands are estimated under 50% trimming of normally and log‐normally distributed outcomes (), for equal () and unequal (, ) treatment group SDs, true treatment effect , and 20% lower value comparator group dropout (). Two scenarios are considered: (I) the outcome, Y, is a function of treatment group and an additional continuous covariate, U; (II) the outcome, Y, is a function of treatment group and U, with U also affecting dropout, simulated via a logit missingness mechanism. Dropout proportions are shown for the treatment () and comparator () groups. Unadjusted CCA and TM estimates are obtained, alongside estimates adjusted (adj) for U in the regression.

Estimated treatment effects for the CCA (), TM (), and adjusted TM () estimator, with the latter obtained when performing the adjustment on the treatment group Note: Mean and SD estimates for simulations are reported. The TM estimands are estimated under 50% trimming of normally and log‐normally distributed outcomes (), for equal () and unequal (, ) treatment group SDs, true treatment effect , and 20% lower value comparator group dropout (). Two scenarios are considered: (I) the outcome, Y, is a function of treatment group and an additional continuous covariate, U; (II) the outcome, Y, is a function of treatment group and U, with U also affecting dropout, simulated via a logit missingness mechanism. Dropout proportions are shown for the treatment () and comparator () groups. Unadjusted CCA and TM estimates are obtained, alongside estimates adjusted (adj) for U in the regression. Table 3 considers a scenario in which both outcome and covariate values are missing. In Scenario I, we have 20% lowest value dropout, and treatment dependent missingness in U. In Scenario II, we additionally have U‐dependent dropout of outcomes. Missingness in U results in a reduced sample size for both the CCA estimator () and TM estimator (). In both scenarios, imputing the missing U values in the trimmed fraction will give an unbiased treatment effect estimate.

TABLE 3

Estimated treatment effects for the CCA () and TM () estimators in observed and imputed data

		Observed data						Imputed data
		βc^	SE	Nc	βt^	SE	Nt	βt^	SE	NtI	pd1	pd0
I	Est	0.43	0.13	631	1.00	0.13	358	1.00	0.11	500	0.00	0.20
I	SD	0.13	0.00	13.34	0.15	0.01	9.79	0.14	0.01	0.00	0.00	0.00
II	Est	0.39	0.14	510	1.00	0.14	358	1.00	0.12	500	0.18	0.36
II	SD	0.15	0.01	14.89	0.17	0.01	10.22	0.15	0.01	0.00	0.02	0.02

Note: Mean and SD estimates for simulations are reported. The TM estimand is estimated under 50% trimming of normally distributed outcomes (), for equal () treatment group SDs, true treatment effect , and 20% lower value comparator group dropout (). In Scenario I, the outcome, Y, is a function of treatment group and an additional continuous covariate, U, with approx. 30% missingness in U as a function of treatment, simulated via a logit missingness mechanism. Scenario II is equivalent to I, but now U also affects dropout, simulated via a logit missingness mechanism. Estimated treatment effects are shown alongside SEs, and the number of cases used in estimation. and give the number of complete cases (U and Y not missing) in the observed and trimmed data, respectively. All estimates are adjusted for U, and data is imputed after 50% trimming.

Estimated treatment effects for the CCA () and TM () estimators in observed and imputed data Note: Mean and SD estimates for simulations are reported. The TM estimand is estimated under 50% trimming of normally distributed outcomes (), for equal () treatment group SDs, true treatment effect , and 20% lower value comparator group dropout (). In Scenario I, the outcome, Y, is a function of treatment group and an additional continuous covariate, U, with approx. 30% missingness in U as a function of treatment, simulated via a logit missingness mechanism. Scenario II is equivalent to I, but now U also affects dropout, simulated via a logit missingness mechanism. Estimated treatment effects are shown alongside SEs, and the number of cases used in estimation. and give the number of complete cases (U and Y not missing) in the observed and trimmed data, respectively. All estimates are adjusted for U, and data is imputed after 50% trimming.

AN APPLICATION TO THE COBALT RANDOMIZED CLINICAL TRIAL

The CoBalT trial , was a multicenter trial investigating the effect of CBT as an adjunct to pharmacotherapy vs usual care (UC) in 469 patients aged 18‐75 with treatment‐resistant depression. The primary outcome was the BDI‐II score—a self‐completed measure of depressive symptoms, with higher values indicating greater depression. Long‐term treatment effect was assessed by a repeated measures intention‐to‐treat analysis (), using outcomes at 6 months, 12 months, and 3‐5 years (average 46 months), and by the adjusted mean difference at 46 months. Both estimates were adjusted for BDI‐II at baseline, treatment center, and minimization variables (previously prescribed antidepressants, presence of a counselor at the general practice, and duration of current depressive episode at baseline). The repeated measures analysis showed a beneficial effect of CBT vs UC of 4.7 (95% CI: 6.4,3.0). At 46 months, the adjusted mean difference was 3.6 (95% CI: 6.6,0.6) for 136 and 112 patients in the CBT and UC group, respectively, with an average dropout rate of 47.1%. In this section, we apply the TM estimator as a sensitivity analysis for the treatment effect estimated at 46 months.

Missing data

Just over half of the initially recruited patients were observed at final follow‐up, with greater dropout in the UC group than the CBT group (Table 4). Dropout in the UC group appears to be unrelated to the most recent depression score, whereas in the CBT group, dropout was more common in those with higher recent depression scores (Table 5). The CBT group SDs are higher, most noticeably for patients missing at 46 months, suggesting that the unobserved full sample CBT SD at 46 months may be higher than the UC group SD. In Supplementary Figure S5 (Appendix P), histograms of the BDI‐II score distributions are given across time, for the UC group, the CBT group and their combined total. We observe that the distributions are moderately right‐skewed and non‐normal. No missingness was present in the minimization variables included in the model.

TABLE 4

Counts (N) and percentages (%) of patients with an observed BDI‐II score at baseline, 6, 12, and 46 months, for the usual care group (UC), the CBT treatment group (CBT), and both groups

	UC		CBT		Total
	N	%	N	%	N	%
Base	235	100	234	100	469	100
6 months	213	90.6	206	88.0	419	89.3
12 months	198	84.3	197	84.2	395	84.2
46 months	112	47.7	136	58.1	248	52.9

TABLE 5

Mean BDI‐II measurements and SDs at baseline, 6, 12, and 46 months for the usual care group (UC) and the CBT treatment group

	Usual care (UC)			CBT
	Observed^a	Missing^a	Difference^b	Observed^a	Missing^a	Difference^b
Base	30.9 (10.3)	32.7 (11.3)	−1.8 (−4.6,1.0)	30.4 (9.3)	33.6 (11.7)	−3.2 (−5.9,−0.5)
6 months	24.6 (13.1)	24.4 (13.2)	0.2 (−3.3,3.8)	17.4 (13.6)	21.8 (14.9)	−4.5 (−8.5,−0.4)
12 months	21.8 (13.3)	21.5 (12.4)	0.2 (−3.4,3.9)	15.8 (12.9)	19.5 (15.8)	−3.8 (−7.9,0.4)
46 months	23.4 (13.2)			19.2 (13.8)

Note: Patients missing and observed at 46 months are compared for all time points, with mean differences and 95% confidence intervals provided.

Given are mean and SD.

Given is the mean difference (observed‐missing) with 95% confidence interval.

Counts (N) and percentages (%) of patients with an observed BDI‐II score at baseline, 6, 12, and 46 months, for the usual care group (UC), the CBT treatment group (CBT), and both groups Mean BDI‐II measurements and SDs at baseline, 6, 12, and 46 months for the usual care group (UC) and the CBT treatment group Note: Patients missing and observed at 46 months are compared for all time points, with mean differences and 95% confidence intervals provided. Given are mean and SD. Given is the mean difference (observed‐missing) with 95% confidence interval.

Analysis details

Using the maximum CCA bias formula (17) from Section 2.4, we would expect the treatment effect to either under‐ or over‐estimate the treatment effect by approximately 10 units of the BDI‐II score. This is a wide bound, obtained without taking into account information available from the data and motivating the use of a more precise sensitivity analysis. To this end, we employed the TM approach and two MI models as a sensitivity analysis for the treatment effect at 46 months, which was estimated using linear regression adjusting for treatment center, BDI‐II at baseline and various minimization variables (previously prescribed antidepressants, presence of a counselor at the general practice, duration of current depressive episode at baseline). At 46 months, there was 41.9% and 52.3% dropout in the CBT and UC groups, respectively. Due to the high dropout proportion, adaptive trimming was employed, with 52.3% of the highest values trimmed away in both groups. Confidence intervals were obtained using a permutation‐based approach. We considered only the unadjusted TM estimator, as the adjusted estimator (Section 2.5), which adjusts for unequal treatment group SDs, is strongly reliant on normality, whereas for the unadjusted TM estimator it is sufficient that outcomes have the same distribution across treatment groups. The imputation models included auxiliary variables: baseline variables associated with missing BDI‐II at any time of follow‐up, and various measures of depression and anxiety (BDI‐II, PHQ‐7, GAD‐7, SF‐12). Two MI models were considered, with the first using only depression/anxiety information available at baseline (MI‐baseline), the second including intermediate outcome measurements at all follow‐up times. The MI models were fitted using the R software package “mice”. To interpret the sensitivity analysis results, we used the bias formulae derived in Sections 2.3 and 2.4 to establish CCA and TM estimator bias directions under different dropout scenarios (Section 4.3). Additionally, we calculated the expected TM estimator bias of the CBT treatment effect in the CoBalT data, for three variations of the most plausible dropout scenario. The location shift assumption bias and strong MNAR bias are both functions of the full sample SDs, which remain unobserved. Subject to the specified dropout mechanism, the observed dropout proportions, and under assumption of normally distributed outcomes, these can be inferred from the observed SDs, and used to calculate the relevant bias terms (derivations and R code in Appendix Q). While the TM bias formulae are derived under assumption of normally distributed outcomes and the 46 month BDI‐II scores are skewed, the formulae are sufficient for establishing bias direction and relative changes in bias size across different dropout scenarios.

Plausibility of assumptions for CCA, MI, and TM estimators

The validity of each method rests on the underlying assumptions the estimator makes about data characteristics, which cannot (usually) be tested. We used the bias formulae derived in Sections 2.3 and 2.4 to compare TM and CCA estimator behavior across four plausible dropout scenarios (Figure 1) and to identify the most plausible direction of bias in the CoBalT data with respect to the CBT treatment effect estimated at 46 months. We used this information to interpret the results of the sensitivity analysis in Section 4.4.

FIGURE 1

Trimmed means (TM) and complete case analysis (CCA) estimators across four scenarios with varying dropout patterns and treatment arm distributions, for 52.3% dropout in the usual care group (pink), 41.9% dropout in the CBT group (grey) and 52.3% higher value trimming. (1) The location shift assumption and strong MNAR assumption are satisfied, with equal treatment group SDs and strictly higher value dropout. (2) The location shift assumption is satisfied, but the strong MNAR assumption is violated in the usual care group. (3) The location shift assumption is violated, with , and the strong MNAR assumption is satisfied. (4) The location shift assumption is violated with , and the strong MNAR assumption is violated in the usual care group and to a lesser degree in the CBT group The CCA estimate will be unbiased if the outcomes at 46 months are MAR conditional on the minimization variables included in the model, and biased if the outcomes are MNAR. Dropout of more depressed patients in the CBT group will result in underestimation of the CBT complete case mean, with the CCA estimator consequently overestimating the beneficial effect of CBT (Figure 1(2)). The TM estimator will be unbiased if the location shift assumption and strong MNAR assumption hold (Figure 1(1)). From the CoBalT data, we suspect dropout of patients with more severe depressive symptoms in the CBT group (higher value dropout) and (approximately) homogeneous (MCAR) dropout in the UC group. This will, under higher value trimming, lead to the inclusion of too many high values in the trimmed fraction, and consequently to overestimation of the UC trimmed mean and the treatment effect (Figure 1(2)). Intermediate BDI‐II measurements at earlier times suggest a comparatively higher SD for the CBT group, implying that the location shift assumption may be violated (Figure 1(3)), with the larger spread of values leading to underestimation of the CBT trimmed mean and overestimation of the beneficial effect of CBT. Violation of the location shift assumption, with , combined with violation of the strong MNAR assumption in the UC group, will exacerbate the bias, leading to a substantial overestimation of the CBT treatment effect. This is illustrated in Figure 1(4), where the bias is somewhat mitigated by allowing for the possibility that the strong MNAR assumption may also be violated, if to a much lesser extent, in the CBT group. MI estimates of the CBT treatment effect at 46 months will be unbiased if the outcomes are MAR, conditional on the variables included in the imputation model. , Higher‐value dropout in the CBT group, if it cannot be explained by the observed imputation model variables, will result in the imputation of BDI‐II values that are too low in the CBT group, but approximately correct in the UC group, leading to underestimation of the CBT imputed mean. As with the CCA estimator, this will result in a bias away from the null, with the MI estimator overestimating the benefit of CBT treatment. We consider two MI models (Section 4.2), the first including a number of covariates and various depression/anxiety scores measured at baseline, and operating under the assumption that the outcomes at 46 months are missing independently of the true, unobserved outcome, conditional on the outcome measured at baseline and the other covariates. The second MI model additionally includes depression/anxiety measurements at 6 and 12 months, and assumes that the outcomes at 46 months are MAR given all earlier outcome measurements and other covariates. This assumption differs slightly for patients who also have incomplete outcomes at earlier times, with the MAR assumption then implying that the outcomes at 46 months are missing independently of the true outcome, conditional on only the observed previous outcome measures and the other covariates. Under the hypothesis of outcome‐dependent dropout, we would expect the first model to be biased. Under the plausible assumption that a patient's earlier outcomes are correlated with the one measured at 46 months, we would expect this bias to be partly mitigated in the second imputation model, by the inclusion of previous observed depression measures. Unless we are willing to assume, however, that earlier measurements are fully predictive of the final outcome and that dropout at 46 months will not be affected by a patient's BDI‐II score in the period between 12 and 46 months, it is unlikely that the missing outcomes are fully MAR conditional on the imputation variables, resulting in some MNAR bias remaining. In summary, our bias formulae suggest that under plausible assumptions, TM and CCA will overestimate the treatment effect. Consideration of MI properties suggest that both MI models will do the same, with a lesser bias for the MI model including intermediate depression/anxiety measures.

Results

The CCA treatment effect estimate for CBT of 3.89 (95% CI: 6.95, 0.83) is comparable to the one of the main CoBalT follow‐up analysis (Table 6). Slightly smaller effects were estimated by the MI models, comparable to the attenuation observed in the MI sensitivity analysis of the CoBalT follow‐up analysis. The TM estimator, in contrast, estimated a considerably bigger beneficial CBT treatment effect of 8.26 (95% CI: 11.20, 5.32). These results are in line with our expectations regarding the BDI‐II score and dropout characteristics, detailed in Section 4.3.

TABLE 6

	β^	SE	95% CI
TM	−8.26	1.47	(−11.2, −5.03)
CCA	−3.89	1.53	(−6.92, −0.87)
MI (baseline)	−3.33	1.39	(−6.07, −0.58)
MI (intermediate)	−2.56	1.37	(−5.65, −0.08)

Treatment effect estimate (), SE, and 95% confidence interval (95% CI), for the trimmed means (TM) estimator, complete case analysis (CCA), multiple imputation model with baseline outcome measurements (MI baseline) and multiple imputation model with intermediate outcome measurements (MI intermediate) From the CoBalT data, we have reason to suspect higher value dropout in the CBT group and approximately homogeneous dropout in the UC group. Table 7 takes a closer look at the TM estimator bias components calculated under three different assumptions about the dropout mechanism. In Scenario A, we assume completely directional, highest value dropout in the CBT group, with all dropout values (41.9%) restricted to the top of the distribution, and entirely homogeneous dropout in the UC group, with the dropout (53.3%) spread equally across the distribution. In Scenarios B and C, we make less stringent assumptions, with in Scenario B dropout in the top 60% for the CBT group, and in Scenario C dropout in the top 60% of the CBT group and top 80% of the UC group. We obtain a total bias, of 17.53, 11.09, and 7.1 for Scenarios A, B, and C, respectively, and calculate a bias‐adjusted estimate, which can be interpreted as an upper bound for the treatment effect estimate, , with the lower bound given by the TM estimate obtained from the data (. For all three scenarios, the CCA and MI estimates (Table 6 ) fall within the bounds, with the bias‐adjusted estimate of Scenario C most comparable, at 1.16. Our sensitivity analysis, comparing CCA, TM, and MI, is consistent with a beneficial CBT treatment effect, but suggests that it may be more modest than indicated by the CCA estimator.

TABLE 7

Trimmed mean estimator treatment effect bounds and bias components under different dropout assumptions

	Group	Spread	B^LS	B^SM0	B^SM1	B^t	β^tBA	TM estimate bounds
A	UC	1	−7.17	−10.36	0	−17.53	9.27	[−8.26, 9.27]
A	CBT	0.419	−7.17	−10.36	0	−17.53	9.27	[−8.26, 9.27]
B	UC	1	−1.14	−10.36	0.41	−11.09	2.83	[−8.26, 2.83]
B	CBT	0.6	−1.14	−10.36	0.41	−11.09	2.83	[−8.26, 2.83]
C	UC	0.8	−2.01	−5.5	0.41	−7.1	−1.16	[−8.26, −1.16]
C	CBT	0.6	−2.01	−5.5	0.41	−7.1	−1.16	[−8.26, −1.16]

Note: (A) CBT group dropout restricted to the top 41.9% of the distribution and homogeneous UC group dropout; (B) CBT group dropout in the top 60% of the distribution and homogenous UC group dropout; (C) CBT group dropout in the top 60% of the distribution and UC group dropout in the top 80%. Shown are the bias components for violation of the location shift assumption (), for violation of the strong MNAR assumption in the UC group (), for violation of the strong MNAR assumption in the CBT group (), the total bias (), the bias adjusted estimate (), and the maximum bounds of the TM estimate, .

Trimmed mean estimator treatment effect bounds and bias components under different dropout assumptions Note: (A) CBT group dropout restricted to the top 41.9% of the distribution and homogeneous UC group dropout; (B) CBT group dropout in the top 60% of the distribution and homogenous UC group dropout; (C) CBT group dropout in the top 60% of the distribution and UC group dropout in the top 80%. Shown are the bias components for violation of the location shift assumption (), for violation of the strong MNAR assumption in the UC group (), for violation of the strong MNAR assumption in the CBT group (), the total bias (), the bias adjusted estimate (), and the maximum bounds of the TM estimate, .

DISCUSSION

In this article we discuss the TM estimator and its use in a sensitivity analysis, when missing data is present in a single continuous outcome variable. Our work extends existing TM literature by deriving formulae that can be used to calculate the bias resulting from violations of the location shift and strong MNAR assumptions, establish bias direction, and also, under certain additional assumptions, calculate a limit for the maximum expected bias. In Section 4.3, we show how these formulae can be used to aid interpretation of sensitivity analyses. In addition to this, we describe an adjustment to the estimator that relaxes the location shift assumption, under the assumption of normally distributed outcomes. While our results apply to normally distributed outcomes, bias formulae can be derived in a similar manner for other distributions. In Section 3, we demonstrate in simulation that the TM estimator remains unbiased when outcomes are non‐normally distributed, when adjusting for other covariates in the model and when performing MI on covariates that are MAR, conditional on the other covariates in the imputation model. Additionally, we show that the TM estimator remains unbiased in the presence of covariate‐dependent dropout in addition to strong MNAR dropout, given that the former is not treatment‐dependent. Missing data is a common feature in RCTs and may result in biased inference, with any analysis reliant on unverifiable assumptions about the relationship between observed and missing data. To increase confidence in the primary results, their robustness should be assessed by performing sensitivity analyses under a range of plausible alternative assumptions. , , , , , While conducting such analyses in the presence of missing data is recommend practice, only a minority of affected trials report performing any kind of sensitivity analysis. Six review articles describing the handling of missing data in RCTs found that only 22% of the 649 trials reviewed reported some kind of sensitivity analysis. , , , , , The majority retained the missingness assumptions of the original analysis, with only a small subset relaxing the original assumptions, and almost none considering a MNAR mechanism. Applying sensitivity analyses can be limited by the complexity of the methods, and a lack of transparency about the assumptions underlying each method. Common choices for such analyses are SM or pattern mixture model (PMM) based approaches. SMs require the specification of the probability of being missing, conditional on the outcome. An example is the weighting approach, described by Carpenter et al, which combines MI with SM for data with missingness in a continuous outcome. Here, a logit SM is specified, given by a linear combination of some function of a fully observed covariate, X, and a partially observed outcome, Y. Such a model requires specification of the general mechanism (logit), and the contributions of X and Y, the latter of which is given by a user‐specified parameter that attempts to capture the unknown MNAR missingness mechanism. This approach combines SM with MI by imputing the datasets using standard MI procedure, and then re‐weighting observations with the SM probabilities prior to estimation. , PMMs specify the relationship between observed and missing outcomes. An example is the delta‐adjusted PMM, which can be used in conjunction with MI, and assumes, for a continuous outcome, that patients who drop out have a mean outcome that differs by a fixed amount, delta, to the ones who remain on study. , , This parameter, delta, characterizes the unknown relationship between observed and unobserved data, and is typically informed by expert knowledge (eg, the minimum clinically important difference). , , , Choosing plausible values and distributions for sensitivity analysis parameters is far from straightforward. At times, a tipping point analysis is employed, which uses a range of delta values and examines the point at which the material conclusions change. If this tipping point delta is clinically implausible, this encourages greater confidence in the primary results. , A simpler alternative to such SM‐ and PMM‐based approaches is to specify some extreme scenarios, and, under these, re‐estimate the main analysis results. An example of such an extreme scenario analysis is the combination approach of best‐worst and worst‐best case sensitivity analyses, in which beneficial outcomes are assigned to dropouts in one group and harmful ones to dropouts in the other group, and vice versa, yielding two estimates under opposing assumptions. , The TM estimator, operating under the assumption of strict directional dropout and requiring only the specification of the size of trimming fraction, can be considered part of this family of easily implemented extreme‐case scenario estimators. Such simple bias‐sensitivity analyses will often be sufficient to assess the robustness of inferences to potential bias sources, but, in the event of ambiguity, can be followed up with a more nuanced and fine‐tuned sensitivity analysis (eg, from the family of SM or PMM models). While the TM estimator is simpler to implement than SM and PMM, requiring no explicit specification of a missingness model or numerical sensitivity parameters, it is theoretically similar. The TM estimator can be thought of as a threshold SM, with the threshold value informed by the size of the trimming fraction, and with the probability of missing set to zero in the trimmed fraction and left unspecified in the part of the distribution that is trimmed away. A similar parallel can be drawn between TM and the delta‐adjusted PMM, with the delta mean difference between observed and missing values implicit in the former and a direct result of the choice of trimming fraction, which is bounded by the largest proportion of missingness observed in either treatment group. For example, given 20% dropout in the comparator group, the minimum trimming fraction is 0.2, and the maximum delta is given by the difference between the means of the upper 80% and lower 20% of the outcome value distribution. In delta‐adjusted PMMs two parameters are unknown: the mean difference, delta, and the variance of the missing values. Common practice is to set the latter equal to the observed sample variance, and to specify the mean difference, with the choice of delta value unrestricted by the model. The TM estimator, in contrast, requires the specification of only a single bounded parameter and makes no such assumption about the variance of the missing and observed values, instead assuming that the underlying treatment group SDs are equal (the “location shift assumption”). The TM estimator will give an unbiased estimate of the true treatment effect under the strong MNAR assumption and the location shift assumption. The TM estimator is only unbiased for the particular case of MNAR data where the trimmed fraction remains unaffected by dropout. The TM estimator will generally be biased, even under the null, when outcomes are MAR. An exception to this is when the MAR dropout is non‐differential across treatment groups. Specifically, in an RCT, where baseline covariates can be reasonably expected to be equally distributed across the treatment arms, MAR dropout that is dependent on such a covariate will not result in bias. In practice, missing data will frequently be characterized by more than a single missingness mechanism (eg, missing outcomes may be a mix of MNAR and both treatment‐ and covariate‐dependent MAR). Ocampo et al sought to resolve this by combining the TM estimator with MI. This approach, however, relies on the assumption that missing values determined by MAR and MNAR can be distinguished using auxiliary information and its success will be heavily affected by the reliability and interpretation of the available information. As assumptions underlying sensitivity analyses cannot be verified, it is important to know how different underlying data structures affect a given estimator. While previous studies have described the TM assumptions, , , the biases arising from the violation of these strict assumptions have not previously been characterized. By quantifying the bias resulting from violation of these assumptions and establishing bias direction under a range of plausible data scenarios, we show how conclusions can be drawn from a sensitivity analysis, moving beyond a comparison of estimates. For example, in the CoBalT analysis (Section 4.4), the TM estimator, under very strict assumptions, estimated a beneficial effect of CBT treatment of approximately 8. However, when considering more plausible data generating scenarios and relaxing these assumptions, we obtained a bias‐corrected estimate that was much smaller. In order to identify relevant sensitivity analyses, a framework, such as the one employed for the CoBalT analysis in Section 4.3, should be followed. There, we used bias formulae to interpret preliminary observations from the data (eg, higher value dropout and greater SD in the CBT group), and identify plausible bias directions. Our initial bias results suggested that all estimators would be biased away from the null and overestimate the CBT treatment effect. The logical next step would involve identifying sensitivity analyses that might be biased in the opposite direction for a plausible data scenario. The TM estimator will be particularly useful in cases where it can be expected to be biased in the opposite direction to the CCA analysis. For example, if we had seen evidence of higher value dropout in both groups (ie, previous depression being higher for those with missing outcome data in both UC and CBT groups) and evidence of similar SDs in the two arms, then we would expect the CCA estimator to underestimate the treatment effect given greater dropout in the comparator group, and overestimate the effect given higher dropout in the treatment group, while the TM estimator would remain unbiased for both. Commonly applied sensitivity analyses employ complex models, for which bias quantification methods are not readily available. In contrast, the TM estimator is a simple approach, for which, as we show, analytic expressions are available for the bias resulting from missingness. In this article, we show how the TM estimator can be used as a sensitivity analysis, by using bias formulae to interpret results in context of information available from the data, as we show in our CoBalT analysis. We recommend performing a TM sensitivity analysis when directional dropout is plausible, using the bias formulae to establish bias direction and interpret estimates.

CONFLICT OF INTEREST

The authors declare no conflicts of interest. Appendix. Supporting information Click here for additional data file.

29 in total

1. A bound for publication bias based on the fraction of unpublished studies.

Authors: John Copas; Dan Jackson
Journal: Biometrics Date: 2004-03 Impact factor: 2.571

2. Bias and efficiency of multiple imputation compared with complete-case analysis for missing covariate values.

Authors: Ian R White; John B Carlin
Journal: Stat Med Date: 2010-12-10 Impact factor: 2.373

Review 3. Missing data: A statistical framework for practice.

Authors: James R Carpenter; Melanie Smuk
Journal: Biom J Date: 2021-02-24 Impact factor: 2.207

4. Sensitivity analysis for clinical trials with missing continuous outcome data using controlled multiple imputation: A practical guide.

Authors: Suzie Cro; Tim P Morris; Michael G Kenward; James R Carpenter
Journal: Stat Med Date: 2020-05-17 Impact factor: 2.373

Review 5. A systematic survey on reporting and methods for handling missing participant data for continuous outcomes in randomized controlled trials.

Authors: Yuqing Zhang; Ivan D Flórez; Luis E Colunga Lozano; Fazila Abu Bakar Aloweni; Sean Alexander Kennedy; Aihua Li; Samantha Craigie; Shiyuan Zhang; Arnav Agarwal; Luciane C Lopes; Tahira Devji; Wojtek Wiercioch; John J Riva; Mengxiao Wang; Xuejing Jin; Yutong Fei; Paul Alexander; Gian Paolo Morgano; Yuan Zhang; Alonso Carrasco-Labra; Lara A Kahale; Elie A Akl; Holger J Schünemann; Lehana Thabane; Gordon H Guyatt
Journal: J Clin Epidemiol Date: 2017-06-03 Impact factor: 6.437

6. Long-term effectiveness and cost-effectiveness of cognitive behavioural therapy as an adjunct to pharmacotherapy for treatment-resistant depression in primary care: follow-up of the CoBalT randomised controlled trial.

Authors: Nicola J Wiles; Laura Thomas; Nicholas Turner; Kirsty Garfield; Daphne Kounali; John Campbell; David Kessler; Willem Kuyken; Glyn Lewis; Jill Morrison; Chris Williams; Tim J Peters; Sandra Hollinghurst
Journal: Lancet Psychiatry Date: 2016-01-07 Impact factor: 27.083

7. Sensitivity analysis after multiple imputation under missing at random: a weighting approach.

Authors: James R Carpenter; Michael G Kenward; Ian R White
Journal: Stat Methods Med Res Date: 2007-06 Impact factor: 3.021

8. When and how should multiple imputation be used for handling missing data in randomised clinical trials - a practical guide with flowcharts.

Authors: Janus Christian Jakobsen; Christian Gluud; Jørn Wetterslev; Per Winkel
Journal: BMC Med Res Methodol Date: 2017-12-06 Impact factor: 4.615

9. Identifying treatment effects using trimmed means when data are missing not at random.

Authors: Alex Ocampo; Heinz Schmidli; Peter Quarg; Francesca Callegari; Marcello Pagano
Journal: Pharm Stat Date: 2021-06-24 Impact factor: 1.234

Review 10. The current practice of handling and reporting missing outcome data in eight widely used PROMs in RCT publications: a review of the current literature.

Authors: Ines Rombach; Oliver Rivero-Arias; Alastair M Gray; Crispin Jenkinson; Órlaith Burke
Journal: Qual Life Res Date: 2016-01-28 Impact factor: 4.147

1 in total

1. Sensitivity to missing not at random dropout in clinical trials: Use and interpretation of the trimmed means estimator.

Authors: Audinga-Dea Hazewinkel; Jack Bowden; Kaitlin H Wade; Tom Palmer; Nicola J Wiles; Kate Tilling
Journal: Stat Med Date: 2022-01-31 Impact factor: 2.497

1 in total