| Literature DB >> 20711246 |
Dan Jackson, Ian R White, Morven Leese.
Abstract
When a randomized controlled trial has missing outcome data, any analysis is based on untestable assumptions, e.g. that the data are missing at random, or less commonly on other assumptions about the missing data mechanism. Given such assumptions, there is an extensive literature on suitable methods of analysis. However, little is known about what assumptions are appropriate. We use two sources of ancillary data to explore the missing data mechanism in a trial of adherence therapy in patients with schizophrenia: carer-reported (proxy) outcomes and the number of contact attempts. This requires additional assumptions to be made whose plausibility we discuss. Proxy outcomes are found to be unhelpful in this trial because they are insufficiently associated with patient outcome and because the ancillary assumptions are implausible. The number of attempts required to achieve a follow-up interview is helpful and suggests that these data are unlikely to depart far from being missing at random. We also perform sensitivity analyses to departures from missingness at random, based on the investigators' prior beliefs elicited at the start of the trial. Wider use of techniques such as these will help to inform the choice of suitable assumptions for the analysis of randomized controlled trials.Entities:
Year: 2010 PMID: 20711246 PMCID: PMC2916212 DOI: 10.1111/j.1467-985X.2009.00627.x
Source DB: PubMed Journal: J R Stat Soc Ser A Stat Soc ISSN: 0964-1998 Impact factor: 2.483
Pattern of missingness for the QUATRO trial
| Baseline only | 26 | 11 | 37 |
| Final only | 10 | 8 | 18 |
| Neither outcome | 3 | 2 | 5 |
| Both outcomes | 165 | 184 | 349 |
| Total | 204 | 205 | 409 |
Fig. 1Expert opinion relating missing and observed quality of life in (a) the intervention and (b) the control groups
Fig. 2Directed acyclic graph of the model used for the QUATRO data
Results from the sensitivity analysis†
| Δ | Δ | ||||
|---|---|---|---|---|---|
| A | 0 | 0 | −0.01 (2.12) | 0.12 (3.04) | −0.33 (1.08) |
| B | 0.5 | 0 | −0.01 (2.12) | 0.14 (3.04) | −0.33 (1.09) |
| C | 1 | 0 | −0.02 (2.12) | 0.11 (3.06) | −0.33 (1.08) |
| D | 0 | 0.5 | −4.49 (2.13) | −4.51 (3.07) | −0.65 (1.10) |
| E | 0 | 1 | −8.69 (2.16) | −9.04 (3.08) | −0.95 (1.11) |
| F | 0 | 0 | −0.05 (2.14) | −8.88 (3.02) | 0.25 (1.10) |
| G | 0 | 1 | −8.49 (2.12) | 0.35 (3.11) | −1.53 (1.10) |
Δ(Y1,T) and Δ(Y1,C) represent the expectation of the posterior distribution of the difference between the means of the missing and observed Y1 in the intervention and control groups respectively, and denotes the intervention effect. Standard deviations are in parentheses.
In the intervention group.
In the control group.
Fig. 3Plot of final score against baseline score, assuming that unobserved data are missing at random: ○, participants where both scores are observed; •, participants where one of these scores has been replaced by the posterior mean; ⋮, …., 95% posterior credible intervals for the imputed scores
Fig. 4Plot of final score against baseline score, assuming model E in Table 2: ○, participants where both scores are observed; •, participants where one of these scores has been replaced by the posterior mean; ⋮, …., 95% posterior credible intervals for the imputed scores
Results from the sensitivity analysis including the proxy scores†
| Δ | Δ | ||||
|---|---|---|---|---|---|
| A | 0 | 0 | 0.08 (2.10) | −0.03 (3.01) | −0.30 (1.09) |
| B | 0 | 0.5 | 0.13 (2.11) | −0.08 (3.00) | −0.29 (1.08) |
| C | 0 | 1 | 0.12 (2.11) | −0.09 (3.02) | −0.29 (1.09) |
| D | 0.5 | 0 | −4.40 (2.11) | −4.66 (3.03) | −0.63 (1.09) |
| E | 1 | 0 | −8.61 (2.14) | −9.15 (3.05) | −0.93 (1.11) |
| F | 0 | 0 | −0.05 (2.13) | −8.75 (3.00) | 0.25 (1.09) |
| G | 1 | 0 | −8.27 (2.10) | −0.09 (3.04) | −1.46 (1.10) |
Baseline scores are assumed missing at random, i.e. δ=0. Δ(Y1,T) and Δ(Y1,C) represent the expectation of the posterior distribution of the difference between the means of the missing and observed Y1 in the intervention and control groups respectively, and denotes the intervention effect. Standard deviations are in parentheses.
In the intervention group.
In the control group.
Mean final MCS by number of contact attempts and trial arm†
| 1 | 40.7 (87) | 42.4 (83) |
| 2 | 40.2 (96) | 41.3 (96) |
| 3 | 38.6 (9) | 38.7 (10) |
| More than 3 | 36.2 (12) | 36.8 (16) |
The numbers of participants in each category are in parentheses.
Results from the models using the information concerning the repeated attempts to contact participants†
| Δ | Δ | ||||
|---|---|---|---|---|---|
| A | −1.54 (2.39) | −1.04 (3.13) | 0.22 (0.17) | — | −0.46 (1.09) |
| B | −2.57 (2.60) | −0.60 (3.20) | 0.14 (0.19) | 0.25 (0.29) | −0.64 (1.11) |
| C | −2.64 (2.52) | −0.94 (3.16) | 0.15 (0.19) | 0.28 (0.28) | −0.61 (1.11) |
Baseline scores are assumed missing at random. Δ(Y1,T) and Δ(Y1,C) represent the expectation of the posterior distribution of the difference between the means of the missing and observed Y1 in the intervention and control groups respectively, and denotes the intervention effect. Standard deviations are in parentheses.