| Literature DB >> 24058309 |
James D Dziura1, Lori A Post, Qing Zhao, Zhixuan Fu, Peter Peduzzi.
Abstract
Randomized clinical trials are the gold standard for evaluating interventions as randomized assignment equalizes known and unknown characteristics between intervention groups. However, when participants miss visits, the ability to conduct an intent-to-treat analysis and draw conclusions about a causal link is compromised. As guidance to those performing clinical trials, this review is a non-technical overview of the consequences of missing data and a prescription for its treatment beyond the typical analytic approaches to the entire research process. Examples of bias from incorrect analysis with missing data and discussion of the advantages/disadvantages of analytic methods are given. As no single analysis is definitive when missing data occurs, strategies for its prevention throughout the course of a trial are presented. We aim to convey an appreciation for how missing data influences results and an understanding of the need for careful consideration of missing data during the design, planning, conduct, and analytic stages.Entities:
Keywords: MAR; MCAR; MNAR; clinical trial; intent to treat; missing data; study design
Mesh:
Year: 2013 PMID: 24058309 PMCID: PMC3767219
Source DB: PubMed Journal: Yale J Biol Med ISSN: 0044-0086
Common examples of the three missing data mechanisms in clinical trials.
|
|
|
| MCAR | Administrative censoring: follow-up is terminated because the study has ended. |
| Migration-study participants move and are unable to complete visits. | |
| Random failure of the experimental instrument (e.g. test tube break, equipment failure) | |
| MAR | Missing data caused by features of the study design such as participants being removed from the trial if their conditions are not controlled sufficiently well according to protocol criteria. |
| Dropout based on recorded side-effects. | |
| Dropout based on known baseline characteristics. | |
| MNAR | Dropout based on the unobserved response (e.g., a person not responding to treatment is more likely not to provide an observation). |
| Participants miss a visit because they’ve had an outcome. |
MCAR Example: Results from a hypothetical clinical trial evaluating the effect of treatment on improvement in body fat (Outcome).
|
| ||||
|
| ||||
|
|
|
| ||
| • Risk Ratio=(30/40)/(20/40)=1.5 | ||||
| A | 30 | 10 | 40 | • SE(Ln(RR))=0.18 |
| B | 20 | 20 | 40 | • 95% CI =1.05,2.15 |
| 50 | 30 | 80 | • Chi-Square=5.33 P=0.02 | |
|
| ||||
|
| ||||
|
| ||||
|
|
|
| ||
| • Risk Ratio=(21/28)/(14/28)=1.5 | ||||
| A | 21 | 7 | 28 | • SE(Ln(RR))=0.22 |
| B | 14 | 14 | 28 | • 95% CI =0.98,2.30 |
| 35 | 21 | 56 | • Chi-Square=3.73 P=0.053 | |
*Each cell from Table 2a is multiplied by 70% to obtain 30% missing data from an MCAR mechanism in Table 2b.
MAR example: Results from a hypothetical clinical trial evaluating the effect of treatment on improvement in body fat (Outcome).
|
| ||||
| Men |
| |||
|
| ||||
|
|
|
| ||
| • Outcome in A = 225/300 = 0.75 | ||||
| A | 225 | 75 | 300 | • Outcome in B = 150/300 = 0.50 |
| B | 150 | 150 | 300 | • Risk Ratio = 1.5 |
| 375 | 225 | 600 | ||
|
| ||||
| Women |
| |||
|
| ||||
|
|
|
| ||
| • Outcome in A = 90/300 = 0.30 | ||||
| A | 90 | 210 | 300 | • Outcome in B = 60/300 = 0.20 |
| B | 60 | 240 | 300 | • Risk Ratio = 1.5 |
| 150 | 450 | 600 | ||
|
| ||||
| Total |
| |||
|
| ||||
|
|
|
| • Outcome in A = 315/600 = 0.525 | |
| • Outcome in B = 210/600 = 0.35 | ||||
| A | 315 | 285 | 600 | • Risk Ratio =1.5 |
| B | 210 | 390 | 600 | • 95%CI=1.31, 1.71 |
| 525 | 675 | 1200 | • Mantel Haenszel RR=1.5 | |
| • 95% CI=1.33, 1.70 | ||||
|
| ||||
|
| ||||
| Men |
| |||
|
| ||||
|
|
|
| ||
| • Outcome in A = 180/240 = 0.75 | ||||
| A | 180 | 60 | 240 | • Outcome in B = 120/240 = 0.50 |
| B | 120 | 120 | 240 | • Risk Ratio = 1.5 |
| 300 | 180 | 480 | ||
|
| ||||
| Women |
| |||
|
| ||||
|
|
|
| ||
| • Outcome in A = 81/270 = 0.30 | ||||
| A | 81 | 189 | 270 | • Outcome in B = 27/135 = 0.20 |
| B | 27 | 108 | 135 | • Risk Ratio =1.5 |
| 108 | 360 | 405 | ||
|
| ||||
| Total |
| |||
|
| ||||
|
|
|
| • Outcome in A = 261/510 = 0.51 | |
| • Outcome in B = 147/375 = 0.39 | ||||
| A | 261 | 249 | 510 | • Risk Ratio = 1.31 |
| B | 147 | 228 | 375 | • 95% CI=1.12, 1.52 |
| 408 | 477 | 885 | • Mantel Haenszel RR=1.5 | |
| • 95% CI=1.30, 1.73 | ||||
*Probabilities of missing are dependent on the combination of treatment and gender to mimic an MAR mechanism
Probability Missing for Men in Trt A = 0.20
Probability Missing for Men in Trt B = 0.20
Probability Missing for Women in Trt A = 0.10
Probability Missing for Women in Trt B = 0.55
MNAR example: Results from a hypothetical clinical trial evaluating the effect of treatment on improvement in body fat.
|
| ||||
|
| ||||
|
|
|
| ||
| • Risk Ratio=(315/600)/(210/600)=1.5 | ||||
| A | 315 | 285 | 600 | • 95% CI =1.31, 1.71 |
| B | 210 | 390 | 600 | • Chi-Square=37.3 P<0.001 |
| 525 | 675 | 1200 | ||
|
| ||||
|
| ||||
|
| ||||
|
| ||||
|
|
|
| ||
| • Risk Ratio=(189/474)/(210/444)=0.84 | ||||
| A | 189 | 285 | 474 | • 95% CI =0.73, 0.98 |
| B | 210 | 234 | 444 | • Chi-Square=5.14 P=0.02 |
| 399 | 519 | 918 | ||
*Probabilities of missing are dependent on the treatment and outcome to mimic an MNAR mechanism
Probability Missing for Outcome “Y” in Trt A = 0.40
Probability Missing for Outcome “N” in Trt A = 0.00
Probability Missing for Outcome “Y” in Trt B = 0.00
Probability Missing for Outcome “N” in Trt B = 0.40
Figure 1A summary of the acceptable and unacceptable analytic methods for types of missing data. Green boxes show methods giving unbiased estimates of treatment effects and correct estimates of standard errors and p-values, yellow boxes show methods giving only unbiased estimates of treatment effects, red boxes show unacceptable methods. *Preferred method as it uses all available data.
Approaches to handling and preventing missing data during trial design, planning, conduct and analysis.
|
| |
| Anticipate Expected Missing Data | 1. Estimate the expected amount of missing data and likely reasons for it. |
| 2. Account for missing data in the sample size calculations and develop a suitable pre-specified analytic plan. | |
| Methods to Encourage Participant Retention | 3. Limit burden to participant by reducing required visits and amount of data collected. |
| 4. Adopt data collection methods that don’t require face to face visits. | |
| 5. Utilize run-in periods, ascertainable treatment outcomes, shorter follow-up periods, randomized withdrawal designs where appropriate. | |
| 6. Budget for monetary incentives for participants that are weighted toward study completion. | |
|
| |
|
| |
|
| |
| Study Documentation | 7. Develop detailed study documentation in the form of manual of operations addressing all aspects of the study including screening procedures, training requirements, methods of communication, delivery of treatment, schedule and windows for assessments, and data collection/entry/editing procedures. |
| Informed Consent | 8. Develop an informed consent that distinguishes the difference between withdrawing from the treatment and withdrawing from the study. |
| Study Sites | 9. Select study sites with strong track records for enrolling, following, and completing participants. |
| 10. Adopt a reimbursement mechanism that encourages study completion. | |
| Training Study Personnel | 11. Train/certify study personnel for participant enrollment, data collection, data entry, delivery of treatment, etc. prior to enrollment with re-certification throughout trial if necessary. |
| 12. Highlight the continued collection of data in participants that are not adherent to treatment but remain in the study. | |
| Pilot Study | 13. Test operational aspects of the trial (e.g., enrollment, retention, clarity of study manuals and data collection instruments, study burden on participants, randomization, treatment delivery). |
|
| |
|
| |
|
| |
| Create Monitoring Reports | 14. Develop monitoring reports to regularly track amounts of missing data at the levels of the study site and study personnel. |
| 15. Keep track of reasons for withdrawal from the study or intervention. | |
| Enhance Participant Contact | 16. Utilize approaches to keep the study participants engaged in the study including incentives, visit reminders, newsletters, and intermittent phone calls to monitor status. |
| 17. Outline procedures for contacting individuals with missed visits in manual of operations. Identify and intervene in participants that are likely to drop out. | |
| Data Entry and Management | 18. Timely data entry allows earlier detection of problems with missing data. |
| 19. Implement a verification process requiring fields to be checked for accuracy and all discrepancies resolved before data entry. | |
| Communication | 20. Devise an efficient method of communication with study personnel for identifying and resolving unanticipated issues that arise during the study. |
|
| |
|
| |
|
| |
| Explore Missing Data | 21. The amount of missing data, missing data patterns and variables associated with missingness will help to inform the primary and sensitivity analyses. |
| Use All Available Data | 22. For primary analysis, use methods that make use of all available data such as multiple imputation or likelihood-based approaches. These methods make weaker assumptions about the missing data compared to complete case analysis. |
| 23. For primary analysis, avoid the use of ad-hoc solutions (e.g., last observation carried forward) as they make unreasonable assumptions about the mechanism that produced the missing data. | |
| Perform sensitivity analysis | 24. Use methods such as pattern mixture or selection models to examine robustness of conclusions to reasonable MNAR mechanisms. |