| Literature DB >> 32892743 |
Brennan C Kahan1, Gordon Forbes2, Suzie Cro3.
Abstract
Results from clinical trials can be susceptible to bias if investigators choose their analysis approach after seeing trial data, as this can allow them to perform multiple analyses and then choose the method that provides the most favourable result (commonly referred to as 'p-hacking'). Pre-specification of the planned analysis approach is essential to help reduce such bias, as it ensures analytical methods are chosen in advance of seeing the trial data. For this reason, guidelines such as SPIRIT (Standard Protocol Items: Recommendations for Interventional Trials) and ICH-E9 (International Conference for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use) require the statistical methods for a trial's primary outcome be pre-specified in the trial protocol. However, pre-specification is only effective if done in a way that does not allow p-hacking. For example, investigators may pre-specify a certain statistical method such as multiple imputation, but give little detail on how it will be implemented. Because there are many different ways to perform multiple imputation, this approach to pre-specification is ineffective, as it still allows investigators to analyse the data in different ways before deciding on a final approach. In this article, we describe a five-point framework (the Pre-SPEC framework) for designing a pre-specified analysis approach that does not allow p-hacking. This framework was designed based on the principles in the SPIRIT and ICH-E9 guidelines and is intended to be used in conjunction with these guidelines to help investigators design the statistical analysis strategy for the trial's primary outcome in the trial protocol.Entities:
Keywords: Bias; Pre-specification; Randomised trial; Transparency; p-hacking
Year: 2020 PMID: 32892743 PMCID: PMC7487509 DOI: 10.1186/s12916-020-01706-7
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 8.775
Common issues in pre-specifying statistical analysis approaches in clinical trial protocols
| Estimated prevalence | |||
|---|---|---|---|
| Issue | Problems associated with issue | Aspect | Prevalencea |
| Omitting an aspect of the analysis approach | Investigators could run multiple analyses, and selectively report the most favourable | Analysis population: Analysis model: Covariates: Missing data: | 27-47% 11-20% 27% 66-77% |
| Insufficient detail around an aspect of the analysis approach | Investigators could run multiple analyses, and selectively report the most favourable | Analysis population: Analysis model: Covariates: Missing data: | 64% 42% 23% 17%b |
| Analysis approach allows some aspects of the final analysis to be subjectively chosen based on trial data | Investigators could run multiple analyses, and selectively report the most favourable | Analysis model: Covariates: | 19% 8% |
| Multiple analysis approaches specified, without one being identified as the primary | Investigators could selectively report the most favourable result, or to elevate its importance compared to less favourable results. | Analysis population: Analysis model: Covariates: Missing data: | 11% 11% 9% 2% |
aBased on references [5] and [2]; one study evaluated protocols and published results for 70 randomised trials approved by the ethics committees for Copenhagen and Frederiksberg, Denmark in 1994-5; the other study evaluated 100 protocols of randomised trials indexed in PubMed November 2016.
b15/99 protocols gave insufficient detail around how they planned to implement multiple imputation, 2/99 protocols but gave insufficient detail around their planned inverse probability weighting procedure
Framework for pre-specifying a statistical analysis strategy (Pre-SPEC)
| Pre-specify the analysis strategy before recruitment to the trial begins. | |
| Specify a single primary analysis strategy. | |
| Each aspect of the planned analysis should be covered, including analysis population, statistical model, covariates, and handling of missing data. | |
| Provide sufficient detail to allow a third party to independently perform the analysis (ideally through statistical code). | |
| For adaptive analysis strategies which use the trial data to inform some aspect of the analysis, use deterministic decision rules that prevent analysis choices being driven by results. |