| Literature DB >> 34853905 |
Anders Granholm1, Waleed Alhazzani2,3, Lennie P G Derde4,5, Derek C Angus6, Fernando G Zampieri7, Naomi E Hammond8,9, Rob Mac Sweeney10, Sheila N Myatra11, Elie Azoulay12, Kathryn Rowan13, Paul J Young14,15,16,17, Anders Perner18, Morten Hylander Møller18.
Abstract
Randomised clinical trials (RCTs) are the gold standard for providing unbiased evidence of intervention effects. Here, we provide an overview of the history of RCTs and discuss the major challenges and limitations of current critical care RCTs, including overly optimistic effect sizes; unnuanced conclusions based on dichotomization of results; limited focus on patient-centred outcomes other than mortality; lack of flexibility and ability to adapt, increasing the risk of inconclusive results and limiting knowledge gains before trial completion; and inefficiency due to lack of re-use of trial infrastructure. We discuss recent developments in critical care RCTs and novel methods that may provide solutions to some of these challenges, including a research programme approach (consecutive, complementary studies of multiple types rather than individual, independent studies), and novel design and analysis methods. These include standardization of trial protocols; alternative outcome choices and use of core outcome sets; increased acceptance of uncertainty, probabilistic interpretations and use of Bayesian statistics; novel approaches to assessing heterogeneity of treatment effects; adaptation and platform trials; and increased integration between clinical trials and clinical practice. We outline the advantages and discuss the potential methodological and practical disadvantages with these approaches. With this review, we aim to inform clinicians and researchers about conventional and novel RCTs, including the rationale for choosing one or the other methodological approach based on a thorough discussion of pros and cons. Importantly, the most central feature remains the randomisation, which provides unparalleled restriction of confounding compared to non-randomised designs by reducing confounding to chance.Entities:
Keywords: Clinical trials; Critical care; Intensive care; Randomized clinical trials
Mesh:
Year: 2021 PMID: 34853905 PMCID: PMC8636283 DOI: 10.1007/s00134-021-06587-9
Source DB: PubMed Journal: Intensive Care Med ISSN: 0342-4642 Impact factor: 41.787
Fig. 1Timeline of important milestones in the general history of clinical trials based on references [2, 3]. A historical timeline of key critical care studies and RCTs is available elsewhere [6]
Fig. 2Overview of different study types and their role in clinical research programmes. In general, pre-clinical studies can provide necessary background or laboratory knowledge that may be used to generate hypotheses later assessed in clinical trials. Summarising existing evidence prior to start of clinical studies is sensible, to identify knowledge gaps, avoid duplication of efforts, and inform further clinical studies. Surveys may identify existing beliefs, practices and attitudes towards further studies; cross-sectional studies and cohort studies can describe prevalence, outcomes, predictors/risk factors and current practice. Randomised clinical trials remain the gold standard for intervention comparisons but may also provide data for secondary studies not necessarily focussing on the randomised intervention comparison. Before randomised clinical trials aimed at assessing efficacy or effectiveness of an intervention are conducted, pilot/feasibility trials may be conducted to prepare larger trials and assess protocol delivery and feasibility. Following the conduct of a randomised clinical trial, relevant systematic reviews and clinical practice guidelines should be updated as necessary, to ease implementation of trial results into clinical practice. Of note, the process is not always linear and unidirectional, and different study types may be conducted at different temporal stages during a research programme. Translational research may incorporate pre-clinical and laboratory studies and clinical studies, including non-randomised cohort studies and randomised clinical trials. Similarly, clinical studies may be used to collect data or samples that are further analysed outside the clinical setting
Fig. 3Direction of probabilities in frequentist (A) and Bayesian (B) analyses. This figure illustrates the direction of probabilities in frequentist (conventional) and Bayesian statistical analyses. A Frequentist P values, Pr(data | H0): probability of obtaining data (illustrated with a spreadsheet) at least as extreme as what was observed given the assumption that the null hypothesis (illustrated with a light bulb with 0 next to it) is correct. This mean that frequentist statistical tests assume that the null hypothesis (generally, that there is exactly no difference between interventions) is true. It then calculates the probability of obtaining a result at least as extreme (i.e., a difference that is at least as large as what was observed) under the assumption that there is no difference. Low P values thus provide direct evidence against the null hypothesis, but only indirect evidence related to the hypothesis of interest (i.e., that there is a difference), which makes them difficult to interpret. With more frequent analyses, there is an increased risk of obtaining results that would be surprising if the null hypothesis is true, and thus, with more tests or interim analyses, the risk of rejection the null hypothesis due to chance (a type I error) increases. B Bayesian probabilities, Pr(H | data): the probability of any hypothesis of interest (illustrated with a light bulb; e.g., that there is benefit with the intervention) given the data collected. Bayesian probabilities thus provide direct evidence for any hypothesis of interest, and the probabilities for multiple hypotheses, e.g. any benefit, clinically important benefit, or a difference smaller than what is considered clinically important, can be calculated from the same posterior distribution without any additional analyses or multiplicity issues. If further data are collected, the posterior probability distribution is updated and replaces the old posterior probability distribution. For both frequentist and Bayesian models, these probabilities are calculated according to a defined model and all its included assumptions—and for Bayesian analyses also a defined prior probability distribution—all of which are assumed to be correct or appropriate for the results to be trusted. Abbreviations and explanations: data: the results/difference observed; H: a hypothesis of interest; H0: a null hypothesis (i.e., that there is no difference). Pr: probability; |: should be read as “given”
Fig. 4Heterogeneity of treatment effects in clinical trial. Forest plot illustrating a fictive clinical trial enrolling 4603 patients. In this trial, the average treatment effect may be considered neutral with a relative risk (RR) of 0.96 and 95% confidence interval of 0.90–1.04 (or inconclusive, if this interval included clinically relevant effects). The trial population consists of three fictive subgroups with heterogeneity of treatment effects: A, with an intervention effect that is neutral (or inconclusive), similarly to the pooled result; B, with substantial benefit from the intervention; and C, with substantial harm from the intervention. If only the average intervention effect is assessed, it may be concluded – based on the apparent neutral overall result – that whether the intervention or control is used has little influence on patient outcomes, and it may be missed that the intervention provides substantial benefit in some patients and substantial harm in others. Similarly, an intervention with an overall beneficial effect may be more beneficial in some subgroups than others and may provide harm in some patients, and vice versa
Methodological features that may improve clinical trials benefits and challenges
| Feature | Benefits and advantages | Challenges and disadvantages |
|---|---|---|
| Larger trials (increased sample sizes) | Decreased uncertainty, increased precision Easier to detect potential subgroup differences Less chance of inconclusive results (i.e., greater precision and less uncertainty); results from fewer large RCTs are easier to compare than results from many smaller RCTs Easier to address safety concerns if properly monitored, as larger trials have higher chances of detecting rare adverse events Increased generalisability/external validity in multicentre/international RCTs | Economic: trial cost and optimal use of overall research resources Collaboration: increased workload coordinating, different regulatory requirements including different handling of consent procedures and reporting of adverse events, challenges with coordination due to language and time zone differences Comparability: potential differences in standard of care/available resources in international trials Academic challenges: less individually led projects due to increased collaboration – group authorships may be less attractive in settings where individual author positions are valued (e.g., grant applications) |
| Standardisation and meta-analyses | Increased comparability/less heterogeneity Less competition between trials Meta-analysis may be more sensible – less statistical inconsistency may lead to more precise results Prospective meta-analyses or meta-trials may provide quicker answers than individual conventional RCTs and meta-analyses, especially if trialists share data earlier, and adequate certainty is obtained before individual trials finish | Agreement between investigators on the design and variables could be challenging and time-consuming; compromises may be necessary for standardisations; core outcome sets may improve this Data not routinely collected in one setting may be required due to standardisation, potentially increasing workload in some centres If adequate standardisation is not possible, comparisons in meta-analyses may be difficult Differences in populations, interventions, comparators, outcomes, concurrent treatments and changes over time may hamper interpretation of meta-analyses |
| Research programmes | Complete research programmes including multiple study types may lead to better RCTs focussing on more relevant questions Evidence synthesis prior to trial conduct puts trials into context and may help identify the largest knowledge gaps or where new trials are not necessary | Research programmes may require substantial resources and time until an eventual trial can start; in most situations this will be sensible, but may not be possible during pandemics or emergencies and may require additional resources and funding |
| Outcome choices | Choosing non-dichotomous or non-mortality outcomes carrying more information may lead to more efficient or conclusive trials and smaller sample size requirements Outcomes with more levels than just dead/alive may convey important information on how well survivors fare | Definition and handling of death is challenging, including appropriate “weighting” of death, and clinical interpretation if mortality is treated in a special manner (e.g., if days alive without life support is analysed as an ordinal variable with death treated as worse than 0 days) Many non-mortality patient-important outcomes have skewed distributions complicating many common statistical (parametric) analyses and estimations of differences on an interpretable scale (including in meta-analyses) Difficulties in interpretation if effects on mortality and other parts of the outcome are in different directions (for composite outcomes and days alive without life support and similar outcomes) Risk of choosing less patient-important outcomes or surrogate outcomes |
| Avoiding dichotomisation of results, probabilistic interpretations | Nuanced conclusions; assessing evidence as a continuum avoids risk of incorrect “absence of evidence interpreted as evidence of absence” errors Using Bayesian methods allow incorporation of previous results or scepticism and easier propagation of uncertainty to subsequent calculations The same level of evidence may not be required to change clinical practice for all interventions—this depends on price, risk of adverse events, availability, character of intervention, invasiveness, etc.; these considerations apply to both trials and clinical practice guidelines | Probabilistic interpretations do not solve the primary issues of many trials; lack of dichotomisation does not in itself increase the certainty of evidence While conventional significance thresholds are arbitrary, they are widely used; changing methods may lead some researchers to opt for less strict thresholds or allow increased “spin” in conclusions. Disagreements in interpretation may increase if there is no standard threshold and pre-specified criteria for success for e.g. approving new interventions and standardised policy responses may be warranted Non-dichotomous and more detailed interpretations of trial results may be more difficult to communicate to non-researchers and non-experts Prior selection in Bayesian analyses adds additional complexity; results may be unduly influenced by strong(er) priors not shared by other researchers. Sensible priors (often non- or weakly informative priors are used in the primary Bayesian analyses of critical care trials), transparently reported, ideally pre-specified, and with adequate sensitivity analyses performed is warranted |
| Improved HTE analyses | Predictive HTE analyses and other approaches considering multiple patient characteristics simultaneously or overall risk may better reflect clinical reality than one-variable-at-a-time subgroup analyses Hierarchical models may limit the risk of exaggerated results and chance findings in smaller subgroups and increase precision due to borrowing of information Assessment of HTE according to variables of interest on the continuous scale may better detect dose–response relationships than categorised subgroup analyses | Subgroup or HTE analyses, regardless of approach, generally requires more patients—trials may still be underpowered to detect differences The more analyses conducted, the greater risk of chance findings—this may be mitigated, but not completely solved, by the discussed approaches Requires careful consideration of whether HTE analyses should be conducted on the absolute or relative scales; when the baseline risk differs between groups, there will always be HTE on either the relative or the absolute scale (often, intervention effects are most consistent on the relative scale) |
| Adaptation | Adaptive sample sizes/stopping rules may lead to optimally sized trials, more likely to reach conclusive evidence Adaptive arm adding/dropping may increase overall trial efficiency Adaptive randomisation may increase chance of getting better interventions in some situations, which may make trial participation more attractive to patients Adaptive enrichment may enable trials to better detect differences in responses and tailor interventions to different subpopulations or phenotypes | Logistic and economic challenges in planning and funding trials without fixed sample sizes; alternative financing models may be necessary Planning may be more difficult; instead of simple sample size calculations, advanced statistical simulation may be necessary to estimate required sizes and risk of random errors, requiring increased collaboration with statisticians and increased training of clinician-researchers Pre-specified criteria for stopping/adaptation necessary; may be difficult to define Adaptation requires more real-time data collection and verification, increasing data registration burden on individual sites Adaptations may be complex to implement and communicate Outcomes with longer follow-up lead to slower adaptation compared to shorter-term outcomes, which may add additional complexity. Consequently, the use of shorter-term outcomes to guide adaptive trials instead of the outcome of primary interest may be considered in some situations Risk of adaptations based on chance findings/fluctuations may require restraints of adaptation to avoid random errors, which is difficult to plan and handle While adaptive trials may be more likely to reach conclusive evidence if continued until a stopping rule is reached, they may need to be substantially larger to confirm or refute all clinically relevant effects (as is the case for conventional trials, too) |
| Adaptive platform trials | Increased efficiency, and potentially similar advantages as for adaptive conventional trials May decrease time to clinical adaptation and enable “learning while doing” Reuse of trial infrastructure and embedding in electronic health records and clinical practice may increase efficiency and decrease cost Potential improvement of informed consent procedures compared to consent when co-enrolment in multiple trials occurs Familiarity and consistency with a common platform design may be easier in practice than repeated conduction of independent RCTs | Same challenges as adaptive trials in general Potential regulatory issues; less well-known design may complicate approvals May take longer time to setup and implement than regular trials More complex – may be more difficult to implement and train staff, more difficult to explain to patients/potential complication of consent procedures, relatives and other stakeholders, may be more difficult to work with for non-researcher clinicians Standards for conducting and reporting less developed; may be more difficult to report and explain results Additional complexity with time drift/temporal variation and response-adaptive randomisation and potential re-use of non-concurrent controls requires adequate statistical handling to avoid bias Potential challenges with workload/stress of perpetual trials |
| Embedding trials in clinical practice and registers | Tighter integration of clinical practice and clinical trials may lead to faster improvements in patient care Embedding clinical trials in electronic health records may reduce data-collection burden and cost and alert clinicians and researchers of eligible patients and clinical events Register-based trials (including register-based cluster-randomised trials) may reduce data-collection burden and trial cost by using clinical registers already in place | Register-based data-collection may not be as easily standardised without changing individual registers; compromises based on availability in registers may be necessary Embedding trials in registers or electronic health records poses additional challenges with different electronic health record software and across borders Data quality and completeness in registers may not be as good as when data are prospectively collected for all variables Limited long-term outcome data generally available in registers due to additional complexity of data collection |
HTE heterogeneity of treatment effects; RCT randomised clinical trial
| In this review, the primary challenges of conventional randomised clinical trials in critical care are discussed. This is followed by discussion of potential solutions and novel trial methods, including the challenges and potential disadvantages of using these methods. |