Roloff et al provide conditional power formulas for a future meta‐analysis based on an already existing meta‐analysis judged to be inconclusive and use them to determine sample sizes and a number of additional clinical trials to arrive at a conclusive updated meta‐analysis including all studies.1 In the following, we discuss the implications of this meta‐analysis‐based research strategy in comparison with a stand‐alone study‐based research strategy. The calculations by Roloff et al are done under the assumption of a fixed‐effects model (FEM) and random‐effects meta‐analysis model (REM). The main difference between both models is the assumption of heterogeneity, and therefore, whether there is a common underlying effect size or whether study effects are following a distribution with mean . Hence, inference is focused on the common effect size under a FEM and on the mean of the distribution of all (heterogeneous) effect sizes under a REM. In all following formulas, the variance estimates of already observed studies and new studies and the heterogeneity are
,
, and , respectively. The study‐effect estimates of the former studies and new studies are defined as and .First, we consider the special case, that only 1 new study will be conducted, which is planned to detect a prespecified effect with a power of 80% and therefore provides stand‐alone evidence against the null hypothesis (stand‐alone study). If the meta‐analysis is updated with this new trial by using the FEM, the treatment effect will be estimated asThe assumption of a REM and equal heterogeneity in the old and new meta‐analysis leads to the treatment effect estimateRoloff et al calculate the power of the updated meta‐analysis after including a fixed number of equally sized studies, which do not have to be planned (powered) to be conclusive on their own. 1 This suggests the segmentation of 1 stand‐alone study with variance
into smaller studies that each is 1/ times the size of the stand‐alone study with variance
that will be included in the updated meta‐analysis.In cases where a FEM is used for the analysis, it does not make a difference whether the stand‐alone study is included as 1 study or beforehand split into smaller substudies. On the other hand, the treatment effect in the updated REM meta‐analysis including the k smaller studies is estimated asHere, the distribution of the treatment effect is dependent on . Increasing downweigths the heterogeneity parameter and decreases the variance of the estimated treatment effect. Therefore, in (3) leads to a narrower confidence interval compared with (2) and a power gain.1 The magnitude of the power gain by variance reduction can be seen by maximal segmentation: For > 0 and being the total sample size of 1 additional trial segmented into trials, the variance in (3) is monotonically decreasing in , and for , increasing the variance in (3) converges to
and heterogeneity is reduced up to a
‐fold.To illustrate the conditional power approach for a REM, Roloff et al consider a systematic review of the role of preoperative chemotherapy for esophageal cancer including data from 8 studies involving 1729 patients as an example of a meta‐analysis with moderate heterogeneity (I
2 = 40.2%).2 This meta‐analysis reported a hazard ratio for the comparison of preoperative chemotherapy versus surgery alone of 0.88 (95% CI: 0.75 to 1.04). Roloff et al calculate the conditional power of the updated meta‐analysis by using a more optimistic effect of 0.82. We chose the observed effect in the inconclusive meta‐analysis of 0.88 as the best assumption for the updated meta‐analysis effect estimate and calculated that at least 7 additional studies (with a total number of additional events of roughly 18,000) have to be conducted to reach a conditional power of 80% in the updated meta‐analysis. In contrast, a stand‐alone study with a total of 1921 events had a power of 80% to detect a hazard ratio of 0.88 at a significance level of 5%.Taking the segmentation of a stand‐alone study to the extremes is not what would be expected in reality, but it highlights the question: Is it appropriate to gain power for the updated meta‐analysis by increasing the number of planned future studies while reducing the power of each of these planned future trials?The use of study segmentation and subsequent meta‐analysis as a strategy for future research raises some issues:At least 2 adequate and well‐controlled studies each clearly demonstrating efficacy are demanded as a prerequisite for drug licensing by default.3, 4 The meta‐analysis‐based research strategy here opens a door for concluding that a drug should be considered efficacious in a situation where no individual study ever met its primary objective. This strategy is currently not supported in the field of drug licensing, where replicated randomized controlled trial results are considered higher‐level evidence than meta‐analysis results.Roloff et al assume that it would be possible to understand (and then replicate) the conditions under which heterogeneity has been observed in the inconclusive meta‐analysis. They exemplify this situation with multiregional clinical trials, where heterogeneity between regions has been observed. From a purely evidentiary perspective in most of these trials, however, reasons for heterogeneity could be identified that allowed separate decision making in homogeneous subgroups (eg, Platelet Inhibition and Patient Outcomes and high‐dose aspirin,5, 6 PASS, and glomerular filtration rate mutation7).In contrast, if reasons for heterogeneity of study results can be identified, a better strategy is to model this heterogeneity or conduct studies in respective homogeneous subgroups of the population. The gain in power that results from the application of the conditional power formulas in the REM meta‐analysis leads to a purely technical reduction of heterogeneity without additional insights into the causes of heterogeneous study‐specific treatment effects.A research strategy in drug licensing based on randomized controlled stand‐alone trials could be as follows: (a) Given a homogeneous meta‐analysis, which shows a nonsignificant relevant treatment effect, we advocate the conduct of one additional stand‐alone trial based on the observed effect. Here, the additional stand‐alone trial will inevitably give the updated FEM meta‐analysis a sufficient power, as well. (b) Given a heterogeneous meta‐analysis, which features a nonsignificant relevant treatment effect, a logical research strategy would be to first make some attempts to better understand the potential reasons for heterogeneity (eg, by using subgroups as a means to understand who benefits at which risks) and then conduct 1 additional study with well‐defined inclusion and exclusion criteria. Additionally, evidence synthesis methods can be applied to get a wider picture and learn about the heterogeneity of effects, external validity, and generalizability.However, the conditional power approach might be a useful tool in identifying heterogeneity that cannot be ignored at the planning stage of a future trial. Whenever an updated meta‐analysis cannot reach sufficient power after the inclusion of 1 additional stand‐alone study, Roloff's method could indicate substantial heterogeneity worth exploring. For decision making in drug licensing, however, “Individual clinical trials should always be designed to satisfy their objectives and […] stand‐alone studies (should not be) substituted by a meta‐analysis of trials of inadequate size.”3
Authors: Kenneth W Mahaffey; Daniel M Wojdyla; Kevin Carroll; Richard C Becker; Robert F Storey; Dominick J Angiolillo; Claes Held; Christopher P Cannon; Stefan James; Karen S Pieper; Jay Horrow; Robert A Harrington; Lars Wallentin Journal: Circulation Date: 2011-06-27 Impact factor: 29.690
Authors: Stefan James; Axel Akerblom; Christopher P Cannon; Håkan Emanuelsson; Steen Husted; Hugo Katus; Allan Skene; Philippe Gabriel Steg; Robert F Storey; Robert Harrington; Richard Becker; Lars Wallentin Journal: Am Heart J Date: 2009-04 Impact factor: 4.749