Reinier Cornelis Anthonius van Linschoten1,2, Rachel West3, Desirée van Noord3, Nikki van Leeuwen4. 1. Gastroenterology & Hepatology, Franciscus Gasthuis en Vlietland, Rotterdam, The Netherlands r.linschoten@franciscus.nl. 2. Gastroenterology & Hepatology, Erasmus Medical Center, Rotterdam, The Netherlands. 3. Gastroenterology & Hepatology, Franciscus Gasthuis en Vlietland, Rotterdam, The Netherlands. 4. Public Health, Erasmus Medical Center, Rotterdam, The Netherlands.
It is with great interest that we read the report of Yoshida et al
1 on the effect of second-generation narrow band imaging compared to white light imaging on detecting early gastric cancer in high-risk patients. The trial was expertly designed with a large patient population and, although superiority of narrow band imaging could not be proven, has important implications for further research on this topic. However, a significant issue concerning the analyses attracted our interest and we would like to comment on it.The primary outcome, the difference in proportion of patients in whom early gastric cancer was diagnosed, failed to reach statistical significance (p=0.412). This difference in proportions was tested for significance using Fisher’s exact test. This might not have been the proper method for analysis, as patients in the study were randomised using minimisation with a random component, stratified by institution, age and indication of endoscopy.Imbalance of risk factors between treatment and control arms can occur by chance under normal randomisation, possibly leading to confounded treatment estimates. Stratification and minimisation are useful methods to ensure balance of risk factors between treatment arms.2–4 These methods can be beneficial in small and large trials, but for trials larger than 1000 patients little effect of minimisation on imbalance was found as compared with simple randomisation.4One of the assumptions of Fisher’s exact test is that samples are random and independent, which is not the case in this study. The problem that occurs with stratified or minimised randomisation is clustering between treatment groups which introduces positive correlation between observations. The correlation between observations violates the independence assumption and will lead to standard errors (SE) that are biased upwards because tests for independent samples do not account for this correlation and will overestimate the variance of the treatment effect.As a SE that is biased upwards leads to inflated p values, not accounting for these balancing variables in the analysis may lead to wrongfully not rejecting the null hypothesis. This effect can be considerable, as a reanalysis of a large trial showed twofold to fourfold increases in p values and a simulation study showed reductions in power of up to 30 percentage points.5 6 Thus, studies using either of the balancing methods should adjust their analysis for the balancing variables used in the randomisation procedure.5Considering the report of Yoshida et al, this could mean that they may have erroneously concluded that second-generation narrow band imaging was not superior to white light imaging. We cannot determine the precise effect adjustment would have had in the study by Yoshida et al as a reanalysis requires the individual patient data. While adjustment is possible using Fisher’s exact test,7 8 we suggest performing logistic regression analysis as this allows adjustment for multiple minimisation variables, does not rely on inefficient stratification,8 and can be used to determine the confidence interval around the treatment effect estimate.Unadjusted analyses in balanced randomised trials seem to be a recurring phenomenon. In 2012, a systematic review showed that only 26% of trials published in leading journals that used a balancing method correctly adjusted for all balancing factors.6 But even in more recent trials analyses are often not adjusted for balancing factors, as shown by the study of Yoshida et al, but also by other trials in the leading journals of gastroenterology and hepatology.1 9 10 When used correctly minimisation and stratification are powerful tools for balancing randomised trials and improving the validity of studies. However, this has important consequences for data analysis. As such, we urge trialists to include the balancing variables as adjustment factors in their statistical analyses.
Authors: Anthony Y B Teoh; Masayuki Kitano; Takao Itoi; Manuel Pérez-Miranda; Takeshi Ogura; Shannon Melissa Chan; Carlos Serna-Higuera; Shunsuke Omoto; Raul Torres-Yuste; Takayoshi Tsuichiya; Ka Tak Wong; Chi-Ho Leung; Philip Wai Yan Chiu; Enders Kwok Wai Ng; James Yun Wong Lau Journal: Gut Date: 2020-03-12 Impact factor: 23.059
Authors: Selina R Cox; James O Lindsay; Sébastien Fromentin; Andrew J Stagg; Neil E McCarthy; Nathalie Galleron; Samar B Ibraim; Hugo Roume; Florence Levenez; Nicolas Pons; Nicolas Maziers; Miranda C Lomer; S Dusko Ehrlich; Peter M Irving; Kevin Whelan Journal: Gastroenterology Date: 2019-10-02 Impact factor: 22.682