Literature DB >> 28199677

Use of a Spin Strategy Can Result in Unreliable Research Findings.

Abstract

Entities: Disease Species

Year: 2017 PMID： 28199677 PMCID： PMC5492806 DOI： 10.1093/ijnp/pyx013

Source DB: PubMed Journal: Int J Neuropsychopharmacol ISSN： 1461-1457 Impact factor: 5.176

× No keyword cloud information.

We read the article by Yamanouchi and colleagues with great interest and applaud the authors’ efforts to assess the noninferiority of an antipsychotic tapering regimen compared with usual care in patients with schizophrenia who were treated with 2 or more antipsychotics (Yamanouchi et al., 2015). The results of the trial showed that there were no statistically significant differences in the co-primary outcomes (i.e., quality of life and severity of psychiatric symptoms) between the 2 groups despite the posthoc calculation with large statistical power and therefore supported the noninferiority hypothesis that a tapering regimen is not worse than usual care. Although the results of the trial had important implications for mental health policy in Japan, we believe that the results should be interpreted with caution due to several discrepancies between the study protocol submitted to their funding agency in April 2011 (Iwata, 2011) and the article published in March 2015 (Yamanouchi et al., 2015) of the same trial launched in November 2010 (Table 1). We used the study protocol (Iwata, 2011) instead of the trial registration document (Yamanouchi, 2010) to understand the original study design, because the trial registration document had inadequate information about the study design.

Table 1.

Three Discrepancies between the Study Protocol Submitted to Their Funding Agency in April 2011 and the Article Published in March 2015 of the Same Trial Launched in November 2010

Concern	Original study design	Published study design
Quality of life	Primary outcome to test a superiority hypothesis	Co-primary outcome to test a noninferiority hypothesis
Sample size calculation	400	142
Severity of psychiatric symptoms	Secondary outcome to test a noninferiority hypothesis	Co-primary outcome to test a noninferiority hypothesis

Three Discrepancies between the Study Protocol Submitted to Their Funding Agency in April 2011 and the Article Published in March 2015 of the Same Trial Launched in November 2010 The first discrepancy was the stated use of quality of life, as assessed using the EuroQoL. In the study protocol, quality of life was specified as a primary outcome to test the superiority hypothesis that a tapering regimen is superior to usual care. However, in the subsequent publication, the superiority hypothesis was changed to a noninferiority hypothesis without any notice. The second discrepancy was the sample size calculation. In the study protocol, the sample size calculation for quality of life based on the use of a t test showed that a total sample size of 400 (200 in each group) would be required to provide 80% power at a 2-sided significance level of 5% to detect a standardized mean difference of 0.30, assuming a dropout rate of 10% (our sample size estimation of 392 was generated by using R version 3.2.2). In addition, the sample size calculation for symptom severity based on the use of the t test showed that a total sample size of 400 (200 in each group) would be required to provide 80% power at a 1-sided significance level of 2.5% to detect a mean difference of 0.8 (standard deviation of 5.8) with a noninferiority margin of 1.0, assuming a dropout rate of 10% (our sample size estimation of 363 was generated by using the TrialSize package in R). However, in the subsequent publication, the required total sample size was reduced from 400 to 142, without an explanation for the protocol modifications. The assumptions for the sample size calculation in the published article were completely different from those in the study protocol. In the published article, the authors assumed the use of repeated measures ANOVA, a power of 80% at a significance level of 2.5% to detect an effect size of 0.20 (called Cohen’s f) without specification of a noninferiority margin, the number of repeated measurement to be 6, a correlation among repeated measures of 0.50, and a nonsphericity correction of 1. Although the authors asserted that the sample size calculation was for testing the noninferiority hypothesis, we believe that the sample size calculation was for testing the superiority hypothesis. Now, we consider the statistical power with the reduced sample size. When assuming that all parameters except the statistical power and dropout rate are set to the same values as those in the study protocol (i.e., sample size of 142 and dropout rate of 0%), the statistical power is reduced from 80% to 39% for quality of life and to 42% for symptom severity. To calculate more conservative estimates, when assuming the sample size is set at the actual enrolled number of 163 participants, statistical power is reduced from 80% to 44% for quality of life and to 47% for symptom severity. The third discrepancy was the importance of the severity of psychiatric symptoms, as assessed using the Manchester scale. In the study protocol, symptom severity was specified as a secondary outcome to test the noninferiority hypothesis. However, in the subsequent publication, symptom severity was insidiously upgraded to a co-primary outcome. In conclusion, we believe that the trial was a “negative trial,” with misleading reporting practices that are not negligible. In fact, the results of the trial do not support the noninferiority hypothesis in the published article due to the modified hypothesis, inadequate statistical power, and selective outcome reporting. Such a misleading reporting practice is called “spin,” which is defined as specific reporting strategies that could distort the interpretation of results and mislead readers, for whatever motive (Boutron et al., 2010; Hernandez et al., 2013). One possible reason for the use of the spin strategy is that the research team consisted entirely of clinicians (i.e., 6 psychiatrists and a pharmacist) without epidemiologist or bio-statistician team members. In general, researchers with inadequate knowledge of epidemiology and/or biostatistics could be tempted to unintentionally use a spin strategy when their results have negative findings. However, unreliable results can be a consequence of the use of a spin strategy, and the editor could decide to retract the article (Cassani et al., 2015, 2016; Dimova and Allison, 2016). The retraction guideline developed by the Committee on Publication Ethics recommends that journal editors should consider retracting a publication if they have clear evidence that the findings are unreliable, either as a result of misconduct or honest error (Committee on Publication Ethics, 2009). A trial for which the superiority hypothesis is changed to a noninferiority hypothesis is typically vulnerable to unreliable findings; therefore, editors should consider retracting the publication in such situations. Researchers, reviewers, and editors should be aware of the need to ensure that the reported results are reliable and their conclusions are an appropriate reflection of the design and results.

Statement of Interest

During the past 3 years, Y.O. has served as a chairperson of the Reporting Quality Initiative of Researchers in Clinical Epidemiology; an associate editor of the Journal of Epidemiology, the Japanese Journal of Behavior Therapy, and the Japanese Journal of Cognitive Therapy; and an adviser for reviewers of the Journal of Japan Academy of Gerontological Nursing. He received personal fees from Janssen Pharmaceuticals, Inc., the Medical Technology Association, and Cando Inc. He has received research grants from the Japan Agency for Medical Research and Development; Ministry of Health, Labour and Welfare; Japan Society for the Promotion of Science; Institute for Health Economics and Policy; and Mental Health and Morita Therapy.

6 in total

1. Reporting and interpretation of randomized controlled trials with statistically nonsignificant results for primary outcomes.

Authors: Isabelle Boutron; Susan Dutton; Philippe Ravaud; Douglas G Altman
Journal: JAMA Date: 2010-05-26 Impact factor: 56.272

Review 2. Deficient reporting and interpretation of non-inferiority randomized clinical trials in HIV patients: a systematic review.

Authors: Adrian V Hernandez; Vinay Pasupuleti; Abhishek Deshpande; Priyaleela Thota; Jaime A Collins; Jose E Vidal
Journal: PLoS One Date: 2013-05-03 Impact factor: 3.240

3. Impact of weight loss diet associated with flaxseed on inflammatory markers in men with cardiovascular risk factors: a clinical study.

Authors: Roberta Soares Lara Cassani; Priscila Giacomo Fassini; Jose Henrique Silvah; Cristiane Maria Mártires Lima; Júlio Sérgio Marchini
Journal: Nutr J Date: 2015-01-10 Impact factor: 3.271

4. Evaluation of the individual safe correction of antipsychotic agent polypharmacy in Japanese patients with chronic schizophrenia: validation of safe corrections for antipsychotic polypharmacy and the high-dose method.

Authors: Yoshio Yamanouchi; Tsuruhei Sukegawa; Ataru Inagaki; Toshiya Inada; Takashi Yoshio; Reiji Yoshimura; Nakao Iwata
Journal: Int J Neuropsychopharmacol Date: 2014-12-11 Impact factor: 5.176

5. Retraction Note: Impact of weight loss diet associated with flaxseed on inflammatory markers in men with cardiovascular risk factors: a clinical study.

Authors: Roberta Soares Lara Cassani; Priscila Giacomo Fassini; Jose Henrique Silvah; Cristiane Maria Mártires Lima; Júlio Sérgio Marchini
Journal: Nutr J Date: 2016-06-07 Impact factor: 3.271

6. Inappropriate statistical method in a parallel-group randomized controlled trial results in unsubstantiated conclusions.

Authors: Rositsa B Dimova; David B Allison
Journal: Nutr J Date: 2016-06-06 Impact factor: 3.271

6 in total

1 in total

1. Response to Dr. Okumura.

Authors: Yoshio Yamanouchi; Tsuruhei Sukegawa; Ataru Inagaki; Toshiya Inada; Takashi Yoshio; Reiji Yoshimura; Nakao Iwata
Journal: Int J Neuropsychopharmacol Date: 2017-07-01 Impact factor: 5.176

1 in total