Literature DB >> 20339903

The ongoing tyranny of statistical significance testing in biomedical research.

Andreas Stang1, Charles Poole, Oliver Kuss.   

Abstract

Since its introduction into the biomedical literature, statistical significance testing (abbreviated as SST) caused much debate. The aim of this perspective article is to review frequent fallacies and misuses of SST in the biomedical field and to review a potential way out of the fallacies and misuses associated with SSTs. Two frequentist schools of statistical inference merged to form SST as it is practised nowadays: the Fisher and the Neyman-Pearson school. The P-value is both reported quantitatively and checked against the alpha-level to produce a qualitative dichotomous measure (significant/nonsignificant). However, a P-value mixes the estimated effect size with its estimated precision. Obviously, it is not possible to measure these two things with one single number. For the valid interpretation of SSTs, a variety of presumptions and requirements have to be met. We point here to four of them: study size, correct statistical model, correct causal model, and absence of bias and confounding. It has been stated that the P-value is perhaps the most misunderstood statistical concept in clinical research. As in the social sciences, the tyranny of SST is still highly prevalent in the biomedical literature even after decades of warnings against SST. The ubiquitous misuse and tyranny of SST threatens scientific discoveries and may even impede scientific progress. In the worst case, misuse of significance testing may even harm patients who eventually are incorrectly treated because of improper handling of P-values. For a proper interpretation of study results, both estimated effect size and estimated precision are necessary ingredients.

Entities:  

Mesh:

Year:  2010        PMID: 20339903     DOI: 10.1007/s10654-010-9440-x

Source DB:  PubMed          Journal:  Eur J Epidemiol        ISSN: 0393-2990            Impact factor:   8.082


  17 in total

Review 1.  Sifting the evidence-what's wrong with significance tests?

Authors:  J A Sterne; G Davey Smith
Journal:  BMJ       Date:  2001-01-27

2.  Low P-values or narrow confidence intervals: which are more durable?

Authors:  C Poole
Journal:  Epidemiology       Date:  2001-05       Impact factor: 4.822

3.  Commentary: This study failed?

Authors:  Charles Poole; Ulrike Peters; Dora Il'yasova; Lenore Arab
Journal:  Int J Epidemiol       Date:  2003-08       Impact factor: 7.196

4.  Treating COPD--the TORCH trial, P values, and the Dodo.

Authors:  Klaus F Rabe
Journal:  N Engl J Med       Date:  2007-02-22       Impact factor: 91.245

Review 5.  A dirty dozen: twelve p-value misconceptions.

Authors:  Steven Goodman
Journal:  Semin Hematol       Date:  2008-07       Impact factor: 3.851

6.  Significance questing.

Authors:  K J Rothman
Journal:  Ann Intern Med       Date:  1986-09       Impact factor: 25.391

7.  Effects of moderate alcohol consumption on cognitive function in women.

Authors:  Meir J Stampfer; Jae Hee Kang; Jennifer Chen; Rebecca Cherry; Francine Grodstein
Journal:  N Engl J Med       Date:  2005-01-20       Impact factor: 91.245

8.  The use of predicted confidence intervals when planning experiments and the misuse of power when interpreting results.

Authors:  S N Goodman; J A Berlin
Journal:  Ann Intern Med       Date:  1994-08-01       Impact factor: 25.391

9.  Risks and benefits of estrogen plus progestin in healthy postmenopausal women: principal results From the Women's Health Initiative randomized controlled trial.

Authors:  Jacques E Rossouw; Garnet L Anderson; Ross L Prentice; Andrea Z LaCroix; Charles Kooperberg; Marcia L Stefanick; Rebecca D Jackson; Shirley A A Beresford; Barbara V Howard; Karen C Johnson; Jane Morley Kotchen; Judith Ockene
Journal:  JAMA       Date:  2002-07-17       Impact factor: 56.272

10.  Correlation of cytogenetic abnormalities with the outcome of patients with uveal melanoma.

Authors:  V A White; J D Chambers; P D Courtright; W Y Chang; D E Horsman
Journal:  Cancer       Date:  1998-07-15       Impact factor: 6.860

View more
  54 in total

1.  Randomized controlled trials-an indispensible part of clinical research.

Authors:  Andreas Stang
Journal:  Dtsch Arztebl Int       Date:  2011-09-30       Impact factor: 5.594

2.  Statistical perspectives: all together NOT.

Authors:  Will G Hopkins; Alan M Batterham; Franco M Impellizzeri; David B Pyne; David S Rowlands
Journal:  J Physiol       Date:  2011-11-01       Impact factor: 5.182

3.  Statistical perspectives: all together NOT.

Authors:  Will G Hopkins; Alan M Batterham; Franco M Impellizzeri; David B Pyne; David S Rowlands
Journal:  Br J Pharmacol       Date:  2012-02       Impact factor: 8.739

4.  Erratum to: Letter to the Editor: The ongoing tyranny of statistical significance testing in biomedical research.

Authors:  Tyler J VanderWeele
Journal:  Eur J Epidemiol       Date:  2010-12       Impact factor: 8.082

5.  Multiple hypothesis testing in proteomics: a strategy for experimental work.

Authors:  Angel P Diz; Antonio Carvajal-Rodríguez; David O F Skibinski
Journal:  Mol Cell Proteomics       Date:  2011-03       Impact factor: 5.911

Review 6.  Single-Case Research Methods: History and Suitability for a Psychological Science in Need of Alternatives.

Authors:  Camilo Hurtado-Parrado; Wilson López-López
Journal:  Integr Psychol Behav Sci       Date:  2015-09

7.  Cancer incidence among priests: 45 years of follow-up in four Nordic countries.

Authors:  Andreas Stang; Jan Ivar Martinsen; Kristina Kjaerheim; Elisabete Weiderpass; Pär Sparén; Laufey Tryggvadóttir; Eero Pukkala
Journal:  Eur J Epidemiol       Date:  2011-12-27       Impact factor: 8.082

8.  Placebo effects and the common cold: a randomized controlled trial.

Authors:  Bruce Barrett; Roger Brown; Dave Rakel; David Rabago; Lucille Marchand; Jo Scheder; Marlon Mundt; Gay Thomas; Shari Barlow
Journal:  Ann Fam Med       Date:  2011 Jul-Aug       Impact factor: 5.166

9.  Association of blood pressure and aortic distensibility with P wave indices and PR interval: the multi-ethnic study of atherosclerosis (MESA).

Authors:  Alvaro Alonso; Elsayed Z Soliman; Lin Y Chen; David A Bluemke; Susan R Heckbert
Journal:  J Electrocardiol       Date:  2013-02-26       Impact factor: 1.438

10.  Assessing the component associations of the healthy worker survivor bias: occupational asbestos exposure and lung cancer mortality.

Authors:  Ashley I Naimi; Stephen R Cole; Michael G Hudgens; M Alan Brookhart; David B Richardson
Journal:  Ann Epidemiol       Date:  2013-06       Impact factor: 3.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.