Literature DB >> 34050436

Permutation tests are robust and powerful at 0.5% and 5% significance levels.

Kimihiro Noguchi1, Frank Konietschke2,3, Fernando Marmolejo-Ramos4, Markus Pauly5.   

Abstract

Recent replication crisis has led to a number of ad hoc suggestions to decrease the chance of making false positive findings. Among them, Johnson (Proceedings of the National Academy of Sciences, 110, 19313-19317, 2013) and Benjamin et al. (Nature Human Behaviour, 2, 6-10 2018) recommend using the significance level of α = 0.005 (0.5%) as opposed to the conventional 0.05 (5%) level. Even though their suggestion is easy to implement, it is unclear whether or not the commonly used statistical tests are robust and/or powerful at such a small significance level. Therefore, the main aim of our study is to investigate the robustness and power curve behaviors of independent (unpaired) two-sample tests for metric and ordinal data at nominal significance levels of α = 0.005 and α = 0.05. Through an extensive simulation study, it is found that the permutation versions of the Welch t-test and the Brunner-Munzel test are particularly robust and powerful while the commonly used two-sample tests which utilize t-distribution tend to be either liberal or conservative, and have peculiar power curve behaviors under skewed distributions with variance heterogeneity.
© 2021. The Psychonomic Society, Inc.

Entities:  

Keywords:  Nonparametric tests; Permutation tests; Replication crisis; Reproducibility issue; Robust statistics; Statistical evidence; Statistical significance

Mesh:

Year:  2021        PMID: 34050436     DOI: 10.3758/s13428-021-01595-5

Source DB:  PubMed          Journal:  Behav Res Methods        ISSN: 1554-351X


  20 in total

Review 1.  On effect size.

Authors:  Ken Kelley; Kristopher J Preacher
Journal:  Psychol Methods       Date:  2012-04-30

2.  The Wilcoxon-Mann-Whitney test under scrutiny.

Authors:  Morten W Fagerland; Leiv Sandvik
Journal:  Stat Med       Date:  2009-05-01       Impact factor: 2.373

3.  Moving beyond P values: data analysis with estimation graphics.

Authors:  Joses Ho; Tayfun Tumkaya; Sameer Aryal; Hyungwon Choi; Adam Claridge-Chang
Journal:  Nat Methods       Date:  2019-07       Impact factor: 28.547

4.  Redefine statistical significance.

Authors:  Daniel J Benjamin; James O Berger; Magnus Johannesson; Brian A Nosek; E-J Wagenmakers; Richard Berk; Kenneth A Bollen; Björn Brembs; Lawrence Brown; Colin Camerer; David Cesarini; Christopher D Chambers; Merlise Clyde; Thomas D Cook; Paul De Boeck; Zoltan Dienes; Anna Dreber; Kenny Easwaran; Charles Efferson; Ernst Fehr; Fiona Fidler; Andy P Field; Malcolm Forster; Edward I George; Richard Gonzalez; Steven Goodman; Edwin Green; Donald P Green; Anthony G Greenwald; Jarrod D Hadfield; Larry V Hedges; Leonhard Held; Teck Hua Ho; Herbert Hoijtink; Daniel J Hruschka; Kosuke Imai; Guido Imbens; John P A Ioannidis; Minjeong Jeon; James Holland Jones; Michael Kirchler; David Laibson; John List; Roderick Little; Arthur Lupia; Edouard Machery; Scott E Maxwell; Michael McCarthy; Don A Moore; Stephen L Morgan; Marcus Munafó; Shinichi Nakagawa; Brendan Nyhan; Timothy H Parker; Luis Pericchi; Marco Perugini; Jeff Rouder; Judith Rousseau; Victoria Savalei; Felix D Schönbrodt; Thomas Sellke; Betsy Sinclair; Dustin Tingley; Trisha Van Zandt; Simine Vazire; Duncan J Watts; Christopher Winship; Robert L Wolpert; Yu Xie; Cristobal Young; Jonathan Zinman; Valen E Johnson
Journal:  Nat Hum Behav       Date:  2018-01

5.  1,500 scientists lift the lid on reproducibility.

Authors:  Monya Baker
Journal:  Nature       Date:  2016-05-26       Impact factor: 49.962

6.  Four simple ways to increase power without increasing the sample size.

Authors:  Stanley E Lazic
Journal:  Lab Anim       Date:  2018-04-08       Impact factor: 2.471

7.  Sample size in four areas of psychological research.

Authors:  C B Holmes
Journal:  Trans Kans Acad Sci       Date:  1983-07

8.  Policy: NIH plans to enhance reproducibility.

Authors:  Francis S Collins; Lawrence A Tabak
Journal:  Nature       Date:  2014-01-30       Impact factor: 49.962

9.  Why most published research findings are false.

Authors:  John P A Ioannidis
Journal:  PLoS Med       Date:  2005-08-30       Impact factor: 11.613

Review 10.  Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs.

Authors:  Daniël Lakens
Journal:  Front Psychol       Date:  2013-11-26
View more
  1 in total

1.  Advice on comparing two independent samples of circular data in biology.

Authors:  Lukas Landler; Graeme D Ruxton; E Pascal Malkemper
Journal:  Sci Rep       Date:  2021-10-13       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.